Even for a large insurer with a significant market share, the reliable measurement of performance is challenging due to data limitations, according to this study. This suggests mechanisms must be developed for multiple stakeholders to collaborate and pool patient data.
Most Pay for Performance (P4P) programs in the U.S. are implemented by a single insurer. Each insurer uses the data from their covered medical groups as a whole to assess the performance of each group. But there is concern that a single insurer does not have sufficient data to create accurate and reliable performance standards. Using seven years (2001-2007) of patient data (a total of 197,905 “person-years”) from a single insurer that covers 20 medical groups in Washington state, this study examines whether there are ample annual sample sizes to establish reliable standards in eight clinical care process measures.
The authors note that the “fragmented organization and implementation” of P4P initiatives runs a “high risk” of unreliable measurement of medical group performance, as is evidenced by this study. They call for the development of collaborative mechanisms to allow multiple payers and insurers to pool data, but acknowledge there are significant competitive and practical challenges.