Summary box.
Performance-based financing (PBF) is widely implemented despite limited available evidence on its effectiveness.
There is a need to assess how PBF works in different contexts within and across settings in order to inform country-level decisions as to whether PBF may be a suitable health financing option and how it should be shaped to suit different contexts within a country.
This commentary intends to inspire research looking beyond average programme effects into understanding heterogeneity in PBF programmes and their effects.
As a starting point, we propose to document and analyse all potential sources of heterogeneity in PBF.
We further provide initial guidance to conduct systematic analyses of heterogeneity of PBF by measuring and understanding differential effects and what drives or explains them.
Introduction
Over the last decade, performance-based financing (PBF) has gained momentum as a health financing innovation, which combines linking healthcare payments to performance with increased provider autonomy and supervision.1 2 The combination of these elements is expected to boost supply-side efforts towards increasing quantity and quality of service provision, triggering a demand-side response towards improved service utilisation.1 3 4
A recent paper by Paul et al has critically questioned the widespread introduction of PBF in light of the limited available evidence on its effectiveness.5 The response to this paper has been varied, with authors advancing arguments for and against PBF. Some African PBF implementers have claimed that PBF is an evolving strategy with potential benefits on health systems despite its existing challenges.6 Others have drawn attention to the unintended consequences of PBF7 or to the need to assess the economic value of such an approach.8 Beyond their diverse arguments, however, most authors have concurred with Paul et al5 on the limited scope of currently available evidence and have postulated the need to better assess how PBF works under different contextual constraints within and across settings.9 10
Our commentary positions itself against this background, acknowledges the limited scope of current evidence on PBF, and explicitly argues in favour of devoting more effort to unravel heterogeneity across and within settings. Our argument is based on the recognition that by virtue of how impact evaluations are designed, the focus has been on the average effect, which masks important heterogeneity across settings, providers and users.11–13 To date, only a handful of studies have assessed heterogeneity of PBF effects across population subgroups4 14 15 or across health providers.16–18 Similarly, little attention has been devoted to understanding which factors can explain heterogeneity in the response to PBF or why PBF stimulates changes in some instances, but not in others.3 4 10
In light of the above, we call for more systematic analyses of heterogeneity, defined in relation to both the need to report differential effects and the need to understand what drives or explains such differential effects within and across settings. We first define and outline potential sources of heterogeneity and then offer initial guidance on how to measure and understand heterogeneity.
Potential sources of heterogeneity in PBF
PBF interventions are not uniformly designed or implemented across settings. Variations in design, implementation models, and settings, inevitably lead to substantial heterogeneity in PBF programmes and eventually effects.3 4 19 However, publications rarely detail these variations, inevitably omitting underlying programme heterogeneity and hence nurturing a false view of PBF as a monolithic intervention. Furthermore, within a single setting, PBF addresses organisational units and actors whose intrinsic diversity represents an additional source of heterogeneity for PBF implementation and effects. Hereafter, we address three key sources of heterogeneity.
Design features of PBF
First, we highlight the need to account for PBF design features as part of a researcher’s core engagement into unravelling heterogeneity. PBF programmes vary in many aspects, all worth reporting given their potential to shape responses to the programme: what services are incentivised and how (restricted vs extensive service package; case based vs target payments); who is eligible for incentives (individual providers vs health facilities vs both); the level of autonomy granted to providers; frequency and content of supervision and verification visits.19 It is also relevant to describe how performance is measured and along which dimensions and to consider whether PBF is implemented on its own or in conjunction with demand-side interventions.
Contextual and structural factors
Relevant contextual characteristics can be identified at international, national, local and institutional level, as well as at the level of the individual actors.3 At national level, countries differ in terms of health system setup (eg, financing mechanisms, resource availability) and pre-PBF performance. In some instances, heterogeneity arises as PBF is implemented alongside another health financing reform (eg, health insurance, user fee removal) or in a fragile and conflict-affected region.9 At subnational level, PBF effects may for instance vary substantially across districts and facilities, levels of care or public and private facilities. This heterogeneity may be due to variation in: resource availability; complexity and scope of service provision; health staff’s availability, competence and motivation; facility remoteness and spread of catchment populations; and pre-PBF performance in regard to service coverage and quality.15 17 18
Implementation processes and progress
Variations in implementation fidelity and/or implementation failures can also explain heterogeneity both within and across PBF programmes and settings.3 Process analyses are useful to detect differences in implementation fidelity across settings as well as within single programmes. Researchers are advised to document losses in implementation fidelity at different levels (from programme orientation to implementing activities to timeliness of PBF payments) as an important potential source of heterogeneity.
How to measure heterogeneity in PBF
Having acknowledged diverse sources of heterogeneity, we provide some initial practical guidance on how to look beyond average programme effects using both quantitative and qualitative methods, the former being oriented at measurement and the latter at in-depth understanding. The propositions below can be applied to assess heterogeneity across a wide range of PBF-related outcomes, including providers’ motivation and performance, health service utilisation and quality of service provision.
Capture health systems factors, community and population characteristics
One’s capacity to assess heterogeneity begins with having access to comprehensive information on the characteristics of both healthcare providers implementing PBF and target communities. Similarly, information on contextual elements, implementation models and implementation fidelity at different levels of the health system is needed. Depending on the specific focus of one’s analysis, this information can be captured quantitatively through use of routine data sources (eg, Health Management Information Systems), facility-based and population-based surveys, or qualitatively through means of document reviews, observations and interview methods.
Identify relevant subgroups to analyse heterogeneity of PBF effects
The identification of potential sources of heterogeneity can be done through data disaggregation. This entails categorising data by key actors (eg, different sets of providers/users), by setting (eg, rural–urban, different districts), by different implementation models and different levels of implementation fidelity, or by any other conceptually pertinent factor, including, for example, facilities baseline performance levels or managerial capacity. Assessing heterogeneity quantitatively or qualitatively effectively entails having sufficient information to allow for subgroup analysis (comparison between groups) in relation to any of the above-mentioned categories. This of course relies on having engaged in comprehensive data collection efforts from the onset of the study and on defining relevant subgroups in relation to relevant sources of heterogeneity, the specific research questions being addressed, the study conceptual model or the intervention broader contextual elements.
Select an appropriate analytical approach
When assessing heterogeneity quantitatively, the choice for an appropriate quantitative analytical approach largely relies on the typology of the data available—time series data, panel data or repeated cross-sectional data—as well as on the specific research question being addressed. Within the PBF literature in low-income and middle-income countries, only a few studies to date have explicitly used interactions terms and/or stratified analysis to assess differential impacts across subgroups of providers and users.14–17 Some authors have proposed additional analytical options to estimate heterogeneity of programme effects. For instance, some authors have used quantile-specific treatment effects,11 13 while others have used hierarchical linear models on propensity score strata to estimate the distributional impact of policies.20
Assessing heterogeneity qualitatively means assessing how different constituencies within and across settings define and experience PBF in light of broader contextual and implementation-specific elements. Thematic descriptive qualitative analysis has largely dominated this body of literature.18 We note that with a few exceptions, quantitative and qualitative analytical approaches have to date rarely been merged into a single research effort to explore specifically heterogeneity in PBF effects.
Conclusion
With this brief commentary, we wish to inspire the research and policy community to look beyond average programme effects, devoting more attention towards measuring and understanding heterogeneity in PBF programmes and their effects. Understanding heterogeneity represents an essential step towards establishing the needed evidence base to inform country-level decisions as to whether PBF may be a suitable health financing option for a given setting. In addition, understanding heterogeneity and its sources also represents the initial step towards adjusting design and implementation to take into account the different actors and realities that may co-exist within a single setting.
Footnotes
Twitter: @peter_binyaruka, @_julia_lohmann_, @ManuelaDeAllegr
Contributors: PB conceptualised this commentary together with MDA. PB wrote the first draft of the commentary, while JL and MDA revised the commentary. All authors read and approved the final version.
Funding: The authors have not received any funding for writing this commentary.
Competing interests: None declared.
Patient consent for publication: Not required.
Provenance and peer review: Not commissioned; internally peer reviewed.
Data availability statement: This article did not use any data.
References
- 1.Meessen B, Soucat A, Sekabaraga C. Performance-based financing: just a donor FAD or a catalyst towards comprehensive health-care reform? Bull World Health Organ 2011;89:153–6. 10.2471/BLT.10.077339 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Fritsche G, Soeters R, Meessen B. Performance-based financing toolkit. Washington DC: The World Bank, 2014. [Google Scholar]
- 3.Witter S, Toonen J, Meessen B, et al. . Performance-based financing as a health system reform: mapping the key dimensions for monitoring and evaluation. BMC Health Serv Res 2013;13:367. 10.1186/1472-6963-13-367 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Renmans D, Holvoet N, Orach CG, et al. . Opening the 'black box' of performance-based financing in low- and lower middle-income countries: a review of the literature. Health Policy Plan 2016;31:1297–309. 10.1093/heapol/czw045 [DOI] [PubMed] [Google Scholar]
- 5.Paul E, Albert L, Bisala Badibanga N'Sambuka, et al. . Performance-based financing in low-income and middle-income countries: isn't it time for a rethink? BMJ Glob Health 2018;3:e000664. 10.1136/bmjgh-2017-000664 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Mayaka Ma-Nitu S, Tembey L, Bigirimana E, et al. . Towards constructive rethinking of PBF: perspectives of implementers in sub-Saharan Africa. BMJ Glob Health 2018;3:e001036. 10.1136/bmjgh-2018-001036 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Turcotte-Tremblay A-M, De Allegri M, Gali-Gali IA, et al. . The unintended consequences of combining equity measures with performance-based financing in Burkina Faso. Int J Equity Health 2018;17:109. 10.1186/s12939-018-0780-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Chi Y-L, Gad M, Bauhoff S, et al. . Mind the costs, too: towards better cost-effectiveness analyses of PBF programmes. BMJ Glob Health 2018;3:e000994. 10.1136/bmjgh-2018-000994 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Bertone MP, Falisse J-B, Russo G, et al. . Context matters (but how and why?) a hypothesis-led literature review of performance based financing in fragile and conflict-affected health systems. PLoS One 2018;13:e0195301. 10.1371/journal.pone.0195301 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Borghi J, Singh NS, Brown G, et al. . Understanding for whom, why and in what circumstances payment for performance works in low and middle income countries: protocol for a realist review. BMJ Glob Health 2018;3:e000695. 10.1136/bmjgh-2017-000695 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Djebbari H, Smith J. Heterogeneous impacts in PROGRESA. J Econom 2008;145:64–80. 10.1016/j.jeconom.2008.05.012 [DOI] [Google Scholar]
- 12.Markovitz AA, Ryan AM. Pay-for-performance: disappointing results or masked heterogeneity? Med Care Res Rev 2017;74:3–78. 10.1177/1077558715619282 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Bitler MP, Gelbach JB, Hoynes HW. What mean impacts miss: Distributional effects of welfare reform experiments. Am Econ Rev 2006;96:988–1012. 10.1257/aer.96.4.988 [DOI] [Google Scholar]
- 14.Lannes L, Meessen B, Soucat A, et al. . Can performance-based financing help reaching the poor with maternal and child health services? the experience of rural Rwanda. Int J Health Plann Manage 2016;31:309–48. 10.1002/hpm.2297 [DOI] [PubMed] [Google Scholar]
- 15.Binyaruka P, Robberstad B, Torsvik G, et al. . Who benefits from increased service utilisation? examining the distributional effects of payment for performance in Tanzania. Int J Equity Health 2018;17:14. 10.1186/s12939-018-0728-x [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Sherry TB, Bauhoff S, Mohanan M. Multitasking and heterogeneous treatment effects in pay-for-performance in health care: evidence from Rwanda. Am J Health Econ 2017;3:192–226. 10.1162/AJHE_a_00072 [DOI] [Google Scholar]
- 17.Binyaruka P, Robberstad B, Torsvik G, et al. . Does payment for performance increase performance inequalities across health providers? A case study of Tanzania. Health Policy Plan 2018;33:1026–36. 10.1093/heapol/czy084 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.De Allegri M, Bertone MP, McMahon S, et al. . Unraveling PBF effects beyond impact evaluation: results from a qualitative study in Cameroon. BMJ Glob Health 2018;3:e000693. 10.1136/bmjgh-2017-000693 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Eijkenaar F. Key issues in the design of pay for performance programs. Eur J Health Econ 2013;14:117–31. 10.1007/s10198-011-0347-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Zhou X, Xie Y. Heterogeneous treatment effects in the presence of self-selection: a propensity score perspective. SSRN 2018. [DOI] [PMC free article] [PubMed] [Google Scholar]