Health insurance claims data—created for administrative, financial, and reimbursement purposes—are an important source of information and yield insights into a multitude of complex health system and public health problems. Typically, researchers use claims data to explore issues relevant to the costs and effects of health care delivery. These data are viewed as a best fit for answering questions about health care utilization in an eligible user population, not the broader public.
Claims data typically include person-level demographics, medications, inpatient utilization, diagnoses, procedure codes, equipment and supplies, other services, and costs. This rich information set can help us answer research questions about policies and interventions that affect determinants of health and outcomes relevant to public health practitioners. However, public health researchers may not fully embrace claims data’s potential to answer these questions—in part because common myths exist about the relevance and quality of the data. Therefore, we address three prevailing myths regarding claims data to raise awareness about their use in creating reliable evidence that supports public health practice.
MYTH 1: RELEVANT OUTCOMES ARE NOT MEASURED
The first myth is that claims data do not measure outcomes relevant to public health researchers. An important criterion for improving public health is the ability to analyze the effect of an intervention, program, or policy on the determinants of health outcomes or the outcomes themselves. Relevant public health outcomes include mortality, morbidity, prevalence of disease, and health care costs. Claims data can answer research questions that assess theorized changes affecting medical care utilization, costs, mortality, and morbidity. Additionally, they can be used for epidemiological research identifying chronic diseases such as hypertension.1 Claims data can span multiple years, thus helping researchers create a robust longitudinal database. This offers the researcher versatility to examine the consequences of policy changes, such as the effects of Medicaid expansion on unauthorized immigrant women receiving prenatal care.2 Thus, the utility of claims data allows the study of relevant questions related to identifying a chronic disease or analyzing the effect of a change on the determinants of health and health outcomes.
MYTH 2: UNRELIABLE EVIDENCE
The second myth is that the nature of claims data makes their evidence unreliable. To generate reliable evidence, the researcher must carefully design a study to address claims data’s inherent limitations. Claims data contain standardized billing codes that physicians, pharmacies, hospitals, and other health providers submit for payment. In some cases, claims data do not adequately capture the patient’s actual diagnosis and treatment because of code precision, clerical error, and omission of comorbidity codes. “Upcoding” refers to fraudulent assignment of a billing code that is more expensive than the service performed. When analyzing claims data, upcoding potentially makes the population being studied seem sicker than they actually are. Additionally, claims are criticized for lacking critical social determinants of health.
To overcome these limitations, researchers need to apply a few principles to design a valid claims-based study. First, they should define the determinants of health, outcomes, and other measures of interest on the basis of an appropriate theory. Then, they should carefully assess whether these measures exist in claims data and how they are measured. Often, limitations such as the lack of social determinants or issues with coding precision can be managed by merging claims data with other sources from the US Census Bureau or the electronic medical record. Next, researchers need to carefully decide and explain how the sample is identified. This allows increased transparency in understanding the population the sample generalizes to. Selection of appropriate statistical methods also can ensure that other issues that limit the findings’ validity, such as confounding, are properly addressed. Two studies that apply sound design principles assess the effect of the Affordable Care Act (ACA) on access to health care and medication use among low-income populations.3,4 Once findings are generated, their careful interpretation in the context of any remaining limitations that could not be addressed through study design is required to promote accurate use of evidence in public health practice.
MYTH 3: LACKS INSIGHT INTO PRACTICE
The third myth is that evidence from claims data cannot provide insight into public health practice. Researchers can use administrative data to produce evidence that provides meaningful insights into public health practice. These data offer advantages for researchers when randomized controlled trials are not feasible or pose ethical concerns. The aforementioned studies relied on the use of claims data partly because of these concerns.2–4 To transform public health, practitioners need to be able to use the evidence that researchers generate to anticipate the impact of policy changes so they can respond appropriately.5
One example in which evidence from claims data was able to translate into practitioners’ understanding implications for changing policy involves a provision of the ACA regarding concurrent care for children enrolled in Medicaid and their families (section 2302).6 Concurrent care enables curative and palliative care therapies to be administered simultaneously. Before the ACA, children with chronic, complex conditions had to choose between continuing curative therapies and receiving hospice care services.6 Researchers were able to use claims data to demonstrate that the California Medicaid system could provide comprehensive care for children at end of life but also that hospice and home health services were often underutilized.7
Such evidence profiling children with chronic, complex conditions facilitates practitioners’ understanding of the destabilizing effects that repealing the ACA would have. Destabilization would be caused by the elimination of concurrent care for children and their families at the end of life. More specifically, it enabled clinicians, caregivers, and practitioners to anticipate and prepare for children needing hospice services but not being able to access them because of the repeal of the ACA.6 Having an ability to anticipate the effects of policy changes allows appropriate responses that in some cases may include advocacy.
CONCLUSIONS
Claims data have the potential to help researchers measure relevant outcomes, produce reliable evidence, and lend meaningful insight into public health practice. Therefore, these data are an important source to consider in designing research studies, especially in situations in which prospective designs may be infeasible. Using claims data to address a relevant public health research question and create evidence through the application of appropriate, rigorous, and transparent analysis can be particularly rewarding. Only by going through this process can public health researchers overcome some common misconceptions and use claims data to their full potential to support public health practice.
ACKNOWLEDGMENTS
This publication was made possible by the National Institute of Nursing Research (grant R01NR017848).
The authors thank Ronnie Horner, professor, Health Services Policy and Management Department, Arnold School of Public Health, University of South Carolina, for his helpful insights.
Note. The contents of this editorial are solely the responsibility of the authors and do not necessarily represent the official views of the National Institute of Nursing Research or National Institutes of Health.
CONFLICTS OF INTEREST
The authors have no conflicts of interest to declare.
REFERENCES
- 1.Tessier-Sherman B, Galusha D, Taiwo OA et al. Further validation that claims data are a useful tool for epidemiologic research on hypertension. BMC Public Health. 2013;13:51. doi: 10.1186/1471-2458-13-51. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Atkins DN, Held ML, Lindley LC. The impact of expanded health insurance coverage for unauthorized pregnant women on prenatal care utilization. Public Health Nurs. 2018;35(6):459–465. doi: 10.1111/phn.12524. [DOI] [PubMed] [Google Scholar]
- 3.Kominski GF, Nonzee NJ, Sorensen A. The Affordable Care Act’s impacts on access to insurance and health care for low-income populations. Annu Rev Public Health. 2017;38:489–505. doi: 10.1146/annurev-publhealth-031816-044555. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Ghosh A, Simon K, Sommers BD. The effect of health insurance on prescription drug use among low-income adults: evidence from recent Medicaid expansions. J Health Econ. 2019;63:64–80. doi: 10.1016/j.jhealeco.2018.11.002. [DOI] [PubMed] [Google Scholar]
- 5.Erwin PC, Brownson RC. The public health practitioner of the future. Am J Public Health. 2017;107(8):1227–1232. doi: 10.2105/AJPH.2017.303823. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Keim-Malpass J, Lindley L. Repeal of the Affordable Care Act will negatively impact children at the end of life. Pediatrics. 2017;140(3):e20171134. doi: 10.1542/peds.2017-1134. [DOI] [PubMed] [Google Scholar]
- 7.Lindley LC, Lyon ME. A profile of children with complex chronic conditions at end of life among Medicaid beneficiaries: implications for health care reform. J Palliat Med. 2013;16(11):1388–1393. doi: 10.1089/jpm.2013.0099. [DOI] [PMC free article] [PubMed] [Google Scholar]