Abstract
This study uses administrative claims data to describe trends in use of ICD-10-CM diagnosis codes for novel coronavirus patients in January-May 2020, before and afer the April 1 release of the U07.1 code to facilitate billing for and case monitoring of COVID-19.
Administrative claims may provide an understanding of the clinical and economic burden of novel coronavirus disease 2019 (COVID-19). However, their limited sensitivity in identifying conditions, such as sepsis,1 suggests the need to assess whether billing codes reliably capture COVID-19 discharges.
A new diagnosis code for COVID-19 (International Statistical Classification of Diseases, Tenth Revision, Clinical Modification [ICD-10-CM] code U07.1) was introduced on April 1, 2020, to facilitate billing and case monitoring.2 For discharges prior to April 1, 2020, the Centers for Disease Control and Prevention advised coding for the clinical syndrome (eg, pneumonia) and “other coronavirus as the cause of diseases classified elsewhere” (ICD-10-CM code B97.29).3
We examined the uptake in COVID-19–specific coding (ICD-10-CM code U07.1), the transition from legacy coding, and the accuracy of the COVID-19–specific code using severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) polymerase chain reaction (PCR) testing as the reference standard.
Methods
The Premier Healthcare Database, an administrative all-payer repository that covers approximately 20% of all US hospitalizations from 48 states, was queried for inpatient discharges between January 1, 2020, and May 31, 2020, with COVID-19–specific coding or legacy coding. Only hospitals that reported data for every week during the study period were included. A descriptive analysis was performed by week to examine when and to what extent each hospital transitioned from legacy coding to COVID-19–specific coding.
Discharges from hospitals that also reported microbiology data through the TheraDoc clinical surveillance system also were queried for SARS-CoV-2 PCR test results from respiratory specimens. A positive SARS-CoV-2 PCR test result during or up to 4 weeks prior to the hospitalization was used as the reference standard against which the sensitivity, specificity, positive predictive value, and negative predictive value of ICD-10-CM code U07.1 were calculated. All analyses were performed using SAS version 9.4 (SAS Institute Inc). Given the use of deidentified data exclusively, the study was deemed not to require ethics board review based on the policy of the Office of Human Subjects Research Protections, National Institutes of Health, under the revised Common Rule.
Results
Between January 1, 2020, and May 31, 2020, 3 128 064 inpatient discharges were identified in 875 hospitals, of which 74 084 (2.37%) recorded specific and 8656 (0.28%) recorded legacy coding for COVID-19. Legacy coding captured the early increase in putative COVID-19 discharges prior to the use of ICD-10-CM code U07.1, increasing from 706 discharges in February 2020 to 6636 discharges in March 2020 (Figure). However, hospitals rapidly began using ICD-10-CM code U07.1 within 2 weeks of its release, at which point legacy coding decreased to prepandemic levels.
Among 150 hospitals reporting microbiology data, 51 907 inpatient discharges with SARS-CoV-2 PCR test results were identified during April and May 2020. Within this cohort, the sensitivity of ICD-10-CM code U07.1 was 98.01% (95% CI, 97.62%-98.39%), the specificity was 99.04% (95% CI, 98.95%-99.13%), the positive predictive value was 91.52% (95% CI, 90.77%-92.27%), and the negative predictive value was 99.79% (95% CI, 99.75%-99.83%) (Table). Among patients tested between April and May 2020, the mean percentage of positive SARS-CoV-2 PCR test results was 9.57% (4965/51 907).
Table. Accuracy of the ICD-10-CM COVID-19–Specific Diagnosis Code in Capturing Inpatient Discharges Displaying PCR-Confirmed COVID-19 From 150 Hospitals Between April 1, 2020, and May 31, 2020.
SARS-CoV-2 PCR test result | Total | ||
---|---|---|---|
Positive | Negative | ||
No. of patients by use of ICD-10-CM code U07.1 | |||
Code useda | 4866 | 451 | 5317 |
Code not used | 99 | 46 5491 | 46 590 |
Total No. of patients | 4965 | 46 942 | 51 907 |
Abbreviations: COVID-19, coronavirus disease 2019; ICD-10-CM, International Statistical Classification of Diseases, Tenth Revision, Clinical Modification; PCR, polymerase chain reaction; SARS-CoV-2, severe acute respiratory syndrome coronavirus 2.
The sensitivity of ICD-10-CM code U07.1 was 98.01% (95% CI, 97.62%-98.39%), the specificity was 99.04% (95% CI, 98.95%-99.13%), the positive predictive value was 91.52% (95% CI, 90.77%-92.27%), and the negative predictive value was 99.79% (95% CI, 99.75%-99.83%).
Discussion
Despite its off-cycle release, the diagnosis code for COVID-19 (ICD-10-CM code U07.1) was widely adopted across study hospitals and rapidly replaced a stop-gap approach in coding for COVID-19 that relied on a preexisting nonspecific code for coronavirus. The COVID-19–specific code showed high sensitivity and specificity compared with the PCR test results. The lower positive predictive value reflects discharges with the COVID-19–specific ICD-10-CM code U07.1 and discharges with negative SARS-CoV-2 PCR test results. These patients may have been diagnosed outside the included hospitals or diagnosed more than 4 weeks prior to admission, or the diagnosis was based on clinical rather than laboratory grounds.
The World Health Organization has provided a specific code (ICD-10-CM code U07.2) for patients with negative SARS-CoV-2 PCR test results,4 but this code is not currently available for use in the US. Limitations include the potential for incomplete data due to ongoing data collection, selection bias in those with PCR test results available, and false-negative PCR test results.5 Furthermore, hospitals reporting microbiology data may not have been representative of all US hospitals.
The findings suggest that hospitals appear to provide reasonably accurate COVID-19 diagnosis codes in administrative data, and these codes may be a reasonable means to track inpatient discharges and costs associated with COVID-19. As the pandemic continues to evolve, it will be important to reassess the reliability of ICD-10-CM code U07.1 to capture COVID-19 cases.
Section Editor: Jody W. Zylke, MD, Deputy Editor.
References
- 1.Rhee C, Dantes R, Epstein L, et al. ; CDC Prevention Epicenter Program . Incidence and trends of sepsis in US hospitals using clinical vs claims data, 2009-2014. JAMA. 2017;318(13):1241-1249. doi: 10.1001/jama.2017.13836 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Centers for Disease Control and Prevention ICD-10-CM official coding and reporting guidelines: April 1, 2020 through September 30, 2020. Accessed October 9, 2020. https://www.cdc.gov/nchs/icd/icd10cm.htm
- 3.Centers for Disease Control and Prevention ICD-10-CM official coding guidelines—supplement coding encounters related to COVID-19 coronavirus outbreak (effective: February 20, 2020). Accessed October 9, 2020. https://www.cdc.gov/nchs/icd/icd10cm.htm
- 4.World Health Organization Emergency use ICD codes for COVID-19 disease outbreak. Accessed October 9, 2020. https://www.who.int/classifications/icd/covid19/en/
- 5.Woloshin S, Patel N, Kesselheim AS. False negative tests for SARS-CoV-2 infection—challenges and implications. N Engl J Med. 2020;383(6):e38. doi: 10.1056/NEJMp2015897 [DOI] [PubMed] [Google Scholar]