Skip to main content
JAMA Network logoLink to JAMA Network
. 2018 Aug 9;4(9):1293–1295. doi: 10.1001/jamaoncol.2018.2979

Assessment of the Accuracy of Disease Coding Among Patients Diagnosed With Sarcoma

Heather G Lyu 1, Leah A Stein 2, Lily V Saadat 1, Sheila N Phicil 2, Adil Haider 1, Chandrajit P Raut 1,3,, for the Dana-Farber/Brigham and Women’s Cancer Center Sarcoma Surgery Group
PMCID: PMC6143007  PMID: 30098150

Abstract

This case study compares diagnosis codes with pathology reports for patients diagnosed with sarcoma to assess the accuracy of disease coding at a health care center.


The rarity of sarcoma makes performing appropriately powered studies challenging and increases the significance of accurate data collection. Tumor registries and population-based databases are increasingly used to determine sarcoma incidence, treatment patterns, and outcomes.1,2,3 The utility of these databases is contingent on meticulous data collection. Although the validity of large databases has been questioned,4 little is documented about the initial coding process. This study characterizes inaccuracies in coding practices that result in incorrect sarcoma surgical diagnostic codes and tumor registry data at a high-volume health care center. Identification of coding practice errors has implications for the validity of larger oncology databases.

Methods

The Brigham and Women’s Hospital Institutional Review Board approved the study and waived the need for patient consent. Patients who underwent resection of primary or recurrent sarcoma between January 1, 2012, and December 31, 2016, by 5 sarcoma surgeons (including C.P.R.) were identified using prospectively collected data from Brigham and Women’s Hospital and Dana-Farber Cancer Institute. Demographic data were not collected.

Diagnoses were confirmed by comparing Brigham and Women’s Hospital operative diagnosis codes (International Classification of Diseases, Ninth Revision [ICD-9], and International Classification of Diseases and Related Health Problems, Tenth Revision [ICD-10]) with pathology reports. Each patient was labeled as true positive, false negative, or true negative. Using the true positive data set, each patient’s Dana-Farber Cancer Institute diagnosis code (International Classification of Diseases for Oncology, Third Revision [ICD-O-3]) was collected to determine the accuracy of the tumor registry. The ICD-O-3 codes are an extension of International Classification of Diseases codes and specify the site and histologic characteristics of neoplasms. Statistical analyses were conducted in May 2017 using Stata, version 13.0 (StataCorp).

Results

During the study period, 2715 patients with soft-tissue and bone oncologic cases were treated by 3 surgical oncologists (1856 cases) and 2 orthopedic oncologists (859 cases). Of these, 1237 patients (855 treated by surgical oncologists, 382 treated by orthopedic oncologists) had a sarcoma diagnosis confirmed by pathologic findings.

On the basis of ICD-9 and ICD-10 codes, 764 of 1237 patients (61.8%) had cases that were accurately coded as sarcoma, 208 of 1237 patients (16.8%) had a nononcologic diagnosis, and 265 of 1237 patients (21.4%) had an organ site–based malignancy code; 487 of 855 patients (57.0%) treated by surgical oncologists and 277 of 382 patients (72.5%) treated by orthopedic oncologists had cases that were accurately coded. Organ-confined sarcoma was commonly coded with a nonsarcoma, organ-site ICD-9 or ICD-10 code (Table). For instance, 49 of 156 (31.4%) gastric gastrointestinal stromal tumor cases and 24 of 46 (52.2%) breast angiosarcoma cases were coded as gastric and breast cancer, respectively (Figure).

Table. ICD-9, ICD-10, and ICD-O-3 Codification of Sarcomas by Specialty.

Diagnosis No. (%) of Cases
ICD-9 and ICD-10 ICD-O-3
Surgical Oncology (n = 855) Orthopedic Oncology (n = 382) Total (N = 1237) Surgical Oncology (n = 718) Orthopedic Oncology (n = 337) Total (N = 1055)
Sarcoma 487 (57.0) 277 (72.5) 764 (61.8) 428 (59.6) 203 (60.2) 631 (59.8)
Nononcologic diagnosis or not listed 109 (12.7) 99 (25.9) 208 (16.8) 269 (37.5) 129 (38.3) 398 (37.7)
Oncologic diagnosis 259 (30.3) 6 (1.6) 265 (21.4) 21 (2.9) 5 (1.5) 26 (2.5)
Gastrointestinal 110 (12.9) 1 (0.3) 111 (41.9) 6 (0.8) 1 (0.3) 7 (0.7)
Breast 32 (3.7) 0 32 (12.1) 5 (0.7) 1 (0.3) 6 (0.6)
Cutaneous 29 (3.4) 0 29 (10.9) 1 (0.1) 1 (0.3) 2 (0.2)
Genitourinary 17 (2.0) 1 (0.3) 18 (6.8) 1 (0.1) 0 1 (0.1)
Gynecologic 13 (1.5) 0 13 (4.9) 4 (0.6) 0 4 (0.4)
Hematologic 9 (1.1) 1 (0.3) 10 (3.8) 3 (0.4) 0 3 (0.3)
Neurologic 8 (0.9) 1 (0.3) 9 (3.4) 1 (0.1) 1 (0.3) 2 (0.2)
Thoracic 0 1 (0.3) 1 (0.4) 0 1 (0.3) 1 (0.1)
Other 41 (4.8) 1 (0.3) 42 (15.8) 0 0 0

Abbreviations: ICD-9, International Classification of Diseases, Ninth Revision; ICD-10, International Classification of Diseases and Related Health Problems, Tenth Revision; ICD-O-3, International Classification of Diseases for Oncology, Third Revision.

Figure. Classification of Gastrointestinal Stromal Tumors and Breast Angiosarcomas.

Figure.

Based on ICD-O-3 codes from the Dana-Farber Cancer Institute tumor registry during an overlapping 4-year period, 631 of 1055 patients (59.8%) had cases that were accurately coded, 26 of 1055 patients (2.5%) had cases that were coded with an other cancer diagnosis, and 398 of 1055 patients (37.7%) had cases that were not listed in the registry.

Discussion

This study emphasizes that the vague nature of definitions for diseases can lead to coding inaccuracies that can be propagated through data sets, which is an issue that possibly extends beyond any single institution. Our study has several limitations. First, the coding inaccuracies as identified in this study may be specific to our institution. Our findings may not be generalizable to all sarcoma centers, and confirmation from other institutions is needed. However, tumor registrars are trained uniformly by American Joint Committee on Cancer guidelines, which raises concern that this issue could be widespread. Consequently, national data sets may not be as comprehensive or useful as expected for studying population-based outcomes for sarcoma. Nevertheless, properly framed questions may still be valid within the limitations of such data sets. Potential reasons for our findings include the heterogeneity, number of histologic subtypes, and variable nomenclature of sarcoma, which renders accurate characterization of cases challenging.5,6 Sarcoma may be inaccurately classified on the basis of the organ site rather than on the basis of the pathologic findings.

National databases—including the National Inpatient Sample; the American College of Surgeons’ and American Cancer Society’s National Cancer Database; the Centers for Disease Control and Prevention National Program of Cancer Registries; and the Surveillance, Epidemiology, and End Results Program—that rely on International Classification of Diseases codes are vulnerable to limitations attributable to inaccurate coding. Gross underestimation in coding data representing sarcoma resections may also contribute to skewed market forecasts. Our findings, if validated by others, suggest that the number of sarcoma cases may be higher than that reported by studies that use these data sets. At present, the net effect of coding errors is unknown. Discussions among surgeons, pathologists, coders, and tumor registrars about how to specify sarcomas are encouraged.

References

  • 1.Ferrari A, Sultan I, Huang TT, et al. . Soft tissue sarcoma across the age spectrum: a population-based study from the Surveillance Epidemiology and End Results database. Pediatr Blood Cancer. 2011;57(6):943-949. doi: 10.1002/pbc.23252 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Thiels CA, Bergquist JR, Krajewski AC, et al. . Outcomes of primary colorectal sarcoma: a national cancer data base (NCDB) review. J Gastrointest Surg. 2017;21(3):560-568. doi: 10.1007/s11605-016-3347-0 [DOI] [PubMed] [Google Scholar]
  • 3.Datar M, Khanna R. Inpatient burden of gastrointestinal stromal tumors in the United States. J Gastrointest Oncol. 2012;3(4):335-341. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Cooke CR, Iwashyna TJ. Using existing data to address important clinical questions in critical care. Crit Care Med. 2013;41(3):886-896. doi: 10.1097/CCM.0b013e31827bfc3c [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Dancsok AR, Asleh-Aburaya K, Nielsen TO. Advances in sarcoma diagnostics and treatment. Oncotarget. 2017;8(4):7068-7093. doi: 10.18632/oncotarget.12548 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Jo VY, Fletcher CD. WHO classification of soft tissue tumours: an update based on the 2013 (4th) edition. Pathology. 2014;46(2):95-104. doi: 10.1097/PAT.0000000000000050 [DOI] [PubMed] [Google Scholar]

Articles from JAMA Oncology are provided here courtesy of American Medical Association

RESOURCES