Lambin et al. introduced the term radiomics in 2012 (1) to describe the extraction of biologically relevant quantitative features from radiological images. They suggested that radiomic features (RFs), the invisible tissue infrastructural components of the objects being imaged, might be a valuable way to study cancer using computed tomography (CT) and other modalities. Such imaging studies are easily repeated and provide in vivo visualization and quantitative analysis of RFs throughout an imaged mass. Thus, they could support a personalized precision medicine approach to cancer diagnosis and serial assessment and prediction of response to treatment. Even so, radiomics has not evolved into a widely used, reliable component of cancer evaluation. The complicated nature of radiomics and its validation have raised questions, as have the reproducibility and generalizability of texture analysis and other fundamental components of the radiomics signature (2). In this issue of the Journal, Dercle et al. (3) demonstrated progress in several aspects of cancer analysis by radiomics and provided insights into future validation work.
In oncology, radiomics analysis involves the calculation of RFs from the images of the cancers. A wide variety of such features can be calculated. Examples include lesion size and shape, spatial histograms, and descriptors of the spatial arrangement of pixel intensities (texture) such as gray-scale co-occurrence matrices. These calculations are enabled by using several available software packages (3), but did not include pyradiomics (4). For example, in the Dercle study, the authors reported considering 1757 RFs and a nearly equal number of deep learning features. From these, the authors selected the four features most closely correlated with their image findings for analysis and validation. In 13 radiomics cancer studies summarized by Lambin et al. in 2017 (5), the features that proved useful varied, even in the same type of cancer. The development of a standard group of features and criteria for evaluating those features would be useful, although the biological variations of cancer may make this difficult. Standardization clearly will not be achieved unless the components of the radiomic analysis itself are more standard. To support this, Lambin et al. (5) have recommended a rigorous 16-component radiomics quality score (RQS) and transparent reporting of clinical radiomics study methods. This RQS was used and reported by Dercle et al. (3).
Dercle et al. (3) reported data involving 667 patients with metastatic colon cancer. These patients were part of a prospective blinded clinical trial comparing first-line treatment response of patients who received chemotherapy alone or in combination with an epidermal growth factor receptor (EGFR) inhibitor. Repeat CT scans at baseline and after 8 weeks of therapy were used to monitor change in the size and RFs of hepatic metastases. The CT scans were acquired at several different institutions following the same CT protocols, but the scan quality varied. Experienced radiologists blinded to outcomes were later asked to rank all CT images as high quality or standard in quality. In patients who received EGFR inhibitors, RFs derived from both high- and standard-quality CTs provided statistically significantly better prediction of treatment response and overall survival than change in tumor size or the baseline KRAS mutational status. Because these RFs were acquired from multiple CT scanners of varying quality in several institutions, the authors concluded that their radiomics signatures were resilient to CT technological variations and likely generalizable.
Dercle et al. (3) reported in Supplementary Materials that their RQS analysis yielded a conformity score of 78% (28 of 36 points) with the 16 components of the RQS. Reporting such compliance with the RQS is a positive step and should be a requirement of future radiomics research. CT texture phantoms, which could have helped document and possibly compensate for CT inconsistencies, are part of the RQS but were not available in the Dercle study. Such phantoms for CT radiomics have been developed (6) and validated (7) and should be part of future radiomics studies whenever possible.
Although Dercle reported results from 667 patients with metastatic colon cancer, there were only 100 patients in the validation group receiving anti-EGFR therapy. Samples of this size can be problematic in studies that employ deep learning approaches that are prone to overfitting. Dercle applied proper approaches to minimize overfitting, but whether overfitting was controlled remains uncertain. Validation of their RFs by prospective analysis of larger independent datasets of patients with metastatic colon cancer is warranted. In addition, retrospective validation of RFs from Dercle or other radiomic studies against large datasets with known tissue diagnoses—such as datasets that might be available from the National Institutes of Health, the American College of Radiology Imaging Network (8), or other sources—also could provide opportunities for radiomic validation.
The Dercle study shows many aspects of the rigor and attention to scan quality needed to allow radiomics to make consistent contributions to personalized cancer imaging. It also shows the complexity of radiomics analysis. Is radiomics too complex to become generally established in the radiological community? The work of Lambin et al. (1,5), that of the Image Biomarker Standardization Initiative (9), and others suggests that standardization and rigorous quality control are needed for radiomics to succeed. Will this be sufficient, or will the complexities of radiomics lead to it being available only in specialized imaging centers that serve as analysis referral sites? Further work is warranted to determine if the potential shown by the Dercle study and other radiomics work can be broadly validated and to establish the proper approach to radiomics in cancer evaluation.
Funding
This editorial work was supported in part by the Intramural Research Program of the National Institutes of Health Clinical Center.
Notes
Dr Alderson is Dean Emeritus of the School of Medicine at Saint Louis University in St Louis, MO, and is the corresponding author. He was on the faculty in the Department of Radiology at Columbia University Medical Center (CUMC) until 2008 but has had no ongoing relationship with CUMC since that time. Dr Alderson has no other conflicts of interest to report. Dr Summers is a Senior Investigator in the Imaging Biomarkers and Computer-Aided Diagnosis Laboratory in the Department of Radiology at the National Institutes of Health Clinical Center, Bethesda, MD. He receives royalties from iCAD, Philips, ScanMed, and PingAn. His laboratory receives support from PingAn and has received GPU card donations from NVIDIDA.
The funder had no role in the writing of this editorial or decision to submit it for publication.
References
- 1. Lambin P, Rios-Velazquez E, Leijenaar R, et al. Radiomics: extracting more information from medical images using advanced feature analysis. Eur J Cancer. 2012;48(4):441–446. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Summers RM. Texture analysis in radiology: does the emperor have no clothes? Abdom Radiol. 2017;42(2):342–345. [DOI] [PubMed] [Google Scholar]
- 3. Dercle L, Lu L, Schwartz LH, et al. Radiomic response signature for identification of metastatic colon cancer sensitive to therapies targeting the EGFR pathway. J Natl Cancer Inst. 2020;112(9):902–912. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Van Greithuysen JJM, Federov A, Parmar C, et al. Computational radiomics system to decode the radiographic phenotype. Cancer Res. 2017;77( 21):e104–107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Lambin P, Leijenaar RTH, Deist TM, et al. Radiomics: the bridge between medical imaging and personalized medicine. Nat Rev Clin Oncol. 2017;14(12):749–762. [DOI] [PubMed] [Google Scholar]
- 6. Mackin D, Fave X, Zhang L, et al. Measuring computed tomography scanner variability of radiomics features. Invest Radiol. 2015;50(11):757–765. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Hassan MS, Zhang GG, Latifi K, et al. Intrinsic dependencies of CT radiomic features on voxel size and number of gray levels. Med Phys. 2017;44(3):1050–1062. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Hillman BJ, Gatsonis C.. The American College of Radiology Imaging Network (ACRIN) – Clinical Trials of diagnostic imaging and image-guided treatment. Semin Onc. 2008;35(5):460–469. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Vallieres M, Zwanenburg A, Badic B.. Responsible radiomics research for faster clinical translation. J Nucl Med. 2018;59(2):189–193. [DOI] [PMC free article] [PubMed] [Google Scholar]