Skip to main content
. 2021 Apr 7;28:97–115. doi: 10.1016/j.ctro.2021.03.006

Table 3.

Recommendations for conducting and reporting studies that investigate radiogenomic associations with tumor immune phenotypes.

Process Considerations Recommendations
Study design Study registration Pre-register studies in databases such as the Open Science Framework (OSF)
Cohort selection Focus on specific molecular subtypes or subclasses of cancers may enable more accurate radiogenomic modelsMeta-analysis of multiple cohorts can be used to achieve more generalizable models
Study design Prospective study design to enable longitudinal feature assessment may be ideal for generating models to predict immunotherapy response and identify biomarkers of resistanceFor retrospective study design, statistical and modeling approaches should be decided a priori
Evaluating molecular data Tumor and TME gene expression data procurement and processing RNA-seq for assessing gene expression, refer to Conesa et al. 2016 for a review of good data practices [121]
RNA-seq may be eventually supplanted by single-cell RNA-seq, which can improve the ability to
distinguish tumor versus immune cell gene expression
Pathway and immune infiltration analysis Software like Gene Set Enrichment Analysis (GSEA), Ingenuity Pathway Analysis, DAVID, Metascape are standard for pathway enrichment analysis
Approaches including single sample GSEA (ssGSEA), CIBERSORT, and Immunoscore useful for more specific quantification of types of tumor immune cell infiltration
Cell markers by IHC Specific staining of cell surface markers remains the gold standard for quantifying immune cell infiltration
To increase staining throughput, consider using tissue microarrays and multiplexed IHC
Quantifying TILs by H&E H&E allows for good quantitation of TILs, but is often subject to clinician-reader bias
Best clinical practices are outlined in Salgado et al. 2015 [122]
Image acquisition, processing, and extraction Image acquisition parameters Use standardized acquisition parameters
Image pre-processing Normalize voxel intensities of images, particularly MRI, to more accurately and reproducibly extract
radiomic features
Feature definition and extraction Use feature standardization platforms, such as MITK Phenotyping and the Image Biomarker
Standardization Initiative
Tumor segmentation Use multiple independent observers if segmenting manually or consider semi-automatic/automatic
approaches to maximize reproducibility
Deep learning Utilize algorithm visualization methodology, such as saliency maps, to increase
interpretability/explainability/transparency
Modeling and data analysis Feature selection Reduce feature dimensionality such as through regression modeling (e.g. LASSO Cox, Elastic Net) or using intra-class feature similarity measures (e.g. intra-class correlation coeffcient) to prevent
overfitting and improve feature reliability
Model design Best performing models for predicting prognosis and immunotherapy response are likely achieved by combining radiogenomics models with other covariates into composite models
Correct for multiple hypothesis testing where appropriate
Machine learning Use hold-out data sets for evaluation of models and to prevent any data leakage from training to evaluation sets
Validate on data that are independent from the training set and ideally from multi-institutional sources
Data transparency and reporting Public data and code repositories Share code in open-source repositories like GitHub
Share imaging data in public repositories like the Imaging Data Commons (IDC)
Radiomics quality score (RQS) Report RQS score (out of 36) developed by Sanduleanu et al. 2018 [1]
Study reporting checklists Use of TRIPOD 22-item checklist for model development and validation [120]

Legend: DAVID: database for annotation, visualization, and integrated discovery, IHC: immunohistochemistry, TIL: tumor infiltrating lymphocyte, H&E: hematoxylin and eosin, MITK: medical imaging interaction toolkit, LASSO: least absolute shrinkage and selection operator, TRIPOD: Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis.