Abstract
Malignant lymphomas are a family of heterogenous disorders caused by clonal proliferation of lymphocytes. 18F-FDG-PET has proven to provide essential information for accurate quantification of disease burden, treatment response evaluation, and prognostication. However, manual delineation of hypermetabolic lesions is often a time-consuming and impractical task. Applications of artificial intelligence (AI) may provide solutions to overcome this challenge. Beyond segmentation and detection of lesions, AI could enhance tumor characterization and heterogeneity quantification, as well as treatment response prediction and recurrence risk stratification. In this scoping review, we have systematically mapped and discussed the current applications of AI (such as detection, classification, segmentation as well as the prediction and prognostication) in lymphoma PET.
Keywords: Artificial intelligence, Deep learning, Detection, Lymphoma, Positron emission tomography (PET), Radiomics, Radiophenomics, Segmentation
Introduction
Lymphomas are a diverse group of hematologic malignancies, which can be broadly categorized into Hodgkin (HL) and non-Hodgkin diseases (NHL) and have a wide range of clinical presentations.1 2-deoxy-2-[Fluorine-18]fluoro-d-glucose (18F-FDG) positron emission tomography/computed tomography (PET/CT) is extensively used for staging and response assessment in HL and NHL.2, 3, 4, 5 The accurate and precise quantification of tumor burden in lymphoma is critical for prognosis and treatment response evaluation and prediction. 18F-FDG PET scans provide valuable information about the metabolism of lesions. This functional information combined with structural (CT or MRI) data can be used to assess the global disease burden in Alzheimer’s disease,6 Crohn’s disease,7 knee inflammation8 as well as lymphoma.9 Therefore, 18F-FDG PET/CT is extremely valuable in the noninvasive assessment of disease burden.10
To determine global disease burden, segmentation of all tumor lesions is a vital step that allows the measurement of metabolically active tumor volumes (MTV), mean activity of the lesion (SUVmean), lesion partial volume corrected metabolic volume product (PVC-MVP: calculated as the product of lesion MTV and lesion PVC-SUVmean), total metabolic tumor volume (TMTV: calculated as the sum of MTV of all lesions), total lesion glycolysis (TLG),11 whole-body metabolic burden (WBMB: calculated as the sum of lesion PVC-MVP of all lesions),10,12 metabolic heterogeneity (MH)13,14 and lesion dissemination (Dmax).15,16 There are, however, various methods for the segmentation of tumor lesions (e.g. manual, thresholding-based, region-based, or boundary-based)17,18 each with high inter-observer variability depending on the operator and segmentation method.19,20 Furthermore, even for an expert, manual segmentation takes time (30–45 minutes per patient depending on tumor burden) as summary measurements of each lesion must be aggregated.19 Lymphoma lesion segmentation is a challenging task due to the large variability in number, size, distribution, uptake, the shape of lesions, and different degrees of glucose metabolism (Fig. 1)21, 22, 23 Normal biodistribution of 18F-FDG creates physiologic intense activity either due to high metabolic rate (such as brain) or high concentration of excreted radiotracer (such as renal collecting ducts and bladder). This normal pattern significantly deteriorates the performance of the crude intensity-based detection and segmentation methods.24,25 Automated segmentation and feature extraction approaches may be an exciting avenue to limit measurement discrepancies and cut image analysis to a fraction of the current required time.19
Fig. 1.

Examples of different sizes and distributions of tumor in 5 patients with diffuse large B-cell lymphoma25.
(From Barrington SF, Meignan M. Time to Prepare for Risk Adaptation in Lymphoma by Standardizing Measurement of Metabolic Tumor Burden. J Nucl Med. 2019;60(8):1096–1102: under Open Access Creative Commons License https://creativecommons.org/licenses/by/4.0/)
In general, the most critical aspects that should be evaluated during lymphoma PET are as follows: (1) quantification of disease burden,26,27 (2) evaluation of therapy response,28 and (3) extraction of additional image information used for prognosis and diagnosis of lymphoma.29,30 Artificial Intelligence (AI) has ample potential to achieve the aforementioned goals by first performing automatic quantification, which entails (1) automatically identifying the location of the abnormality,31 (2) automatically segmenting the lesion,32,33 (3) summarizing each lesion to other dimensions (SUVmean, SUV max).10,34 Finally, AI can enable registration at multiple points in time,35,36 scaling from one space to another. This allows evaluation of lymphoma before and after diagnosis or therapy.37,38 Given such significant promise that AI has in other fields of medical imaging and sporadic relevant evidence specific to lymphoma PET, it is important to scope and map current applications of AI in lymphoma PET imaging. However, at the time of this publication, there has not been a review of various applications of AI as it pertains to lymphoma.
Thus, in this article, we first aim to scope the breadth of evidence and systematically map literature on the topic of AI applications in lymphoma PET to identify key concepts and disseminate research evidence on various AI models. We then depict the potential clinical utility of AI in PET imaging and anticipate future directions that can be expected for AI applications in lymphoma.
A list of abbreviations used in this article are shown in Box 1.
Box 1. Abbreviations.
| AI | Artificial Intelligence | ML | Metabolic Heterogeneity |
| CNN | Convolutional Neural Network | MTV | Metabolic Tumor Volume |
| DL | Deep Learning | NHL | Non-Hodgkin Lymphoma |
| DLBCL | Diffuse Large B-cell Lymphoma | NM | Nuclear Medicine |
| FP | False Positive | OS | Overall Survival |
| FN | False Negative | PFS | Progression-Free Survival |
| 18F-FDG | 18F-fluorodeoxyglucose | RF | Random Forest |
| HL | Hodgkin Lymphoma | SVM | Support Vector Machine |
| HiNA | High Normal Activity | TLG | Total Lesion Glycolysis |
| ML | Machine Learning | TMTV | Total Metabolic Tumor Volume |
Methods
This scoping review was conducted following the preferred reporting items for systematic reviews and meta-analysis extension for scoping reviews (PRISMA-ScR) guidelines.39
Search Strategies
Bibliographic searches were performed in PubMed, EMBASE, Cochrane Library, and Google Scholar for articles published before September 1st, 2021. In PubMed/Medline, Medical Subject Headings (MESH) in all fields were searched for “Artificial Intelligence” (or Deep Learning or Machine Learning or Support Vector Machine (SVM) or Convolutional Neural Network (CNN) or Artificial Neural Network (ANN)) and “Positron Emission Tomography” (PET or PET-CT or PET-MR) and “lymphoma.” The remainder of the studies were identified through manual searches of bibliographies and citations until no further relevant studies were found. One investigator (N.H) independently screened titles and abstracts and selected relevant citations for full-text review.
Eligibility and Exclusion Criteria
Studies that reported the diagnostic measurement of an AI/ML/DL algorithm to investigate any type of lymphoma using PET were sought. Articles were excluded if the study was not written in English. All the nonpeer-reviewed material such as nonpeer-reviewed conference articles and archives as well as studies irrelevant to applications of AI in lymphoma PET imaging were excluded. Studies that comprised a development or assessment of an AI algorithm on PET imaging in human populations diagnosed with lymphoma or any subtypes were eligible. The search included all primary articles since the beginning of 2009.
Data Extraction and Analysis
Key study characteristics (such as tasks/models, methods, results) for selected papers are summarized. According to the specification of the task—segmentation, classification, prognostication—the articles were categorized (Table 1). The details of methods and the AI architecture proposed were recorded. Study characteristics extracted were the purpose of the article, authors, year of publication, AI model design, proposed AI application, and ground truth (GT). Also extracted were information regarding sample size, training sample, testing, and validation samples as well as figures of merit (FoM) such as specificity, sensitivity, dice similarity coefficient (DSC), and Hausdorff distance (HD) (Table 2) depending on the proposed application of the algorithm (see Table 1).
Table 1.
Summary of characteristics of selected literature
| Author (Year) | Tasks Performed | Task Specific Input/Out | FoM | Details (Model-Related) | Details (Ground Truth, Sample Size) |
|---|---|---|---|---|---|
| CNN Models | |||||
| Pinochet et al,43 2021 | Classification (Radiophenomics) | Input: 2D; WB; axial/sagittal/coronal 18F-FDG PET slice Output: slice-level 3-category classification (Benign, Malignant, Equivocal lymph nodes) |
AUC = 0.62 | Evaluate PET Assisted Reporting System (PARS-PET) by Siemens on DLBCL patients CNN model: PET Assisted Reporting System (PARS) |
GT: 2 NM physicians segmented DLBCL lesions Testing: 119 patients (research cohort) + 430 patients (routine cohort) |
| Segmentation | Input 1: 2D; WB; axial/sagittal/coronal 18F-FDG PET slices Input 2: detection map provided by PARS prototype software Output: 2D; segmented lesions with borders masked on PET slice |
DSC = 0.65 (research cohort) DSC = 0.48 (routine cohort) TMTV ICC = 0.68 (research cohort) TMTV ICC = 0.61 (routine cohort) |
|||
| Statistical Classification (Prediction/Prognosis) | Input: TMTV value from WB PET Image (Method: TMTV thresholding) Output: Prognostication (PFS, OS) |
OS Hazard ratio = 2.4 PFS Hazard ratio = 2.1 |
|||
| Sadik et al,44 2021 | Classification (Radiophenomics) | Input: 2D; WB; sagittal/coronal/axial; 18F-FDG PET slice and CT slice (2 channel input) Output: Pt-level 4-class classification [high vs low diffuse bone marrow uptake] × [presence vs absence of focal lesion]) |
PA = 0.85 Kappa = 0.41 |
Highlight foci of skeletal and bone marrow uptake in Hodgkin’s Lymphoma patients CNN model: based on RECOMIA prototype |
GT: 10 independent experienced NM clinicians classified lesions Training: 156 patients Testing: 49 patients |
| Guo et al,45 2021 | Characterization (deepRadiomics) | Input: Manually segmented lesions (3D; axial; 1 channel: Rank 3 Tensor [combined 18F-FDG PET and CT]) Output: 16 × 8 feature maps, total of 128 features |
AUC = 0.88 (for PSI) PSI-based PFS prediction: Spec = 0.80, Sens = 0.83, Accuracy = 0.85 |
Extraction of feature maps surrogates for prognosis prediction in nasal ENKTL. Proposes PSI be a predictor of PFS; PSI is the ratio of the PPV to NPV Model: Weakly supervised deep learning (WSDL) based on Residual Network-18 (ResNet-18) and PNU classifier. |
GT: 1 NM physician (15 y experience) segmented nasal ENKTL lesions Training sample: 64 patients Testing: 20 patients |
| Statistical Classification (Radiophenomics) | Input: Prediction similarity index (PSI) derived from image features Output: Relapsed vs nonrelapsed classes for ENKTL |
||||
| Yuan et al,46 2021 | Detection | Input: 2D; axial; neck/chest/abdomen; 18F-FDG PET slice and CT slices (2 channel input) Output: 2D; axial; detection map with lesions in rectangular boxes |
Sens (chest) = 83.2%, Spec (chest) = 99.75%, Accuracy = 99.5% |
Hybrid Learning for feature fusion of DLBCL Segmentation Hybrid CNN models can create feature fusion maps and quantify the spatial contributions of each modality. PET and CT image feature-based hybrid learning CNN model architecture |
GT: 1 physician manually segmented DLBCL lesions Training and Validation: Cross-validation using a dataset with total of 1242 PET-CT slice pairs from 45 PET-CT samples |
| Segmentation | Input 1: 2D; axial; neck/chest/abdomen; 18F-FDG PET slice and CT slices (2 channel input) Input 2: Detection map results Output: lesions border segmentation map |
DSC = 0.73, MHD = 4.38 mm |
|||
| Zhou et al,38 2021 | Detection | Input: 2D; axial/coronal; WB 18F-FDG PET/CT (1 channel) or 18F-FDG PET alone Output: Map of detected mantle cell lymphoma |
Sens = 0.88 FP/patient = 15 For outside-institute patients Sens = 0.84 FP/patient = 14 |
Xception-based U-Net Localized lesions on PET/CT and labeled each pixel as MCL or not MCL. High FPs/patient needs to be corrected through physicians’ inspection |
GT: 3 NM physicians each with more than 10 y of experience identified and contoured MCL lesions Training: 110 patients Validation: 5-fold cross-validation Testing: 32 outside-institute patients |
| End-to-end Segmentation | Input: 3D; coronal; WB; 18F-FDG PET and 3D CT 2 separate channels Output: Mask of segmented lesions with calculated TMTV on 18F-FDG PET/CT |
JSC = 0.60, DSC = 0.73 Predicted TMTV R = 0.88, 0.82 in first cohort, second cohort, respectively |
Fully automatic segmentation of DLBCL lesions for total MTV prediction – 3D 18FDG-PET/CT | GT: masks were manually obtained with 41% SUVmax adaptive thresholding. TMTV protocol from LIFEx used for VOI semiautomatically segmented. 2 experienced physicians reviewed clustering results and remove physiologic uptakes Training: 639 patients Validation: 5-fold cross-validation Testing: 94 patients |
|
| Weisman et al,48 2020 | End-to-end Segmentation | Input: 3D; coronal; WB; 18F-FDG PET and CT image (2 channels) (Hidden input 2: integrated detection of lymph node map by DeepMedic) Output: Map of masked segmented lesions |
DSC = 0.86 | Measures PET imaging features in pediatric lymphoma PET/CT scans in a fully-automated fashion. Model: an ensemble of 3 DeepMedics |
GT: 1 NM physician with 11 yrs of experience segmented and determined malignancy status at lymph nodes Training/validation: 80 patients Testing: 20 patients |
| Characterization (Radiomics) | Input: 3D; coronal; WB; PET-CT slices with segmented lesions Output: SUVmax, MTV, TLG, SA/MTV, measure of disease spread (Dmaxpatient) |
R = 0.95 | |||
| Weisman et al49 (with Kieler) 2020 | Detection | Input: 2D; coronal; WB; 18F-FDG PET slice (from PET/CT) Output: Lymph nodes probability map contoured |
TPR = 0.85 4 FP/patient |
Automated detection of diseased lymph node Burden in lymphoma patients – PET/CT Model: an ensemble of 3 DeepMedics |
GT: 1 NM physician with 11 yrs of experience segmented and determined malignancy status at lymph nodes Training: 58 patients Testing: 90 patients |
| Sibille et al,50 2020 | Detection (localization + 4-category classification suspicious vs nonsuspicious for lung cancer or lymphoma) | Input: 2D; coronal; WB; 18F-FDG PET slice fused with CT, MIP, anatomic atlas Output: Map of detected lesions classified under [suspicious or nonsuspicious] × [lung cancer or lymphoma] |
For Localization: Sens = 0.81, Spec = 0.97, accuracy = 0.96 (for body parts), 0.87 (for region), 0.81 (for subregion) For Classification: FP = 1.47 (96 of 65), FN = 1.76 (of 65), AUC = 0.98 |
18F-FDG Uptake Classification in Lymphoma and Lung Cancer – using CT, PET, MIP, and atlas information | GT: 2 NM physicians annotated and segmented foci with increased 18F-FDG uptake specified the anatomic location and classified. Training: 380 patients Validation:126 patients Testing: 123 patients |
| Li et al,51 2019 | End-to-end Segmentation | Input 1: 2D; axial; WB; 18F-FDG PET and CT slices (6 channels) (Hidden input: single pixel probability map of lesions) Output: segmentation map of lymphoma |
DSC = 0.73, Precision = 0.70, Recall = 0.81 |
End-to-end lymphoma segmentation – WB PET/CT Model details: DenseX-Net |
GT: 3 clinicians delineated images, then verified and revised by 1 nuclear medicine expert Validation: 5-fold cross-validation Testing: 80 patients |
| Sadik,52 2019 | Segmentation | Input 1: 2D axial/sagittal, coronal CT (3 channels) Input 2: manually detected liver and aorta Output: Segmentation of liver and aorta |
DSC = 0.95 | Automated calculation of liver and aortic 18F-FDG uptake levels to serve as a reference for therapy response classification in HL and NHL CT segmentation maps were resampled to fit the 18F-FDG-PET image in order to calculate SUVmedian Model details: CNN adopted from Goodfellow et al,53 2016 |
GT: 2 radiologists segmented images Training: 80 patients Validation: 6 patients |
| Bi et al,24 2017 | Detection | Input: 3D; WB; coronal; 18F-FDG PET with CT slices (2 channels) Output: 3D; WB; coronal; map of sFEPU regions (ie, Left, and right kidneys, bladder, brain, heart) |
DSC: 0.92 | Automatic detection of superpixel regions of FDG uptake of lymphoma regions Model details: MSE + CFSC |
GT: 1 experienced operator manually identified ROI using PERCIST thresholding and the diagnostic report of PET-CT scan Training: 1.5 million nonmedical images, validated: 50,000 nonmedical images Testing: 11 patients |
| Classic Machine Learning Models | |||||
| Annunziata et al,22 2021 | Statistical Classification (Prediction/Prognosis) | Input: Deauville Score, qPET, MTV0, slope (slope of a linear function of MTV) features from 3D; axial/coronal end-of-treatment than beginning-of-treatment 18F-FDG PET and CT slices Output: Patient-level 2-class prediction (relapse vs progression) |
PPV = 0.55, NPV = 0.83 (for DS 4–5) PPV = 0.89, NPV = 0.82 (for positive qPET) R = 0.63 (for ANN) |
Assess the prognostic capacity of post-treatment 18F-FDG-PET/CT in DLBCL patients Model details: multi-regression model, ANN |
GT: 2 NM physicians independently evaluated using a dedicated fusion and display software Training: 26 patients Testing: 11 patients K-fold cross-validation |
| Lippi et al,54 2020 | Classification (Radiophenomic) | Input: 3D; WB; coronal 18F-FDG PET slices Output: Patient-level 4-class classification of malignant lymphoma (DLBCL, HL, follicular and mantle cell lymphoma) |
Sens = 0.97, PPV = 0.94 |
Texture analysis and classification of malignant lymphoma Model details: SVM + RF |
GT: 1 NM physician with 5 y of experience extracted VOIs using a 40%-threshold of SUVmax Evaluation: leave-one-out procedure, whereby each patient was used, in turn, as the test set, and all the other patients constituted the training set. |
| Mayerhoefer et al,55 2019 | Statistical Classification (Prediction/Prognosis) | Input: TMTVs, SUVmean, TLG, entropy, and 15 other textural radiomic features Output: Patient-level 3-category metabolic risk (low, intermediate, high) of progression |
AUC = 0.72 | Radiomic features for prediction of outcome in mantle cell lymphoma International prognostic indices for MCL = MIPI and MIPI-b Model details: Multilayer perceptron feed-forward ANN |
GT: TMTV protocol used to semiautomatically construct with 41% SUVmax threshold Training: 75 patients Testing: 32 patients |
| Hu et al,56 2019 | Detection | Input: 3D; WB; coronal/axial/sagittal 18F-FDG PET slices and CT slices (2 Channels) Output: 3D probability map of the segmented lesion (normal organ and tumors) |
Sens = 0.80, DSC = 0.59 |
Physical spatial characteristics of the lesions along with prior knowledge were used to optimize the technique. Density-based spatial clustering of applications with noise (DBSCAN) |
GT: Segmentation ground truth obtained by 41% SUV max thresholding, no information on the physicians Testing: 48 patients |
| Segmentation | Input1: 3D; WB; coronal/axial/sagittal 18F-FDG PET slices and CT slices (2 Channels) Input2: Detection results Output: 3D, coronal/axial/sagittal slice with segmented normal organ and tumor lesions. |
Dref (DSC) = 0.74, Dglobal = 0.50, Volumesup = .39 |
|||
| Yu et al,57 2018 | Detection | Input: 2D; WB; coronal/axial/sagittal 18F-FDG PET slices and CT slices (2 Channels) Output: probability map of detected lymphoma lesions |
Sens = 1.0 | Semiautomatic lymphoma detection and segmentation | GT: 1 physician contoured images Training/validation: 11 patients |
| Segmentation | Input1: PET/CT images with physiologic hypermetabolic organs removed. Input2: Detection results Output: Border mask segmentation visualized on software on axial, sagittal, and coronal |
DSC = 0.84 | Model details: FC-CRF | ||
| Grossiord et al,58 2017 | Classification (Radiophenomics) | Input: 2D; coronal; 18F-FDG PET and CT slices (2 channel) Output: slice-level 3-class classification (Organ, tumor, nonrelevant) |
Sens = 0.65, Spec = 0.92, Accuracy = 0.86 |
Automated 3D lymphoma lesion segmentation - PET/CT Model details: PET/CT feature extraction, random forest classification, mixed spatial-spectral space of component-trees |
GT: 1 expert manually expert segmentation at 41% SUVmax Training/validation : 43 patients Leave-one patient-out cross-validation for classification task |
| Segmentation | Input 1: 2D; WB; coronal 18F-FDG PET and CT slices (2 channel) Input 2: single cluster representative of each lesion Output: 2D, WB, coronal segmentation map |
DSC = 0.75 | |||
| Desbordes et al,59 2016 | End-to-end Segmentation | Input: 2D; WB; coronal/axial/sagittal PET slices and CT slices (2 Channels) (Hidden input: single pixel representative each lesion (based on automatic seed definition)) Output: 2D, WB, lesion segmentation map |
DSC = 0.80 | Cellular automata define tumor seed within ROI to obtain final segmentation by iterative growth. Model details: auto-initialization cellular automata |
GT: 1 NM physician manually selected and segmented the ROI Testing: 12 patients |
| Lartizien et al,60 2014 | Classification (Radiophenomics) | Input: 3D; supraclavicular; axial 18F-FDG PET slices and CT slice images Output: 2-class classification (benign vs cancer) |
AUC = 0.91 | SVM classifier applied on 12 most discriminant 1st and 2nd order textural features derived from the registered PET and contrast CT images | Evaluation set: 156 lymphomatous and 32 suspicious 25 (11 males and 14 females) baselines with B-cell lymphoma or HL. |
Abbreviations: 18F-FDG, 18F-fluorodeoxyglucose; ANN, artificial neural network; AUC, area under curve; CFSC, class-driven feature-selection & classification; DLBCL, diffuse large B-cell lymphoma; DSC, dice similarity coefficient; ENKTL, extranodal NK/T cell lymphoma; nasal type; FC-CRF, fully connected conditional random fields; FCN, fully convolutional network; FoM, figures of merit; FROC, free-response receiver operating characteristic; ICC, intraclass correlation coefficient, JSC, Jaccard similarity coefficient; MH, metabolic heterogeneity; MHD, modified Hausdorff distance; MIP, maximum intensity projection, MLP, multilayer perception; MM, multi-regression model, MSE, multi-scale super pixel-based encoding; MTV, metabolic tumor volume; NHL, non-Hodgkin lymphoma; NM, nuclear medicine; NPV, negative predictive value; OS, overall survival; PA, percentage agreement; PERCIST, PET response criteria in solid tumors; PFS, progression-free survival; PPV, positive predictive value, PSI, prediction similarity index; R, Pearson correlation coefficient; RF, random forest; ROI, region of interest; Sens, sensitivity, sFEPU, sites of FDG excretion and physiologic FDG uptake; Spec, Specificity; SVM, Support Vector Machine, TLG, Tumor lesion glycolysis; TMTV, total Metabolic Tumor volume.
Table 2.
The mathematical definitions for the evaluation metrics used in the reviewed articles
| Evaluation Measure | Mathematical Definition |
|---|---|
| Sensitivity | |
| Specificity | |
| PPV or Precision | |
| Dice similarity coefficient (DSC) (Synonyms: Dice similarity index, Sorensen–Dice coefficient, F1-Score, Sorensen–Dice index, Dice’s Coefficient) | |
| Dice Ref (equivalent to Dice) F(M) is defined as the number of elements in set M. G = ground truth regions composed of gi voxels. B = set of detected lymphoma regions consisting of bi voxels.56 |
|
| Dice Global56 Same as the DiceRef reference but includes false-positive regions. S is the set of all detected false regions. |
|
| Volume Sup56 For evaluating the volume of the false-positive region. |
|
| Jaccard similarity coefficient (JSC) |
Results
Search Results
We retrieved 1122 documents from initial searches; 1089 met the eligibility criteria for the title and abstract review and 75 met the final criteria for full-text review as shown in Fig. 2. After the screening process, 20 articles were included; these covered both the AI development and clinical assessment fields (see Fig. 2). All 20 papers examined AI applications in lymphoma PET imaging. These studies either developed an original model or evaluated a previously proposed AI model to perform detection, classification, segmentation, characterization, prediction/prognosis, or a combination of these tasks on PET/CT or PET images. The definition of the aforementioned tasks is provided in “Terminology for Elucidating Algorithm Aim” under the Results section.
Fig. 2.

Demonstrates the summary of the literature search strategies and the results at each stage.
Overview: Key Literature Characteristics
In this section, we will systematically examine how each study performed these tasks assigned to AI. We first identify the reported AI task (detection, segmentation, classification, radiophenomics), then determine the model’s input and output for that specific task. For instance, regarding the detection task, we identify the AI’s input, which is frequently in the form of pixels, and the output would be detecting areas suspicious for cancer (high FDG uptake). Table 1 summarized the results of each study as it pertains to the proposed model (such as CNN, ANN, and so forth), task (classification, detection, segmentation, prediction, prognosis, and so forth), FoM provided specific to each task (see Table 2), and the GT definition for evaluating the results. Based on the current literature, the 2 main applications for radiomics in lymphoma: distinguishing lymphomas as separate form other tumors, and prediction or prognostication of lymphomas.40
Appropriate GT and label for an imaging AI application are highly related to AI objectivity. PET images are often visually analyzed, and this may often lead to high inter-and intraoperator variability. Thus, it is a challenge to define optimal GT for datasets to be used for AI training, and a suboptimal GT will hamper the predictive accuracy of the model.41,42 For these reasons, here, we present a definition of GT specific to the AI objective as provided by each of the studies (see Table 1).
Terminology for Elucidating Algorithm Aim
Detection
Detection as a task refers to locating an area within an image that contains an object of interest with a stated level of certainty. This task often involves a combination of localization and some level of classification, finding a nodule in the lung is an example of a detection task. As referred to in Table 1, the input to an AI algorithm that performs detection should be a type of image (pixel/voxel), and the output should also be a location containing the object of desire.24 For example, Bi and colleagues use 3D WB, coronal 18F-FDG PET with CT slices (2 channels) as an input to detect individual regions of High Normal Activity (HiNA) (also referred to as sites of FDG excretion and physiologic FDG uptake (sFEPU) by Bi and colleagues) using their multi-scale superpixel encoding CNN model.24 In addition, Sibille and colleagues proposed a model that automatically detects HiNA and lesions suspicious of lung cancer and lymphoma lesions50 (Fig. 3). Similarly, Wiseman and colleagues proposed a 3D DeepMedic model that implicitly learned information about the HiNA regions during model training achieving 85% detection TPR on average.49 Due to the increased heterogeneity of HiNA regions below the diaphragm (for example in bladder, kidney, and ureter) this model performed better above the diaphragm than the below.49
Fig. 3.

Maximum intensity projection 18F-FDG PET/CT images were processed in 2 patients using the constructed CNN. The test data consists of patients with both lung cancer and lymphoma; the detected lesions are color coded accordingly. IASLC is the abbreviation for the International Association for the Study of Lung Cancer.
(From Sibille L, Seifert R, Avramovic N, et al. 18F-FDG PET/CT Uptake Classification in Lymphoma and Lung Cancer by Using Deep Convolutional Neural Networks. Radiology. 2020;294(2):445–452; with permission.)
In this scoping review, we found 7 studies for lymphoma lesion detection in PET/CT imaging (see Table 1 - Yuan and colleagues (2021),46 Zhou and colleagues (2021)38 Weisman and colleagues (2021),61 Sibille and colleagues (2020),50 Hu and colleagues (2019),56 Yu and colleagues (2018),57 Bi and colleagues (2017)24).
Segmentation
Delineation of the boundary of an object of interest given its location is referred to as segmentation.62 Accurate segmentation of lymphoma is an important task as it permits the extraction of both lesion-level (such as SUVmax and SUVmean) and whole-body quantitative metrics (such as TMTV) which provide important predictive and prognostic information.63,64 The image input data for the segmentation task can be as large as a 3D WB PET image or as small as a group of pixels.
The segmentation task which is the combination of localization and pixel/voxel level classification often has two inputs (1) the image that contains an object of interest (input 1) (2) the location of that object of interest with a certain level of certainty (input 2). Input 2 can either be a probability map of coarse region of the lesions, or single-pixel locations representative of the lesion locations which can be derived from a localization or a detection task. The output of segmentation is an image that encodes the membership of each voxel to the object of interest (this output can be referred to as a segmentation map).
The second input for the tumor segmentor can be entered manually (eg, Sadik (2019)52) or automatically (eg, Yuan and colleagues)46 into the system. In case of automatic input, the detection probability map results can be fed into the algorithm. This can be called a cascaded approach whereby all of the steps are performed separately and sequentially by one or several neural networks (Fig. 4). These techniques often divide the segmentation process into detection, and segmentation phases and provide specific evaluation metrics for each step along the way.
Fig. 4.

Schematic of a proposed cascaded model for PET tumor segmentation; Module 1, classifies the axial slices to suspicious and non-suspicious ones; Module 2, detects the lymphoma lesions in axial slices that are a candidate by Module 1. In Module 3, the 3D PET image and detection results are given to the tumor segmentation algorithm to segment the lesions inside the bounding boxes provided by Module 2.
In this scoping review, six studies performed the segmentation task (see Table 1- Pinochet and colleagues (2021)43 Yuan and colleagues (2021)46 Sadik (2019)52 Hu and colleagues (2019)56 Yu and colleagues (2018)57 Grossiord and colleagues (2017).58)
End-to-end segmentation
When the segmentation task is done in one step, we use the term end-to-end segmentation in which case the only input is the image data. These approaches optimize for efficiency and performance in terms of memory consumption and in case of limited access to well-annotated training data.65 An example, Weisman and colleagues proposed an end-to-end segmentation model that receives WB 18F-FDG PET and CT image through 2 channels and without an additional detection step, the CNN is able to provide a map of masked segmented lymphoma lesions with DSC of 0.86 (Fig. 5).48
Fig. 5.

CT and coronal PET multicenter images are input to 3 segment layers, there are then 8 convolution layers and two fully connected layers that subsequently generate a probability map for lymphoma lesions as shown in purple.
(From Weisman AJ, Kim J, Lee I, et al. Automated quantification of baseline imaging PET metrics on FDG PET/CT images of pediatric Hodgkin lymphoma patients. EJNMMI Phys. 2020;7(1):76: under Open Access Creative Commons License http://creativecommons.org/licenses/by/4.0/.)
In this review, there were 4 studies that performed the end-to-end lesion segmentation task (see Table 1 - Blanc-Durand and colleagues (2020),47 Weisman and colleagues (2020),48 Li and colleagues (2019),51 and Desbordes and colleagues (2016)59).
Classification
Through reviewing the literature, we recognized 2 distinct usages of the word classification. To avoid ambiguity, we have to clarify these terms here: in a mathematical and statistical context, categorization of members of a set to various classes is defined as “classification.” This is a broad meaning of this term, and we refer to this as “statistical classification.” However, in computer vision, classification has a narrower meaning. It refers to the categorization of an image. We refer to this meaning as “image classification.” For example, we refer to the process of using radiomic feature inputs to assign patients to different diagnostic or prognostic groups by the term “statistical classification” (Refer to Section “Prediction and Prognosis” under Results section). For example, Annunziata and colleagues22 used image features such as Deauville Score, qPET, MTV0 from end-of-treatment 18F-FDG PET, and CT to classify the prognosis of patients into “relapse” or “progression” classes. In contrast, the process of categorizing an image input (such as the axial slice of the PET) to normal versus abnormal is “image classification.” For example, Lippi and colleagues used a machine-learning algorithm that classified the lesions within the image into 4 malignant lymphoma subtypes: DLBCL, HL, follicular and mantle cell lymphomas.54 Radiomics signatures such as PET and CT textural features have shown very good performance to classify the disease sites from physiologic uptake sites and inflammatory nonlymphomatous sites. For instance, to classify an 18F-FDG avid lesion (benign vs malignant), Lartizien and colleagues developed an SVM and Random Forest (RF) model based on 12 different radiomic features extracted from PET and CT scans achieving Area Under the Curve (AUC) of 0.9160.
In this scoping review, there were 5 studies that performed image classification (see Table 1.—Pinochet and colleagues (2021),43 Sadik and colleagues (2021),44 Guo and colleagues (2021),45 Lippi and colleagues (2019),54 Grossiord and colleagues (2017),58 and Lartizien and colleagues (2014).60
Prediction and prognosis
Recurrence is common in patients with HL and NHL, emphasizing the importance of risk stratification, prognostication, and relapse prediction based on PET studies. In the context of this paper, the task of prediction and prognosis of lymphoma based on PET imaging is performed by statistical classification. The inputs in statistical classification are radiomic features (eg, SUV, TMTV, TLG, entropy among others). These inputs are used to inform the output in the form of a prognostic or predictive classification.65
Baseline TMTV can be used for risk stratification and be a prognostic factor in a range of lymphomas (DLBCL, primary mediastinal B-cell lymphoma (PMBCL) and HL).11,66 This is exemplified by Vercellino and colleagues, whereby the authors analyzed the prognostic capability of TMTV in patients aged 60 to 80 with DLBCL and found that a high baseline TMTV indicates poorer PFS and OS.66
In addition to the usage of a singular radiomic feature to predict and prognose lymphoma, some studies may combine various radiomic features to perform the same task. Mayerhoefer and colleagues used both the maximal standardized uptake value (SUVmax) and entropy to predict survival in mantle cell lymphoma (MCL).22,43,55 Entropy is a measure of glucose metabolism heterogeneity within the TMTV.55 Demonstrating the concept of combination of radiomic features to predict and prognose, Mayerhoefer and colleagues, used entropy (heterogeneity of glucose metabolism) as well as TMTV and SUVmax for the prediction and prognosis of PFS in mantle cell lymphoma patients with 0.72 AUC.55
In this scoping review, we found 3 studies that performed the statistical classification of lymphoma (see Table 1. Pinochet and colleagues, Annunziata and colleagues, Mayerhoefer and colleagues22,43,55)
Discussion
The major objective of this study is to review recent papers in the field of artificial intelligence-based PET medical imaging in lymphoma. According to our findings, the most prevalent uses of artificial intelligence in lymphoma PET imaging are presently focused on tumor burden evaluation (detection, segmentation, and advanced quantification of lesions). In our discussion, we focus on two key themes derived from our research findings. First, we review the implications for the clinical transition of AI-based applications in lymphoma patient care. Next, we cover some critical concepts that clinicians should consider when evaluating and validating AI algorithms. In addition, we offer our thoughts on the field’s future directions.
Clinical Implementation of Artificial Intelligence in the Management of Lymphoma
Currently, a prominent objective in lymphoma PET image quantification is the evaluation of disease burden by TLG and TMTV, which requires the detection and segmentation of all lesions.67,68 In this workflow, a major bottleneck toward improved prognostic pipelines and treatment planning is the segmentation.51,69 This is in part due to the time-consuming task of manual segmentation and a high degree of intra- and interobserver variability.70,71
AI approaches can help by: (1) Automated detection and segmentation (fully-automated) and32,58,72(2) user detection/selection of the lesion followed by AI-based segmentation (semi-automated). A clear advantage of a fully automated AI model is that it can further enhance the workflow without requiring the nuclear radiologist to identify each lesion separately. Given the extent of lymphomatous involvement, individual identification of each lesion could be time-consuming, and unlikely feasible in the routine workflow given clinical demands and traditionally available resources. A few studies carried out a fully automatic disease burden assessment of lymphoma on PET images (see Table 1- Pinochet and colleagues,43 Bi and colleagues,24 Li and colleagues,51 Blanc-Durand and colleagues47). For example, Grossiord and colleagues used RF classifier and morphologic hierarchy to first extract PET and CT image features to classify lesions into 3 categories: organ, tumor, and nonrelevant. Then, they automatically segment lesions into the tumor category.58 Although fully automated models can remove a layer of physician lesion identification from the workflow, semiautomated models may provide other benefits in terms of accuracy or precision. Human involvement in semiautomatic methods manifests in 2 ways: (1) Presegmentation human-based lesion detection and identification + automatic segmentation of the identified lesion (2) Automatic detection and segmentation of lesions with high false-positive rate and reliance on human agents to select the real lesions and discard the false-positives. As an example of the human selection of real lesions and deletion of false-positives, Yu and colleagues57 manually identified lesions of true lymphoma after all possible detected lesions were automatically segmented.57
The biodistribution of 18F-FDG creates regions with HiNA (for example in kidneys, bladder, brain, and heart) which can cause inaccurate AI-based identification and segmentation of lymphoma lesions. However, to improve the performance of a model one may attempt to exclude HiNA regions from the scene before the process of lesion detection. This may be conducted manually or automatically as a pre or postprocessing step.51,73 By removing the HiNA regions from the training data, the performance of automated AI techniques for lesion detection and segmentation can be improved. For example, Yu and colleagues used a semiautomatic approach to identify and remove HiNA regions followed by an AI algorithm to automatically detect lymphoma lesions in patients.57 For the purpose of improving workflow and integration into clinical practice, an end-to-end lymphoma detection AI algorithm can be trained to combine the task of HiNA region identification and detection of lymphoma regions.
We previously discussed methods of lymphoma segmentation on 18F-FDG PET/CT studies. Along with simplifying the clinician’s workflow, automatic segmentation of lymphoma lesions can enable end-to-end prognostication and radiomic analysis of the studies to gain valuable insight into therapy augmentation, remission planning, and recurrence prediction. There are 2 avenues for downstream prediction and prognosis of lymphoma: (1) an end-to-end prognostication/prediction task (whereby the image input is processed to output a prognostication/risk stratification category directly, with no reporting of the intermediate steps; for example Sadik and colleagues),44 or (2) a radiomic analysis based prognostication, which is carried out as a secondary step after explicit segmentation. For example, Guo and colleagues11 employ deep radiomics by using a neural network to derive deep features. Deep features, in contrast to handcrafted features of classical radiomics, are not predefined; a CNN learns features of interest depending on the task at hand and the input images. Automatically segmented lesions can also be entered into a classic radiomics pipeline to derive handcrafted features such as SUV, volume, or entropy. These features are predefined by formulas and can be statistically analyzed to derive prognoses or a risk stratification schema. A range of studies showcasing classification, prognostication, and prediction based on 18F-FDG-PET radiomic features is presented in Table 3.
Table 3.
Survey of classification, prognostication, and prediction methods based on 18F-FDG-PET radiomics
| Authors (Year) | Lymphoma Subtypes | Aim of Study | Radiomic Feature Information | Discriminator Used | Figures of Merit | |||
|---|---|---|---|---|---|---|---|---|
| Input | Goal | Extraction Method | Features Used | Notable Radiomic Features | ||||
| Studies in which radiomic features were utilized for the classification of lymphoma | ||||||||
| Ou et al,74 2020 | Breast lymphoma | Segmented breast tumor VOIs on 18F-FDG-PET/CT | Differentiation (breast lymphoma vs breast carcinoma) | LifeX | First and second-order radiomic features | PETa and CTa models demonstrated great potential to differentiate in training and validation group | LDA | Not given for testing datasets |
| Xu et al,75 2019 | Hepatic lymphoma | Segmented hepatic tumor VOIs on 18F-FDG-PET | Differentiation (hepatic lymphoma vs HCC) | LifeX | 6 image-based parameters and 39 texture features | Combination model of texture and image features had greater diagnostic capability | ROC analysis | AUC = 0.898 |
| Ou et al,76 2019 | Breast lymphoma | Segmented breast tumor VOIs on 18F-FDG-PET/CT | Differentiation (breast lymphoma vs breast carcinoma) | LifeX | First and second-order radiomic features | Combination model of PET and CT features had greater diagnostic capability | Binary logistic regression | PET: AUC = 0.751. CT: AUC = 0.729; PET + CT: AUC = 0.771 |
| Aide et al,77 2018 | DLBCL | Axial skeleton segmented on 18F-FDG-PET | Identify bone marrow involvement in DLBCL based on radiomic features from 18F-FDG-PET | LifeX | 4 first-order, 6 second-order and 11 third-order texture features | SkewnessH was most predicitive of lymphoma | Linear regression, ROC analysis | AUC = 0.820 |
| Lartizien et al,60 2014 | All types | Segmented suspicious regions of interest on18-F-FDG-PET/CT | Lymphoma vs HiNA | Not reported | 105 features (GLDM, GLCM, GLISZ, GLRLM, and first order) | Combination model of PET and CT features had greater diagnostic capability | SVM | Combination of CT and PET: AUC = 0.910 |
| Studies in which radiomic features were used for the prognosis/prediction of lymphoma | ||||||||
| Rodriguez Taroco et al,78 2021 | HL | Segmented tumor VOIs on 18F-FDG-PET | Prediction of PFS from 18F-FDG-PET radiomic features in HL and DLBCL | Not specified | 8 first-order features, 23 features from GLCM, 11 features from GLRLM, 5 features from NGLM, 3 features from the neighborhood grey-tone difference | PFS in patients with Deauville scores of 1, 2, 3, and X at initial PET was higher than that in patients with a Deauville score of 4 | Univariate and multivariate Cox regression analysis | Average PFS, for patients with Deauville 4 score, of 1120 d (95% CI, 229–672) |
| Eertink et al,79 2021 | DLBCL | Segmented tumor VOIs on 18F-FDG-PET | Prediction of treatment outcome with first-line treatment of DLBCL from baseline 18F-FDG-PET radiomic features | RaCat | Large number of morphologic and texture features were extracted | Five models were created based on radiomic features as well as clinical predictors; combination of clinical and radiomics predictors was best | ROC analysis | Combined model: HR = 4.6 (95% CI, 2.6–7.9) |
| Wang et al,80 2020 | ENKTL | Segmented tumor VOIs on 18F-FDG-PET | Identify a 18F-FDG-PET radiomics-based model for predicting PFS and OS in ENKTL | LifeX | 41 features | Radiomics and metabolism-based models were combined to predict both PFS and OS | Univariate and multivariate Cox regression analysis | PFS: 0.788 (95% CI = 0.682–0.895) and 0.473 (P = .803) OS: 0.637 (95% CI = 0.488–0.786) and 0.730 (95% CI = 0.548–0.912) |
| Sun et al,81 2020 | Primary gastric DLBCL | Segmented tumor VOIs on 18F-FDG-PET | Texture analysis of 18F-PET-CT scans to predict interim response after 3–4 rounds of chemotherapy in primary gastric DLBCL | In-house software | First and second-order features | Combination of SUVmax, volume, and entropy in one model best predicted treatment response | Mann-Whitney U | AUC = 0.915 |
| Aide et al,82 2020 | DLBCL | Segmented tumor VOIs on 18F-FDG-PET | Prognosticate DLBCL treated with first-line immunotherapy using radiomic features from baseline 18F-FDG-PET | LifeX | 19 features | 18F-FDG-PET heterogeneity of the largest lymphoma lesion is associated with 2y-event free survival (EFS) | Univariate and multivariate Cox regression analysis | EFS: HR = 7.47 (95% CI = 0.83–66.99) |
| Wu et al,83 2019 | DLBCL | 18F-FDG-PET/CT pre and posttreatment | Radiomics-based treatment outcome prediction model | MATLAB | GLCM, GLRLM, GLSZM | Belief-function theory-based outcome prediction outperformed than other studies | EK-NN and SVM | Therapy response: NS |
| Tatsumi et al,84 2019 | FL | Segmented tumor VOIs on 18F-FDG-PET | Predict response and recurrence after therapy in FL | PETSTAT | 6 texture features | low gray-level zone emphasis (LGZE) in texture features predicted complete response | Logistic regression | Therapy response: AUC = 0.720; PFS: NS |
| Lue et al,85 2019 | HL | Segmented tumor VOIs on 18F-FDG-PET | 18F-FDG-PET was analyzed using radiomics to predict/prognose HL | OsiriX, CGITA, MATLAB | 11 first-order, 39 higher-order, 400 wavelet features | Ann Arbor stage, GLRLM and SUV kurtosis were associated with PFS | Univariate and multivariate Cox regression analysis | PFS: HR = 6.640 (95% CI, 1.261–34.96; P = .026); OS: HR = 14.54 (95% CI, 1.808–117.0; P = .012) |
| Lue et al,86 2019 | HL | Segmented tumor VOIs on 18F-FDG-PET | Radiomic intratumor heterogeneity in 18F-FDG-PET to predict treatment response and survival outcomes in patients with HL | OsiriX, CGITA, MATLAB | 7 SUV and HU, 78 second- and higher-order, 624 wavelet features | Treatment response was associated with high-intensity run emphasis (HIR) was performed on PET images and run-length nonuniformity (RLNU) of CT extracted from gray-level run-length matrix (GLRM) in high-frequency wavelets PFS was independently associate with intensity nonuniformity (INU) of PET and wavelet short-run emphasis (SRE) of CT from GLRM and Ann Arbor stage. OS was associated with zone-size nonuniformity (ZSNU) of PET from gray-level size zone matrix (GLSZM) |
Cox proportional hazards model, ROC curve, logistic regression | PET: Therapy response: OR = 36.4 (95% CI, 2.060–642.0, P = .014); PFS: HR = 9.286 (95% CI, 1.341–66.28; P = .023); OS: HR = 41.02 (95% CI, 4.206–400.1; P = .001) |
| CT:Therapy response: OR = 30.4 (95% CI, 1.700–545.0; P = .014); PFS: HR = 18.480 (95% CI, 1.918–178.1; P = .012); OS: NS | ||||||||
| Zhou et al,87 2019 | Primary gastric DLBCL | Segmented tumor VOIs on 18F-FDG-PET | Prediction of OS and PFS from 18F-FDG-PET radiomic features in primary gastric DLBCL | LifeX | 44 texture features | Kurtosis, TMTV, GLNU, and HGZE were identified as independent prognostic factors | Univariate and multivariate Cox regression analysis | PET: PFS: HR = 14.642 (95% CI, 2.661–80.549; P = .002); OS: HR = 28.685 (95% CI, 2.067–398.152; P = .012) |
| CT: PFS: HR = 11.504 (95% CI, 1.921–68.888; P = .007); OS: HR = 11.791 (95% CI, 1.583–87.808; P = .016) | ||||||||
| Milgrom et al,88 2019 | Mediastinal HL | Segmented nodal disease on 18F-FDG-PET/CT | Predict response to therapy in mediastinal HL | MIM, IBEX | GLCM, intensity histogram, shape | A combination model of 5 most predictive features accomplished the highest AUC (SUVmax, TMTV, inverse variance, and 2 measures of tumor heterogeneity) | ROC analysis | AUC = 0.952 |
| Wang et al,89 2019 | Renal/adrenal lymphoma | Segmented tumor VOIs on 18F-FDG-PET | Prognose patients with primary renal lymphoma and primary adrenal lymphoma using texture features | LifeX | 37 texture features | GLRLM_RLNU (gray-level co-occurrence matrix run-length nonuniformity) was most predictive of OS. | Univariate and multivariate Cox regression analysis | OS: HR = 9.016 (95% CI, 1.041–78.112; P = .046) |
| Parvez et al,90 2018 | NHL | TMTV using thresholding and radiomic features | Predict response to therapy and outcome in NHL using radiomic features extracted from 18F-FDG-PET/CT | LifeX | GLCM, NGLDM, GLRLM, GLZLM, indices from sphericity and histogram | GLNU correlated to DFS, and kurtosis correlated with OS | Univariate Cox regression analysis | Therapy response: NS; DFS: P = .013; OS: P = .035 |
| Aide et al,77 2018 | DLBCL | Axial skeleton segmented on 18F-FDG-PET | Determine prognostic value of skeletal textural features in DLBCL | LifeX | 4 first-order, 6 second-order and 11 third-order texture features | The only independent predictor of PFS was SkewnessH | ROC analysis | PFS: HR = 3.17 (95% CI, 1.00–10.04; P = .032) |
| Ben Bouallègue et al,91 2017 | Bulky HL and NHL | Segmented tumor VOIs on 18F-FDG-PET | Predict response to therapy in bulky HL and NHL | In-house software | Shape, texture features | SVM accounting for both texture and shape features achieved the highest ROC AUC | ROC analysis | AUC = 0.820 |
Abbreviations: 18F-FDG-PET, 18F-fluorodeoxyglucose-positron emission tomography; ACC, accuracy; AUC, area under the curve, DFS, disease-free survival; DLBCL, diffuse large B cell lymphoma; EFS, event-free survival; EK-NN, evidential k-NN; FL, follicular lymphoma; GLCM, grey-level co-occurrence matrix; GLRLM, grey-level run-length matrix; GLSZM, grey-level size-zone matrix; GLZLM, grey-level zone length matrix; HL, Hodgkin’s lymphomas; HR, hazard ratio; LDA, linear discriminant analysis; MCL, mantle cell lymphoma; NGLDM, neighborhood grey-level different matrix; NHL, non-Hodgkin’s lymphomas; NS, not significant; OR, odds ratio; OS, overall survival; PFS, progression-free survival; ROC, receiver operating characteristic; RUN, run-length matrix; SEN, sensitivity; SPE, specificity; SVM, support vector machine; TF, texture features; VOI, volume of interest.
Important Considerations for Evaluating and Validating Artificial Intelligence in Lymphoma Positron Emission Tomography
The transition of AI-based technologies to patient care requires important considerations that are both general for any AI algorithm and specific to lymphoma. In addition to the accuracy of the lesion detection and segmentation, it is very important to report the amount of time clinicians should dedicate to verify and correct model outcomes. Lumped sensitivity and specificity for lesion detection are not reflective of clinical significance as some lesions are much more important than others and “critical misses” are not tolerable. In both detection and segmentation tasks, missed lesions occur particularly for the interim scans with shrunk tumors or patients with smaller lesions. TMTV, as the only figure of merit for the assessment of detection/segmentation, may undermine the smaller missed lesions. To address this shortcoming, additional performance evaluation measures such as the Dmax and MH should be used to define a task-specific composite FoM. Weisman and colleagues characterized the results of their end-to-end segmentation using SUVmax, MTV, TLG, SA/MTV, Dmax to depict a well-rounded assessment of the CNN performance.48
As demonstrated in Table 1, the studies use a range of different methods to determine their GT based on which the AI algorithm is tested. Therefore, a degree of uncertainty is expected due to this nonuniformity.92 A standardized approach for GT determination must be sought as it can be less user-dependent and capable of constructing generalizable algorithms.49
Future Directions
After a review of current trends in AI applications in the PET imaging of lymphoma, we aim to place a special focus on describing likely future directions in this field. The discussion related to the future directions is primarily aimed at depicting the potential clinical utility of AI in the management of lymphoma with 18F-FDG-PET imaging.
From spatial domain to spatiotemporal realm
Current AI-based methods rarely use prior images in their models. This is counterintuitive for clinicians who consider prior images as one of the most important sources of information.93, 94, 95 In clinical practice, temporal changes in a lesion may provide much more useful information than imaging features of a lesion in one study. As an example, conventional PET radiomics methods usually use a single-time-point for analysis, which does not take into account the interim change of the lesions throughout the treatment process.96 However, “Delta-radiomics”95 appraise the change in radiomic features during or after treatment to enhance information extraction.95,97 Several studies have shown that CT-based Delta-radiomics can be used for the prediction of lung cancer, gastric cancer, and also the detection of side effects to radiation therapy.94,98, 99, 100 Therefore, this method will be better suited to evaluate tumor response of treatment.56 Creation of the Delta image permits identifying any posttreatment tumor transformations.94,101 The difference between the baseline and interim parameters may be measured (ΔSUV, ΔMTV, ΔTLG, and ΔADC) for this purpose.102 Delta radiomics as a quantitative assessment can be used for the evaluation of the changes over time and for the prediction of treatment response earlier in the treatment course.101 Time-interval changes in the features such as SUVmax, TMTV, MH, and Dmax and the other features can be considered as the complementary important features that have not been considered before, especially for analysis of lymphoma data.
Using AI capabilities to generate and visualize Delta images will provide insight into the intralesional tumor heterogeneity, such as when a piece of a bulky tumor shrinks/improves while the other portion of the lesion grows. This capacity will be revolutionary for the detection and evaluation of tumor heterogeneity and heterogeneous response of tumor colonies to therapies. Future studies should also determine how to appropriately visualize the Delta image for better interpretability. Furthermore, the quantification of Delta images and identification of nonresponders at an earlier stage is a key direction AI-based algorithms can move toward.103 This utility allows the optimization of treatment or biopsy of the non-responding lesion (or portion of a lesion) for new mutations. By identifying the nonresponder region of the tumor clinicians will also be able to use external beam radiation or percutaneous ablation sooner during lymphoma treatment. Recommendations for posttreatment lymphoma recurrence surveillance using AI-based PET imaging can be included in guidelines. Particularly in those with cancer remission, the Delta image could allow important insights on early and accurate detection of potential recurrence.
From data silos to large shared databases
Accessible, high-quality, and diverse imaging datasets are essential for accelerating the development of AI algorithms in lymphoma and successful transition of these technologies into routine clinical practice.104 These repositories can bypass many barriers for researchers and diversify the patient population and their lymphoma subtypes, therefore, improving the generalizability of the algorithms. Especially when these datasets include multi-centric GT data and were generated by expert with varying levels of experience to recreate the heterogeneities that exist in real clinical practice rather than a controlled setting for biomedical research.
Currently, there is no centralized publicly available medical imaging data repository for lymphoma. There have, however, been minor initiatives to establish open access data sets. The Cancer Imaging Archive (TCIA) contains almost 31 million de-identified cancer medical images that are accessible to the public105 which includes CT and 18F-FDG PET studies of 155 DLBCL patients.106 Additionally, the National Institutes of Health (NIH) has made available over 10,500 labeled CT imaging studies of 4400 different patients (DeepLesion dataset) with lung nodules, liver tumors, enlarged lymph nodes, and other critical findings throughout the body.107 With the rapid growth of AI algorithms in lymphoma PET imaging, there is an increasing demand for institutional collaboration to enable the gathering and curation of both large data sets and labeled images necessary for the establishment of centralized yet diverse open access data repositories.
Summary
Almost 35 years after the first utilization of 18F-FDG PET in lymphoma,108,109 this modality has proven its value in a wide spectrum of management from diagnosis and staging to treatment response assessment and prognostication. Alongside the significant improvement in treatment measures, from biological agents110 to CAR-T cell therapy,111 there have been substantial efforts to improve the quantitative aspect of 18F-FDG PET to produce robust, reliable, and feasible metabolic imaging biomarkers. Establishment of PERCIST was a monumental event,112 2 years after universal acceptance of PET imaging for lymphoma response assessment by International Working Group (IWG).113 18F-FDG PET was extremely successful in the elimination of “complete remission unconfirmed” (CRu) category in treatment response assessment. CRu was assigned to patients with a residual mass detected by CT after treatment that was unlikely to be malignant. Lugano classification,1 a consensus document of the 11th International Conference on Malignant Lymphoma, reemphasized the role of 18F-FDG-PET. Advancement of biological agents underscored the importance of the “tumor flare” phenomenon (pseudo-progression) and resulted in the refinement of Lugano classification by the introduction of immune-related criteria (IRC)114 emphasizing the importance of clinical context in the process of image interpretation and quantification.
Treatment failure is the major problem in the management of patients with lymphoma and 18F-FDG-PET has been providing valuable information to predict this event115,116 and guide the treatment (response-adapted therapy).117,118 Furthermore, the importance of biologic heterogeneity in treatment failure119 motivated the molecular imaging community to detect and quantify this heterogeneity using Radiomics.119
In spite of all these achievements, the clinical adaptation of quantitative PET-based imaging biomarkers has been limited so far. Workflow integration barriers are one of the major contributing factors. Deep learning has the potential to make this process more efficient and more precise at the same time and this will open the door for all the subsequent utilization of PET-based imaging biomarkers. But the utility of AI is not limited to lesion detection and segmentation. Almost 20 years after wide utilization of 18F-FDG-PET in lymphoma, we are experiencing a major transformation powered by AI affecting the entire imaging lifecycle: from scheduling and operational tasks [Beegle and colleagues’ article “Artificial Intelligence and Positron Emission Tomography Imaging Workflow: Technologists’ Perspective,” in this issue], to image acquisition optimization [Muhammad Nasir Ullah and Craig S. Levin’s article, “Application of Artificial Intelligence (AI) in Positron Tomography (PET) instrumentation,” in this issue], enhancement of image reconstruction120 and harmonization of the images121 toward high-throughput imaging biomarkers122 and multi-omics data integration for prediction and prognosis [Yousefirizi and colleagues’ article, “Artificial Intelligence-Based Detection, Classification and Prediction/Prognosis in PET Imaging: Towards Radiophenomics,” in this issue].123
Key points.
One of the most serious issues in the management of lymphoma patients is treatment failure.
Accurate quantification of tumor burden using 18F-FDG-PET is an important method for therapy response assessment and prediction.
Artificial Intelligence (AI)-based PET approaches could make this process more efficient, precise, and pave the way for future PET-based imaging biomarker applications.
In addition to streamlining the workflow, AI can enable segmentation and radiomic analysis to acquire prognostic information regarding therapy augmentation, remission planning, and recurrence prediction.
Clinics care points.
In order to efficiently extract valuable information about tumor biology from 18F-FDG-PET, we need to move beyond SUV measurement. The first step in this path is detection and delineation of hypermetabolic lesions.
AI based methods have the potential to provide clinicians with a high throughput platform to perform these steps efficiently and accurately.
Clinicians should be aware of pearls and pitfalls of AI algorithms. Deep learning is very efficient when utilized in the correct setting and could be prone to bias and aberrant performance if used out of scope of training and testing. It is ultimately the responsibility of physicians and the healthcare system to verify the trustworthiness and reliability of AI as Medical Devices (AIMDs).
Despite their significant advances, PET-based AI applications have had limited clinical implementation. Immaturity of PACS architecture is among the important reasons. AI orchestrators will play an important role in future of imaging workflow.
Disclosure
This research was supported by the Intramural Research Program of the NIH, Clinical Center and NIDCR. The opinions expressed in this publication are the author’s own and do not reflect the view of the National Institutes of Health, the Department of Health and Human Services, or the United States government.
References
- 1.Cheson BD, Fisher RI, Barrington SF, et al. Recommendations for initial evaluation, staging, and response assessment of Hodgkin and non Hodgkin lymphoma: the Lugano classification J Clin Oncol, 32 (27) (2014), pp. 3059–3068 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.El-Galaly TC, Hutchings M, Mylam KJ, et al. Impact of18F-fluorodeoxyglucose positron emission tomography/computed tomography staging in newly diagnosed classical Hodgkin lymphoma: fewer cases with stage I disease and more with skeletal involvement Leuk Lymphoma, 55 (10) (2014), pp. 2349–2355 [DOI] [PubMed] [Google Scholar]
- 3.Hutchings M, Barrington SF PET/CT for therapy response assessment in lymphoma J Nucl Med, 50 (Suppl 1) (2009), pp. 21S–30S [DOI] [PubMed] [Google Scholar]
- 4.Baba S, Abe K, Isoda T, et al. Impact of FDG-PET/CT in the management of lymphoma Ann Nucl Med, 25 (10) (2011), pp. 701–716 [DOI] [PubMed] [Google Scholar]
- 5.Meignan M, Itti E, Gallamini A, et al. FDG PET/CT imaging as a biomarker in lymphoma Eur J Nucl Med Mol Imaging, 42 (4) (2015), pp. 623–633 [DOI] [PubMed] [Google Scholar]
- 6.Alavi A, Newberg AB, Souder E, et al. Quantitative analysis of PET and MRI data in normal aging and Alzheimer’s disease: atrophy weighted total brain metabolism and absolute whole brain metabolism as reliable discriminators J Nucl Med, 34 (10) (1993), pp. 1681–1687 [PubMed] [Google Scholar]
- 7.Saboury B, Salavati A, Brothers A, et al. FDG PET/CT in Crohn’s disease: correlation of quantitative FDG PET/CT parameters with clinical and endoscopic surrogate markers of disease activity Eur J Nucl Med Mol Imaging, 41 (4) (2014), pp. 605–614 [DOI] [PubMed] [Google Scholar]
- 8.Saboury B, Parsons MA, Moghbel M, et al. Quantification of aging effects upon global knee inflammation by 18F-FDG-PET Nucl Med Commun, 37 (3) (2016), pp. 254–258 [DOI] [PubMed] [Google Scholar]
- 9.Basu S, Zaidi H, Salavati A, et al. FDG PET/CT methodology for evaluation of treatment response in lymphoma: from “graded visual analysis” and “semiquantitative SUVmax” to global disease burden assessment Eur J Nucl Med Mol Imaging, 41 (11) (2014), pp. 2158–2160 [DOI] [PubMed] [Google Scholar]
- 10.Basu S, Saboury B, Torigian DA, et al. Current evidence base of FDG-PET/CT imaging in the clinical management of malignant pleural mesothelioma: emerging significance of image segmentation and global disease assessment Mol Imaging Biol, 13 (5) (2011), pp. 801–811 [DOI] [PubMed] [Google Scholar]
- 11.Guo B, Tan X, Ke Q, et al. Prognostic value of baseline metabolic tumor volume and total lesion glycolysis in patients with lymphoma: a meta-analysis PLoS One, 14 (1) (2019), p. e0210224. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Saboury B, Moghbel M, Basu S, et al. Modern Quantitative Techniques for PET/CT/MR Hybrid Imaging. In: Schaller B, ed. Molecular Imaging. IntechOpen; (2012) [Google Scholar]
- 13.Ceriani L, Milan L, Martelli M, et al. Metabolic heterogeneity on baseline 18FDG-PET/CT scan is a predictor of outcome in primary mediastinal B-cell lymphoma Blood, 132 (2) (2018), pp. 179–186 [DOI] [PubMed] [Google Scholar]
- 14.Ceriani L, Gritti G, Cascione L, et al. SAKK38/07 study: integration of baseline metabolic heterogeneity and metabolic tumor volume in DLBCL prognostic model. Blood Adv. 2020;4(6):1082–1092 Blood Adv, 4 (10) (2020), p. 2135 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Cottereau A-S, Nioche C, Dirand A-S, et al. 18F-FDG PET dissemination features in diffuse large B-cell lymphoma are predictive of outcome J Nucl Med, 61 (1) (2020), pp. 40–45 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Cottereau A-S, Meignan M, Nioche C, et al. New approaches in characterization of lesions dissemination in DLBCL patients on baseline PET/CT Cancers, 13 (16) (2021), 10.3390/cancers13163998 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Foster B, Bagci U, Mansoor A, et al. A review on segmentation of positron emission tomography images Comput Biol Med, 50 (2014), pp. 76–96 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Dewalle-Vignion A-S, Yeni N, Petyt G, et al. Evaluation of PET volume segmentation methods: comparisons with expert manual delineations Nucl Med Commun, 33 (1) (2012), pp. 34–42 [DOI] [PubMed] [Google Scholar]
- 19.Burggraaff CN, Rahman F, Kaßner I, et al. Optimizing workflows for fast and reliable metabolic tumor volume measurements in diffuse large B cell lymphoma Mol Imaging Biol, 22 (4) (2020), pp. 1102–1110 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Zijlstra JM, Comans EF, van Lingen A, et al. FDG PET in lymphoma: the need for standardization of interpretation. An observer variation study Nucl Med Commun, 28 (10) (2007), pp. 798–803 [DOI] [PubMed] [Google Scholar]
- 21.Frood R, Burton C, Tsoumpas C, et al. Baseline PET/CT imaging parameters for prediction of treatment outcome in Hodgkin and diffuse large B cell lymphoma: a systematic review Eur J Nucl Med Mol Imaging (2021), 10.1007/s00259-021-05233-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Annunziata S, Pelliccioni A, Hohaus S, et al. The prognostic role of end-of-treatment FDG-PET/CT in diffuse large B cell lymphoma: a pilot study application of neural networks to predict time-to-event Ann Nucl Med, 35 (1) (2021), pp. 102–110 [DOI] [PubMed] [Google Scholar]
- 23.Whiting PF, Rutjes AWS, Westwood ME, et al. QUADAS-2: a revised tool for the quality assessment of diagnostic accuracy studies Ann Intern Med, 155 (8) (2011), pp. 529–536 [DOI] [PubMed] [Google Scholar]
- 24.Bi L, Kim J, Kumar A, et al. Automatic detection and classification of regions of FDG uptake in whole-body PET-CT lymphoma studies Comput Med Imaging Graph, 60 (2017), pp. 3–10 [DOI] [PubMed] [Google Scholar]
- 25.Barrington SF, Meignan M Time to prepare for risk adaptation in lymphoma by standardizing measurement of metabolic tumor burden J Nucl Med, 60 (8) (2019), pp. 1096–1102 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Berkowitz A, Basu S, Srinivas S, et al. Determination of whole-body metabolic burden as a quantitative measure of disease activity in lymphoma: a novel approach with fluorodeoxyglucose-PET Nucl Med Commun, 29 (6) (2008), pp. 521–526 [DOI] [PubMed] [Google Scholar]
- 27.Akhtari M, Milgrom SA, Pinnix CC, et al. Reclassifying patients with early-stage Hodgkin lymphoma based on functional radiographic markers at presentation Blood, 131 (1) (2018), pp. 84–94 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Kostakoglu L, Goldsmith SJ 18F-FDG PET evaluation of the response to therapy for lymphoma and for breast, lung, and colorectal carcinoma J Nucl Med, 44 (2) (2003), pp. 224–239 [PubMed] [Google Scholar]
- 29.Gallamini A PET scan in Hodgkin lymphoma: role in diagnosis, prognosis, and treatment Springer, New York City, NY, USA: (2016) [Google Scholar]
- 30.Punwani S, Taylor SA, Saad ZZ, et al. Diffusion-weighted MRI of lymphoma: prognostic utility and implications for PET/MRI? Eur J Nucl Med Mol Imaging, 40 (3) (2013), pp. 373–385 [DOI] [PubMed] [Google Scholar]
- 31.Gull S, Akbar S Artificial intelligence in brain tumor detection through MRI Scans Artif Intelligence Internet Things (2021), pp. 241–276, 10.1201/9781003097204-10 [DOI] [Google Scholar]
- 32.Yousefirizi F, Jha AK, Brosch-Lenz J, et al. Towards high-throughput AI-based segmentation in oncological PET imaging. arXiv [physics.med-ph] Available at: http://arxiv.org/abs/2107.13661 (2021) Accessed September 10, 2021 [DOI] [PubMed]
- 33.Taghanaki SA, Abhishek K, Cohen JP, et al. Deep semantic segmentation of natural and medical images: a review Artif Intelligence Rev, 54 (1) (2021), pp. 137–178 [Google Scholar]
- 34.Hirata K, Manabe O, Magota K, et al. A preliminary study to use SUVmax of FDG PET-CT as an identifier of lesion for artificial intelligence Front Med, 8 (2021), p. 647562. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Spatial and temporal image registration Yankeelov TE, Pickens DR, Price RR (Eds.), Quantitative MRI in cancer, CRC Press, Boca Raton, FL: (2011), pp. 256–269 [Google Scholar]
- 36.Jiao J, Searle GE, Tziortzi AC, et al. Spatio-temporal pharmacokinetic model based registration of 4D PET neuroimaging data. Neuroimage. 2014;84:225–235. [DOI] [PubMed] [Google Scholar]
- 37.Pereira G Deep Learning techniques for the evaluation of response to treatment in Hogdkin Lymphoma Available at: https://estudogeral.uc.pt/handle/10316/86276 (2018) Accessed September 10, 2021
- 38.Zhou Z, Jain P, Lu Y, et al. Computer-aided detection of mantle cell lymphoma on 18F-FDG PET/CT using a deep learning convolutional neural network Am J Nucl Med Mol Imaging, 11 (4) (2021), p. 260. [PMC free article] [PubMed] [Google Scholar]
- 39.Tricco AC, Lillie E, Zarin W, et al. PRISMA extension for scoping reviews (PRISMA-ScR): checklist and explanation Ann Intern Med, 169 (7) (2018), pp. 467–473 [DOI] [PubMed] [Google Scholar]
- 40.Mayerhoefer ME, Umutlu L, Schöder H Functional imaging using radiomic features in assessment of lymphoma Methods, 188 (2021), pp. 105–111 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Sheng VS, Provost F, Ipeirotis PG. Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. KDD ‘08. Association for Computing Machinery; Las Vegas, Nevada, USA: August 24–27, 2008:614–622. [Google Scholar]
- 42.Park SH, Han K Methodologic guide for evaluating clinical performance and effect of artificial intelligence technology for medical diagnosis and prediction Radiology, 286 (3) (2018), pp. 800–809 [DOI] [PubMed] [Google Scholar]
- 43.Pinochet P, Eude F, Becker S, et al. Evaluation of an automatic classification algorithm using convolutional neural networks in oncological positron emission tomography Front Med, 8 (2021), p. 628179. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Sadik M, López-Urdaneta J, Ulén J, et al. Artificial intelligence could alert for focal skeleton/bone marrow uptake in Hodgkin’s lymphoma patients staged with FDG-PET/CT Sci Rep, 11 (1) (2021), p. 10382. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Guo R, Hu X, Song H, et al. Weakly supervised deep learning for determining the prognostic value of 18F-FDG PET/CT in extranodal natural killer/T cell lymphoma, nasal type Eur J Nucl Med Mol Imaging (2021), 10.1007/s00259-021-05232-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Yuan C, Zhang M, Huang X, et al. Diffuse large B-cell lymphoma segmentation in PET-CT images via hybrid learning for feature fusion Med Phys (2021), 10.1002/mp.14847 mp.14847 [DOI] [PubMed] [Google Scholar]
- 47.Blanc-Durand P, Jégou S, Kanoun S, et al. Fully automatic segmentation of diffuse large B cell lymphoma lesions on 3D FDG-PET/CT for total metabolic tumour volume prediction using a convolutional neural network Eur J Nucl Med Mol Imaging, 48 (5) (2021), pp. 1362–1370 [DOI] [PubMed] [Google Scholar]
- 48.Weisman AJ, Kim J, Lee I, et al. Automated quantification of baseline imaging PET metrics on FDG PET/CT images of pediatric Hodgkin lymphoma patients EJNMMI Phys, 7 (1) (2020), p. 76. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Weisman AJ, Kieler MW, Perlman SB, et al. Convolutional neural networks for automated PET/CT detection of diseased lymph node burden in patients with lymphoma Radiol Artif Intell, 2 (5) (2020), p. e200016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Sibille L, Seifert R, Avramovic N, et al. 18F-FDG PET/CT uptake classification in lymphoma and lung cancer by using deep convolutional neural networks Radiology, 294 (2) (2020), pp. 445–452 [DOI] [PubMed] [Google Scholar]
- 51.Li H, Jiang H, Li S, et al. DenseX-Net: an end-to-end model for lymphoma segmentation in whole-body PET/CT Images IEEE Access, 8 (2020), pp. 8004–8018 [Google Scholar]
- 52.Sadik M, Lind E, Polymeri E, et al. Automated quantification of reference levels in liver and mediastinal blood pool for the Deauville therapy response classification using FDG-PET/CT in Hodgkin and non-Hodgkin lymphomas Clin Physiol Funct Imaging, 39 (1) (2019), pp. 78–84 [DOI] [PubMed] [Google Scholar]
- 53.Goodfellow I, Bengio Y, Courville A Deep learning MIT Press, Cambridge, MA, USA: (2016) [Google Scholar]
- 54.Lippi M, Gianotti S, Fama A, et al. Texture analysis and multiple-instance learning for the classification of malignant lymphomas Comput Methods Programs Biomed, 185 (2020), p. 105153. [DOI] [PubMed] [Google Scholar]
- 55.Mayerhoefer ME, Riedl CC, Kumar A, et al. Radiomic features of glucose metabolism enable prediction of outcome in mantle cell lymphoma Eur J Nucl Med Mol Imaging, 46 (13) (2019), pp. 2760–2769 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Hu H, Decazes P, Vera P, et al. Detection and segmentation of lymphomas in 3D PET images via clustering with entropy-based optimization strategy Int J Comput Assist Radiol Surg, 14 (10) (2019), pp. 1715–1724 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Yu Y, Decazes P, Lapuyade-Lahorgue J, et al. Semi-automatic lymphoma detection and segmentation using fully conditional random fields Comput Med Imaging Graph, 70 (2018), pp. 1–7 [DOI] [PubMed] [Google Scholar]
- 58.Grossiord É, Talbot H, Passat N, Meignan M, Najman L. Automated 3D lymphoma lesion segmentation from PET/CT characteristics. In: 2017 IEEE 14th International Symposium on Biomedical Imaging (ISBI 2017). Melbourne, VIC, Australia; 18–21 April 2017:174–178. [Google Scholar]
- 59.Desbordes P, Petitjean C, Ruan S Segmentation of lymphoma tumor in PET images using cellular automata: a preliminary study IRBM, 37 (1) (2016), pp. 3–10 [Google Scholar]
- 60.Lartizien C, Rogez M, Niaf E, et al. Computer-aided staging of lymphoma patients with FDG PET/CT imaging based on textural information IEEE J Biomed Health Inform, 18 (3) (2014), pp. 946–955 [DOI] [PubMed] [Google Scholar]
- 61.Weisman A, Kim J, Lee I, et al. Automated deep learning-based quantification of baseline imaging PET metrics on FDG PET/CT images of pediatric lymphoma patients J Nucl Med, 61 (Suppl 1) (2020), p. 506. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Puttagunta M, Ravi S Medical image analysis based on deep learning approach Multimed Tools Appl (2021), pp. 1–34 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Kostakoglu L, Chauvie S PET-derived quantitative metrics for response and prognosis in lymphoma PET Clin, 14 (3) (2019), pp. 317–329 [DOI] [PubMed] [Google Scholar]
- 64.Lucignani SUV G and segmentation: pressing challenges in tumour assessment and treatment Eur J Nucl Med Mol Imaging, 36 (4) (2009), pp. 715–720 [DOI] [PubMed] [Google Scholar]
- 65.Yousefirizi F, Jha AK, Brosch-Lenz J, et al. Toward high-throughput artificial intelligence-based segmentation in oncological PET imaging PET Clin, 16 (4) (2021), pp. 577–596 [DOI] [PubMed] [Google Scholar]
- 66.Vercellino L, Cottereau A-S, Casasnovas O, et al. High total metabolic tumor volume at baseline predicts survival independent of response to therapy Blood, 135 (16) (2020), pp. 1396–1405 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Weisman AJ, Kieler MW, Perlman S, et al. Comparison of 11 automated PET segmentation methods in lymphoma Phys Med Biol, 65 (23) (2020), p. 235019. [DOI] [PubMed] [Google Scholar]
- 68.Rahim MK, Kim SE, So H, et al. Recent trends in PET image interpretations using volumetric and texture-based quantification methods in nuclear oncology Nucl Med Mol Imaging, 48 (1) (2014), pp. 1–15 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Rizzo A, Triumbari EKA, Gatta R, et al. The role of 18F-FDG PET/CT radiomics in lymphoma Clin Translational Imaging (2021), 10.1007/s40336-021-00451-y [DOI] [Google Scholar]
- 70.Starmans MPA, van der Voort SR, Castillo Tovar JM, et al. Chapter 18 - radiomics: Data mining using quantitative medical image features Zhou SK, Rueckert D, Fichtinger G (Eds.), Handbook of medical image computing and computer assisted intervention, Academic Press, Cambridge, MA, US: (2020), pp. 429–456 [Google Scholar]
- 71.Hatt M, Le Rest CC, Tixier F, et al. Radiomics: data are also images J Nucl Med, 60 (Suppl 2) (2019), pp. 38S–44S [DOI] [PubMed] [Google Scholar]
- 72.Blanc-Durand P, Van Der Gucht A, Schaefer N, et al. Automatic lesion detection and segmentation of 18FET PET in gliomas : a full 3D U-Net convolutional neural network study J Nucl Med, 59 (Suppl 1) (2018), p. 330. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Klyuzhin I, Xu Y, Harsini S, et al. Unsupervised background removal by dual-modality PET/CT guidance: application to PSMA imaging of metastases J Nucl Med, 62 (Suppl 1) (2021), p. 36 [Google Scholar]
- 74.Ou X, Zhang J, Wang J, et al. Radiomics based on 18 F-FDG PET/CT could differentiate breast carcinoma from breast lymphoma using machine-learning approach: a preliminary study Cancer Med, 9 (2) (2020), pp. 496–506 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Xu H, Guo W, Cui X, et al. Three-dimensional texture analysis based on PET/CT images to distinguish hepatocellular carcinoma and hepatic lymphoma Front Oncol, 9 (2019), p. 844. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Ou X, Wang J, Zhou R, et al. Ability of 18F-FDG PET/CT radiomic features to distinguish breast carcinoma from breast lymphoma Contrast Media Mol Imaging, 2019 (2019), p. 4507694. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Aide N, Talbot M, Fruchart C, et al. Diagnostic and prognostic value of baseline FDG PET/CT skeletal textural features in diffuse large B cell lymphoma Eur J Nucl Med Mol Imaging, 45 (5) (2018), pp. 699–711 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Rodríguez Taroco MG, Cuña EG, Pages C, et al. Prognostic value of imaging markers from 18FDG-PET/CT in paediatric patients with Hodgkin lymphoma Nucl Med Commun, 42 (3) (2021), pp. 306–314 [DOI] [PubMed] [Google Scholar]
- 79.Eertink JJ, van de Brug T, Wiegers SE, et al. 18F-FDG PET baseline radiomics features improve the prediction of treatment outcome in diffuse large B-cell lymphoma Eur J Nucl Med Mol Imaging (2021), 10.1007/s00259-021-05480-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Wang H, Zhao S, Li L, et al. Development and validation of an 18F-FDG PET radiomic model for prognosis prediction in patients with nasal-type extranodal natural killer/T cell lymphoma Eur Radiol, 30 (10) (2020), pp. 5578–5587 [DOI] [PubMed] [Google Scholar]
- 81.Sun Y, Qiao X, Jiang C, et al. Texture analysis improves the value of pretreatment 18F-FDG PET/CT in predicting interim response of primary gastrointestinal diffuse large B-cell lymphoma Contrast Media Mol Imaging, 2020 (2020), p. 2981585. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Aide N, Fruchart C, Nganoa C, et al. Baseline 18F-FDG PET radiomic features as predictors of 2-year event-free survival in diffuse large B cell lymphomas treated with immunochemotherapy Eur Radiol, 30 (8) (2020), pp. 4623–4632 [DOI] [PubMed] [Google Scholar]
- 83.Wu J, Lian C, Ruan S, et al. Treatment outcome prediction for cancer patients based on radiomics and belief function theory IEEE Trans Radiat Plasma Med Sci, 3 (2) (2019), pp. 216–224 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Tatsumi M, Isohashi K, Matsunaga K, et al. Volumetric and texture analysis on FDG PET in evaluating and predicting treatment response and recurrence after chemotherapy in follicular lymphoma Int J Clin Oncol, 24 (10) (2019), pp. 1292–1300 [DOI] [PubMed] [Google Scholar]
- 85.Lue K-H, Wu Y-F, Liu S-H, et al. Prognostic value of pretreatment radiomic features of 18F-FDG PET in patients with Hodgkin lymphoma Clin Nucl Med, 44 (10) (2019), pp. e559–e565 [DOI] [PubMed] [Google Scholar]
- 86.Lue K-H, Wu Y-F, Liu S-H, et al. Intratumor heterogeneity assessed by 18F-FDG PET/CT predicts treatment response and survival outcomes in patients with Hodgkin lymphoma Acad Radiol, 27 (8) (2020), pp. e183–e192 [DOI] [PubMed] [Google Scholar]
- 87.Zhou Y, Ma X-L, Pu L-T, et al. Prediction of Overall survival and progression-free survival by the 18F-FDG PET/CT radiomic features in patients with primary gastric diffuse large B-cell lymphoma Contrast Media Mol Imaging, 2019 (2019), p. 5963607. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88.Milgrom SA, Elhalawani H, Lee J, et al. A PET radiomics model to predict refractory mediastinal Hodgkin lymphoma Sci Rep, 9 (1) (2019), 10.1038/s41598-018-37197-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Wang M, Xu H, Xiao L, et al. Prognostic value of functional parameters of 18F-FDG-PET images in patients with primary renal/adrenal lymphoma Contrast Media Mol Imaging, 2019 (2019), p. 2641627. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Parvez A, Tau N, Hussey D, et al. Erratum to: 18F-FDG PET/CT metabolic tumor parameters and radiomics features in aggressive non Hodgkin’s lymphoma as predictors of treatment outcome and survival Ann Nucl Med, 32 (6) (2018), pp. 410–416 [DOI] [PubMed] [Google Scholar]
- 91.Ben Bouallègue F, Tabaa YA, Kafrouni M, et al. Association between textural and morphological tumor indices on baseline PET-CT and early metabolic response on interim PET-CT in bulky malignant lymphomas Med Phys, 44 (9) (2017), pp. 4608–4619 [DOI] [PubMed] [Google Scholar]
- 92.Pfaehler E, Burggraaff C, Kramer G, et al. PET segmentation of bulky tumors: strategies and workflows to improve inter-observer variability PLoS One, 15 (3) (2020), p. e0230901. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93.Noortman WA, Vriens D, Slump CH, et al. Adding the temporal domain to PET radiomic features PLoS One, 15 (9) (2020), p. e0239438. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 94.Fave X, Zhang L, Yang J, et al. Delta-radiomics features for the prediction of patient outcomes in non–small cell lung cancer Sci Rep, 7 (1) (2017), pp. 1–11 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 95.Carvalho S, Leijenaar RTH, Troost EGC, et al. Early variation of FDG-PET radiomics features in NSCLC is related to overall survival - the “delta radiomics” concept Radiother Oncol, 118 (Suppl 1) (2016), pp. S20–S21 [Google Scholar]
- 96.Mayerhoefer ME, Materka A, Langs G, et al. Introduction to radiomics J Nucl Med, 61 (4) (2020), pp. 488–495 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 97.Alahmari SS, Cherezov D, Goldgof D, et al. Delta radiomics improves pulmonary nodule malignancy prediction in lung cancer screening IEEE Access, 6 (2018), pp. 77796–77806 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 98.Wang L, Gao Z, Li C, et al. Computed tomography--based delta-radiomics analysis for discriminating radiation pneumonitis in patients with esophageal cancer after radiation therapy Int J Radiat Oncol Biol Phys (2021) Available at: https://www.sciencedirect.com/science/article/pii/S0360301621004752?casa_token=klpUfNmgEhEAAAA:ZOAShjEEgzDXK6JxuvCpIWKcps-6o7x51hP4a952C9kqQMbH7zXrqgkjIhumgcLoWkrVJu8 [DOI] [PubMed] [Google Scholar]
- 99.Mazzei MA, Di Giacomo L, Bagnacci G, et al. Delta-radiomics and response to neoadjuvant treatment in locally advanced gastric cancer—a multicenter study of GIRCG (Italian Research Group for Gastric Cancer) Quantitative Imaging Med Surg, 11 (6) (2021), pp. 2376–2387 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 100.Liu Y, Shi H, Huang S, et al. Early prediction of acute xerostomia during radiation therapy for nasopharyngeal cancer based on delta radiomics from CT images Quant Imaging Med Surg, 9 (7) (2019), pp. 1288–1302 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 101.Nasief H, Zheng C, Schott D, et al. A machine learning based delta-radiomics process for early prediction of treatment response of pancreatic cancer NPJ Precis Oncol, 3 (2019), p. 25. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 102.Fave X, Zhang L, Yang J, et al. Using pretreatment radiomics and delta-radiomics features to predict non–small cell lung cancer patient outcomes Int J Radiat Oncol Biol Phys, 98 (1) (2017), p. 249 [Google Scholar]
- 103.Bera K, Velcheti V, Madabhushi A Novel quantitative imaging for predicting response to therapy: techniques and clinical applications Am Soc Clin Oncol Educ Book, 38 (2018), pp. 1008–1018 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 104.Kohli MD, Summers RM, Geis JR Medical image data and datasets in the era of machine learning-whitepaper from the 2016 C-MIMI meeting dataset session J Digit Imaging, 30 (4) (2017), pp. 392–399 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 105.Prior F, Smith K, Sharma A, et al. The public cancer radiology imaging collections of the cancer imaging archive Sci Data, 4 (2017), p. 170124. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 106.Dose-adjusted EPOCH-R compared with R-CHOP as frontline therapy for diffuse large B-cell lymphoma (CALGB50303) - the cancer imaging archive (TCIA) public access - cancer imaging archive wiki Available at: https://wiki.cancerimagingarchive.net/pages/viewpage.action?pageId=70225094 Accessed October 1, 2021
- 107.Yan K, Wang X, Lu L, et al. DeepLesion: automated mining of large-scale lesion annotations and universal lesion detection with deep learning J Med Imaging (Bellingham), 5 (3) (2018), p. 036501. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 108.Kiyosawa M, Ohmura M, Mizuno K, et al. [18F-FDG positron emission tomography in orbital lymphoid tumor] Nihon Ganka Gakkai Zasshi, 89 (12) (1985), pp. 1329–1333 [PubMed] [Google Scholar]
- 109.Kuwabara Y, Ichiya Y, Otsuka M, et al. High [18F]FDG uptake in primary cerebral lymphoma: a PET study J Comput Assist Tomogr, 12 (1) (1988), pp. 47–48 [DOI] [PubMed] [Google Scholar]
- 110.Coiffier B, Lepage E, Briere J, et al. CHOP chemotherapy plus rituximab compared with CHOP alone in elderly patients with diffuse large B-cell lymphoma N Engl J Med, 346 (4) (2002), pp. 235–242 [DOI] [PubMed] [Google Scholar]
- 111.Schuster SJ, Bishop MR, Tam CS, et al. Tisagenlecleucel in adult relapsed or refractory diffuse large B-cell lymphoma N Engl J Med, 380 (1) (2019), pp. 45–56 [DOI] [PubMed] [Google Scholar]
- 112.Wahl RL, Jacene H, Kasamon Y, et al. From RECIST to PERCIST: evolving Considerations for PET response criteria in solid tumors J Nucl Med, 50 (Suppl 1) (2009), pp. 122S–150S [DOI] [PMC free article] [PubMed] [Google Scholar]
- 113.Cheson BD, Pfistner B, Juweid ME, et al. Revised response criteria for malignant lymphoma J Clin Oncol, 25 (5) (2007), pp. 579–586 [DOI] [PubMed] [Google Scholar]
- 114.Cheson BD, Ansell S, Schwartz L, et al. Refinement of the Lugano Classification lymphoma response criteria in the era of immunomodulatory therapy Blood, 128 (21) (2016), pp. 2489–2496 [DOI] [PubMed] [Google Scholar]
- 115.Hutchings M, Loft A, Hansen M, et al. FDG-PET after two cycles of chemotherapy predicts treatment failure and progression-free survival in Hodgkin lymphoma Blood, 107 (1) (2006), pp. 52–59 [DOI] [PubMed] [Google Scholar]
- 116.Burggraaff CN, de Jong A, Hoekstra OS, et al. Predictive value of interim positron emission tomography in diffuse large B-cell lymphoma: a systematic review and meta-analysis Eur J Nucl Med Mol Imaging, 46 (1) (2019), pp. 65–79 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 117.André MPE, Girinsky T, Federico M, et al. Early positron emission tomography response-adapted treatment in stage I and II Hodgkin lymphoma: final results of the randomized EORTC/LYSA/FIL H10 trial J Clin Oncol, 35 (16) (2017), pp. 1786–1794 [DOI] [PubMed] [Google Scholar]
- 118.Borchmann P, Goergen H, Kobe C, et al. PET-guided treatment in patients with advanced-stage Hodgkin’s lymphoma (HD18): final results of an open-label, international, randomised phase 3 trial by the German Hodgkin Study Group Lancet, 390 (10114) (2017), pp. 2790–2802 [DOI] [PubMed] [Google Scholar]
- 119.Sehn LH, Gascoyne RD Diffuse large B-cell lymphoma: optimizing outcome in the context of clinical and biologic heterogeneity Blood, 125 (1) (2015), pp. 22–32 [DOI] [PubMed] [Google Scholar]
- 120.Gong K, Kim K, Cui J, Wu D, Li Q The Evolution of Image Reconstruction in PET: From Filtered Back-Projection to Artificial Intelligence PET Clin, 16 (4) (2021. October), pp. 533–542, 10.1016/j.cpet.2021.06.004 [DOI] [PubMed] [Google Scholar]
- 121.Liu J, Malekzadeh M, Mirian N, Song TA, Liu C, Dutta J Artificial Intelligence-Based Image Enhancement in PET Imaging: Noise Reduction and Resolution Enhancement PET Clin, 16 (4) (2021. October), pp. 553–576, 10.1016/j.cpet.2021.06.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 122.Yousefirizi F, Jha AK, Brosch-Lenz J, Saboury B, Rahmim A Toward High-Throughput Artificial Intelligence-Based Segmentation in Oncological PET Imaging PET Clin, 16 (4) (2021. October), pp. 577–596, 10.1016/j.cpet.2021.06.001 [DOI] [PubMed] [Google Scholar]
- 123.Jha AK, Myers KJ, Obuchowski NA, et al. Objective Task-Based Evaluation of Artificial Intelligence-Based Medical Imaging Methods: Framework, Strategies, and Role of the Physician. PET Clin. 2021;16(4):493–511. [DOI] [PubMed] [Google Scholar]
