Abstract
Purpose
To develop quantitative molecular imaging biomarkers of immune-related adverse event (irAE) development in malignant melanoma (MM) patients receiving immune-checkpoint inhibitors (ICI) imaged with 18F-FDG PET/CT.
Methods
18F-FDG PET/CT images of 58 MM patients treated with anti-PD-1 or anti-CTLA-4 ICI were retrospectively analyzed for indication of irAE. Three target organs, most commonly affected by irAE, were considered: bowel, lung, and thyroid. Patient charts were reviewed to identify which patients experienced irAE, irAE grade, and time to irAE diagnosis. Target organs were segmented using a convolutional neural network (CNN), and novel quantitative imaging biomarkers — SUV percentiles (SUVX%) of 18F-FDG uptake within the target organs — were correlated with the clinical irAE status. Area under the receiver-operating characteristic curve (AUROC) was used to quantify irAE detection performance. Patients who did not experience irAE were used to establish normal ranges for target organ 18F-FDG uptake.
Results
A total of 31% (18/58) patients experienced irAE in the three target organs: bowel (n=6), lung (n=5), and thyroid (n=9). Optimal percentiles for identifying irAE were bowel (SUV95%, AUROC=0.79), lung (SUV95%, AUROC=0.98), and thyroid (SUV75%, AUROC=0.88). Optimal cut-offs for irAE detection were bowel (SUV95%>2.7 g/mL), lung (SUV95%>1.7 g/mL), and thyroid (SUV75%>2.1 g/mL). Normal ranges (95% confidence interval) for the SUV percentiles in patients without irAE were bowel [1.74, 2.86 g/mL], lung [0.73, 1.46 g/mL], and thyroid [0.86, 1.99 g/mL].
Conclusions
Increased 18F-FDG uptake within irAE-affected organs provides predictive information about the development of irAE in MM patients receiving ICI and represents a potential quantitative imaging biomarker for irAE. Some irAE can be detected on 18F-FDG PET/CT well before clinical symptoms appear.
Keywords: Immune-checkpoint inhibitors, Immune-related adverse effects, 18F-FDG PET/CT, Quantitative imaging biomarkers
Introduction
New cancer treatments with ICI have improved outcomes and altered management strategies for patients with melanoma and a variety of other malignancies. Many trials have shown the safety and efficacy of ICI targeting programmed death-1 (PD-1), programmed death ligand-1 (PD-L1), and cytotoxic T-lymphocyte antigen-4 (CTLA-4) and these agents are now widely implemented treatments for melanoma in both metastatic and adjuvant settings. None of the anti-PDL-1 agents has been approved for treatment of melanoma as single agent or in combination with other ICIs [1–5].
ICI treatments might cause immune-related toxicities, called irAE, where an immune response is generated against healthy tissue. These irAEs can occur in any organ system [6]. Though the exact pathophysiology of irAE is not completely understood, it is believed that they are provoked by immune upregulation and inflammation. Depending on the treatment regimen, irAEs might occur less frequently when compared to cytotoxic chemotherapy-related toxicities [7]. However, some patients might experience higher grades of irAEs that require hospitalization or prolonged treatment and might be life-threatening [8, 9]. Due to the novel mechanism of action, unpredictable nature, and broad usage of ICIs, development of biomarkers capable of early detection and monitoring of irAEs is an area of urgent need [10].
Positron emission tomography/computed tomography with [18F]2fluoro-2-deoxy-D-glucose (18F-FDG PET/CT) is a sensitive and non-invasive test commonly used in diagnosis, staging, and treatment response evaluation in MM [9, 11, 12]. The combination of 18F-FDG PET and CT allows functional and morphological evaluation of the disease and guides clinical decision-making and treatment selection. It is also a very sensitive method for recognizing inflammatory processes that can be reflective of irAE [13–16]. Some preliminary analysis of irAE detection and monitoring via 18F-FDG PET/CT has been undertaken [17–19]. None of them perfomed a quantitative assessment of the observed organs.
For melanoma patients receiving ICI, 18F-FDG PET/CT is performed for the purpose of disease response assessment. However, incidental findings of increased tracer uptake in off-target organs are sometimes reported. The fact that 18F-FDG PET/CT is currently not used to detect irAE may be due in part to the impracticality of performing a manual whole-organ assessment of 18F-FDG uptake. Small organs such as the thyroid can be assessed manually with relative ease, but larger organs such as the bowel or lung would require hours of expert physician time to contour manually. We hypothesize that this is a contributing factor to why irAE assessment on 18F-FDG PET/CT has thus far been limited to the thyroid [17, 18], or to qualitative assessment only in the bowel [19]. To overcome this hurdle, automated image analysis techniques can be employed. In particular, the segmentation of organs on PET/CT via deep learning-based CNNs has recently become possible [20, 21].
The goal of this study was to identify quantitative imaging biomarkers of irAE development on 18F-FDG PET/CT in a cohort of patients with MM who were treated with ICI. MM is specifically selected for this pilot retrospective study as ICI regimens and response evaluation with 18F-FDG PET/CT are standard of care treatment for this disease and routinely performed. Other malignancies that utilize 18F-FDG PET/CT for response evaluation use ICI combined with other treatment modalities (e.g., cytotoxic chemotherapy or targeted therapy) which makes it difficult to tease out irAE from other treatment-related AEs. We hypothesized that an automated organ segmentation method would be capable of quantifying irAE-susceptible organ inflammation in 18F-FDG PET/CT images in metastatic melanoma patients treated with ICI.
Patients and methods
We conducted a retrospective pilot study, analyzing 18F-FDG PET/CT images from patients with metastatic melanoma who were treated per standard of care with ICI (anti-CTLA-4 or anti-PD1) at the Institute of Oncology Ljubljana (OIL), Slovenia (January 2016–January 2019) or at the University of Wisconsin Carbone Cancer Centre (UWCCC), Madison, WI, USA (June 2012–June 2019). None of the patients had an autoimmune disease. All available 18F-FDG PET/CT data acquired before and during ICI treatment was collected for review. We determined the date of clinical irAE detection via chart review. If the irAE grade was not explicitly documented in the chart, when possible, irAE grading was assigned retrospectively based on the available clinical course documentation following Common Terminology Criteria for Adverse Events (CTCAE, v.5.0) [22]. Clinical and demographic data were collected from both hospital databases. Clinical and imaging data were anonymized and stored in a secure LabKey database server [23].
The study was approved by the Institutional Review Board Committee of both Institutions (Approval number: 2016–0418 in Madison, USA; ERIDKE-0005/2020 in Ljubljana, Slovenia) and was conducted following the ethical standards defined by the Declaration of Helsinki. At OIL, patients have signed informed consent for treatment and consent allowing the usage of their data for scientific purposes. At UWCCC, the study was approved with a waiver of informed consent.
PET acquisition
PET scans were primarily performed for immunotherapy treatment response evaluation in melanoma patients. Images were acquired on five PET/CT scanners: GE Discovery 710, GE Discovery STE, GE Discovery IQ, GE Discovery MI (General Electric, Waukesha, WI), and mCT (Siemens, Knoxville, TN). In all cases, the imaging protocol required patients to fast for 6 h prior to injection of the radiotracer and have a blood glucose level below 200 mg/dL (UWCCC) or 6–10 mmol/L (OIL) at the time of the scan. Patients were required to hold all diabetic medication, including metformin, for 6 h prior to radiotracer injection. On the GE Discovery IQ, patients were injected with 259±52 MBq of 18F-FDG, while on other scanners, patients were injected with a weight-based dose of 5 (OIL) to 5.2 MBq per kilogram and a minimum 370 MBq (UWCCC) of 18F-FDG. Scans were acquired 60±10 min post-injection. For UWCCC patients, the CT used in segmentation was a low-dose CT acquired for attenuation correction. At OIL, CT that meets RECIST analysis needs was acquired according to adjusted protocol including SAFIR reconstruction to minimize dose. Following reconstruction, images were normalized by patient weight and injected dose to compute standardized uptake values (SUV). If available, TOF reconstruction was used.
18F-FDG PET/CT image analysis
To quantify organ 18F-FDG uptake, a CNN was trained to segment the thyroid, lungs, and bowel from the low-dose CT component of patients’ PET/CT imaging data. A CNN was chosen for segmentation for the ability to segment irregular and variable structures and for the ability to successfully segment multiple target structures with very different sizes (e.g., thyroid versus lung) [21]. The network architecture used was DeepMedic, a 3-D, patch-based CNN with multi-resolution pathways [24]. The loss function used was Dice similarity coefficient (DSC). The optimizer was RMSprop [25]. Sixty manual contours of the bowel, lung, and thyroid were produced using a public dataset of N=20 patients from the VISCERAL.eu Anatomy3 benchmark [26], and an additional private institutional dataset of N=40 patients by an experienced graduate student using 3D Slicer [27]. Labelled data were split 80%/20% (n=48/n=12) for CNN training/validation. Images were resampled to a cubic 2-mm grid and normalized to have a mean of 0 and variance of 1 within the patient. Data augmentation via histogram shifting, histogram scaling, and random rotation was used to increase the effective training dataset size. The CNN was trained using a workstation with one NVIDIA Titan Xp GPU with 12 GB of memory.
The trained CNN was used to perform inference on the CT component from the 18F-FDG PET scans and produce contours of the thyroid, lung, and bowel. The contours were then applied to the PET image to quantify 18F-FDG uptake within the three target organs. To determine the ability of PET to detect irAE, percentiles of the distribution of SUV from within each target organ (SUVX%) were extracted. Percentiles of the distribution of organ SUV were pursued as potential biomarkers of irAE due to their improved reliability as compared to SUVmax [28].
Receiver operating characteristic (ROC) analysis was performed to determine the value of organ SUV percentiles as potential quantitative imaging biomarkers of irAE development. This was done by comparing organ SUV percentiles with the clinical irAE status as determined by chart review. For patients who had multiple 18F-FDG PET/CT exams during ICI treatment, the maximum organ SUV percentile value was used as a predictor of irAE. The optimal organ SUV percentile (SUVOPT%) was defined to be the percentile that maximized the area under the ROC curve (AUROC) for predicting irAE status (Eq. 1).
(1) |
where SUVX% are the set of percentiles of the distribution of organ SUV. SUVOPT% was measured on all available 18F-FDG PET scans and tracked longitudinally to assess if changes in target organ 18F-FDG uptake may precede clinical irAE identification.
Target organ 18F-FDG uptake was also assessed in patients who did not experience irAE. This was done to establish normal ranges for organ 18F-FDG SUVOPT% values against which SUVOPT% values from patients with irAE can be compared. The 95% confidence interval for SUVOPT% was determined for each target organ using the baseline PET images of N=15 patients who did not experience any irAE (Eq. 2).
(2) |
where μ and σ are the mean and standard deviation of baseline SUVOPT% values of patients who did not experience irAE, respectively.
Statistical analysis
For each target organ (bowel, lung, thyroid), patients were divided into two groups: patients who experienced irAE and patients who did not experience irAE. Differences in SUV metric by irAE status were assessed with Wilcoxon rank-sum tests. p<0.05 was considered to be statistically significant. ROC analysis was performed to determine the ability of SUV metrics to detect irAE. Optimal cutoff values for detecting irAE were assigned to maximize the Youden’s index (sensitivity+specificity-1). Image analysis and statistical testing was done using MATLAB R2020b (The MathWorks, Inc., Natick, MA, USA).
Results
Patient characteristics
Out of 58 MM patients treated with anti-PD-1 or anti-CTLA-4 ICI, 20 irAE of interest (colitis, pneumonitis, and thyroiditis) were observed on 18F-FDG PET/CT scans in 18 (31%) patients: the bowel N=6 (11%), lung N=5 (9%), and thyroid N=9 (16%). Two patients had multiple target organ irAE (one had colitis and pneumonitis, one had thyroiditis and pneumonitis). 18F-FDG PET of 15 (26%) patients without irAE were used to establish normal ranges. The remaining 25 (43%) patients had irAE involving organs other than irAE of interest and were considered jointly with the previous group as negative for the purpose of organ-specific irAE identification.
Patient demographics for all 58 patients are summarized in Table 1.
Table 1.
Patients | Total cohort | UWCCC | OIL | p * |
---|---|---|---|---|
N = 58 (%) | N = 29 (%) | N = 29 (%) | ||
Age; mean (±sd) (yr) | 61 (14) | 57 (15) | 65 (13) | 0.02 |
Gender | 1 | |||
Male | 32 (55%) | 16 (55%) | 16 (55%) | |
Actionable mutation | 0.4 | |||
BRAF wild type | 38 (66%) | 18 (62%) | 20 (69%) | |
BRAF mutated | 10 (17%) | 5 (17%) | 5 (17%) | |
Mutations other than BRAF | 7 (12%) | 6 (21%) | 1 (4%) | |
Unknown | 3 (5%) | 0 (0%) | 3 (10%) | |
ECOG performance status | 0.001 | |||
0 | 30 (51%) | 17 (59%) | 13 (45%) | |
1 | 15 (26%) | 3 (10%) | 12 (41%) | |
2 | 4 (7%) | 0 (0%) | 4 (14%) | |
Unknown | 9 (16%) | 9 (31%) | 0 (0%) | |
Anatomic site of primary | 0.6 | |||
Cutaneous | 48 (83%) | 21 (73%) | 27 (94%) | |
Mucosal | 4 (7%) | 3 (10%) | 1 (3%) | |
Ocular | 2 (3%) | 2 (7%) | 0 (0%) | |
Unknown primary | 4 (7%) | 3 (10%) | 1 (3%) | |
Treatment setting | 0.001 | |||
Adjuvant setting | 10 (17%) | 10 (34%) | 0 (0%) | |
Metastatic setting | 48 (83%) | 19 (66%) | 29 (100%) | |
Best response — only for metastatic disease (N=48) | 0.8 | |||
Complete response | 17 (35%) | 8 (42%) | 9 (31%) | |
Progressive disease | 10 (21%) | 5 (26%) | 5 (17%) | |
Radiographic response | 14 (29%) | 3 (16%) | 11 (38%) | |
Mixed radiographic response | 6 (13%) | 2 (11%) | 4 (14%) | |
Other | 1 (2%) | 1 (5%) | 0 (0%) | |
Number of 18F-FDG PET/CT per patient, median (range) | 4 (2–16) | 3 (2–16) | 4 (2–11) | 0.5 |
Sd standard deviation, UWCCC University of Wisconsin Carbone Cancer Centre, OIL Institute of Oncology Ljubljana Slovenia,.
p value was calculated using Fisher’s exact test to compare the statistical difference in patients characteristics between UWCCC and OIL
A total of 261 18F-FDG PET/CT exams were collected. The median number of 18F-FDG PET/CT exams available per patient was 4 (range: 2, 16). The median time between baseline 18F-FDG PET/CT and ICI treatment start was 36 days (range: −119, 102). The median time from ICI treatment start to first follow-up 18F-FDG PET/CT was 113 days (range: 2, 245). All US patients had baseline scan. In the Slovenian patient group, one patient did not have a baseline 18F-FDG PET/CT (a CT scan was performed instead) and another patient had a baseline 18F-FDG PET/CT at a different institution. Neither of these two patients developed irAE of interest.
Median time from ICI treatment start to clinical identification of irAE was 195 days (range: 17, 310) for bowel (N =7), 187 days (range: 122, 299) for lung (N=3), and 53 days (range: 17, 637) for thyroid (N=10). The date of clinical diagnosis of irAE was not available for two irAE seen on PET scan, both irAE occurred in the same patients.
CNN organ segmentation
The performance of the CNN for segmentation of the three target organs was assessed with DSC and average symmetric surface distance (ASSD) [29]. Validation DSC for the bowel, lung, and thyroid were as follows: 0.87±0.06, 0.98±0.01, and 0.57±0.11, respectively (mean±sd). Corresponding values for ASSD were 2.6±1.2 mm, 0.8±0.3 mm, and 6.8±9.6 mm.
Quantification of irAE by 18F-FDG PET/CT
The optimal SUV percentiles (SUVOPT%) for the three target organs were as follows: bowel=SUV95%, lung=SUV95%, thyroid=SUV75%. AUROC as a function of SUV percentile is shown in Fig. 1. The AUROC for SUVOPT% to predict irAE status was 0.79 for bowel, 0.98 for lung, and 0.88 for thyroid. The optimal cutoff for SUVOPT% was 2.7 g/mL for bowel, 1.7 g/mL for lung, and 2.1 g/mL for thyroid. Sensitivity and specificity at the optimal cutoff were 1.00 and 0.49 for bowel, 1.00 and 0.96 for lung, and 0.89 and 0.81 for thyroid. The ROC curves for SUVOPT% are shown in Fig. 2. SUVOPT% of all target organs was significantly higher for patients with irAE in that organ than for patients without irAE (Wilcoxon rank-sum test, p<0.05) (Fig. 3).
irAE detection by 18F-FDG PET/CT versus clinical detection
Dates of clinical irAE detection were collected via medical records review. Longitudinal time series of SUVOPT% for patients with irAE are shown in Fig. 4 with dates of clinical irAE detection indicated with dashed vertical lines, when available. Multiple cases of target organ 18F-FDG uptake increasing outside of normal ranges prior to clinical diagnosis were observed, but due to the retrospective nature of the data collection, no formal statistical comparison of irAE detection timing was made. Images of three patients who experienced irAE of the lung, combination irAE of the lung and bowel, and thyroid are highlighted in Fig. 5, Fig. 6 and 7, respectively.
Normal 18F-FDG uptake in irAE-affected organs
To determine normal ranges for target organ SUVOPT% values, a subcohort of N=15 patients who did not experience any irAE was identified. Baseline 18F-FDG PET/CT of these patients were analyzed, and SUVOPT% values for each target organ were calculated. From these, a confidence interval for normal SUVOPT% was constructed as defined by Eq. 2. The 95% confidence intervals for SUVOPT% for patients without irAE in the three target organs were as follows: bowel CI95,SUV95%=[1.74 g/mL, 2.86 g/mL], lung CI95,SUV95%=[0.73 g/mL, 1.46 g/mL], and thyroid CI95,SUV75%=[0.86 g/mL, 1.99 g/mL]. Confidence intervals for normal SUVOPT% are shown as horizontal grey bands in Fig. 4.
Discussion
This is the first study to propose a quantitative imaging analysis of irAE development based on 18F-FDG PET/CT imaging. We hypothesized that patients with melanoma who were treated with ICIs and experience irAE would demonstrate increased 18F-FDG uptake in the involved organ at the time of clinical irAE diagnosis. In our cohort, patients who experienced immune-related thyroiditis, pneumonitis, or colitis demonstrated increased 18F-FDG uptake in the involved organs. This increased uptake could be quantified and monitored longitudinally utilizing an automated image analysis platform that employs CNN-based whole-organ segmentation. Furthermore, elevated 18F-FDG uptake preceded clinical detection of irAE in several cases, indicating that 18F-FDG PET/CT might provide valuable information in the early detection and management of irAE. This represents an automatic quantitative procedure that is reproducible, non-subjective, and provides information not available by visual analysis only. It provides additional quantitative information for both, radiologist/nuclear medicine to facilitate faster and more accurate informative reads, as well as to a treating physician about the extent of inflammation (e.g., pattern of inflammation, intensity of inflammation). As currently no clinical biomarkers for the development of irAE exists, clinicians must rely on laboratory tests and often perform invasive procedures such as biopsies; quantitative imaging biomarkers would therefore provide welcome additional tool for improved patient management.
We selected three ICI patients who were imaged regularly by 18F-FDG PET/CT to highlight as case studies of irAE detection by PET (Fig. 5, 6 and 7). Delay in irAE diagnosis might cause more severe symptoms, which are harder to reverse and could turn into life-threatening situations. Increased radiotracer uptake in the scans preceded the symptomatic clinical diagnosis. In the case of the patient highlighted in Fig. 5, the patient had increased 18F-FDG uptake in the lungs on day 245 scan (SUV95%=4.1 g/mL) while having minimal symptoms of pneumonitis. This radiographic finding allowed the patient to have all necessary diagnostic procedures performed to confirm irAE, including bronchoscopy. For the patient highlighted in Fig. 6, increasing bowel 18F-FDG uptake is seen on the day 84 scan (SUV95%=3.1 g/mL) and day 173 scan (SUV95%=4.0 g/mL). Immune-related colitis was not clinically diagnosed until 3 weeks later when the patient was hospitalized and underwent colonoscopy with biopsy. This patient also experienced immune-related pneumonitis on day 273, which can be seen as bilateral, diffuse, elevated 18F-FDG uptake in the lungs (SUV95%=2.2 g/mL). Both patients had a complete resolution of their irAE after receiving systemic corticosteroids, while they continued to have a favorable treatment response in their melanoma.
For identifying irAE, SUV95% was the optimal SUV percentile for the bowel and lung; however, for the thyroid, SUV75% performed best. This difference in the optimal SUV percentile is likely due to the difference in inflammation patterns in different organs. For example, inflammation in thyroid is often relatively uniform throughout the organ volume, and so the SUV histogram is shifted up uniformly in patients with thyroid AE. This is further supported by the relatively small difference in AUROC for thyroid AE classification in the range of SUV65% to SUV95% seen in Fig. 1. In contrast, irAE findings in the lung and bowel are often localized to a small fraction of the organ volume (e.g., a single segment of bowel), so elevated uptake is only reflected in the top few percent of the organ volume. Additionally, the lower segmentation performance of thyroid, being a smaller organ, may also be a contributing factor for reducing the optimal SUV percentile, as mis-segmentation can impact SUV percentile quantification [28]. To further justify our methodology, the metric we used for irAE identification — percentiles of the distribution of organ SUV — is less sensitive to segmentation error than other non-histogram SUV metrics, where only a few mislabelled out of distribution pixels can significantly change the final value. The diversity of scanner and treatment setting in our multicenter study improves the robustness of this analysis by including factors that could potentially impact the SUV percentile calculations.
There are several previous case reports of radiographic findings associated with irAE on 18F-FDG PET/CT [30]. In the case of thyroiditis, intense diffuse radiotracer uptake in the thyroid was reported in previous studies [31]. In a small retrospective study of lung cancer patients, increased radiotracer uptake in 18F-FDG PET was found to predict thyroiditis even before laboratory findings indicating changes in the thyroid function [17]. Both of these studies relied on manual segmentation of the thyroid for quantification. In our pilot study, we also observed elevated thyroid 18F-FDG uptake as quantified by SUV75% in patients who experienced thyroiditis. For colitis, 18F-FDG PET was prospectively studied in a cohort of 100 metastatic melanoma patients for colitis/diarrhea [19]. In this study, the determination of PET-colitis was made based on the presence of diffuse, clearly elevated tracer uptake in the colon as interpreted by a radiologist. However, no quantification of colonic 18F-FDG uptake was performed. They found a significant correlation between PET-colitis and clinical presentation of diarrhea. In contrast to these studies, our method for irAE detection does not require manual organ segmentation, nor does it rely on radiologist interpretation. Additionally, our study uses optimized metric, percentiles of the SUV distribution, to detect irAE. Our study is also the first to perform a quantitative analysis of 18F-FDG uptake of the lung and bowel and evaluate its association with the timing of clinical presentation of irAE in those organs.
For our proposed imaging biomarkers of irAE, the sensitivity and specificity reported were for the cutoff which maximized the sum of sensitivity and specificity. However, depending on clinical need, a different operating point may be more appropriate. The high sensitivity may be traded for increased specificity. This is especially salient in the case of the bowel, where the reported operating point has 100% sensitivity, but only 49% specificity. Instead, operating points of 83% sensitivity/64% specificity or 66% sensitivity/81% specificity could be used for bowel as seen on the ROC curve in Fig. 2.
Another important possible confounding factor in the bowel is the antidiabetic medication metformin that can cause diffuse radiotracer uptake in the bowel. Withholding metformin for 6 h prior to the exam in order to avoid hypoglycemia is not sufficient to reduce the influence of metformin on radiopharmaceutical biodistribution in the bowel. In our cohort, 5/58 (9%) patients had metformin listed in their medication list in the electronic health record (EHR) at the time of 18F-FDG PET/CT imaging. None of the patients receiving metformin experienced irColitis; however, 3/5 (60%) demonstrated elevated bowel uptake above the established normal range on at least one 18F-FDG PET/CT scan. This is one factor which contributed to decreasing the specificity of SUV95% as a biomarker of irColitis. Additionally, high variation in physiological 18F-FDG uptake not related to metformin usage may also contribute to the reduced specificity.
Quantification of 18F-FDG uptake in relevant organs for patients undergoing ICI could improve personalized management of these patients. The use of imaging biomarkers could help with proactively managing the treatment-related AE and direct when to hold or discontinue treatment before severe clinically noticeable symptoms appear. The use of accurate quantification could help to detect subclinical inflammation and guide the clinicians to be more vigilant in cases with evidence of high 18F-FDG uptake scores. There is a possibility of a subclinical increased inflammation in the target organs which might increase the 18F-FDG uptake in patients who did not experience clinically diagnosed irAE. This would have increased the mean baseline tracer uptake. However, this subclinical increase in tracer uptake does not impact the treatment course and was therefore not considered irAE.
Many unanswered questions need to be addressed in future studies. The optimal timing of 18F-FDG PET during ICI treatment for early detection of irAEs is currently unknown. The timing of irAE development varies widely, and an irAE can even occur after discontinuation or completion of ICI therapy [32]. Perhaps a quantitative analysis of an early timepoint 18F-FDG PET/CT (e.g., at 1 month after ICI treatment initiation) could identify patients who are at greater risk for irAE, as 18F-FDG PET/CT after 1 month has been shown to be predictive of patient response to ICI [11]. Furthermore, quantitative analysis of irAE severity and the extent of organ involvement could help to decide if ICI rechallenge could be considered with a reasonably low risk of irAE recurrence. A recent retrospective study reported a 29% recurrence rate of the same irAE after ICI rechallenge [33]. Possibly, 18F-FDG PET/CT quantification could help us predict which patients are at higher risk for recurrence of irAE.
In this study, we quantified irAE of the lung, bowel, and thyroid. However, it is known that irAE can affect practically any organ, some of them being very rare but often life threatening [6, 8, 9]. Due to their rarity, a large study cohort would be required to identify imaging biomarkers of irAE in these organs. On the other hand, some irAE are common and can be readily diagnosed clinically, for example, cutaneous irAE (rash, pruritis), which are readily apparent without imaging.
There are several limitations of this study which should be discussed. First, this study was conducted in a retrospective cohort of patients. Due to retrospective nature of the study, some data on irAE was collected by reviewing the available medical documentation and was limited for some patients. Also, there was a variability of timing of 18F-FDG PET/CT assessments based on the clinical course of the disease, clinicians’ decision, and mainly for disease status and response assessment and not focused on the irAE assessment. Prospective trials with larger groups of patients, more regimented 18F-FDG PET/CT imaging and data collection, and prospective irAE grading and tumor response assessment are needed in the future [34]. Despite these limitations, we provided preliminary data on the process of development of an AI-supported platform capable of quantitative assessment of irAE using 18F-FDG PET/CT scans.
Conclusions
Molecular imaging with 18F-FDG PET/CT is a useful tool for detecting and monitoring irAE in melanoma patients receiving ICI. Elevated 18F-FDG uptake in the lung, bowel, and thyroid is indicative of the development of irAE and may precede clinical detection of irAE. Early detection of irAE makes it possible to manage irAEs proactively and before development of moderate or severe clinical symptoms and avoid potentially life-threatening treatment-related complications. In the future, it is important to establish algorithms for implementing quantitative analysis of 18F-FDG PET/CT into daily clinical use.
Funding
This work is supported by the UWCCC (P30 CA014520) and by the National Cancer Institute of the National Institutes of Health under Award Number T32 CA009206. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health. The research was financially supported by The Slovenian Research Agency (ARRS), grant number P3-0321.
Footnotes
This article is part of the Topical Collection on Advanced Image Analyses (Radiomics and Artificial Intelligence)
Ethics approval The study was approved by the Institutional Review Board Committee of both Institutions (Approval number: 2016–0418 in Madison, USA; ERIDKE-0005/2020 in Ljubljana, Slovenia) and was conducted following the ethical standards defined by the Declaration of Helsinki.
Informed consent The study was conducted with acknowledgement and consent of the subjects. At OIL, patients have signed informed consent for treatment and consent allowing the usage of their data for scientific purposes. At the UWCCC, the study was approved with a waiver of informed consent.
Conflict of interest Robert Jeraj is a co-founder and CSO of AIQ Solutions, Madison, WI, USA. No potential conflicts of interest were disclosed for other co-authors.
References
- 1.Hodi FS, O’Day SJ, McDermott DF, Weber RW, Sosman JA, Haanen JB, et al. Improved survival with ipilimumab in patients with metastatic melanoma. N Engl J Med. 2010;363(8):711–23 https://www.nejm.org/doi/full/10.1056/NEJMoa1003466. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Hamid O, Robert C, Daud A, Hodi FS, Hwu WJ, et al. Five-year survival outcomes for patients with advanced melanoma treated with pembrolizumab in KEYNOTE-001. Ann Oncol. 2019;30(4):582–8. 10.1093/annonc/mdz011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Robert C, Ribas A, Schachter J, Arance A, Grob JJ, Mortier L, et al. Pembrolizumab versus ipilimumab in advanced melanoma (KEYNOTE-006): post-hoc 5-year results from an open-label, multicentre, randomised, controlled, phase 3 study. Lancet Oncol. 2019;20(9):1239–51. 10.1016/S1470-2045(19)30388-2. [DOI] [PubMed] [Google Scholar]
- 4.Ascierto PA, Long GV, Robert C, Brady B, Dutriaux C, Di Giacomo AM, et al. Survival outcomes in patients with previously untreated BRAF wild-type advanced melanoma treated with nivolumab therapy: three-year follow-up of a randomized phase 3 trial. JAMA Oncol. 2019;5(2):187194. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Larkin J, Chiarion-Sileni V, Gonzalez R, Grob JJ, Rutkowski P, Lao CD, et al. Five-year survival with combined nivolumab and ipilimumab in advanced melanoma. N Engl J Med. 2019;381(16):1535–46 https://www.nejm.org/doi/10.1056/NEJMoa1910836. [DOI] [PubMed] [Google Scholar]
- 6.Postow MA, Sidlow R, Hellmann MD. Immune-related adverse events associated with immune checkpoint blockade. N Engl J Med. 2018;378(2):158–68 https://www.nejm.org/doi/10.1056/NEJMra1703481. [DOI] [PubMed] [Google Scholar]
- 7.Magee DE, Hird AE, Klaassen Z, Sridhar SS, Nam RK, Wallis CJD, et al. Adverse event profile for immunotherapy agents compared with chemotherapy in solid organ tumors: a systematic review and meta-analysis of randomized clinical trials. Ann Oncol. 2020;31(1):50–60. 10.1016/j.annonc.2019.10.008. [DOI] [PubMed] [Google Scholar]
- 8.Wang DY, Salem JE, Cohen JV, Chandra S, Menzer C, Ye F, et al. Fatal toxic effects associated with immune checkpoint inhibitors: a systematic review and meta-analysis. JAMA Oncol. 2018;4(12):1721–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Balaji A, Zhang J, Wills B, Marrone KA, Elmariah H, Yarchoan M, et al. Immune-related adverse events requiring hospitalization: spectrum of toxicity, treatment, and outcomes. J Oncol Pract. 2019;15(9):e825–34 https://ascopubs.org/doi/10.1200/JOP.18.00703. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Spain L, Diem S, Larkin J. Management of toxicities of immune checkpoint inhibitors. Cancer Treat Rev. 2016;44:51–60. 10.1016/j.ctrv.2016.02.001. [DOI] [PubMed] [Google Scholar]
- 11.Cho SY, Lipson EJ, Im HJ, Rowe SP, Gonzalez EM, Blackford A, et al. Prediction of response to immune checkpoint inhibitor therapy using early-time-point 18F-FDG PET/CT imaging in patients with advanced melanoma. J Nucl Med. 2017;58(9):1421–8. 10.2967/jnumed.116.188839. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Jiménez-Requena F, Delgado-Bolton RC, Fernández-Pérez C, Gambhir SS, Schwimmer J, Pérez-Vázquez JM, et al. Meta-analysis of the performance of (18)F-FDG PET in cutaneous melanoma. Eur J Nucl Med Mol Imaging. 2010;37(2):284–300. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Dimitrakopoulou-Strauss A Monitoring of patients with metastatic melanoma treated with immune checkpoint inhibitors using PET-CT. Cancer Immunol Immunother. 2019;68(5):813–22. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Tan AC, Emmett L, Lo S, Liu V, Kapoor R, Carlino MS, et al. FDG-PET response and outcome from anti-PD-1 therapy in metastatic melanoma. Ann Oncol. 2018;29(10):2115–20. 10.1093/annonc/mdy330. [DOI] [PubMed] [Google Scholar]
- 15.Aide N, Hicks RJ, Le Tourneau C, Lheureux S, Fanti S, Lopci E. FDG PET/CT for assessing tumour response to immunotherapy: report on the EANM symposium on immune modulation and recent review of the literature. Eur J Nucl Med Mol Imaging. 2019;46(1):238–50. 10.1007/s00259-018-4171-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Nobashi T, Baratto L, Reddy SA, Srinivas S, Toriihara A, Hatami N, et al. Predicting response to immunotherapy by evaluating tumors, lymphoid cell-rich organs, and immune-related adverse events using FDG-PET/CT. Clin Nucl Med. 2019;44(4):e272–9. [DOI] [PubMed] [Google Scholar]
- 17.Eshghi N, Garland LL, Nia E, Betancourt R, Krupinski E, Kuo PH. 18F-FDG PET/CT can predict development of thyroiditis due to immunotherapy for lung cancer. J Nucl Med Technol. 2018;46(3):260–4. 10.2967/jnmt.117.204933. [DOI] [PubMed] [Google Scholar]
- 18.Frelau A, Palard-Novello X, Jali E, Boussemart L, Dupuy A, James P, et al. Increased thyroid uptake on18F-FDG PET/CT is associated with the development of permanent hypothyroidism in stage IV melanoma patients treated with anti-PD-1 antibodies. Cancer Immunol Immunother. 2021;70(3):679–87. 10.1007/s00262-020-02712-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Lang N, Dick J, Slynko A, Schulz C, Dimitrakopoulou-Strauss A, Sachpekidis C, et al. Clinical significance of signs of autoimmune colitis in 18F-fluorodeoxyglucose positron emission tomography-computed tomography of 100 stage-IV melanoma patients. Immunotherapy. 2019;11(8):667–76. 10.2217/imt-2018-0146. [DOI] [PubMed] [Google Scholar]
- 20.Wang H, Zhang N, Huo L, Zhang B. Dual-modality multi-atlas segmentation of torso organs from [18F]FDG-PET/CT images. Int J Comput Assist Radiol Surg. 2019;14(3):473–82. 10.1007/s11548-018-1879-3. [DOI] [PubMed] [Google Scholar]
- 21.Taghanaki SA, Zheng Y, Kevin Zhou S, Georgescu B, Sharma P, Xu D, Comaniciu D, et al. Combo loss: Handling input and output imbalance in multi-organ segmentation. Comput Med Imaging Graph. 2019;75:24–33. 10.1016/j.compmedimag.2019.04.005. [DOI] [PubMed] [Google Scholar]
- 22.National Cancer Institute (NCI). NCI Common Terminology Criteria for Adverse Events (CTCAE). Version 5.0. 2021. https://ctep.cancer.gov/protocoldevelopment/electronic_applications/docs/ctcae_v5_quick_reference_8.5×11.pdf (Accessed on May 26, 2021).
- 23.Nelson EK, Piehler B, Eckels J, Rauch A, Bellew M, Hussey P, et al. LabKey Server: an open source platform for scientific data integration, analysis and collaboration. BMC Bioinform. 2011;12:71. 10.1186/1471-2105-12-71. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Kamnitsas K, Ledig C, Newcombe VFJ, Simpson JP, Kane AD, Menon DK, et al. Efficient multi-scale 3D CNN with fully connected CRF for accurate brain lesion segmentation. Med Image Anal. 2017;36:61–78. 10.1016/j.media.2016.10.004. [DOI] [PubMed] [Google Scholar]
- 25.Tieleman T, Hinton G. Lecture 6.5-rmsprop: divide the gradient by a running average of its recent magnitude. COURSERA: Neural Networks for. Mach Learn. 2012;4:26–31. [Google Scholar]
- 26.Jimenez-Del-Toro O, Muller H, Krenn M, Gruenberg K, Taha AA, Winterstein M, et al. Cloud-based evaluation of anatomical structure segmentation and landmark detection algorithms: VISCERAL anatomy benchmarks. IEEE Trans Med Imaging. 2016;35(11):2459–75. [DOI] [PubMed] [Google Scholar]
- 27.Fedorov A, Beichel R, Kalpathy-Cramer J, Finet J, Fillion-Robin JC, Pujol S, et al. 3D Slicer as an image computing platform for the Quantitative Imaging Network. Magn Reson Imaging. 2012;30(9):1323–41. 10.1016/j.mri.2012.05.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Huff DT, Ferjancic P, Namías M, Emamekhoo H, Perlman SB, Jeraj R. Image intensity histograms as imaging biomarkers: application to immune-related colitis. Biomed Phys Eng Express. 2021;7(6):065019. 10.1088/2057-1976/ac27c3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Heimann T, van Ginneken B, Styner MA, Arzhaeva Y, Aurich V, Bauer C, et al. Comparison and evaluation of methods for liver segmentation from CT datasets. IEEE Trans Med Imaging. 2009;28(8):1251–65. [DOI] [PubMed] [Google Scholar]
- 30.Schierz JH, Sarikaya I, Wollina U, Unger L, Sarikaya A. Immune checkpoint inhibitor related adverse effects and FDG PET/CT findings. J Nucl Med Technol. 2021:jnmt.121.262151. 10.2967/jnmt.121.262151. [DOI] [PubMed] [Google Scholar]
- 31.Mekki A, Dercle L, Lichtenstein P, Marabelle A, Michot JM, Lambotte O, et al. Detection of immune-related adverse events by medical imaging in patients treated with anti-programmed cell death 1. Eur J Cancer. 2018;96:91–104. 10.1016/j.ejca.2018.03.006. [DOI] [PubMed] [Google Scholar]
- 32.Ghisoni E, Wicky A, Bouchaab H, Imbimbo M, Delyon J, Gautron Moura B, et al. Late-onset and long-lasting immune-related adverse events from immune checkpoint-inhibitors: an overlooked aspect in immunotherapy. Eur J Cancer. 2021;149:153–64. 10.1016/j.ejca.2021.03.010. [DOI] [PubMed] [Google Scholar]
- 33.Dolladille C, Ederhy S, Sassier M, Cautela J, Thuny F, Cohen AA, et al. Immune checkpoint inhibitor rechallenge after immune-related adverse events in patients with cancer. JAMA Oncol. 2020;6(6):865–71. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Ayati N, Sadeghi R, Kiamanesh Z, Lee ST, Zakavi SR, Scott AM. The value of 18F-FDG PET/CT for predicting or monitoring immunotherapy response in patients with metastatic melanoma: a systematic review and meta-analysis. Eur J Nucl Med Mol Imaging. 2021;48(2):428–48 https://link.springer.com/article/10.1007%2Fs00259-020-04967-9. [DOI] [PubMed] [Google Scholar]