Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2013 Jan 1.
Published in final edited form as: J Nucl Med. 2012 Jan;53(1):4–11. doi: 10.2967/jnumed.111.093443

Impact of the Definition of Peak Standardized Uptake Value on Quantification of Treatment Response

Matt Vanderhoek 1, Scott B Perlman 2, Robert Jeraj 1
PMCID: PMC3308343  NIHMSID: NIHMS360995  PMID: 22213818

Abstract

PET-based treatment response assessment typically measures the change in maximum standardized uptake value (SUVmax), which is adversely affected by noise. Peak SUV (SUVpeak) has been recommended as a more robust alternative, but its associated region of interest (ROIpeak) is not uniquely defined. We investigated the impact of different ROIpeak definitions on quantification of SUVpeak and tumor response.

Methods

Seventeen patients with solid malignancies were treated with a multitargeted receptor tyrosine kinase inhibitor resulting in a variety of responses. Using the cellular proliferation marker 3′-deoxy-3′-18F-fluorothymidine (18F-FLT), whole-body PET/CT scans were acquired at baseline and during treatment. 18F-FLT–avid lesions (~2/patient) were segmented on PET images, and tumor response was assessed via the relative change in SUVpeak. For each tumor, 24 different SUVpeaks were determined by changing ROIpeak shape (circles vs. spheres), size (7.5–20 mm), and location (centered on SUVmax vs. placed in highest-uptake region), encompassing different definitions from the literature. Within each tumor, variations in the 24 SUVpeaks and tumor responses were measured using coefficient of variation (CV), standardized deviation (SD), and range. For each ROIpeak definition, a population average SUVpeak and tumor response were determined over all tumors.

Results

A substantial variation in both SUVpeak and tumor response resulted from changing the ROIpeak definition. The variable ROIpeak definition led to an intratumor SUVpeak variation ranging from 49% above to 46% below the mean (CV, 17%) and an intratumor SUVpeak response variation ranging from 49% above to 35% below the mean (SD, 9%). The variable ROIpeak definition led to a population average SUVpeak variation ranging from 24% above to 28% below the mean (CV, 14%) and a population average SUVpeak response variation ranging from only 3% above to 3% below the mean (SD, 2%). The size of ROIpeak caused more variation in intratumor response than did the location or shape of ROIpeak. Population average tumor response was independent of size, shape, and location of ROIpeak.

Conclusion

Quantification of individual tumor response using SUVpeak is highly sensitive to the ROIpeak definition, which can significantly affect the use of SUVpeak for assessment of treatment response. Clinical trials are necessary to compare the efficacy of SUVpeak and SUVmax for quantification of response to therapy.

Keywords: PET, SUVpeak, treatment response


PET continues to gain importance as a tool to assess response to therapy. Typically, the change in standardized uptake value (SUV) is measured to quantify treatment response (1). Patients are then classified into different response categories based on the relative change in SUV. These categories include complete response, partial response, stable disease, and progressive disease. Such response classifications are often used to guide subsequent treatment decisions and can be predictive of clinical out-come (2-4).

Most response assessment studies measure the change in maximum SUV (SUVmax), a single-pixel value that is adversely affected by noise (5-8), which leads to uncertainty in the quantification of treatment response. Consequently, peak SUV (SUVpeak) has been suggested as a more robust alternative (9), defined as the average SUV within a small, fixed-size region of interest (ROIpeak) centered on a high-uptake part of the tumor (9). SUVpeak is illustrated in Figure 1. Because of its larger volume, SUVpeak is less affected by image noise than SUVmax (6,7,10) and therefore is expected to reduce uncertainties in the quantification of response to therapy.

FIGURE 1.

FIGURE 1

Multiple definitions of ROIpeak. In this schematic PET image of radiotracer uptake in tumor (purple outline), ROIpeaks of different sizes and in different locations are shown. Shape of ROIpeak can vary too (only circles are shown).

There is a wide variety of SUVpeak definitions in the literature, and they differ in the shape, size, and location of the ROIpeak (Fig. 1). Shapes and sizes include square and cuboidal regions with side lengths ranging from 7 to 15 mm (5,8,11-13), as well as circular, cylindric, and spheric regions with diameters ranging from 9 to 17 mm (6,7,9,14-16). Locations include the tumor region with the highest radiotracer uptake, the tumor region yielding the greatest SUVpeak, and the tumor region containing the voxel of maximum uptake.

The definition of SUVpeak could significantly affect the quantification of treatment response. Altering the size, shape, or location of ROIpeak may affect the relative change in SUVpeak and ultimately the classification of response. Uncertainties in the quantification of response could have significant implications regarding treatment decisions and clinical prognoses. Furthermore, these uncertainties could influence the recommendation to use SUVpeak rather than SUVmax for response assessment. Consequently, we investigated the impact of different ROIpeak definitions on the quantification of SUVpeak and tumor response to therapy.

MATERIALS AND METHODS

Treatment and Imaging

Seventeen patients with advanced solid malignancies were treated with a multitargeted receptor tyrosine kinase inhibitor with antiproliferative and antiangiogenic effects. Malignancies included a diverse range of tumor types: renal cell carcinoma (n = 7), esophagus (n = 2), hepatocellular (n = 2), prostate (n = 1), sarcoma (n = 1), small cell lung (n = 2), thymus (n = 1), and uterine carcinosarcoma (n = 1). Response to therapy was measured using the PET radiotracer 3′-deoxy-3′-18F-fluorothymidine (18F-FLT). As a surrogate of cellular proliferation, 18F-FLT is emerging as a promising candidate for chemotherapy response assessment as demonstrated in patients with lymphoma, breast cancer, and glioma (17-23). Patients were injected intravenously with 240 MBq (6.5 mCi) of 18F-FLT and underwent whole-body PET/CT at baseline (pretreatment) and during treatment using a Discovery LS PET/CT scanner (GE Healthcare). 18F-FLT was synthesized following the method described by Martin et al. with slight modifications (24). PET/CT began 47 ± 4 min after injection and extended inferiorly from the base of the skull to the distal femora. Acquisition time was 10 min per bed position. PET images were reconstructed on a 128 × 128 grid over a 50-cm field of view using the ordered-subset expectation maximization algorithm with 2 iterations, 28 subsets, a 5-mm gaussian loop (interiteration) filter, a 3-mm gaussian postprocessing filter, and CT attenuation correction. On average, patient weight changed only 1.5% between the 2 PET scans.

The study protocol was approved by the University of Wisconsin Health Sciences Institutional Review Board, the Scientific Review Board of the University of Wisconsin Carbone Comprehensive Cancer Center, and the University of Wisconsin Radiation Drug Research Committee. Written informed consent was obtained from each patient before enrollment in the study.

Quantification of SUVpeak and Tumor Response

PET activity concentrations (MBq/cm3) were converted to standardized uptake values by dividing by the injected activity per patient mass. 18F-FLT-avid lesions (~2/patient) were segmented on PET images by an experienced nuclear medicine physician. The location and number of lesions were as follows: lung, 14; mediastinum, 5; liver, 4; abdomen, 3; adrenal, 2; gastrointestinal tract, 2; pelvis, 1; gluteus, 1; uterus, 1; arm, 1; bone, 1. Tumor volumes ranged from 1 cm3 to 530 cm3, with an average of 66 cm3.

For each tumor, 24 different SUVpeaks were determined by changing the region of interest (ROIpeak) used to measure SUVpeak (Fig. 2). The shape, size, and location of ROIpeak were varied as follows: for shape, circular (2-dimensional) or spheric (3-dimensional) ROIs were used; for size, ROI diameters of 7.5, 10, 12.5, 15, 17.5, or 20 mm, encompassing the range of fixed ROI lengths in the literature, were used; for location, the ROI was centered on SUVmax or was placed in the highest-uptake region. An example ROI is shown in Supplemental Figure 1, where a 12.5-mm-diameter circular ROI was placed in the highest-uptake region of a lung lesion (supplemental materials are available online only at http://jnm.snmjournals.org).

FIGURE 2.

FIGURE 2

Varying ROIpeak. Shape, location, and size of ROIpeak were varied as shown to yield 24 different SUVpeaks for each tumor.

SUVpeak was determined automatically. First, an ROI (circular or spheric) was centered on each tumor voxel and the average SUV within the ROI was determined by weighting each voxel uptake value by the percentage of its volume contained within the ROI. The ROI location yielding the greatest average SUV was defined as the highest-uptake region of the tumor (Supplemental Fig. 1). In addition, an ROI (circular or spheric) was centered on SUVmax and the average SUV within the ROI was determined.

For each tumor, the 24 SUVpeaks were normalized to the mean intratumor SUVpeak (Eq. 1) and their variation was measured using the coefficient of variation (CV) and range.

SUVpeakn¯=i=1i=24(SUVpeakn)i24 Eq. 1

Here, SUVpeakn¯ is the mean intratumor SUVpeak for an individual tumor n, and (SUVpeakn)i corresponds to an SUVpeak determined using 1 of the 24 ROIpeak definitions (the ith definition, e.g., 12.5-mm-diameter spheric ROI placed in the highest-uptake region).

For each ROIpeak definition, a population average SUVpeak (Eq. 2) was determined over all tumors.

(SUVpeak)i¯=n=1N(SUVpeakn)iN. Eq. 2

Here, (SUVpeak)i¯ SUVpeak is the population average SUVpeak using the ith ROIpeak definition, n is an individual tumor, N is the total number of tumors, and (SUVpeakn)i corresponds to an SUVpeak of an individual tumor determined using 1 of the 24 ROIpeak definitions (the ith definition). The variation of the 24 population average SUVpeak was measured using coefficient of variation and range.

Tumor response (R) during treatment was defined as the relative change in SUVpeak normalized to the baseline SUVpeak (Eq. 3).

Rin=(SUVpeakn,duringtx)i(SUVpeakn,baseline)i(SUVpeakn,baseline)i×100%. Eq. 3

Here, Rin is the response of an individual tumor (n) using 1 of the 24 ROIpeak definition (the ith definition).

For each tumor, the 24 different SUVpeaks gave rise to 24 different responses whose variation was measured using the standardized deviation (SD) and range. For each ROIpeak definition, a population average response (Eq. 4) was determined over all tumors.

Ri¯=n=1NRinN. Eq. 4

Here, Ri¯ is the population average response using the ith ROIpeak definition, n is an individual tumor, and N is the total number of tumors. The variation of the 24 population average responses was measured using SD and range.

Tumor response was also determined using SUVmax for comparison with response measured using SUVpeak. SUVmax can be considered as a special case of SUVpeak in the limit of a very small ROIpeak (single-voxel ROIpeak). For SUVmax, an equivalent diameter of 5 mm was derived by calculating the diameter of a sphere whose volume equaled the volume of the single voxel (65 mm3) represented by SUVmax.

One-way ANOVA was used to test whether the ROIpeak definition resulted in statistically significant differences in SUVpeak and tumor response. The Levene test for equal variance was used, and means were compared with the Bonferroni test. Differences were considered statistically significant at an a-level of less than 0.05/24. Correlations between the variation in SUVpeak and tumor response and other tumor characteristics were tested using the Pearson correlation coefficient (r) and considered statistically significant at an a-level of less than 0.05.

RESULTS

Individual Tumors

SUVpeak

Within individual tumors, considerable variation in SUVpeak resulted from changing the ROIpeak definition. The variable ROIpeak definition led to an intratumor SUVpeak variation ranging from 49% above to 46% below the mean, resulting in a 17% CV. These intratumor variations in SUVpeak are highlighted for a retroperitoneal lesion in Supplemental Figure 2 and for all lesions in Supplemental Figure 3.

The size of ROIpeak caused more variation in intratumor SUVpeak than did the location or shape of ROIpeak (Supplemental Fig. 3). Within individual tumors, varying ROIpeak diameter resulted on average in a 14% CV associated with SUVpeak, compared with a CV of 9% and 5% when the location or shape, respectively, of ROIpeak was varied. In general, intratumor SUVpeak tended to decrease, but its variation tended to increase as the size of ROIpeak increased (Supplemental Fig. 2).

There was no significant correlation between tumor size and the variation in intratumor SUVpeak (Supplemental Fig. 3, tumors ordered by size). Furthermore, there was no significant correlation between intratumor uptake heterogeneity (measured by CV of tumor uptake) and the variation in intratumor SUVpeak.

Tumor Response

Within individual tumors, a substantial variation in tumor response resulted from changing the ROIpeak definition. Intratumor response ranged from 49% above to 35% below the mean, resulting in a 9% SD. These intratumor variations in response are highlighted for a retroperitoneal lesion in Figure 3 and for all lesions in Figure 4. Responses determined using SUVmax were within the range of responses quantified with SUVpeak in almost 70% of all tumors (Fig. 4).

FIGURE 3.

FIGURE 3

Variation in tumor response for retroperitoneal lesion (lesion 11). (Left) 24 different SUVpeak tumor responses (mean response, dashed line) arising from 24 different ROIpeak definitions. Response was ambiguously classified as either stable disease (above −30%, green line) or partial response (below −30%). SUVmax response is also shown. (Right) Box represents SD, whiskers show range, and solid line depicts median of SUVpeak response.

FIGURE 4.

FIGURE 4

Intratumor variation in SUVpeak responses. (Left) Responses of 31 tumors (ordered from smallest to largest) arising from different ROIpeak definitions. Responses of 12 tumors (in blue) were ambiguously classified as either progressive disease/stable disease (130%, solid red line) or stable disease/partial response (-30%, solid green line). SUVmax response and SUVpeak PERCIST response (1.25-cm-diameter sphere in highest-uptake region) are also shown. (Right) Overall variation in intratumor response and variation associated with changing size, location, and shape of ROIpeak. Boxes represent SD, whiskers show range, and solid line depicts median of response values.

Variation in intratumor response resulted in the ambiguous classification of individual tumors into multiple response categories (Table 1; Figs. 3 and 4). Different response thresholds were applied to classify tumors into response categories (e.g., +30% and −30% for progressive disease/stable disease and stable disease/partial response thresholds, respectively, as recommended by PET Response Criteria in Solid Tumors [PERCIST]). When response thresholds of ±20%, ±30%, and ±40% were applied, 55%, 42%, and 32%, respectively, of all tumors suffered from an ambiguous response classification. In addition, response classifications using SUVpeak and SUVmax were compared (Table 1; Figs. 3 and 4).

TABLE 1.

Tumor Response Classification Using Different Response Thresholds

Response
thresholds
Response
metric
Total
tumors
Partial-response
tumors
Stable-disease
tumors
Progressive-
disease
tumors
Partial-
response/
stable-disease
tumors
Stable-disease/
progressive-
disease
tumors
Partial-response/
stable-disease/
progressive-disease
tumors
Ambiguous
response
(%)
±20% SUVpeak 31 12 1 1 10 6 1 55
SUVmax 31 18 9 4 NA NA NA NA

±30% SUVpeak 31 9 8 1 9 4 0 42
SUVmax 31 13 15 3 NA NA NA NA

±40% SUVpeak 31 7 14 0 7 3 0 32
SUVmax 31 9 20 2 NA NA NA NA

NA = not applicable.

The size, location, and shape of ROIpeak all caused similar variations in intratumor response (Fig. 4). Within individual tumors, varying ROIpeak size, location, and shape resulted on average in respective SDs of 5%, 7%, and 5% associated with tumor response.

In general, the magnitude of intratumor response was independent of the size of ROIpeak. However, the variation in intratumor response tended to increase as the size of ROIpeak increased (Fig. 3). The magnitude and variation of intratumor response were independent of ROIpeak shape and location.

A strong correlation was exhibited between tumor response (average of all 24 SUVpeak responses for each tumor) and variation in intratumor response (r = 0.81, P < 0.001, Figs. 4-6). Variation in intratumor response tended to increase as response increased (i.e., as response worsened from partial response to stable disease to progressive disease). There was no significant correlation between tumor size and variation in intratumor response (Fig. 4, tumors ordered by size). Furthermore, there was no significant correlation between intratumor uptake heterogeneity (measured by CV of tumor uptake) and variation in intratumor response.

FIGURE 6.

FIGURE 6

Small vs. large variation in intratumor SUVpeak response. 18F-FLT PET/CT images of periaortic lesion (left, lesion 24) and pelvic tumor (right, lesion 30) at baseline (top) and during treatment (bottom). Lesions are indicated by white circles. Periaortic lesion responded well, exhibiting fairly uniform reduction of 18F-FLT uptake in higher-uptake regions. Consequently, there was little variation in SUVpeak response. In contrast, pelvic tumor responded poorly, with heterogeneous response in higher-uptake regions, resulting in large variation in SUVpeak response.

Population Average

SUVpeak

Quantification of the population average SUVpeak was substantially affected by changing the ROIpeak definition. For different ROIpeak definitions, the population average SUVpeak ranged from 24% above to 28% below the mean, resulting in a 14% CV (Supplemental Fig. 4). Differences in SUVpeak (associated with the ROIpeak definitions) between the populations were statistically significant (P < 0.001).

The size of ROIpeak caused more variation in the population average SUVpeak than did the location or shape of ROIpeak (Supplemental Fig. 4). Varying ROIpeak diameter resulted in a 13% CV associated with the population average SUVpeak, compared with a CV of 7% and 4%, respectively, when ROIpeak location or shape was varied. Trends observed in the population average SUVpeak reflected trends associated with intratumor SUVpeak. As the size of ROIpeak increased, the population average SUVpeak tended to decrease and its variation increased (Supplemental Fig. 4).

Tumor Response

Tumor response during treatment averaged −21% but ranged as high as +116% and as low as 280% (Fig. 4). However, population average tumor response was not significantly affected by changing the ROIpeak definition. For different ROIpeak definitions, the population average tumor response ranged from only 3% above to 3% below the mean, resulting in a 2% SD (Fig. 7). Differences in response (associated with the ROIpeak definitions) between the populations were not statistically significant (P = 1.00).

FIGURE 7.

FIGURE 7

Variation in average tumor response in population. (Left and middle) For each ROIpeak definition, tumor response was averaged over all tumors, resulting in population average response (mean response, dashed line). SUVmax response is also shown. (Right) Overall variation in population average response and variation associated with changing size, location, and shape of ROIpeak. Boxes represent SD, whiskers show range, and solid line depicts median. Variation in population average response is extremely small (~2%).

Size, location, and shape of ROIpeak all caused minimal variations in population average tumor response (Fig. 7), as all SDs were less than 2%. The magnitude and variation of population average tumor response were independent of the size, shape, and location of ROIpeak (Fig. 7).

Tumor Subgroup Analysis

Variation in SUVpeak and tumor response was determined using all 35 tumors assessed in this study. In addition, the results were recalculated on 2 different tumor subgroups. The first subgroup, in which lesions were in regions without significant background activity (n = 23), was studied in order to reduce the chance that elevated background activity was incorrectly included in ROIpeak. Consequently, abdominal, hepatic, renal, and bone lesions were excluded from this group. The second subgroup was one in which lesions were larger than 20 mL (n = 12), which is approximately 5 times larger than the largest ROIpeak (4.2 mL, 20-mm diameter). Studying this subgroup ensured that ROIpeak was completely inside the tumor boundaries and that no background activity was incorrectly included in ROIpeak. Results for the 2 tumor subgroups were almost identical to results determined using all tumors (Table 2).

TABLE 2.

Comparison of All Tumors with Different Tumor Subgroups

Individual tumors
Population average
SUVpeak
Response
SUVpeak
Response
Tumor group n Range Average CV Range Average SD Range CV Range SD
All tumors 35 −46% to +49% 17% −35% to +49% 9% −28% to +24% 14% −3% to +3% 2%

Tumors in regions
 without significant
 background activity*
23 −46% to +49% 17% −35% to +49% 9% −29% to +26% 15% −4% to 13% 2%

Tumors > 20 cm3 12 −43% to +49% 16% −18% to +36% 8% −27% to +22% 14% −4% to +7% 3%
*

Tumors in abdominal, hepatic, renal, and bone regions were excluded because normal, elevated 18F-FLT uptake was present in these areas.

DISCUSSION

Individual Tumor Response Versus Population Average Response

The region of interest used to determine SUVpeak can have a profound effect on its quantification and on the response of individual tumors. On average, different ROIpeak definitions resulted in intratumor variations of approximately 17% and 9% for SUVpeak and tumor response, respectively, and these variations ranged as high as 50%. This degree of variation can lead to different categorizations of response (Figs. 3 and 4) using criteria such as PERCIST (9). With PERCIST, such ambiguous response categorizations arose in over 40% of the tumor responses assessed in this study (Fig. 4; Table 1). Ambiguous response categorization of tumors increased with narrower response criteria (e.g., ±20%) but was reduced using broader response criteria (e.g., ±40%), similar to that of the MUNICON phase II trial (Table 1) (23). The sensitivity of response quantification to the ROIpeak definition reveals the need to optimize PET metrics (such as SUVpeak) for quantitative response assessment in individual patients. An ambiguous response classification underscores the necessity for a unique, consistent, standard region of interest with associated criteria that can accurately assess response.

Unlike individual tumor responses, population average response was relatively insensitive to the definition of ROIpeak used to measure response (Fig. 7), as is consistent with the findings of Krak et al. (6). The small variation (only 2%) in population average response occurred because the magnitude of individual tumor responses was independent of the ROIpeak definition. Therefore, because of an averaging effect, variation was reduced when determining population average response and might be reduced even further as more tumors are included in the population average. This robustness of population average response points to the strength of PET for accurate quantification of the average response to therapy.

Effects of Different Factors on Variation in Intratumor Response

The variation in intratumor response correlated strongly with tumor response (Fig. 5). Tumors that responded well (i.e., partial response, tumor response < −30%) exhibited significantly less variation in intratumor SUVpeak response than did tumors that responded poorly (i.e., stable disease or progressive disease, tumor response > −30%). Well-responding tumors seemed to exhibit a response more uniform than the heterogeneous response of poorly responding tumors (Fig. 6). Thus, SUVpeak–based response was considerably more sensitive to the ROIpeak definition for poorly responding tumors than for well-responding tumors.

FIGURE 5.

FIGURE 5

Variation in intratumor response vs. response. Variation in intratumor response tended to increase as response increased (i.e., as response worsened from partial response to stable disease to progressive disease).

Surprisingly, neither tumor size nor tumor uptake heterogeneity had a significant effect on the variation in either intratumor response or SUVpeak. This finding suggests that the characteristics (size, heterogeneity, etc.) of only the high-uptake regions encompassed by ROIpeak, not those of the entire tumor, directly affect the variation in tumor response and SUVpeak. Though not investigated, partial-volume effects tend to reduce uptake heterogeneity and therefore are expected to reduce the variation in both SUVpeak and response. Thus, a greater variation in both SUVpeak and response should result from partial-volume correction of the PET data.

The variable ROIpeak definition led to a variation in intratumor response that was about half that of SUVpeak. Tumor response was determined via normalization by baseline SUVpeak, in effect canceling out some of the variation in SUVpeak, which may explain the reduced variation in intratumor response. Most of the variation in SUVpeak was due to the size of ROIpeak, as is consistent with the findings of Boellaard et al. (5). As expected, variation in both intratumor response and SUVpeak increased as the size of ROIpeak increased.

For each ROIpeak definition, population average SUVpeak preserved the trends caused by the size, shape, and location of ROIpeak. Consequently, the variation in population average SUVpeak was approximately equal to the variation in intratumor SUVpeak. This result is in contrast to tumor response, in which the variation in intratumor response for different ROIpeak definitions (9%) was much larger than the variation in population average response (2%). For tumor response, there were no significant trends caused by size, shape, or location of ROIpeak, resulting in very little variation in population average response due to an averaging effect.

The wide variation in both intratumor response and SUVpeak stemmed from changes to the size, shape, and location of ROIpeak, reflecting the range of different ROIpeak definitions found in the literature. Therefore, a wide variation in intratumor response is expected under normal, realistic conditions. It is likely that an even greater variation would occur because of errors during image analysis for response assessment. For example, improper localization of ROIpeak in an average- or low-uptake region of a tumor at baseline would result in a measured tumor response that is artificially large, leading to a more extreme variation in intratumor response.

18F-FLT, rather than 18F-FDG, was selected as a radio-tracer in this study because of the antiproliferative nature of the molecular targeted therapy. Furthermore, 18F-FLT may be more effective for assessment of treatment response than is 18F-FDG (21,25,26). However, imaging of tumors using both 18F-FLT and 18F-FDG has revealed a somewhat higher SUV and broader SUV range with 18F-FDG than with 18F-FLT (17,27,28). Thus, compared with 18F-FLT, 18F-FDG is expected to result in a greater variation in both SUVpeak and tumor response due to different ROIpeaks.

SUVpeak was determined using body weight (SUVpeakBW) and not lean body mass (SUVpeakLBM, as recommended by PERCIST). However, on average, patient weight changed only 1.5% between the 2 PET scans, and this weight change would result in an approximate difference of only 0.6% between response determined using SUVpeakBW and SUVpeakLBM. Consequently, in this study, approximately the same variation in SUVpeak and tumor response is expected using either SUVpeakBW or SUVpeakLBM.

Implications for Treatment Response Assessment

Currently, most response assessment studies use SUVmax, although recently SUVpeak has been recommended as a more robust alternative (9). Patient-specific response quantification is subject to significant uncertainty because of the different ROIpeak definitions, and therefore SUVpeak requires further study to optimize its use for quantification of response in individual patients. Though stemming from different causes, the uncertainties associated with SUVpeak and SUVmax are comparable (6). Moreover, the noise uncertainty associated with SUVmax continues to be reduced because of the increased counts with 3-dimensional PET acquisition, the current standard on most scanners. A correlation between SUVmax and SUVpeak responses has been demonstrated (6,29), and in this study, SUVmax response was within the range of responses quantified with SUVpeak in almost 70% of all tumors (Fig. 4). Nevertheless, despite this correlation, there can be substantial differences between SUVmax and SUVpeak responses in individual tumors. For example, response quantification using the PERCIST-recommended SUVpeak (1.25-cm-diameter sphere in highest-uptake region) was 45% smaller than that of SUVmax in tumor 9 (Fig. 4), resulting in different response categorizations. Such differences underscore the need to establish the relative predictive power of SUVpeak versus SUVmax for response assessment. Consequently, the recent recommendation in favor of SUVpeak over SUVmax should be approached with caution (9). It must first be determined whether SUVpeak or SUVmax is best suited for treatment response assessment.

Clinical trials are necessary to establish the superiority of SUVpeak or SUVmax for quantification of response to therapy. These trials should investigate the sensitivities of SUVpeak and SUVmax to a variety of factors, including image noise, scan acquisition and image reconstruction parameters, partial-volume effects, tumor motion, and others. Furthermore, the clinical utility of either SUVpeak or SUVmax for response quantification will strongly depend on its correlation with patients’ clinical outcomes. Ultimately, the most robust and predictive SUV measure should be selected for quantification of treatment response.

It is probably not feasible to compare all definitions of SUVpeak with SUVmax, within the context of a larger clinical trial. Rather, a standard ROIpeak should be carefully selected to determine SUVpeak. ROIpeak should be large enough to prevent SUVpeak from suffering from noise, partial-volume effects, and other sensitivities that plague SUVmax. However, ROIpeak should not be so large that it includes substantial uptake heterogeneity and voxels that lie outside the tumor. These considerations lend support to the 1.2-cm-diameter sphere recommended by PERCIST as a standard definition of ROIpeak (for 2-cm or larger diameter tumors). This size is in the middle of the range of ROIpeak definitions found in the literature.

Identification of a suitable SUV measure for response quantification requires clinical trials. After these trials, thresholds for the different response categories (complete response, partial response, stable disease, and progressive disease) can be established using population average response data in which the uncertainties are small. Unique thresholds may be established for specific diseases and their associated therapies. The size of the thresholds will need to exceed the overall uncertainty associated with the selected SUV measure (SUVpeak or SUVmax). Subsequently, the SUV measure could be quantified in individual patients to gauge their response to therapy using the established response thresholds as a guide.

CONCLUSION

Quantification of individual tumor response with SUVpeak is sensitive to the region of interest used to determine SUVpeak. Changes to the size, shape, and location of ROIpeak result in substantial variation (≤50%) in both SUVpeak and tumor response for individual tumors. These considerable uncertainties in SUVpeak and tumor response call into question recommendations favoring SUVpeak over SUVmax for quantification of treatment response. Clinical trials are necessary to compare the efficacy of SUVpeak and SUVmax for quantification of response to therapy.

Supplementary Material

Supplementary Data

ACKNOWLEDGMENTS

This work was financially supported by NIH grant R01 CA136927. We acknowledge the PET technologists Chris Jaskowiak and Mark McNall for scanning patients after hours as well as the University of Wisconsin Cyclotron Research Group for producing the 18F-FLT used in the study. No other potential conflict of interest relevant to this article was reported.

Footnotes

DISCLOSURE STATEMENT

The costs of publication of this article were defrayed in part by the payment of page charges. Therefore, and solely to indicate this fact, this article is hereby marked “advertisement” in accordance with 18 USC section 1734.

REFERENCES

  • 1.Weber WA. PET for response assessment in oncology: radiotherapy and chemotherapy. Br J Radiol. 2005;28(suppl):42–49. [Google Scholar]
  • 2.Hawkins DS, Schuetze SM, Butrynski JE, et al. [18F]fluorodeoxyglucose positron emission tomography predicts outcome for Ewing sarcoma family of tumors. J Clin Oncol. 2005;23:8828–8834. doi: 10.1200/JCO.2005.01.7079. [DOI] [PubMed] [Google Scholar]
  • 3.Ott K, Weber WA, Lordick F, et al. Metabolic imaging predicts response, survival, and recurrence in adenocarcinomas of the esophagogastric junction. J Clin Oncol. 2006;24:4692–4698. doi: 10.1200/JCO.2006.06.7801. [DOI] [PubMed] [Google Scholar]
  • 4.Stroobants S, Goeminne J, Seegers M, et al. 18FDG-positron emission tomography for the early prediction of response in advanced soft tissue sarcoma treated with imatinib mesylate (Glivec) Eur J Cancer. 2003;39:2012–2020. doi: 10.1016/s0959-8049(03)00073-x. [DOI] [PubMed] [Google Scholar]
  • 5.Boellaard R, Krak NC, Hoekstra OS, Lammertsma AA. Effects of noise, image resolution, and ROI definition on the accuracy of standard uptake values: a simulation study. J Nucl Med. 2004;45:1519–1527. [PubMed] [Google Scholar]
  • 6.Krak NC, Boellaard R, Hoekstra OS, Twisk JW, Hoekstra CJ, Lammertsma AA. Effects of ROI definition and reconstruction method on quantitative outcome and applicability in a response monitoring trial. Eur J Nucl Med Mol Imaging. 2005;32:294–301. doi: 10.1007/s00259-004-1566-1. [DOI] [PubMed] [Google Scholar]
  • 7.Nahmias C, Wahl LM. Reproducibility of standardized uptake value measurements determined by 18F-FDG PET in malignant tumors. J Nucl Med. 2008;49:1804–1808. doi: 10.2967/jnumed.108.054239. [DOI] [PubMed] [Google Scholar]
  • 8.Nakamoto Y, Zasadny KR, Minn H, Wahl RL. Reproducibility of common semi-quantitative parameters for evaluating lung cancer glucose metabolism with positron emission tomography using 2-deoxy-2-[18F]fluoro-D-glucose. Mol Imging Biol. 2002;4:171–178. doi: 10.1016/s1536-1632(01)00004-x. [DOI] [PubMed] [Google Scholar]
  • 9.Wahl RL, Jacene H, Kasamon Y, Lodge MA. From RECIST to PERCIST: evolving considerations for PET response criteria in solid tumors. J Nucl Med. 2009;50(suppl 1):122S–150S. doi: 10.2967/jnumed.108.057307. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Westerterp M, Pruim J, Oyen W, et al. Quantification of FDG PET studies using standardised uptake values in multi-centre trials: effects of image reconstruction, resolution and ROI definition parameters. Eur J Nucl Med Mol Imaging. 2007;34:392–404. doi: 10.1007/s00259-006-0224-1. [DOI] [PubMed] [Google Scholar]
  • 11.Brun E, Kjellen E, Tennvall J, et al. FDG PET studies during treatment: preiction of therapy outcome in head and neck squamous cell carcinoma. Head Neck. 2002;24:127–135. doi: 10.1002/hed.10037. [DOI] [PubMed] [Google Scholar]
  • 12.Minn H, Zasadny KR, Quint LE, Wahl RL. Lung cancer: reproducibility of quantitative measurements for evaluating 2-[F-18]-fluoro-2-deoxy-D-glucose uptake at PET. Radiology. 1995;196:167–173. doi: 10.1148/radiology.196.1.7784562. [DOI] [PubMed] [Google Scholar]
  • 13.Wahl RL, Zasadny K, Helvie M, Hutchins GD, Weber B, Cody R. Metabolic monitoring of breast cancer chemohormonotherapy using positron emission tomography: initial evaluation. J Clin Oncol. 1993;11:2101–2111. doi: 10.1200/JCO.1993.11.11.2101. [DOI] [PubMed] [Google Scholar]
  • 14.Choi NC, Fischman AJ, Niemierko A, et al. Dose-response relationship between probability of pathologic tumor control and glucose metabolic rate measured with FDG PET after preoperative chemoradiotherapy in locally advanced non-small-cell lung cancer. Int J Radiat Oncol Biol Phys. 2002;54:1024–1035. doi: 10.1016/s0360-3016(02)03038-9. [DOI] [PubMed] [Google Scholar]
  • 15.Römer W, Hanauske AR, Ziegler S, et al. Positron emission tomography in non-Hodgkin’s lymphoma: assessment of chemotherapy with fluorodeoxyglucose. Blood. 1998;91:4464–4471. [PubMed] [Google Scholar]
  • 16.Schelling M, Avril N, Nahrig J, et al. Positron emission tomography using [18F] fluorodeoxyglucose for monitoring primary chemotherapy in breast cancer. J Clin Oncol. 2000;18:1689–1695. doi: 10.1200/JCO.2000.18.8.1689. [DOI] [PubMed] [Google Scholar]
  • 17.Buck AK, Halter G, Schirrmeister H, et al. Imaging proliferation in lung tumors with PET: 18F-FLT versus 18F-FDG. J Nucl Med. 2003;44:1426–1431. [PubMed] [Google Scholar]
  • 18.Chen W, Delaloye S, Silverman DH, et al. Predicting treatment response of malignant gliomas to bevacizumab and irinotecan by imaging proliferation with [18F] fluorothymidine positron emission tomography: a pilot study. J Clin Oncol. 2007;25:4714–4721. doi: 10.1200/JCO.2006.10.5825. [DOI] [PubMed] [Google Scholar]
  • 19.Herrmann K, Wieder HA, Buck AK, et al. Early response assessment using 39-deoxy-39-[18F]fluorothymidine-positron emission tomography in high-grade non-Hodgkin’s lymphoma. Clin Cancer Res. 2007;13:3552–3558. doi: 10.1158/1078-0432.CCR-06-3025. [DOI] [PubMed] [Google Scholar]
  • 20.Kenny L, Coombes RC, Vigushin DM, Al-Nahhas A, Shousha S, Aboagye EO. Imaging early changes in proliferation at 1 week post chemotherapy: a pilot study in breast cancer patients with 39-deoxy-39-[18F]fluorothymidine positron emission tomography. Eur J Nucl Med Mol Imaging. 2007;34:1339–1347. doi: 10.1007/s00259-007-0379-4. [DOI] [PubMed] [Google Scholar]
  • 21.Pio BS, Park CK, Pietras R, et al. Usefulness of 39-[F-18]fluoro-39-deoxythymidine with positron emission tomography in predicting breast cancer response to therapy. Mol Imaging Biol. 2006;8:36–42. doi: 10.1007/s11307-005-0029-9. [DOI] [PubMed] [Google Scholar]
  • 22.Buck AK, Kratochwil C, Glatting G, et al. Early assessment of therapy response in malignant lymphoma with the thymidine analogue [18F]FLT. Eur J Nucl Med Mol Imaging. 2007;34:1775–1782. doi: 10.1007/s00259-007-0452-z. [DOI] [PubMed] [Google Scholar]
  • 23.Lordick F, Ott K, Krause BJ, et al. PET to assess early metabolic response and to guide treatment of adenocarcinoma of the oesophagogastric junction: the MUNICON phase II trial. Lancet Oncol. 2007;8:797–805. doi: 10.1016/S1470-2045(07)70244-9. [DOI] [PubMed] [Google Scholar]
  • 24.Martin SJ, Eisenbarth JA, Wagner-Utermann U, et al. A new precursor for the radiosynthesis of [18F]FLT. Nucl Med Biol. 2002;29:263–273. doi: 10.1016/s0969-8051(01)00289-x. [DOI] [PubMed] [Google Scholar]
  • 25.Dittmann H, Dohmen BM, Kehlbach R, et al. Early changes in [18F]FLT uptake after chemotherapy: an experimental study. Eur J Nucl Med Mol Imaging. 2002;29:1462–1469. doi: 10.1007/s00259-002-0925-z. [DOI] [PubMed] [Google Scholar]
  • 26.Been LB, Suurmeijer AJ, Cobben DC, Jager PL, Hoekstra HJ, Elsinga PH. [18F] FLT-PET in oncology: current status and opportunities. Eur J Nucl Med Mol Imaging. 2004;31:1659–1672. doi: 10.1007/s00259-004-1687-6. [DOI] [PubMed] [Google Scholar]
  • 27.Yap CS, Czernin J, Fishbein MC, et al. Evaluation of thoracic tumors with 18F-fluorothymidine and 18F-fluorodeoxyglucose-positron emission tomography. Chest. 2006;129:393–401. doi: 10.1378/chest.129.2.393. [DOI] [PubMed] [Google Scholar]
  • 28.Kasper B, Egerer G, Gronkowski M, et al. Functional diagnosis of residual lymphomas after radiochemotherapy with positron emission tomography comparing FDG- and FLT-PET. Leuk Lymphoma. 2007;48:746–753. doi: 10.1080/10428190601113568. [DOI] [PubMed] [Google Scholar]
  • 29.Yap J, Locascio T, Tanaka Y, Syrkin L, Van Den Abbeele A. Impact of variations in SUV methods for assessing cancer response using FDG-PET [abstract] J Nucl Med. 2011;52(suppl 1):392P. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Data

RESOURCES