Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2012 Jan 21.
Published in final edited form as: Clin Cancer Res. 2010 Jul 19;16(18):4493–4495. doi: 10.1158/1078-0432.CCR-10-1548

Volumes to learn: advancing therapeutics with innovative CT image data analysis

Michael L Maitland 1
PMCID: PMC3263311  NIHMSID: NIHMS220673  PMID: 20643780

Summary

Semi-automated methods for calculating tumor volumes from computed tomography images are a new tool for advancing the development of cancer therapeutics. Volumetric measurements, relying on already widely available standard clinical imaging techniques could shorten the observation intervals needed to identify cohorts of patients sensitive or resistant to treatment.


In this issue of Clinical Cancer Research, Zhao, et al.(1) demonstrate that calculating tumor volumes from computed tomography (CT) images could help developers of new cancer therapeutics to learn early which patients have disease that responds to the treatment. This paper culminates 3 years of publications that suggest effective cancer therapeutics development could be accelerated if we use the CT imaging data routinely collected in clinical care more efficiently and intelligently.

CT imaging measurements are the basis for the assessment of new anticancer agents using the Response Evaluation Criteria in Solid Tumors (RECIST) in clinical trials(2). To maximize the inter-rater reliability of treatment response evaluations, RECIST categorizes the patient as having: complete response, partial response, stable disease, or disease progression. A standardized categorization of disease assessment was important for advancing therapeutics decades ago when disease response evaluations depended on physical exam and plain film radiography, but in the current era of digital imaging has lost its utility.

Few RECIST users are aware of the original study on which some reference cut-points in the categorization are based(3). Moertel and Hanley asked each enrolled oncologist to use “the usual technique and equipment (ruler or caliper) he employed in clinical practice” to measure the sizes of 12 …solid spheres ranging from 1.8 to 14.5 cm in diameter. “These [simulated] masses were … arranged…on a soft mattress and covered with a layer of foam rubber [that] measured 0.5 inches in thickness for the six smaller masses to approximate skin and subcutaneous tissue and 1.5 inches for the six larger masses to approximate abdominal wall.” At the time, this was a thoughtful assessment of the clinical measurement “noise” that interfered with investigators' detection in single arm studies of the signal of disease response to an active treatment. The authors suggested ineffective agents would have a placebo-like objective response rate of 5-10%. They also recommended that the threshold for declaring partial response should be a 50% decrease in bidimensional measurements. The mathematical counterpart of this 50% decline criterion for unidimensional measurements is the 30% employed by RECIST in the current system.

The latest RECIST version acknowledges its limitations but contends there is no validated improvement available for routine use. Reliance on categorical systems has stifled implementation of innovative approaches to exploit digital imaging and better measurement of solid tumors. Shortly after Moertel and Hanley's study of spheres under foam rubber, Lavin proposed quantitative assessment of change in tumor size between randomly assigned groups of patients to assess potential treatments more efficiently than non-randomized studies using categorical assessments(4). Modeling Lavin's study design for a trial of erlotinib and sorafenib in the second-line treatment of non-small cell lung cancer (NSCLC)(5), Karrison, et al. suggested the value of the combination could be determined with greater certainty than a single-arm trial without requiring substantially more patients. The design's chief advantage was that the assessment required each patient to remain on her/his assigned treatment arm for only 8 weeks. Patient accrual would be quicker and the total time to completion shorter than trials using alternative end-points such as progression-free survival.

The change in tumor size at 8 weeks endpoint for NSCLC treatment was validated in 3,398 NSCLC patients enrolled in 4 phase III trials(6). The investigators determined the range of tumor growth patterns over time. They then developed a predictive model for each patient and the relationship of this pattern to overall survival. The best predictors for survival in this quantitative model were ECOG performance status, baseline tumor size, and percentage tumor size reduction from the baseline measurement to 8 weeks post treatment. Similarly, in a study of advanced colorectal cancer patients, baseline tumor size and the change in that measurement 7 weeks post-treatment were significant predictors of overall survival(7).

Routine measurement variance makes the 7-8 week tumor size change the earliest for predicting treatment effects on overall survival. Patient position in the scanner and physiologic fluctuations in tumor shape and size over short intervals confound reproducible measurement of treatment effects. The modern equivalent of the spheres-under-foam study- radiology scientists have performed “coffee break” studies(8). Patients undergo a CT scan, get off the imager table, and then undergo a second scan. Radiologists measure lesions in both image sets and the variance is determined. Prior to the current study, coffee break investigations suggested that the ratio of the variance in unidimensional or bidimensional to total measurements was greater than for volume(9). This implied that a major advantage of volume measurements would be the routine capacity to detect treatment effects earlier in the course of therapy. This also predicted that volume could more readily detect treatment effects in indolent neoplasms where the changes in tumor size over time are subtle and require prolonged investigations to establish therapeutic benefit. Two independent studies established the therapeutic benefit of sirolimus in indolent angiomylipomata and tuberous sclerosis with volume measurements(10, 11).

In a clinical trial of 3-weeks gefitinib therapy before curative lobectomy, NSCLC patients had tumor epidermal growth factor receptor genotyping and pre and post-treatment CT imaging. Zhao, et al. used thin slice (1.25mm sections) CT exams and in addition to determining the single longest dimension of each tumor, they calculated tumor volumes with a semi-automated software application. Using genotype as the gold standard to categorize gefitinib sensitivity and resistance, the investigators demonstrated that changes in tumor volume at 3 weeks had better sensitivity and specificity for identifying the tumor genotype than unidimensional measurements. They concluded that early changes in tumor volume measurements could identify patients who will and will not benefit from a novel therapeutic and that these data could be used in efforts to discover or validate predictive tumor biomarkers.

As a pilot study, there are limitations to applying these results to therapeutics development. The optimized cutpoint for tumor volume changes was specific to this one cohort and had 11% false positive and 10% false negative assignments, unacceptable for determining an individual patient's decision to remain on or withdraw from further treatment. As these were early stage NSCLC patients it could not be determined if this error rate was due to imperfections in tumor genotyping as the “gold standard” or if the assignment accuracy could be improved by observing tumors for a longer timeframe. Although prior studies suggested bidimensional measurements would not be significantly better than unidimensional, this was not addressed in this study. Finally, the sharp contrast in pixel intensity between lung tumors and the adjacent, air-filled parenchyma leads to easier discernment of tumor boundaries and volume calculations than for tumors in other visceral sites.

The biggest limitation to advancing this research to benefit patients is not technical but operational. After plain films, CT is the diagnostic imaging test most commonly used in the care of patients with solid tumors(12). The popularity of these scans is due to their accessibility, speed, and ease of performance. Although imaging methods that employ new radiotracers or spectroscopic methods offer more detailed functional information, these studies are performed orders of magnitude less frequently than CT scans. Similar to the relationship of tumor radius to volume, modest investment in the infrastructure to share, develop, and implement quantitative data from CT imaging should yield exponential gains in our capacity to develop and deliver personalized cancer therapeutics.

Figure 1. Tumor volume change as an early response metric.

Figure 1

Early response assessment is limited by smaller magnitude changes than seen with conventionally timed response assessment (top). Volume measurement may allow detection of antitumor effect when it is not apparent with conventional diameter measurement (bottom).

References

  • 1.Zhao B, Oxnard GR, Moskowitz CS, et al. A Pilot Study of Volume Measurement as a Method of Tumor Response Evaluation to Aid Biomarker Development. Clin Cancer Res. 2010 June 9; doi: 10.1158/1078-0432.CCR-10-0125. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Eisenhauer EA, Therasse P, Bogaerts J, et al. New response evaluation criteria in solid tumours: revised RECIST guideline (version 1.1) Eur J Cancer. 2009;45:228–47. doi: 10.1016/j.ejca.2008.10.026. [DOI] [PubMed] [Google Scholar]
  • 3.Moertel CG, Hanley JA. The effect of measuring error on the results of therapeutic trials in advanced cancer. Cancer. 1976;38:388–94. doi: 10.1002/1097-0142(197607)38:1<388::aid-cncr2820380156>3.0.co;2-a. [DOI] [PubMed] [Google Scholar]
  • 4.Lavin PT. An alternative model for the evaluation of antitumor activity. Cancer Clin Trials. 1981;4:451–7. [PubMed] [Google Scholar]
  • 5.Karrison TG, Maitland ML, Stadler WM, Ratain MJ. Design of phase II cancer trials using a continuous endpoint of change in tumor size: application to a study of sorafenib and erlotinib in non small-cell lung cancer. J Natl Cancer Inst. 2007;99:1455–61. doi: 10.1093/jnci/djm158. [DOI] [PubMed] [Google Scholar]
  • 6.Wang Y, Sung C, Dartois C, et al. Elucidation of relationship between tumor size and survival in non-small-cell lung cancer patients can aid early decision making in clinical drug development. Clin Pharmacol Ther. 2009;86:167–74. doi: 10.1038/clpt.2009.64. [DOI] [PubMed] [Google Scholar]
  • 7.Claret L, Girard P, Hoff PM, et al. Model-based prediction of phase III overall survival in colorectal cancer on the basis of phase II tumor dynamics. J Clin Oncol. 2009;27:4103–8. doi: 10.1200/JCO.2008.21.0807. [DOI] [PubMed] [Google Scholar]
  • 8.Zhao B, Schwartz LH, Moskowitz CS, Ginsberg MS, Rizvi NA, Kris MG. Lung cancer: computerized quantification of tumor response--initial results. Radiology. 2006;241:892–8. doi: 10.1148/radiol.2413051887. [DOI] [PubMed] [Google Scholar]
  • 9.Mozley PD, Schwartz LH, Bendtsen C, Zhao B, Petrick N, Buckler AJ. Change in lung tumor volume as a biomarker of treatment response: a critical review of the evidence. Ann Oncol. doi: 10.1093/annonc/mdq051. [DOI] [PubMed] [Google Scholar]
  • 10.Bissler JJ, McCormack FX, Young LR, et al. Sirolimus for angiomyolipoma in tuberous sclerosis complex or lymphangioleiomyomatosis. New Engl J Med. 2008;358:140–51. doi: 10.1056/NEJMoa063564. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Davies DM, Johnson SR, Tattersfield AE, et al. Sirolimus therapy in tuberous sclerosis or sporadic lymphangioleiomyomatosis. New Engl J Med. 2008;358:200–3. doi: 10.1056/NEJMc072500. [DOI] [PubMed] [Google Scholar]
  • 12.Dinan MA, Curtis LH, Hammill BG, et al. Changes in the use and costs of diagnostic imaging among Medicare beneficiaries with cancer, 1999-2006. JAMA. 2010;303:1625–31. doi: 10.1001/jama.2010.460. [DOI] [PubMed] [Google Scholar]

RESOURCES