Abstract
The purpose was to study correlations amongst IMRT DVH evaluation points and how their relaxation impacts the overall plan. 100 head‐and‐neck cancer cases, using the Eclipse treatment planning system with the same protocol, are statistically analyzed for PTV, brainstem, and spinal cord. To measure variations amongst the plans, we use (i) interquartile range (IQR) of volume as a function of dose, (ii) interquartile range of dose as a function of volume, and (iii) dose falloff. To determine correlations for institutional and ICRU goals, conditional probabilities and medians are computed. We observe that most plans exceed the median PTV dose (average prescribed dose). Furthermore, satisfying reduced the probability of also satisfying , constituting a negative correlation of these goals. On the other hand, satisfying increased the probability of satisfying , suggesting a positive correlation. A positive correlation is also observed between the PTV and . Similarly, a positive correlation between the brainstem and is measured by an increase in the conditional median of , when is violated. Despite the imposed institutional and international recommendations, significant variations amongst DVH points can occur. Even though DVH aims are evaluated independently, sizable correlations amongst them are possible, indicating that some goals cannot be satisfied concurrently, calling for unbiased plan criteria.
PACS number(s): 87.55.dk, 87.53.Bn, 87.55.Qr, 87.55.de.
Keywords: IMRT plan evaluation, correlations, planning aims, statistical analysis, DVH
I. INTRODUCTION
Treatment plan optimization in intensity‐modulated radiation therapy (IMRT) often leads to unsatisfactory solutions due to constraints imposed to achieve clinical goals. To overcome infeasibilities, constraints are relaxed and competing objectives between tumor volume(s) and surrounding structures are formed. Competing objectives lead to trade‐offs for clinical use that are visually inspected and approved. The subjectivity of a visual inspection for IMRT planning decisions introduces biases and tendencies, consequently resulting in plans to become nonreproducible. In fact, Das et al. (1) observed high variability among treatment planners and institutions. They reported that the median dose delivered to the tumor varied between ± 10% of the prescribed dose for 96% of the patient population. Any tendencies in treatment planning also lead to plan degeneracy, where multiple competing plans are produced, all satisfying institutional criteria (typically 95% volume receiving 95% dose to target). Even though all these guidelines are typically met, the significant personal preferences in plans raise concerns for effective and reproducible methods for comparing treatment plans among patients and institutions.
In general, such preferences may be introduced from a variety of sources, including delineation of anatomical structures, treatment planning system (TPS) variability, and institutional guidelines, but most importantly treatment planner's preferences. Accurately delineating tumors and organs is heavily dependent on both the quality of CT images and the experience of clinicians. Cazzaniga et al. (2) compared structure delineation for three patients by six radiologists and observed standard deviations ranging from 0.35 to 2.64 cm2 of overlapping area. This reveals the need for more detailed constraints in addition to clinical experience in order to restrict such preferences. Fogliata et al. (3) observed tendencies introduced through the choice of TPS by comparing eight TPSs on four patients using scoring indices. They reported varying scores for organs at risk (OAR) and target depending on the employed TSP. Eriguchi et al. (4) observed variations due to institutional protocols by comparing treatment plans executed at multiple institutions using the same TPS on the same patient scans. They reported a to PTV that ranged from 43.6 to 51.2 Gy, and the volume that received 20 Gy or higher, , ranged from 12.2% to 18.9% over the four institutions. is the minimum dose to the hottest x% of the volume and is the fractional volume that receives at least x% of the prescription dose.
In principle, clinical dose and volume constraints and guidelines are used to overcome the aforementioned sources of variations, as well as to assess the quality of plans. 5 , 6 However, despite rigorously imposing these constraints, Das et al. (7) observed substantial differences in the DVH shapes when varying patients, planners, and TPS. An additional factor is the allocation of time to generate a plan, as typically very little time is devoted for optimization. (8) Dosimetric uncertainties can also alter the quality of plans. (9)
The DVH constraints are often imposed through specific dose‐volume points as a comparison metric, such as , , , and . They may also directly serve as DVH constraints in plan optimization, as suggested by Cambria et al. (5) In order to adhere to these constraints, treatment plans are often iteratively adjusted by relaxing one or more constraints to achieve the desired result. This iterative process is heavily dependent on the planner's skill and experience, as demonstrated by Nelms et al., (10) who suggested the need for best‐practices criteria as part of a continuous improvement strategy to reduce significant plan variations. Subjective preferences in trading off DVH constraints during the planning are reflected in the final plan's DVH control points and their correlations.
In this work, we study the dependence amongst these DVH control points. Since we cannot assume the DVH variations amongst cases to follow a Gaussian distribution, nonparametric statistical tools are employed in two main steps:
Variations amongst plans, measured by median and interquartile range.
Correlations between planning aims, computed via conditional probability and median.
We observe significant variations surrounding the DVH constraints even though most of the clinical goals were satisfied. It is shown that the current clinical DVH constraints require prioritizing amongst them, which limits the reproducibility and standardized plan quality. Furthermore, we demonstrate trends in increased dose delivery, suggesting dissension in evaluating treatment plans. The details of materials used in the study are discussed in the Materials & Methods section. Three statistical measures are detailed in subsection A.1 to demonstrate the variation in dose, volume, and DVH falloff along the DVH. To show the efficacy of clinical goals, statistical comparisons between plans that violate and those that satisfy the constraints are presented in subsection B.
II. MATERIALS AND METHODS
One hundred head‐and‐neck cancer cases from Indiana University, Melvin and Bren Simon Cancer Center are selected based on their anatomical comparability. The corresponding treatments for all cases are planned using the same TPS (Eclipse, Varian Medical System, Palo Alto, CA), following identical institutional dosimetric criteria. Besides anatomical differences, possible variations stem from plans that were produced by different planners. Based on clinical advice, brainstem and spinal cord are identified as the highest priority OAR to be spared. For brevity, we shall refer to spinal cord as spine. In many cases when the lesion was at or adjacent to the skin, the patient was scanned with bolus. Consequently, the PTV is deliniated to be within 2–3 mm of the skin surface. (11) For the cases with multiple PTVs, the one with the largest prescribed dose is identified as the primary target to be irradiated. We analyze all targets and report the primaries as the main PTV for this study to maximize the contrast between the competing objectives of tumor irradiation and OAR sparing.
These plans were optimized based on dosimetric constraints for PTV of and of the prescribed dose, for spine , and for brainstem , following the institutional guidelines. All DVHs are collected from the TPS with a resolution of 0.1 cGy dose. Furthermore, the DVHs for PTV are collected using relative volume and dose points, allowing effective comparisons. Note that the median absolute prescribed dose over all patients is 70 Gy, with an interquartile range of 10.3 Gy. Moreover, in order to compare specific dose‐volume points, DVH values are linearly extrapolated by binning remaining doses with 0% volume.
The quantitative analysis is divided into two parts: variations amongst plans and correlations between DVH control points. For the first part, we use three statistical metrics that are well suited for such nonsymmetrically distributed data, namely the interquartile range of dose for changing volume, interquartile range of volume for changing dose, and the approximate DVH gradient to study variation in the falloff. For the second part, we use conditional measures for institutional and for internationally recommended DVH goals. Note that the methods and the results section will follow this two‐part structure. The details of our analytical method are as follows.
A. Variations in plans
The interquartile range is an estimator of spread, computed as the difference of the upper and the lower quartile of a distribution. It is inherently robust to changes in scale, hence uninfluenced by outliers. (12) Robustness is important, because an otherwise optimal solution (statistical estimator) may render suboptimal and unreliable when uncertainties occur. 13 , 14 To study the spectrum of relevant variations amongst plans, we employ: (i) the IQR of volume as a function of dose, (ii) the IQR of dose as a function of volume, and (iii) the median DVH falloff and its IQR. The details are as follows.
A.1 Interquartile range of volume
is a statistical estimator of distribution at a dose value D and quantifies directly the spread. In this study, is recorded over the entire dose range in increments of 0.1% relative dose (tumor) or 0.1Gy (OAR).
A.2 Interquartile range of dose
measures the spread of distribution and is recorded over the entire volume range at increments. Note that and are estimators of two different and orthogonal distributions and, hence, independent of each other.
A.3 Dose falloff
∇ serves as a measure for the DVH falloff. It is determined at two neighboring DVH point pairs and via finite differences
where . V2 and V1 are the corresponding volumes. The schematic setting is illustrated in Fig. 1. The objective of this analysis is to identify whether a rapid falloff or a slow falloff (blue or red DVH in Fig. 1) was preferred for a specific plan.
Figure 1.

Schematic of two plans with a magnification of the crossing points.
In order to observe such variations and identify trends amongst past decisions, we compute the median DVH falloff, , for all patients as function of dose. Additionally, the interquartile range of the falloff, , is determined in order to observe potential dissension in decision‐making favoring a slow or rather a rapid DVH falloff.
B. Correlations in plans
The iterative treatment planning process aims to satisfy DVH constraints determined by both institutional and internationally recommended aims. In practice, however, it is often not feasible to satisfy all constraints, which in part justifies the existence of institutional guidelines which, in some cases, deviates from the recommended aims. As a result, treatment planners choose to trade off some constraints. The underlying assumption of this pursuit is that it is admissible to violate one or more of the aims, while still providing an overall satisfactory and clinically acceptable plan. To analyze the dependence amongst the constraints, we determine statistical correlations for PTV, brainstem, and spine constraints.
For each DVH control point i, we divide the set of the plans into those that satisfy the constraint i (subset ) and those that violate it (subset ) with . To establish correlations, we use conditioning for two differing control points i and j as
where denotes the conditional probability. In other words, if violating the constraint j reduces the probability of constraint i to be met, these two constraints are positively correlated and negative otherwise. Here, probability reflects the relative number of plans in the subset.
Additionally, we quantify the rate of change by computing conditional medians and derive conclusions analogous to Eqs. (2) and (3).
In addition to these imposed goals, we also consider empirical control points to measure the correlations of the imposed goals to their surrounding DVH sections. To this end, when the plans exhibit unanimous agreement at a particular high dose by displaying a marginal spread, defined as , we denote the corresponding (constrained) dose as . Conversely, neighboring unconstrained DVH points (within ) with significant deviation (i.e., ) are denoted as (unconstrained dose). Such constrained and unconstrained points subsequently serve to qualify additional subsets. The analysis is divided for DVH constraints that stem from institutional protocols and those recommended by the ICRU‐83. (15) The control points are as follows
B.1 Institutional aims
The institutional dosimetric constraints for PTV are and of the prescribed dose, for spine , and for brainstem . These control points are used to establish subsets of satisfying or violating plans, along with their neighboring points. The corresponding and are data‐specific and will be discussed in the Results section below.
B.2 ICRU‐83 aims
ICRU guidelines recommend for head‐and‐neck cases, near‐max dose , median dose , and near‐min dose constraints for PTV, and for the surrounding OARs. (15) In order to sizably distinguish violations from satisfying the goal, we extend the definition to encompass .
III. RESULTS
Despite consistent planning protocols and the normalizing nature of DVH, sizable variations amongst the 100 head‐and‐neck cases are observed for PTV, brainstem, and spine in Figs. 2(a), (b), and (c), respectively. Maximum PTV dose ranges 104% and 120% of the prescribed dose. While the DVHs for most cases overlap for PTV, we observe a wider spread for brainstem and spine (see Figs. 2(a) to (c)). For brainstem, a rapid decrease is apparent in low dose areas and a plateau in the mid‐volume range for most cases. For spine, we observe an extended plateau at high‐volume range with a decrease for higher dose for most cases. To quantify these variations, we employ the estimators introduced in Material and Methods section 7.
Figure 2.

Dose‐volume histograms of 100 head‐and‐neck cancer patients.
A. Variation estimators
As observed in Figs. 2(a) to (c), the distribution amongst plans is not symmetric. This is expected, given the nonincreasing nature of cumulative DVH. Therefore, IQR is well suited to characterize the spread of the asymmetrical DVH distributions. We consider an at a DVH goal to indicate a significant variation amongst plans.
A.1 Interquartile range of volume
In clinical practice, volume constraints are often imposed at specific points, to achieve desired dose distributions (e.g., on tumor). (16) In order to meet these DVH constraints, the weights on other clinical objectives are adjusted and at times completely relaxed. These decisions are case‐specific and depend on the discretion of the oncologist and/or planner, leading to variations of over the patients. The spread of this distribution can be measured by and is illustrated in Fig. 3.
Figure 3.

Interquartile range of volume as a function of dose: .
Figure 3(a) shows unanimous agreement of PTV's until 90% prescribed dose . The maximum deviation is reached at , and vanishes again at , depicting the DVH falloff region. The observed peak at indicates an increased spread at , which can stem from relaxed PTV constraints or how the objectives are specified, amongst others. Note that the recommendations of the Radiation Therapy Oncology Group allow for up to 20% of the tumor volume to receive more than 110% prescribed dose, which supports the observed tendency to tumor excess dose. (17) The overall narrower and lower of PTV reflects the higher importance assigned to PTV in comparison to brainstem and spine.
Figure 3(b) illustrates that the volume variation for brainstem peaks at , followed by a second peak at with a subsequent steady decrease and vanishes at . One possible explanation would be that treatment planners prefer to relax constraints on brainstem at lower dose values, but tend to much tighter control at higher dose values, supported by the observed narrower spread. For spine, Fig. 3(c) reveals an increase of to , followed by a sharp decrease to . This result indicates tight constraints on higher doses for spine, whereas the large spread for suggests relaxation. However, a tighter control is not necessary because the probability of myelopathy vanishes for to the cord. (18) These observations may be considered as an indirect effect of the predominant constraints on brainstems and spines, as common in head‐and‐neck cases. 19 , 20 , 21 , 22 Note that thresholds lead to at , resulting in narrower .
A.2 Interquartile range of dose
In addition to the volume constraints, a set of constraints are imposed at specific dose values. Similarly, they may lead to variations in the surrounding DVH regions which we measure with . For PTV, Fig. 4(a) illustrates negligible for , with a steep increase to . This suggests that tight constraints are imposed on PTVs until followed by relaxed control for higher volumes, which is in agreement with the recommendations. (23)
Figure 4.

Interquartile range of dose as a function of volume: IQRdose(V).
Figure 4(b) shows reduced at higher brainstem volumes. This is in agreement with the recommendations that strict constraints are essential to maintain functionality of such critical organs. 19 , 20 , 21 , 22 However, Fig. 4(c) for spine depicts significant spread around , indicating a lack of constraint and a need for tighter control at higher doses.
A.3 Dose falloff
To achieve the goal of conformal target dose, intensity modulation is leveraged to deliver heterogeneous distributions, allowing for a sharp dose falloff on DVHs. (24) A rapid dose falloff at the prescribed dose allows to reduce risks of complications. (25) A vertical falloff at the prescribed dose constitutes an ideal target DVH. To achieve this goal, near‐minimum dose and near‐maximum dose constraints are imposed. (26) However, due to competing objectives, a sharp and perpendicular falloff is usually infeasible. Therefore, some constraints are relaxed, leading to V variations amongst plans. The median ∇ and are shown in Figs. 5(a) to (c) for PTV, brainstem, and spine.
Figure 5.

DVH falloff: blue line marks the median and red the interquartile range .
Figures 5(a) to (c) show a marginal planning variation for PTV, in comparison to brainstem and spine. This is seen by narrow surrounding the prescribed dose, which is typically a result of employing multiple constraints. For brainstem and spine, however, the significant is spread over multiple regions, demonstrating variabilities in decision‐making as a result of employing single maximum dose constraints. More specifically, Fig. 5(a) shows a vanishing PTV median for , followed by a drop to median . This implies that a dose change of 1% results in a volume change. In other words, the falloff region extends over a median dose interval of , exhibiting a narrow falloff region through tighter control. This is followed by a rise to . This change in median for coincides with the region surrounding the prescribed dose. Noticeably, the sharpest falloff occurs at instead of , further suggesting planners’ preference towards higher dose, as also reported by Eisbruch et al. (17)
Figure 5(b) illustrates that the sharpest falloff for brainstem occurs at very low‐dose values , demonstrating tight dose constraints. Unlike brainstem, the median for spine in Fig. 5(c) shows two distinct turning points at and at . The first DVH falloff is attributed to an artifact of the patient geometry (for some cases, lower doses to significant portions of organs is common, but not for all). On the other hand, the turning point at is a direct result of maximum dose constraints for brainstem and spine, which are typically imposed in head‐and‐neck cancer cases. 19 , 20 , 21 , 22
The spread around the median is shown via in Figs 5(a) to (c). For PTV, negligible spread is observed for , followed by narrow spread for region, which is typically a result of strict constraints. (23) However, significant spread occurs around the DVH falloff region, revealing variability in decision‐making over this range, followed by the overall agreement beyond . For brainstem, the largest is recorded at lower dose values, followed by a narrow deviation at higher dose points. This further suggests multiple constraints imposed at higher dose for this critical organ. Similarly for spine, beyond the low‐dose spread, a significant spread is observable only between and . This demonstrates both the impact of maximum dose constraints and a lack of median dose constraints. Since the maximum dose constraints are typically rigorously enforced on both PTV and critical organs, competing dose‐volume constraints are relaxed to generate feasible plans.
Note that the use of statistical estimators for symmetric distributions would misrepresent the actual distribution here. For example, the use of mean and standard deviation for ∇ reveals a substantial ripple at (shown in Appendix A), which is not observable with median ∇ and in Fig. 5(a). However, this ripple is caused by four outliers, while the median and interquartile ranges are insensitive to outliers and, hence, appropriately describe the observed asymmetric distribution. While such outliers may reveal additional insights, their analysis is beyond the scope of this work.
B. Correlations in plans
Since all the studied plans were clinically approved, it is justified to expect broad agreement at dose‐volume constraints enforced in the study, while variations may exist at other constraints which were relaxed. To establish possible consequences of such goal relaxation, we included empirical constraints in the vicinity of the DVH goal points. This allows for a guide to gauge potential risks for the overall plan that are associated with violating some constraints. We first focused the analysis on the institutional DVH goals that were enforced for the studied cases. We then analyze the cases based on additional constraints recommended by the ICRU‐83, (15) since a subset of them overlaps with the institutional constraints.
B.1 Institutional aims
When the plans exhibit unanimous agreement at a particular high dose by displaying a marginal spread (i.e., ), we denote the corresponding (constrained) dose as . We observed a PTV constrained dose point with at . For brainstem, and for spine, . This corresponds to maximum dose constraints that are usually imposed on all structures, often being the only constraint for brainstem or spine. 19 , 20 , 21 , 22 When , , or are negligible , we hypothesize them to be a consequence of maximum dose constraints, and otherwise independent. On the other hand, we observe a neighboring (within ) unconstrained point with at for PTV, for brainstem, and for spine. Note that, due to the nonincreasing nature of cumulative DVHs, , , and for PTV, brainstem, and spine, respectively. This implies that an equal or larger spread (from objective) is expected at when compared to , since the constraints are found at higher volumes. To establish deviations, we observed two clusters of plans, namely those satisfying , , or by being less or equal to 2% and those which were significantly different at a neighboring point . The two groups maximally separate, when the volumes at exceeded 45%. Therefore, the hypothesis is that such violations are related to meeting other DVH constraints, and otherwise independent. To summarize these relationships, Table 1 serves as an overview on institutional planning aims. Note that each table element can serve as a subset, as defined in Materials and Methods section B.
Table 1.
Summary of planning aims: left) when constrained DVH points are violated or satisfied; right) when significant deviations or independent points are observed
|
Constrained Dose
|
Neighboring Dose
|
|||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Structure | Planning Aims | Criteria Violated (%) | Criteria Satisfied (%) | Significant Deviation (%) | Independent of D con (%) | |||||
| PTV |
|
|
|
|
|
|||||
|
|
|
|
|
|
||||||
| Brainstem |
|
|
|
|
|
|||||
| Spine |
|
|
|
|
|
|||||
| Constrained volume () | Neighboring volume () | |||||||||
| PTV |
|
|
|
|
|
|||||
Significant overdose tendencies are observed for PTV. For brainstem, minimal deviations in comparison to spine is apparent, which has higher percentage of constraint violations. This demonstrates the increased planning importance for brainstem. Specifically, Fig. 6(a) for PTV shows substantial violations at ( for some patients) in comparison to , where violations are negligible (on average ). This implies substantial volume portions received , supporting aforementioned observation of tendencies to deliver higher doses to PTV. More importantly, this result shows larger violations at for cases that also have slight violations at . This result is more evident when evaluating the conditional median. In particular, the conditional median of given a violation at is Median compared to Median for PTV.
Figure 6.

Impact of DVH volume constraints. Large volume deviations are observed at (red) compared to (blue).
Similarly, constraints (denoted as ) were imposed on PTV, which is a common institutional choice. (27) The neighboring volumes are considered (e.g., at ). Patients with significant deviations at also show deviations from the established constraints (not shown). The conditional Median versus Median shows improvement in neighboring areas when constraints are satisfied.
Figure 6(b) illustrates negligible disagreement at for brainstem. Even though violations are observed, they are not due to decision tendencies but rather due to the nonincreasing nature of cumulative DVHs and the already existing violations at . When comparing the conditional medians for brainstem, we observe a difference between Median and Median. For spine, Fig. 6(c) shows significant disagreement at propagated with violations at . There is also a sizable difference between Median and Median( for spine. This result indicates that marginal deviations from DVH constraints can (in median) lead to significant degradation in neighboring unconstrained points and, hence, limit the plan quality. Overall, Fig. 6 shows the case‐by‐case impact of constraint violation for all cases. Beyond the conditional changes, we next provide a quantitative guide for the associated risk for a plan when constraints are violated.
In practice, not all clinical goals can be satisfied concurrently, resulting in constraint relaxation. To measure the risk (probability) that violating some constraints may have on other goals, we evaluate the plans at for PTV and , and 45 Gy for brainstem, and spine, respectively. We hypothesize that satisfaction is correlated to constraints, or otherwise independent. Additionally for PTV, we evaluate , for which we hypothesize that if satisfied, or if violated. For correlation analysis, we divide all plans into subsets defined in Table 1 and apply the conditional measures of Materials and Methods section B:
Subset A: plans satisfying goal, leading to at ;
Subset B: plans satisfying goal, leading to at ;
Subset : plans violating low volume at high‐dose goal, leading to at .
Figure 7 summarizes the DVHs of plans belonging to the subset A (B) or its compliment . For PTV, violation (subset ) leads to negligible excess dose. However, violation (subset ) has a greater impact through significant underdose, as observed by the broader IQR that is asymmetric towards lower volumes and dose. Note that the IQR limits depict plans that significantly degrade due to violation .
Figure 7.

DVH for subsets of plans: A satisfy the constraint and B meet goal.
For brainstem, a substantial improvement is observed for plans that satisfy (subset A). In fact, the IQR of A and do not overlap, suggesting that significant improvements are possible, if the constraints are satisfied. For spine, moderate reduction in dose is observed for plans that satisfy the constraint (subset A), suggesting a similar potential for improvements. Figure 7 shows that violating constraints goes beyond the common notion of “trade‐off” and actually affects the entire DVH. It allows one to determine the impact of violating one of these constraints on the overall plan quality. Therefore, this analysis can inform clinical planning decisions to quantitatively gauge potential risks of constraint relaxations.
This impact can also be accessed by comparing the probabilities of delivering high doses to larger volume portions in neighboring and unconstrained DVH points when conditioned on violating (or meeting) the goals at . In other words, we compare the conditional probability ). Therefore, the impact of not meeting PTV's constraints is observed through the comparison of
and
This demonstrates a positive correlation between and for the PTV, namely violating one increases the probability of violating the other as well.
Similarly for brainstem, the probabilities for degrading the plans are and for spine . In comparison when are met, and . The two proportion z‐tests, reveals significantly larger than for all structures with a p‐value less than 0.0001.
Additionally for PTV, is significantly larger than . Furthermore, the two proportions z‐tests confirm this significant difference . Therefore, all these observations support the conclusion that significantly higher risks for degrading outcomes are expected when DVH constraints are violated. These results can serve to quantitatively inform the decision‐making process in treatment planning.
B.2 ICRU aims
Similarly, we analyze the correlations amongst the noninstitutional aims, as defined in section B of Material and Methods. Table 2 summarizes the relevant internationally recommended planning aims by ICRU‐83. (15)
Table 2.
Summary of ICRU aims for PTV and the criteria for violation or satisfaction
| Planning Aims | Criteria Violated (%) | Criteria Satisfied (%) | |||
|---|---|---|---|---|---|
|
|
|
|
|||
|
|
|
|
|||
|
|
|
|
These aims may also become competing (i.e., they cannot be met concurrently), hence, compelling planners to relax or even ignore some of them. To measure correlations among the ICRU constraints, we similarly group patients into subsets as discussed in Materials and Methods section B:
Subset E: plans that satisfy ;
Subset F: plans that satisfy ;
Subset G: plans that satisfy .
The impact of violating the ICRU‐83 recommendations is shown in Fig. 8. The excess dose violation of subset and the underdose violation of subset in Fig. 8(a) can be considered as a result of the definition of these constraints. For the subset in Fig. 8(b), the violation is attributed to an increased dose, confirming the overall overdose tendency observed through this analysis. This is also reflected in an elevated dose to larger volumes, hence satisfying , as demonstrated in Fig. 8(c) for the subset G. Conversely for F, is satisfied by reduced dose to larger volumes, which leads to violating and constituting . In other words, plans in subset F have an increased likelihood to also reside in . Similarly, those in are more probable to also be in G. Therefore, the set of analyzed plans clearly suggests a negative correlation between the recommended aims of and , indicating competing goals that are unlikely to be met simultaneously.
Figure 8.

Analysis of the ICRU‐83 recommendations for PTV: (a) satisfied (E) or violated , (b) satisfied (F) or violated , (c) satisfied (G) or violated .
The pairwise conditional probabilities for all goals are summarized in Table 3. Given that 86% of the cases violate (subset ), the probabilities of F conditioned on E or G are marginal. However, when conditioning on F, a sizable positive correlation is observed for the subset E, since the corresponding probability decreases upon complementing F. In other words, these plans suggest that satisfying increases the probability of satisfying as well. On the other hand, a negative correlation is observed when G is conditioned on F, since the probability increases when conditioning on . This means that satisfying reduces the probability of satisfying . This observed competing nature of these two recommended aims demonstrates that the likelihood of meeting them simultaneously is low.
Table 3.
Probabilities of subset in row i is conditioned on in column j. For sizable differences, a positive correlation is marked in blue and a negative correlation in red
|
Subsets
|
|
|||||||
|---|---|---|---|---|---|---|---|---|
| E |
|
F |
|
G |
|
|||
| E | 0.31 | 0.64 | 0.26 | 0.30 | 0.32 | |||
|
|
0.69 | 0.36 | 0.74 | 0.70 | 0.68 | |||
| F | 0.29 | 0 | 0.14 | 0.07 | 0 | |||
|
|
0.71 | 1.0 | 0.86 | 0.93 | 1.07 | |||
| G | 0.68 | 0.70 | 0.36 | 0.74 | 0.69 | |||
|
|
0.32 | 0.30 | 0.64 | 0.26 | 0.31 | |||
Since these plans were designed and delivered to the best ability and knowledge of experts, the result that not all ICRU‐83 recommended aims could be satisfied supports the clinical need for institutional guidelines, such as , as prevalent amongst institutions. At the same time, we observe that these institutional goals lead to trends, such as excess dose tendencies, that potentially jeopardize both plan quality and their comparability across institutions.
IV. DISCUSSION
In treatment planning, point‐wise dose‐volume goals are not only used to enforce clinical objectives, or to directly optimize treatment plans, but also to choose between alternative plans. The analysis presented in this work seeks to identify areas of plan variation and constraint correlation. Therefore, a possible extension of the presented study includes developing and integrating unbiased constraints and estimators directly into treatment planning to mitigate variability.
To limit dissension in decision‐making, this study reveals the need for constraints or comparison estimators that can be enforced for extended DVH regions. The incorporation of an unbiased estimator for treatment plan evaluation was suggested by Loveless et al. (28) A deviation metric was designed to capture the weighted difference between the realized and ideal DVH. The weights were extracted from past treated cases. Alternatively, benchmark DVHs were used for treatment comparisons for prostate cancer cases. (29)
It needs to be noted that this analysis was confined to head‐and‐neck cases only, warranting comparable anatomies. Unique geometries for some patients still may contribute to deviations in received dose on certain structures. These variations are patient‐specific and can occur in any treatment. Therefore, we chose robust estimators to immunize the conclusions against potential uncertainties. Furthermore, since the main goal of this study was to establish correlations, the resulting probabilities and risk measures are indented to inform future treatment planning, but cannot yield causations. The understanding of the nature of these effects, potentially confounded by the geometry, treatment planning system, dosimetric models, and other factors, is beyond the scope of this study. Nevertheless, these results have the potential to quantitatively augment the treatment planning, confining the trial‐and‐error nature of the decision‐making process and, hence, reducing time and ambiguity.
V. CONCLUSIONS
The impact of trading off clinically employed DVH goals for treatment planning is demonstrated using three robust statistical estimators on 100 past head‐and‐neck cases. Significant plan deviations are observed on the interquartile ranges of dose and volume, as well as DVH falloff, especially on areas neighboring the constraints. This analysis shows that, when these constraints are even marginally violated, larger deviations are expected across the entire DVH. These results extend conventional notions of a “trade‐off” between clinical goals to a quantitative risk measure that relates each constraint violation to degradation of the overall plan. In fact, some of the internationally recommended aims were identified to be negatively correlated, hence, competing. This study also identifies DVH goals of unanimous agreement amongst planners, as well as areas of dissension and deviations, calling for novel clinical constraints that go beyond the established dose‐volume constraints and address extended areas of the DVH. Since these results relied on standard dosimetric criteria, they are of general nature and, hence, applicable to a wide range of IMRT planning settings.
ACKNOWLEDGMENTS
We would like to thank S. Srivastava for technical support in data extraction and M. Langer for insightful discussions. O. Nohadani thanks the support from the National Science Foundation under Grant No. CMMI 1463489.
COPYRIGHT
This work is licensed under a Creative Commons Attribution 3.0 Unported License.
APPENDICES
Apendix A: Supplementary Materials
1. Conventional statistical estimators
When the distribution of data is symmetric and in the absence of outliers, conventional statistical estimators, such as sample mean and standard deviation, provide reliable analytical tools for the observations. However, when the distribution is not symmetric and outliers are prevalent, this is no longer the case. Figure A.1 demonstrates the use of mean and standard deviation for the dose falloff ∇ of all three structures. The observed ripple at for PTV suggests an overall phenomenon for this control point. A further analysis of the data reveals that the standard deviation was impacted by only four outlier plans for which specific tumor geometries required deviating arrangements. This ripple, however, does not speak for the cohort of the plans. Similar ripples are observable for brainstem and spine, all representing nonglobal uncertainties. On the other hand, robust estimators, such as the median and IQR, are immune to these pertubations, as exhibited in Fig. 5 of the manuscript.
Figure A.1.

DVH falloff. The blue line marks the mean , and the red line the standard deviation.
2. Institutional control points
Figure A.2(left) shows that patients with sizable violations at also show some deviations from the established constraint. This analysis demonstrates that when the institutional constraint at is relaxed, the dose point is sizably degraded for most cases. On the other hand, when is met, the value at is also satisfactory.
Figure A.2.

(left) : large deviations at (red) compared to (blue); (right) EUD to PTV for all patients. Black stars mark the prescribed dose Ø.
3. Equivalent uniform dose
Equivalent uniform dose (EUD) is computed as , where is the i‐th partial volume receiving the dose and an exponent of for the tumor. (1) The EUD was computed for equal dose per fraction schedules based on the biologically effective dose method. (1) Figure A.2(right) shows the EUD to PTV for all patients. For almost all plans, the EUDs are near the prescribed dose, showing negligible variations.
Supporting information
Supplementary Material
Supplementary Material
Supplementary Material
Supplementary Material
REFERENCES
- 1. Das IJ, Cheng CW, Chopra KL, Mitra RK, Srivastava SP, Glatstein E. Intensity–modulated radiation therapy dose prescription, recording, and delivery: patterns of variability among institutions and treatment planning systems. J Nat Cancer Inst. 2008;100(5):300–07. [DOI] [PubMed] [Google Scholar]
- 2. Cazzaniga LF, Marinoni MA, Bossi A, et al. Interphysician variability in defining the planning target volume in the irradiation of prostate and seminal vesicles. Radiother Oncol. 1998;47(3):293–96. [DOI] [PubMed] [Google Scholar]
- 3. Fogliata A, Nicolini G, Alber M, et al. On the performances of different IMRT treatment planning systems for selected paediatric cases. Radiat Oncol. 2007;2(1):7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Eriguchi T, Takeda A, Oku Y, et al. Multi–institutional comparison of treatment planning using stereotactic ablative body radiotherapy for hepatocellular carcinoma–benchmark for a prospective multi–institutional study. Radiat Oncol. 2013;8(1):113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Cambria R, Cattani F, Jereczek‐Fossa BA, et al. Planning study to compare dynamic and rapid arc techniques for postprostatectomy radiotherapy of prostate cancer. Strahlenther Onkol. 2014;190(6):569–74. [DOI] [PubMed] [Google Scholar]
- 6. Dogan N, Mihaylov I, Wu Y, Keall PJ, Siebers JV, Hagan MP. Monte Carlo dose verification of prostate patients treated with simultaneous integrated boost intensity modulated radiation therapy. Radiat Oncol. 2009;4:18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Das IJ, Desrosiers CM, Srivastava SP, et al. Dosimetric variability in lung cancer IMRT/SBRT among institutions with identical treatment planning systems. Int J Radiat Oncol Biol Phys. 2008;72(1):S604–S605. [Google Scholar]
- 8. Das IJ, Moskvin V, Johnstone PA. Analysis of treatment planning time among systems and planners for intensity–modulated radiation therapy. J Am Coll Radiol. 2009;6(7):514–17. [DOI] [PubMed] [Google Scholar]
- 9. Nohadani O, Seco J, Martin B, Bortfeld T. Dosimetry robustness with stochastic optimization. Phys Med Biol. 2009;54:3421–32. [DOI] [PubMed] [Google Scholar]
- 10. Nelms BE, Robinson G, Markham J, et al. Variation in external beam treatment plan quality: an inter–institutional study of planners and planning systems. Pract Radiat Oncol. 2012;2(4):296–305. [DOI] [PubMed] [Google Scholar]
- 11. Lee N, Chuang C, Quivey JM, et al. Skin toxicity due to intensity‐modulated radiotherapy for head‐and‐neck carcinoma. Int J Radiat Oncol Biol Phys. 2002;53(3):630–37. [DOI] [PubMed] [Google Scholar]
- 12. Huber PJ. Robust statistics. Berlin: Springer; 2011. [Google Scholar]
- 13. Berstimas D, Teo KM, Nohadani O. Robust optimization for unconstrained simulation‐based problems. Operations Res. 2010;58(1):161–78. [Google Scholar]
- 14. Berstimas D, Nohadani O, Teo KM. Nonconvex robust optimization for problems with constraints. INFORMS J Comput. 2009;22(1):44–58. [Google Scholar]
- 15. Planning aims, prescription, and technical data. ICRU 83. J ICRU. 2010;10(1):55–59. doi:10.1093/jicru/ndq010 [DOI] [PubMed] [Google Scholar]
- 16. Cozzi L, Fogliata A, Bolsi A, Nicolini G, Bernier J. Three–dimensional conformal vs. intensity‐modulated radiotherapy in head–and–neck cancer patients: comparative analysis of dosimetric and technical parameters. Int J Radiat Oncol Biol Phys. 2004;58(2):617–24. [DOI] [PubMed] [Google Scholar]
- 17. Eisbruch A, Chao KSC, Garden AS. Phase I/II study of conformal and intensity modulated irradiation for oropharyngeal cancer. RTOG 0022. 2001.
- 18. Kirkpatrick JP, van der Kogel AJ, Schultheiss TE. Radiation dose–volume effects in the spinal cord. Int J Radiat Oncol Biol Phys. 2010;76(3 Suppl):S42–S49. [DOI] [PubMed] [Google Scholar]
- 19. Claus F, De Gersem W, De Wagter C, et al. An implementation strategy for IMRT of ethmoid sinus cancer with bilateral sparing of the optic pathways. Int J Radiat Oncol Biol Phys. 2001;51(2):318–31. [DOI] [PubMed] [Google Scholar]
- 20. Fiorino C, Dell'Oca I, Pierelli A, et al. Significant improvement in normal tissue sparing and target coverage for head and neck cancer by means of helical tomotherapy. Radiother Oncol. 2006;78(3):276–82. [DOI] [PubMed] [Google Scholar]
- 21. Kam MK, Chau R, Suen J, Choi PH, Teo PM. Intensity–modulated radiotherapy in nasopharyngeal carcinoma: dosimetric advantage over conventional plans and feasibility of dose escalation. Int J Radiat Oncol Biol Phys. 2003;56(1):145–57. [DOI] [PubMed] [Google Scholar]
- 22. Martel MK, Eisbruch A, Lawrence TS, Fraass BA, Ten Haken RK, Lichter AS. Spinal cord dose from standard head and neck irradiation: implications for three–dimensional treatment planning. Radiother Oncol. 1998;47(2):185–89. [DOI] [PubMed] [Google Scholar]
- 23. Verbakel WF, Cuijpers JP, Hoffmans D, Bieker M, Slotman BJ, Senan S. Volumetric intensity–modulated arc therapy vs. conventional IMRT in head–and–neck cancer: a comparative planning and dosimetric study. Int J Radiat Oncol Biol Phys. 2009;74(1):252–59. [DOI] [PubMed] [Google Scholar]
- 24. Grills IS, Yan D, Martinez AA, Vicini FA, Wong JW, Kestin HL. Potential for reduced toxicity and dose escalation in the treatment of inoperable non–small–cell lung cancer: a comparison of intensity–modulated radiation therapy (IMRT), 3D conformal radiation, and elective nodal irradiation. Int J Radiat Oncol Biol Phys. 2003;57(3):875–90. [DOI] [PubMed] [Google Scholar]
- 25. Pirzkall A, Debus J, Haering P, et al. Intensity modulated radiotherapy (IMRT) for recurrent, residual, or untreated skull–base meningiomas: preliminary clinical experience. Int J Radiat Oncol Biol Phys. 2003;55(2):362–72. [DOI] [PubMed] [Google Scholar]
- 26. Wu Q, Mohan R, Morris M, Lauve A, Schmidt‐Ullrich R. Simultaneous integrated boost intensity–modulated radiotherapy for locally advanced head–and–neck squamous cell carcinomas. 1: dosimetric results. Int J Radiat Oncol Biol Phys. 2003;56(2):573–85. [DOI] [PubMed] [Google Scholar]
- 27. Ahn PH, Chen CC, Ahn AI, et al. Adaptive planning in intensity–modulated radiation therapy for head and neck cancers: single–institution experience and clinical implications. Int J Radiat Oncol Biol Phys. 2011;80(3):677–85. [DOI] [PubMed] [Google Scholar]
- 28. Loveless A, Roy A, Das IJ, Nohadani O. On augmented DVH analysis. Med Phys. 2013;40(6):261–61. [Google Scholar]
- 29. Luo C, Yang CC, Narayan S, et al. Use of benchmark dose–volume histograms for selection of the optimal technique between three–dimensional conformal radiation therapy and intensity–modulated radiation therapy in prostate cancer. Int J Radiat Oncol Biol Phys. 2006;66(4):1253–62. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Supplementary Material
Supplementary Material
Supplementary Material
Supplementary Material
