Skip to main content
Medical Physics logoLink to Medical Physics
. 2018 Mar 1;45(4):1338–1350. doi: 10.1002/mp.12788

Integrated beam orientation and scanning‐spot optimization in intensity‐modulated proton therapy for brain and unilateral head and neck tumors

Wenbo Gu 1, Daniel O'Connor 1, Dan Nguyen 1,2, Victoria Y Yu 1, Dan Ruan 1, Lei Dong 3, Ke Sheng 1,
PMCID: PMC5904040  NIHMSID: NIHMS939122  PMID: 29394454

Abstract

Purpose

Intensity‐Modulated Proton Therapy (IMPT) is the state‐of‐the‐art method of delivering proton radiotherapy. Previous research has been mainly focused on optimization of scanning spots with manually selected beam angles. Due to the computational complexity, the potential benefit of simultaneously optimizing beam orientations and spot pattern could not be realized. In this study, we developed a novel integrated beam orientation optimization (BOO) and scanning‐spot optimization algorithm for intensity‐modulated proton therapy (IMPT).

Methods

A brain chordoma and three unilateral head‐and‐neck patients with a maximal target size of 112.49 cm3 were included in this study. A total number of 1162 noncoplanar candidate beams evenly distributed across 4π steradians were included in the optimization. For each candidate beam, the pencil‐beam doses of all scanning spots covering the PTV and a margin were calculated. The beam angle selection and spot intensity optimization problem was formulated to include three terms: a dose fidelity term to penalize the deviation of PTV and OAR doses from ideal dose distribution; an L1‐norm sparsity term to reduce the number of active spots and improve delivery efficiency; a group sparsity term to control the number of active beams between 2 and 4. For the group sparsity term, convex L2,1‐norm and nonconvex L2,1/2‐norm were tested. For the dose fidelity term, both quadratic function and linearized equivalent uniform dose (LEUD) cost function were implemented. The optimization problem was solved using the Fast Iterative Shrinkage‐Thresholding Algorithm (FISTA). The IMPT BOO method was tested on three head‐and‐neck patients and one skull base chordoma patient. The results were compared with IMPT plans created using column generation selected beams or manually selected beams.

Results

The L2,1‐norm plan selected spatially aggregated beams, indicating potential degeneracy using this norm. L2,1/2‐norm was able to select spatially separated beams and achieve smaller deviation from the ideal dose. In the L2,1/2‐norm plans, the [mean dose, maximum dose] of OAR were reduced by an average of [2.38%, 4.24%] and[2.32%, 3.76%] of the prescription dose for the quadratic and LEUD cost function, respectively, compared with the IMPT plan using manual beam selection while maintaining the same PTV coverage. The L2,1/2 group sparsity plans were dosimetrically superior to the column generation plans as well. Besides beam orientation selection, spot sparsification was observed. Generally, with the quadratic cost function, 30%~60% spots in the selected beams remained active. With the LEUD cost function, the percentages of active spots were in the range of 35%~85%.The BOO‐IMPT run time was approximately 20 min.

Conclusion

This work shows the first IMPT approach integrating noncoplanar BOO and scanning‐spot optimization in a single mathematical framework. This method is computationally efficient, dosimetrically superior and produces delivery‐friendly IMPT plans.

Keywords: Beam orientation optimization, integrated scanning spot optimization, proton therapy

1. Introduction

Intensity‐Modulated Proton Therapy (IMPT) is an advanced planning technique that fully takes advantage of the freedom of placing the characteristic proton Bragg peaks,1 which are referred to as spots in IMPT planning, in the 3D space. With spot scanning technique,2 the energy and intensity of the Bragg peaks of well‐defined narrow pencil beams can be modulated to generate a sophisticated dose distribution for tumor coverage and normal tissue sparing.3, 4 There are several variations of IMPT, including 2D modulation, distal edge tracking, 2.5D modulation, and 3D modulation.3 Among them, multifield optimization (MFO)‐IMPT3 utilizing all available degrees of freedom has the greatest versatility for normal organ sparing. This study focuses on MFO‐IMPT, which will be simply termed IMPT in the rest of paper.

A typical treatment planning process of IMPT involves three steps: first, a number of proton beams are manually selected by the dosimetrist. For complicated cases, multiple trials are needed to decide the number and orientations of beams; second, proton pencil‐beam dose calculation is performed for these beams; finally, inverse planning is carried out to obtain the intensity map and final dose distribution.4 Because of the unique proton beam physics, the need to further reduce body dose, the limited beam time, and to relieve the patient‐specific QA efforts, 2~4 proton fields are typically utilized in an IMPT session,4, 5 in contrast to the larger number of beams and arcs used in Intensity‐Modulated Photon Therapy (referred to as IMXT). Because of the small beam number, each beam of the IMPT plan heavily influences the final quality. The importance of beam orientation was highlighted in several studies.6, 7, 8 However, due to the vast search space, it is essentially impossible for a human operator to test all combinations and find the optimal set of beams. Therefore, there arises a need for beam orientation optimization (BOO) for IMPT.

For IMXT, BOO is viewed as a combinatorial problem, which by its nature is mathematically intractable for realistic BOO problems. The challenge is greater with the additional depth dimension in IMPT optimization. While methods for IMXT BOO have been developed using heuristic and stochastic algorithms to overcome the mathematical challenge,9, 10, 11, 12, 13, 14, 15 the BOO problem is rarely touched in IMPT. To our knowledge, the only work that attempts to explicitly attack the IMPT BOO problem was by Cao et al.,16 in which the coplanar BOO problem was treated as a combinatorial problem. To reduce the problem size, starting from a set of initial beams, a local search was performed to identify beams that improve dosimetric quality and robustness. This method is clearly limited in its ability of performing a global search in a large solution space. Noncoplanar space is commonly used in proton therapy because of the modern robotic patient positioning device typically installed in a proton treatment room.

In this paper, we present a novel framework to efficiently integrate BOO and scanning‐spot optimization that allows a global search in all feasible beams. The problem is formulated to include a dose fidelity term and hierarchical sparsity penalty terms, which reduce the number of active spots using an L1‐norm sparsity term, and control the number of active beams using a group sparsity term. Group sparsity, also known as structured sparsity, was originally used in optimizing x‐ray beam orientations and showed the potential of reducing the number of beams yet maintaining dense beamlets.17

In this work, different regularization and dose fidelity terms are introduced as well as a solver that is capable of handling a larger problem than the original IMXT BOO problem owing to the additional depth dimension.

2. Materials and methods

The group sparsity‐based integrated BOO and scanning‐spot optimization problem is formulated under two different dose fidelity terms, with either convex or nonconvex group sparsity term. The details are described as follows.

2.A. Problem formulation

The simultaneous beam orientation and scanning‐spot intensity optimization problem is formulated under the following general framework:

minimizexΓ(Ax)+ηx1+bBαbxb2p,subject tox0 (1)

where the optimization variable x is a vector of the intensities of all scanning spots. A is the dose‐calculation matrix that transforms the spot intensities x to dose. A single column of matrix A contains the vectorized doses delivered to the voxels in the patient from one unit intensity spot; and A includes columns of all the candidate spots from all candidate beams. B is the set including all candidate beams, xb is a vector of spot intensities for the candidate beam b (so x is the concatenation of the vectors xb), and αb is the regularization parameter for beam b. ΓAx is a dose fidelity term, to penalize dose deviation from prescriptions, x1 provides sparsity regularization on the scanning spots, to reduce the number of active spots and improve delivery efficiency, and η is the weighting parameter for spot sparsity. bBαbxb2p is the group sparsity term to control the number of active beams to between 2 and 4. The L2,p‐norm (0<p1) encourages most candidate beams to be zero, resulting in a small number of beams being selected. A common choice for the exponent for the group sparsity term is p=1, which makes the objective function convex. However, in the BOO problem, the adjacent block columns of the dose‐calculation matrix (corresponding to adjacent candidate beams) can be correlated in homogeneous areas. Subsequently, the group restricted isometry property may not be well satisfied and the L2,1‐norm may lead to a degenerate solution, i.e., aggregated beams. To avoid degeneracy, the nonconvex group sparsity with p=1/2 is also explored in this study.

We choose the weighting parameter αb on the individual beam b to be:

αb=cAPTVb12nbp/2, (2)

where APTVb is the dose‐calculation matrix of planning target volume (PTV) for beam b, nb is the number of candidate spots in beam b, and c is a regularization parameter. The term APTVb12 in the numerator is used to ensure that beams penetrating different depths in the patient are unbiasedly weighted. Without this term, the group sparsity has a tendency toward only selecting beams passing through less tissue. The denominator nb prevents the group sparsity penalty from having a bias against the beams with more spots. By this weighting method, we are able to tune a single parameter c to control the number of active beams in the solution to problem (1).

In this work, PTV is designated to be the target volume for dose optimization. Although the concept of PTV originally from x‐ray therapy is not exactly applicable in IMPT to maintain tumor coverage,18, 19 without losing generality, the term PTV here is used to indicate the target volume without additional implication of plan robustness.

2.B. Dose fidelity

The function Γ can take different forms based on the dosimetric goals and whether a compatible solver exists. Two different choices of Γ are implemented and compared in this work.

The first is the quadratic loss function, which is a common choice for dose fidelity. The overall cost is written as:

ΓAx=sPTVswsAsxls22+rOARswrArxmr+22 (3)

where ls is the prescription dose to sth PTV and mr is the prescribed maximal allowed dose to rth OAR. The OAR penalty terms utilize the one‐sided quadratic functionz+22, where z+=maxz,0. This component‐wise maximum allows us to consider only the voxels with doses larger than mr in the rth OAR. mr can also be set to 0 to penalize any nonzero dose in an OAR. The weights ws and wr are the structure‐specific weighting parameters to emphasize the different importance of different structures.

The quadratic penalty is mathematically desirable for being convex and differentiable. It heavily penalizes the dose volumes that exceed the constraint. However, in radiation therapy, more controls on the dose volume behavior are often desirable. For serial organs, such as the spinal cord, hot spots need to be avoided. However, for parallel structures, such as the parotid glands, the mean dose can be more important than the maximum dose. Therefore, the second type of function Γ we use is a linear combination of structure mean dose and maximum dose penalty, which is referred as the linearized equivalent uniform dose (LEUD)20 cost function in this work. The LEUD penalty function is formulated as:

ΓAx=sPTVswsNsAsxls2+rOARswrγrmeanArx+1γrmaxArx+IqAx (4)

where Ns is the number of voxels in sth PTV, and the weighting factor γr0,1 balances the mean dose and maximum dose for different organs. The penalty on PTV is a voxel‐normalized L2‐norm. The L2‐norm is chosen because it has better PTV coverage than the L1‐norm (which is meanAsxls), and is consistent with the OAR cost with regard to the order (quadratic term is in the order of 2, while mean dose is linear). Because the L1‐norm on OARs (mean dose) is not as sensitive to outliers, it allows for hot spots in parallel OARs. Therefore, in order to eliminate any dose higher than the upper bound dose q, an upper bound constraint IqAx is added to the objective function. Iqz is defined as:

Iq(z)=0ifzqotherwise (5)

2.C. Accelerated proximal gradient method

Equation (1) is difficult to solve using conventional gradient‐based methods because of the nondifferentiable group sparsity term. Other methods such as the interior point method are not able to handle the problem due to the large A matrix size. The alternating direction method of multipliers (ADMM) method used in the previous study17 required longer computation time when solving a much smaller problem (72 candidate beams with two‐dimensional beamlets). In this study, an accelerated proximal gradient method known as Fast Iterative Shrinkage‐Thresholding Algorithm (FISTA)21 is adopted to minimize the proposed objective function. FISTA is a fast algorithm that has an optimal convergence rate of O1/k2 among first‐order optimization methods.21 To solve an optimization problem using FISTA, the problem needs to be formulated in the form:

minimizexf(x)+g(x) (6)

where f is a smooth convex function, which is continuously differentiable with Lipschitz continuous gradient (f); g is a function which is possibly nonsmooth, but has a proximal operator that can be evaluated efficiently. The proximal operator with step size > 0 for function g is defined by:

proxtg(x)=argminyg(y)+12tyx22 (7)

Once the optimization problem is formulated as in Eq. (6) and the conditions for fx and gx are satisfied, FISTA is relatively straightforward to implement as it only involves elementary matrix‐vector arithmetic operations and inexpensive proximal operator evaluations. FISTA with line search is used in this work, which follows the steps shown in Table 1.

Table 1.

Pseudo code for FISTA with line search

FISTA with line search

Initialize x0:=0, v0:=x0, t0>0 , 0<r<1

for k=1,2, do

t:=tk1/r

Repeat

θ:=1ifk=1positive root oftk1θ2=tθk121θifk>1
y:=1θxk1+θvk1
x:=proxtgytfy

break if fxfy+<fy,xy>+12txy22

t:=rt
tk:=t
θk:=θ
xk:=x
vk:=xk+1θk(xxk)
End
Return x

2.D. Solving the BOO problem with FISTA

In the group sparsity‐based BOO problem, the objective function can be rewritten in the following format:

f(x)=ΓAxg(x)=ηx1+bBαbxb2p+I0(x) (8)

where I0x is an indicator function, defined as:

I0(x)=0ifx0otherwise (9)

For the quadratic fidelity formulation, the gradient of f is given by:

f(x)=ATΓAx=sPTVswsAsTAsxls+rOARswrArTArxmr+ (10)

For the LEUD fidelity formulation, the L2‐norm on PTV, max dose of OAR and upper bound indicator function are not differentiable. In order to solve this problem, Moreau‐Yosida regularization is used to smooth the nondifferentiable function. Moreau‐Yosida regularization for function f with parameter μ>0 is given by:

f(μ)(x)=infyf(y)+12μxy22 (11)

f(μ)x is a convex and differentiable approximation to fx, and its gradient can be calculated as:

f(μ)(x)=1μxproxμf(x) (12)

Therefore, the problem of calculating the gradient of the function f(μ) is transformed into calculating the proximal operator of function f. The gradient of the LEUD penalty after Moreau‐Yosida smoothing is:

f(x)=sPTVswsNsAsTAsxlsminwsNsAsxls2,1μ1+rOARswrγrNrArT1+wr1γrArTPwr1γrBArxμ2+ATmaxAxq,0/μ3 (13)

where μ1, μ2 and μ3 are positive Moreau‐Yosida regularization parameters for the L2‐norm, L ‐norm, and indicator function, respectively. Nr is the number of voxels in rth OAR. B is the L1 unit ball and P is the projection operator. See Appendix A for a detailed derivation. Note that applying Moreau‐Yosida smoothing to the group sparsity term would be unacceptable because in order to turn off beams, the beam intensities need to be exactly zero.

After obtaining the gradient of the function f, the next step is to derive a formula for the proximal operator of the function g. In the BOO problem, gx is a separable sum: gx=b=1Bgbxb, where

gbxb=ηxb1+bBαbxb2p+I0xb (14)

It follows from the separable sum rule for proximal operators that the problem evaluating the proximal operator of gx reduces to independently evaluating the proximal operators of the functions gbxb. To simplify notation, we derive an expression for the proximal operator of the function hx=ηx1+αx2p+I0x. Evaluating the proximal operator of h requires solving the optimization problem:

minimizexηx1+αx2p+12tx^x22subjecttox0 (15)

Due to the non‐negativity constraint, we have that x1=x,1, which implies that:

ηx,1+12tx^x22=x,1+12tx^x22=12txx^ηt122+12tx^22x^ηt122 (16)

Since the second term does not depend on x, the problem (15) is reduced to

minimizexx2p+12tαxx^ηt122subjecttox0 (17)

which is equivalent to evaluating the proximal operator of the function x2p+I0x at the point xηt1, with a step size tα. Accordingly, the proximal operator of function h is:

proxth(x)=proxαt·2pmaxxηt1,0 (18)

There is a known form of proximal operator for both L2,1‐norm and L2,1/2‐norm22:

proxt·2(x)=xx·mintx2,1proxt·212(x)=0,iftx21.5>269x23sin13arccos334tx21.5+π2,otherwise (19)

By deriving the gradient of function f and proximal operator of function g, the BOO problem is then readily solved using FISTA.

2.E. Evaluations

Four cases, which included three unilateral head‐and‐neck (H&N) patients and one skull base chordoma (CHDM) patient with simultaneous‐integrated boost (SIB), were evaluated in this study. We chose these cases with relatively smaller tumors to reduce the size of dose matrix that can be fit into a personal computer with 64 GB memory. The candidate beams included 1162 noncoplanar beams that were evenly distributed across the 4π steradians with 6° separation. For each candidate beam, the doses of all scanning spots covering the PTV and a 2.5 mm margin were calculated using matRad,23, 24 a MATLAB‐based 3D treatment planning toolkit. The dose‐calculation resolution was 2.5 mm with a cut‐off of 5 × 10−5 of the maximal dose. The prescriptions, PTVs, and number of scanning spots are shown in Table 2. The magnitude of the optimization problem can be estimated based on the product of the spot number per beam and the total number of candidate beams (1162). The average number of beamlets per beam needed in IMXT for the same patient is also given in Table 2 as a comparison of the BOO problem size in IMPT and in IMXT. The IMXT multileaf collimator resolution is 5 mm.

Table 2.

Prescription doses, PTV volumes, and average spots number per beam

Case Prescription dose (GyRBE) PTV volume (cc) Average spots number per beam Average IMXT beamlets number per beam
H&N #1 40 23.76 906 85
H&N #2 40 32.29 1109 103
H&N #3 66 33.64 1589 111
CHDM PTV6300 63 86.07 3166 241
PTV7400 74 26.42

A greedy BOO approach, column generation,25, 26 was also applied for each patient, as a comparison for our group sparsity method. The detail of column generation algorithm is provided in Appendix B. The dose fidelity term used in column generation method is LEUD cost.

The BOO plans were evaluated against plans with manually selected beams. The manual beam orientations were selected to avoid OARs as much as possible. In total, seven plans were generated for each patient: three plans with quadratic dose fidelity: manual plan (Quad‐MAN), L2,1/2‐group sparsity (Quad‐L2,1/2‐GS) and L2,1‐group sparsity(Quad‐L2,1‐GS); and four plans with LEUD dose fidelity: manual plan (LEUD‐MAN), L2,1/2‐group sparsity(LEUD‐L2,1/2‐GS), and L2,1‐group sparsity (LEUD‐L2,1‐GS), and column generation (LEUD‐CG). All H&N plans were normalized, so that 100% of the prescription dose covers 95% of the PTV volume. The CHDM plan with a simultaneous boost volume was normalized to have 100% of the 63 Gy prescription dose covering 95% of the PTV6300 volume.

For evaluation, PTV homogeneity, D95, D98, D99, maximum dose, and mean dose were evaluated. PTV homogeneity is defined as D95/D5. The maximum dose is defined as the dose to 2% of the structure volume, D2, following the recommendation by IRCU‐83.27 The mean and maximum doses for OARs were also evaluated.

As an example to preliminarily illustrate the effect of beam orientation on the robustness, a robustness analysis was performed to the CHDM plans to compare the robustness between MAN plans and L2,1/2‐GS plans. Nine scenarios were incorporated to evaluate the plan robustness, including one nominal scenario (no uncertainties) and eight worst‐case scenarios, which consisted of (a) setup uncertainties, by shifting the isocenter of the CT image of ±3 mm along anteroposterior, superoinferior, and lateral directions; (b) range uncertainties, by scaling the CT number by ±3%.

3. Results

The matRad‐based dose calculation for all 1162 candidate beams using an i7 6‐core CPU and Matlab parallel computing toolbox took 30 min to 1 hr depending on the size of the tumor. The dose matrix size ranged from 20 to 70 MB per beam based on target size. The group sparsity‐based BOO process took 2–7 min and 3–20 min for the quadratic and LEUD cost to complete, respectively.

Three beams for each H&N case and four beams for the CHDM case were selected. Table 3 shows the beam arrangement for each patient and each plan. The couch and gantry angles follow IEC 61217 coordinate conventions. Figure 1 shows the beam arrangement of each plan for the CHDM patient. The beams selected by using the L2,1/2‐GS term were spatially well separated. In comparison, the L2,1‐GS term resulted in aggregated beams with both quadratic and LEUD cost terms, indicating a potential degeneracy issue with this group sparsity term. And the CG method tended to choose beams with short pathlengths to target. Interestingly, the beam orientations optimized by L2,1/2‐GS were similar to the actual angles selected by an experienced dosimetrist in this CHDM case.

Table 3.

Beam angles (gantry and couch angle) selected in each plan

Case MAN Quad LEUD
L2,1/2 L2,1 L2,1/2 L2,1 CG
H&N#1

(40, 0)

(180, 0)

(300, 0)

(38, 20)

(159, 35)

(282, 0)

(102, 272)

(39, 290)

(316, 18)

(78, 304)

(300, 0)

(315, 26)

(110, 328)

(130, 16)

(315, 26)

(89, 282)

(132, 8)

(73, 19)

H&N#2

(180, 0)

(45, 45)

(315, 315)

(145, 350)

(60, 28)

(315, 315)

(110, 282)

(225, 71)

(338, 35)

(134, 293)

(96, 6)

(320, 322)

(316, 46)

(148, 23)

(315, 315)

(296, 74)

(159, 35)

(37, 57)

H&N#3

(270, 0)

(90, 275)

(90, 0)

(36, 0)

(300, 346)

(114, 13)

(171, 315)

(342, 270)

(315, 314)

(36, 0)

(294, 354)

(113, 13)

(125, 345)

(54, 30)

(315, 314)

(57, 298)

(329, 53)

(21, 325)

CHDM

(60, 275)

(270, 0)

(90, 0)

(180, 0)

(100, 280)

(130, 16)

(265, 350)

(330, 290)

(105, 335)

(275, 12)

(240, 140)

(142, 20)

(60, 290)

(90, 354)

(270, 348)

(207, 332)

(84, 312)

(81, 317)

(210, 12)

(62, 20)

(54, 270)

(56, 323)

(338, 57)

(281, 341)

Figure 1.

Figure 1

The beam arrangement of each plan for the CHDM patient.

Besides beam orientation selection, spot sparsification was achieved. Table 4 shows the percentage of active scanning spots. Generally, with the quadratic cost function, 30%~60% spots in the selected beams remained active. With the LEUD cost function, the percentages of active spots were within 35%~85%.

Table 4.

The ratio of number of active spots (number before “/”) over the number of total candidate spots (number after “/”) in the selected beams

Cost Plan H&N #1 H&N #2 H&N #3 CHDM
Quad L2,1/2 967/2756 = 35.1% 912/3377 = 27.0% 2521/4759 = 53.0% 6532/12582 = 51.9%
L2,1 1282/2646 = 48.5% 926/3238 = 28.6% 2532/4574 = 55.4% 6130/12618 = 48.58%
MAN 1222/2725 = 44.8% 1028/3366 = 30.5% 2777/4835 = 57.4% 7072/12603 = 56.1%
LEUD L2,1/2 1401/2741 = 51.1% 1212/3396 = 35.7% 3295/4760 = 69.2% 10596/12648 = 83.8%
L2,1 1920/2773 = 69.2% 1315/2989 = 44.0% 2734/4582 = 60.0% 10689/12568 = 85.1%
MAN 1545/2725 = 56.7% 1469/3366 = 43.6% 4116/4835 = 85.1% 10634/12603 = 84.3%

Figure 2 shows the DVHs comparison between GS plans and MAN plan for each patient. Tables 5 and 6 show the PTV and OAR statistics for each patient with quadratic cost function, respectively. Tables 7 and 8 show the PTV and OAR statistics for each patient with LEUD cost function, respectively. On average, the L2,1/2‐GS plans reduced the OAR [mean dose, maximum dose] from MAN plans by [2.38%, 4.24%] and[2.32%, 3.76%] of the prescription dose for Quad and LEUD cost, respectively, while achieving comparable target coverage.

Figure 2.

Figure 2

DVH comparison of the MAN plan (solid line), L2,1/2‐GS plan (dotted line) and L2,1‐GS plan (dashed line) for each patient. The plans with quadratic cost are listed on the left column and the plans with LEUD cost are on the right column.

Table 5.

PTV statistics for all patients under Quad cost

Case Homogeneity D95 D98 D99 Dmax D95 D98 D99 Dmax
MAN L2,1/2 L2,1 L2,1/2 – MAN (GyRBE) L2,1 – MAN (GyRBE)
H&N#1 0.972 0.979 0.969 −0.003 −0.245 −0.542 −0.298 +0.017 −0.074 −0.180 +0.214
H&N#2 0.972 0.974 0.970 +0.018 +0.159 +0.313 −0.246 +0.017 +0.077 +0.149 +0.053
H&N#3 0.981 0.981 0.972 −0.006 −0.061 −0.173 +0.069 +0.013 −0.207 −0.368 +0.723
CHDM PTV6300 0.918 0.916 0.917 −0.003 −0.030 −0.060 +0.126 −0.012 −0.012 −0.050 −0.055
PTV7400 0.953 0.953 0.953 −0.012 −0.012 −0.050 −0.055 −0.028 −0.058 −0.042 −0.054

Table 6.

OAR statistics for all patients under Quad cost

Case L2,1/2 ‐ MAN (GyRBE) L2,1 ‐ MAN (GyRBE)
Dmean Dmax Dmean Dmax
Largest value Average value Largest value Average value Largest value Average value Largest value Average value
H&N#1 −1.24 L Parotid −0.52 −8.05 Brainstem −2.78 −3.08 L Parotid −0.53 −10.38 L Parotid −2.55
H&N#2 −3.75 Carotid −1.03 −2.13 Pharynx −1.04 −0.23 Larynx +0.78 −1.45 Larynx +0.88
H&N#3 −20.14 TMJ −2.96 −34.60 R Opt Nrv −3.86 −26.25 TMJ +1.75 −35.11 TMJ +3.72
CHDM −6.99 L Cochlea −0.72 −16.40 L Cochlea −0.99 −4.49 L Cochlea +0.36 −11.20 L Cochlea −1.16

Table 7.

PTV statistics for all patients under LEUD cost

Case Homogeneity D95 D98 D99 Dmax D95 D98 D99 Dmax
MAN L2,1/2 L2,1 L2,1/2 – MAN (GyRBE) L2,1 – MAN (GyRBE)
H&N#1 0.945 0.961 0.964 −0.010 +0.190 +0.291 −0.714 −0.021 +0.006 −0.227 −0.827
H&N#2 0.966 0.972 0.958 +0.009 −0.200 −0.276 −0.207 +0.013 −0.237 −0.526 +0.731
H&N#3 0.967 0.967 0.965 −0.002 −0.181 +0.069 +0.057 +0.007 −0.333 −0.288 +0.200
CHDM PTV6300 0.885 0.886 0.881 +0.002 −0.037 +0.022 −0.058 +0.002 −0.198 −0.304 +0.242
PTV7400 0.951 0.947 0.945 −0.352 −0.748 −0.957 −0.005 −0.264 −0.680 −1.044 +0.232

Table 8.

OAR statistics for all patients under LEUD cost

Cases L2,1/2 – MAN (GyRBE) L2,1 – MAN (GyRBE)
Dmean Dmax Dmean Dmax
Largest value Average value Largest value Average value Largest value Average value Largest value Average value
H&N#1 −1.49 L Parotid −1.00 −6.75 Pharynx −2.87 −0.98 Pharynx −0.03 −8.96 Pharynx −1.93
H&N#2 −0.76 Carotid −0.270 −3.75 Larynx −1.11 0.04 Mandible +2.22 0.97 Mandible +4.08
H&N#3 −16.15 TMJ −2.51 −28.83 R Opt Nrv −2.29 −19.52 TMJ +0.76 −21.0 R Opt Nrv +1.03
CHDM −8.78 R Cochlea −1.44 −10.99 R Cochlea −1.01 −8.89 R Opt Nrv −1.27 −9.59 R cochlea −1.27

The L2,1‐GS method produced competitive plan for H&N#1 and CHDM case, but led to worse and OAR dose over MAN plan for H&N#2 and H&N#3 case with similar PTV coverage. The average decrease in OAR [mean dose, maximum dose] from MAN plans are [−0.96%, 0.09%] and [−1.15%, 1.23%] of the prescription dose for Quad and LEUD cost, respectively.

The limitation of using L2,1‐norm can be observed from the final value of dose fidelity cost. Figure 3 compares the convergence between L2,1/2‐norm and L2,1‐norm group sparsity for the H&N#3 case. For the quadratic and LEUD cost functions, the L2,1/2‐norm group sparsity method converged after about 800 iterations and 1800 iterations, respectively. In contrast, with an L2,1‐norm group sparsity term, the problem quickly converged within 30 iterations and 80 iterations under the quadratic and LEUD cost functions, respectively. However, the converged dose fidelity function values using the L2,1‐norm are substantially greater than the corresponding values of using the L2,1/2‐norm (~ 105 for L2,1 vs. ~102 for L2,1/2), showing a larger deviation from the prescription dose using L2,1‐norm in this case.

Figure 3.

Figure 3

The convergence comparison between L2,1/2‐norm (solid) and L2,1‐norm (dotted) group sparsity for case H&N#3. Left is Quad‐GS and right is LEUD‐GS. The blue curve shows the value of entire cost function and the red curve shows the value of dose fidelity term.

The DVHs comparison between L2,1/2‐GS plans and CG plans is shown in Fig. 4. In all H&N cases, the CG method produced plans with similar PTV coverage as L2,1/2‐GS method, but the OAR dosimetry was inferior to the group sparsity algorithm. In the CHDM case, the CG plan performed comparably with the L2,1/2‐GS plan with regard to PTV and OAR dose. The CG methods reduced the mean dose to brainstem by 9.8 Gy over L2,1/2‐GS plan by forcing all beams entering from the anterior direction, increasing the risk of exposing the eyes to high dose with slight positioning error.

Figure 4.

Figure 4

DVH comparison between L2,1/2‐GS plan (solid) and the CG plan (dotted) for each patient.

A robustness analysis for CHDM patient is shown in Fig. 5.The solid lines in each plot are the DVHs for the nominal case, and the bands bound the worst‐case distributions. The narrower the bands are, the less sensitive the plan is to uncertainties.19 The spread of the target DVHs at 95% coverage under given uncertainties is also measured. Under Quad cost, the range of spread for [PTV6300, PTV 7400] is [91.4% ~101.7%, 90.0%~99.9%] and [91.9% ~101.6%, 90.8%~100.0%] of the prescription dose for MAN and L2,1/2‐GS plan, respectively. Under LEUD cost, the range of spread for [PTV6300, PTV 7400] is [92.4% ~103.3%, 92.7%~100.6%] and [92.7% ~102.2%, 92.3%~101.6%] of the prescription dose for MAN and L2,1/2‐GS plan, respectively. The target sensitivity to uncertainties between MAN plans and BOO plans is comparable, and the MAN plans have greater uncertainties in left optic nerve, while Quad L2,1/2‐GS plan introduces larger uncertainties to left cochlea.

Figure 5.

Figure 5

DVH bands of CHDM patient including all setup and range uncertainty distributions. The top rows are in Quad cost and bottom row in LEUD cost; the left columns are MAN plans and right column are L2,1/2‐GS plans.

4. Discussion

This work introduces a group sparsity‐based IMPT optimization method that simultaneously selects beams and optimizes the scanning‐spot intensity. In addition to the superior dosimetry compared with plans using manually selected beams, the planning method using optimized beams reduces the dependence on the individual operators who select beams based on experience and intuition.

Compared to the earlier coplanar x‐ray optimization study using group sparsity,17 the current work expands its scope in several ways. This is a new application to the IMPT problem, which is intrinsically a higher dimensional optimization problem than IMXT due to the additional modulation in the depth direction. The number of candidate beams in this study is an order of magnitude larger than in the original IMXT BOO study that used only 72 candidate beams. The optimization solver used in the original study was based on ADMM and is not well suited to the current much larger IMPT BOO problem, because the linear systems that ADMM requires to be solved at each iteration would be intractably large. Therefore, we developed an approach based on FISTA, which requires only matrix‐vector multiplications involving the dose‐calculation matrix. To enable the use of FISTA, we derived several key proximal operators. FISTA has been shown to have an optimal convergence rate of O1/k2 among first‐order methods,21 and the numerical results demonstrate that FISTA is able to solve the large‐scale optimization problem in a clinically acceptable time. Although standard convergence results for FISTA assume that both f and g are convex,21 we have found that FISTA converged to a good solution even in the case of using the nonconvex L2,1/2‐norm group sparsity penalty.

In this study, two different group sparsity terms, L2,1 and L2,1/2 norms, were compared for the IMPT BOO problem. Although the L2,1‐norm is convex and offers certain computational advantages, it leads to suboptimal dosimetry and tends to converge to a worse fidelity value when compared against the nonconvex L2,1/2‐norm group sparsity term. Therefore, in this case, the ability to avoid degeneracy and select spatially separate beams appears to be dosimetrically advantageous. This finding shows that while the group sparsity method is well suited to solve the BOO problem, the selection of the sparsity function is a subtle point that can make a critical difference.

The group sparsity beam orientation optimization method was tested on two different dose fidelity functions: quadratic and LEUD terms. Compared to the quadratic dose fidelity term, the LEUD cost function with a combination of mean and maximum dose constraints is easier to tune to achieve desired DVHs. This is reflected in the process of creating plans for the four cases in this study: the quadratic L2,1/2‐GS method took on average 9–10 rounds of parameter tuning and the LEUD method took 4–5 rounds. The advantage of quadratic dose fidelity function is that while the structure parameters need to be tuned, the group sparsity regularization weighting parameter remains constant for a specific number of beams. In comparison, with LEUD dose fidelity, the group sparsity regularization weight is sensitive to the structure weighting parameter changes, requiring additional adjustment to maintain the desired number of beams. Quadratic dose fidelity also resulted in on average more sparse scanning spots.

The group sparsity BOO method was also compared against the greedy column generation BOO algorithm. The results show the GS method produces comparable or superior plans over CG. Specifically, the CG method tends to select aggregated beams with short pathlengths to the target such as the anterior beams in the CHDM case. The result indicates degeneracy that did not present in the IMXT BOO solution possibly due to the substantial difference in dose distribution between the first proton beam and photon beam. Additional heuristics such as minimal separating between selected beams may be enforced to ameliorate the problem.

The results show that nonconvex L2,1/2‐GS method creates more dosimetrically desirable plans. However, robustness is another important consideration in proton planning. In this manuscript, we provide a general and mathematically tractable framework for beam orientation optimization without explicitly considering robustness. The framework will allow us to more quantitatively describe the robustness by adding a regularization term. In a straightforward fashion, the uncertainties can be included as the stopping power (or density for further simplification) variation. Penalizing this term will push the beams toward more homogeneous paths. More sophisticated regularization terms can be constructed as dosimetric endpoints for a given shift or range uncertainty. That being said, the robustness and new regularization terms deserve a separate study. Without complicating the current manuscript too much, we performed a simple robustness analysis on one of the tested patients by simulating shifts and range uncertainties and the results showed no substantial difference between BOO plans and plans using manually selected beams.

The dose matrix size is determined by the target, which is limited to be approximately 110 cm3 to fit the calculation into a desktop with 64GB memory. For more general IMPT cases with larger tumor targets, either workstations with substantially larger memory, or methods to intelligently reduce the dose matrix size without impacting plan quality are needed. These methods include (a) nonuniform sampling resolution with a higher resolution in the target and nearby organs, and lower resolution in the volumes that are considered less critical and faraway from the target, (b) clustering of the proton pencil‐beam dose matrices, and (c) using heuristics to reduce the number of candidate beams. We will investigate these directions for more general integrated BOO and scanning‐spot optimization problems.

5. Conclusions

This work shows the first IMPT planning approach that integrates noncoplanar beam orientation and scanning‐spot optimization in a single mathematical framework, which was further formulated to have a computationally efficient solution despite its large problem size. This method resulted in dosimetrically competitive plans compared with the manual planning method for a brain and three unilateral head‐and‐neck cases and is less operator‐dependent.

Acknowledgment

This research is supported by DOE Grants No. DE‐SC0017057 and DE‐SC0017687, NIH Grants Nos. R44CA183390, R01CA188300, and R43CA183390.

Appendix A.

Calculate the Gradient of LEUD Cost

Equation (4) can also be written as the following format:

ΓAx=sPTVswsNsAsxls2+rOARswrγrNr1TArx+wr1γrArx+IqAx (A1)

where Nr is the number of voxels in rth OAR.

Therefore, function Γz can be formed as the summation of four subfunctions:

Γz=sPTVsf1zs+rOARsf2zr+f3zr+f4zf1z=wNzl2f2z=wγN1Tzf3z=w1γzf4z=Iqz (A2)

f2z is differentiable and the gradient is:

f2z=wγN1 (A3)

Moreau‐Yosida regularization is applied to f1, f3 and f4. The proximal operators for these functions are:

proxμf1z=zzl·minμw/Nzp2,1proxμf3z=zPw1γμBzproxμf4z=minz,q (A4)

where B is the L1 unit ball, P is the projection operator.

Then based on Eq. (12), the gradients of f1, f3 and f4 after Moreau‐Yosida smoothing are:

f1(μ)x=zlminμw/Nzl2,1/μf3(μ)x=Pw1γBzμf4(μ)x=maxzq,0/μ (A5)

Hence, the gradient of LEUD cost after approximation is written as:

f(μ)x=ATΓAx=sPTVswsNsAsTAsxlsminwsNAsxls2,1μ1+rOARswrγrNrArT1+ArTwr1γrPwr1γrArxμ2+ATmaxAxq,0/μ3 (A6)

where μ1, μ2, and μ3 are positive Moreau‐Yosida regularization parameters for L2‐norm, L ‐norm, and indicator function, respectively.

Appendix B.

Column Generation‐based BOO

First, we define the master problem, which includes the whole candidate beam set B, as:

minimized,xGdsubjecttod=bBAbxbxb0,forbB

where

G(d)=sPTVswsNsdsls2+rOARswrγrmeandr+1γrmaxdr+Iqd (B1)

The variables have the same meaning as 2.1. In addition, here Ab is the dose‐calculation matrix for beam b. It is obvious that the objective function is equivalent to the LEUB dose fidelity term in Eq. (4).

The master problem is designed to optimize the intensities of the scanning spots from all beams. The goal for BAO is to select a small number of beams. The beam not selected should have zero spot intensity. Then, the subproblem with selected beam set Bselect is defined as:

minimized,xGdsubjecttod=bBselectAbxbxb0,forbBselect (B2)

The Lagrange function for the master problem is:

L(x,d,u,v)=Gd+u(dbBAbxb)bBvbxb (B3)

where u, v are Lagrange multipliers associated with the constrains in the master problem.

The Karush‐Kuhn‐Tucker (KKT) conditions28 are evaluated:

d=Ax,dq,xb0,forbBvb0u=G(d)vb=AbTu,forbB (B4)

Using the same smoothing approach as Section 2.D and FISTA, the subproblem can be solved to obtain optimal primal variable xb,bBselect, and optimal dual variables u, and vb,bBselect. Then, all vb,bB\Bselect are calculated by vb=AbTu. The vb in the selected set meets the non‐negative condition, but the wb in the rest set may be negative. In B\Bselect, the negative elements in vector wb are summed to produce a single value to evaluate this beam b. The beam with the most negative sum is supposed to help to optimize the master problem most, so this beam is added to Bselect and a new subproblem is formed and solved again. By this method, we can start from an empty selected set, add one beam per iteration, reach the desired number of beam, and then exit the iteration.

References

  • 1. Wilson RR. Radiological use of fast protons. Radiology. 1946;47:487–491. [DOI] [PubMed] [Google Scholar]
  • 2. Kanai T, Kawachi K, Kumamoto Y, et al. Spot scanning system for proton radiotherapy. Med Phys. 1980;7:365–369. [DOI] [PubMed] [Google Scholar]
  • 3. Lomax AJ. Intensity modulation methods for proton radiotherapy. Phys Med Biol. 1999;44:185–205. [DOI] [PubMed] [Google Scholar]
  • 4. Lomax AJ, Böhringer T, Bolsi A, et al. Treatment planning and verification of proton therapy using spot scanning: initial experiences. Med Phys. 2004;31:3150–3157. [DOI] [PubMed] [Google Scholar]
  • 5. Steneker M, Lomax A, Schneider U. Intensity modulated photon and proton therapy for the treatment of head and neck tumors. Radiother Oncol. 2006;80:263–267. [DOI] [PubMed] [Google Scholar]
  • 6. Jäkel O, Debus J. Selection of beam angles for radiotherapy of skull base tumours using charged particles. Phys Med Biol. 2000;45:1229–1241. [DOI] [PubMed] [Google Scholar]
  • 7. Trofimov A, Nguyen PL, Coen JJ, et al. Radiotherapy treatment of early‐stage prostate cancer with IMRT and protons: a treatment planning comparison. Int J Radiat Oncol Biol Phys. 2007;69:444–453. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Kase Y, Yamashita H, Fuji H, et al. A treatment planning comparison of passive‐scattering and intensity‐modulated proton therapy for typical tumor sites. J Radiat Res. 2012;53:272–280. [DOI] [PubMed] [Google Scholar]
  • 9. Bortfeld T, Schlegel W. Optimization of beam orientations in radiation therapy: some theoretical considerations. Phys Med Biol. 1993;38:291–304. [DOI] [PubMed] [Google Scholar]
  • 10. Li Y, Yao J, Yao D. Automatic beam angle selection in IMRT planning using genetic algorithm. Phys Med Biol. 2004;49:1915–1932. [DOI] [PubMed] [Google Scholar]
  • 11. Pugachev A, Li JG, Boyer AL, et al. Role of beam orientation optimization in intensity‐modulated radiation therapy. Int J Radiat Oncol Biol Phys. 2001;50:551–560. [DOI] [PubMed] [Google Scholar]
  • 12. Li Y, Yao D, Yao J, Chen W. A particle swarm optimization algorithm for beam angle selection in intensity‐modulated radiotherapy planning. Phys Med Biol. 2005;50:3491–3514. [DOI] [PubMed] [Google Scholar]
  • 13. Djajaputra D, Wu Q, Wu Y, Mohan R. Algorithm and performance of a clinical IMRT beam‐angle optimization system. Phys Med Biol. 2003;48:3191–3212. [DOI] [PubMed] [Google Scholar]
  • 14. Wang X, Zhang X, Dong L, Liu H, Wu Q, Mohan R. Development of methods for beam angle optimization for IMRT using an accelerated exhaustive search strategy. Int J Radiat Oncol Biol Phys. 2004;60:1325–1337. [DOI] [PubMed] [Google Scholar]
  • 15. Dias J, Rocha H, Ferreira B, Lopes MC. IMRT beam angle optimization using dynamically dimensioned search. IFMBE Proceedings. 2014;42:1–4. [Google Scholar]
  • 16. Cao W, Lim GJ, Lee A, et al. Uncertainty incorporated beam angle optimization for IMPT treatment planning. Med Phys. 2012;39:5248–5256. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Jia X, Men C, Lou Y, Jiang SB. Beam orientation optimization for intensity modulated radiation therapy using adaptive l(2,1)‐minimization. Phys Med Biol. 2011;56:6205–6222. [DOI] [PubMed] [Google Scholar]
  • 18. Albertini F, Hug EB, Lomax AJ. Is it necessary to plan with safety margins for actively scanned proton therapy? Phys Med Biol. 2011;56:4399–4413. [DOI] [PubMed] [Google Scholar]
  • 19. Liu W, Zhang X, Li Y, Mohan R. Robust optimization of intensity modulated proton therapy. Med Phys. 2012;39:1079–1091. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Thieke C, Bortfeld T, Küfer K‐H. Characterization of dose distributions through the max and mean dose concept. Acta Oncol. 2002;41:158–161. [DOI] [PubMed] [Google Scholar]
  • 21. Beck A, Teboulle M. A Fast Iterative Shrinkage‐Thresholding Algorithm for Linear Inverse Problems. SIAM J Imaging Sci. 2009;2:183–202. [Google Scholar]
  • 22. Moellenhoff T, Strekalovskiy E, Möller M, Cremers D. Low rank priors for color image regularization. Submitted. 2014:1–14.
  • 23. Cisternas E, Mairani A, Ziegenhein P, Jäkel O, Bangert M. MatRad – a multi‐modality open source 3D treatment planning toolkit. IFMBE Proceedings. 2015;51:1608–1611. [Google Scholar]
  • 24. Wieser HP, Cisternas E, Wahl N, et al. Development of the open‐source dose calculation and optimization toolkit matRad. Med Phys. 2017;44:2556–2568. [DOI] [PubMed] [Google Scholar]
  • 25. Romeijn HE, Ahuja RK, Dempsey JF, Kumar A. A column generation approach to radiation therapy treatment planning using aperture modulation. SIAM J Optim. 2005;15:838–862. [Google Scholar]
  • 26. Nguyen D, Thomas D, Cao M, O'Connor D, Lamb J, Sheng K. Computerized triplet beam orientation optimization for MRI‐guided Co‐60 radiotherapy. Med Phys. 2016;43:5667–5675. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Grégoire V, Mackie TR. State of the art on dose prescription, reporting and recording in intensity‐modulated radiation therapy (ICRU report No. 83). Cancer/Radiotherapie. 2011;7:555–559. [DOI] [PubMed] [Google Scholar]
  • 28. Rockafellar RT. Convex Analysis. Princeton, NJ: Princeton University Press; 1970. [Google Scholar]

Articles from Medical Physics are provided here courtesy of American Association of Physicists in Medicine

RESOURCES