Abstract
Radiotherapy treatment planning currently requires many trial-and-error iterations between the planner and treatment planning system, as well as between the planner and physician for discussion/consultation. The physician’s preferences for a particular patient cannot be easily quantified and precisely conveyed to the planner. In this study we present a real-time volumetric Pareto surface dose generation deep learning neural network that can be used after segmentation by the physician, adding a tangible and quantifiable endpoint to portray to the planner. From 70 prostate patients, we first generated 84,000 intensity modulated radiation therapy plans (1,200 plans per patient) sampling the Pareto surface, representing various tradeoffs between the planning target volume (PTV) and the organs-at-risk (OAR), including bladder, rectum, left femur, right femur, and body. We divided the data to 10 test patients and 60 training/validation patients. We then trained a hierarchically densely connected convolutional U-net (HD U-net), to take the PTV and avoidance map representing OARs masks and weights, and predict the optimized plan. The HD U-net is capable of accurately predicting the 3D Pareto optimal dose distributions, with average [mean, max] dose errors of [3.4%, 7.7%](PTV), [1.6%, 5.6%](bladder), [3.7%, 4.2%](rectum), [3.2%, 8.0%](left femur), [2.9%, 7.7%](right femur), and [0.04%, 5.4%](body) of the prescription dose. The PTV dose coverage prediction was also very similar, with errors of 1.3% (D98) and 2.0% (D99). Homogeneity was also similar, differing by 0.06 on average. The neural network can predict the dose within 1.7 s. Clinically, the optimization and dose calculation is much slower, taking 5–10 min.
You have full access to this open access chapter, Download conference paper PDF
Similar content being viewed by others
Keywords
- Radiation therapy treatment planning
- Intensity modulation
- Pareto surface
- Dose distribution
- Deep learning
- U-net
- Neural network
1 Introduction
Radiation therapy is one of the major cancer therapy modalities, accounting for two-thirds of cancer patients in the US, either standalone or in conjunction with surgery, chemotherapy, immunotherapy, etc. In the typical current treatment planning workflow, a treatment planner interacts with a commercial treatment planning system to solve an inverse optimization problem, either in an intensity modulated radiation therapy (IMRT) [1,2,3] or volumetric modulated arc therapy (VMAT) [4,5,6,7] setting. The planner manually tunes many hyperparameters, such as dose-volume constraints and weightings, to control the tradeoff between multiple clinical objectives. These hyperparameters are meticulously tuned in a time-consuming trial-and-error fashion to reach a suitable clinical solution. In addition, many rounds feedback from the physician is needed for the physician to discuss the plan quality with the planner and to properly portray their desired tradeoffs. This is largely due to the fact that the physician’s preferences for a particular patient cannot be fully quantified and precisely conveyed to the planner. This trial-and-error process results in hours of planning time, and the many iterations of physician feedback may extend the time to several days until the plan is accepted.
Recently, deep learning with multi-layered neural networks has exploded in progress, particularly in computer vision. We realize that these new developments can be utilized to solve aspects of the treatment planning problem. Specifically, deep learning can be utilized to quickly realize the physician’s preferences in a tangible and quantifiable manner that can be presented to the treatment planner prior to treatment planning. In this study we present a real-time Pareto surface dose generation deep learning neural network that can be used immediately after segmentation by the physician. Pareto optimal plans are the solutions to a multicriteria problem with various tradeoffs. In particular, the tradeoff lies with the dose coverage of the tumor and the dose sparing of the various critical structures. The benefit of such a model is two-fold. First, the physician can interact with the model to immediately view a dose distribution, and then adjust some parameters to push the dose towards their desired tradeoff in real time. This also allows for the physician to quickly comprehend the kinds of the tradeoffs that are feasible for the patient. Second, the treatment planner, upon receiving the physician’s desired dose distribution, can quickly generate a fully deliverable plan that matches this dose distribution, saving time in tuning the optimization hyperparameters and discussing with the physician. We developed, trained, and tested the feasibility of the model on prostate cancer patients planned with 7 beam IMRT.
2 Methods
2.1 Prostate Patient Data and Pareto Plan Generation
We acquired the anatomical data for 70 prostate patients, in terms of the segmentation of the planning target volume (PTV) and the organs-at-risk, including bladder, rectum, left femur, right femur, and body. Ring and skin structures were added as tuning structures. The patient contours and dose arrays were formatted into 192 × 192 × 64 arrays at 2.5 mm3 voxel size. We then calculated the dose influence arrays for these 70 patients, for a 7 equidistant coplanar beam plan IMRT, with 2.5 mm2 beamlets at 100 cm isocenter—a typical setup for prostate IMRT. Using this dose calculation data, we generated IMRT plans that sampled the Pareto surface, representing various tradeoffs between the PTV and OARs. The multicriteria objective can be written as
where \( x \) is the fluence map intensities to be optimized. There exists individual objectives, \( f_{s} \left( x \right) \forall s \in PTV,OAR \), for the PTV and each of the OARs. Typically, the objective function is designed such that the goal is to deliver the prescribed dose to the PTV, while minimizing the dose to each OAR. Due to the physical aspects of external beam radiation, it is impossible to give the PTV exactly the prescription dose without irradiating normal tissue. Thus, we arrive at a multicriteria objective, where there does not exist a single optimal \( x^{*} \) that would minimize all \( f_{s} \left( x \right) \forall s \in PTV,OAR \). For a proof of concept in this study, we choose to use the L2-norm to represent the objective, \( f_{s} \left( x \right) = \frac{1}{2}\left\| {A_{s} x - p_{s} } \right\|_{2}^{2} \). Here, \( A_{s} \) is the dose influence matrix for a given structure, and \( p_{s} \) is the desired dose for a given structure, assigned as the prescription dose if \( s \) is the PTV, and 0 otherwise. This allows for us to linearly scalarize [8] the multicriteria optimization problem into a single-objective, convex optimization problem,
The key to scalarizing the problem is the addition of \( w_{s} \), which are the tradeoff weights for each objective function, \( f_{s} \left( x \right) \forall s \in PTV,OAR \). With different values of \( w_{s} \), different Pareto optimal solutions are generated. Using an in-house GPU-based proximal-class first-order primal-dual algorithm, Chambolle-Pock [9], we generated 1,200 pseudo-random plans per patient, totaling to 84,000 plans.
The generation of each plan entailed assigning pseudo-random weights to the organs-at-risk. The weight for the PTV was kept at 1. The weight assignment fell into 1 of 6 categories as shown in Table 1. For each patient, 100 plans for each organ-at-risk used the single organ spare category (bladder, rectum, left femoral head, right femoral head, shell, skin), totaling to 600 single organ spare plans for each patient. To ensure a larger sampling of weights, another 100 plans of the high, medium, low, and extra low weights were generated, as well as 200 plans of the controlled weights category. The bounds for the controlled weights were chosen through trial and error such that the final plan generated had a high likelihood of being in acceptable clinical bounds for an inexperienced human operator, but not necessarily acceptable for an experienced physician. In total 1,200 plans were generated per patient. With 70 patients, the total number of plans generated was 84,000 plans.
2.2 Deep Learning Architecture
We utilized a volumetric Hierarchically Dense U-net (HD U-net) architecture [10], as shown in Fig. 1, which adds in the densely connected convolutional layers [11] into the U-net architecture [12]. The HD U-net was trained to take as input the PTV contour, the body contour, and an avoidance map representing OARs masks assigned their respective \( w_{s} \), and to predict the optimized 3D dose distribution.
Specific HD U-net architecture used in this study. Black numbers to the left of the model represents the current dimensions of the 3D data at each hierarchy. -The red numbers indicate the number of feature maps present at the current layer in the neural network. The large number of features maps are due to the densely connected convolutional layers. (Color figure online)
Specifically, our HD U-net architecture has 5 max pooling and 5 upsampling operations, ultimately reducing our image size from 192 × 192 × 64 voxels to 6 × 6 × 4 voxels (the lowest level max pooling/upsampling layer reduces/expands leaves the slice dimension untouched), and back to 192 × 192 × 64 voxels. Skip connections are added between the first half and second half of the network, to allow for the propagation of local information with the global information. Densely connected convolutional connections are added in each block of the network, allowing for efficient information flow of features. The non-linearity used after each convolution was the scaled exponential linear unit (SELU) as presented by Klambauer et al. for self-normalizing neural networks [13]. The study proved, using the Banach fixed-point theorem, that by having the SELU nonlinear activation, the neuron activations automatically converge towards zero mean and unit variance. Also, by the paper suggestion, we did not include batch normalization, as that disrupts the self-normalizing property of SELU-based networks. Since the densely connection convolutional layers allows for less trainable parameters to be used, instead of doubling the number of kernels after every max pooling, we increased number of kernels by 1.25 fold, to the nearest integer. We chose our final activation layer as the softplus activation, as our output data is non-negative and we had found that it is much more stable for training than linear and the rectified linear unit (ReLU) when using SELU as the hidden layer activation.
2.3 Training and Evaluation
We randomly divided the data to 10 test patients and 60 model development (training and validation) patients. The 10 test patients were held out during the entire model development phase, and only used during evaluation. Five instances of the model were trained and validated, using 54 training patients and 6 validation patients, according the schematic outlined in Fig. 2.
At each training iteration, first a patient is selected and then set of \( w_{s} \) is selected from one of the 1,200 plans. These set of weights are then converted into an avoidance map, which is a single channel of the input that represents the OAR masks assigned their corresponding \( w_{s} \). In addition, the binary mask of the PTV and body are included as input. The HD U-net then makes a prediction using these inputs. The optimized dose, that was generated using the dose influence array and Chambolle-Pock algorithm, is used to minimize against the predicted dose distribution with a mean squared error loss. Alternatively, a plan can be generated on the fly from a given set of \( w_{s} \), but is less efficient for training on a single GPU. During training the model was assessed on the validation data every 200 iterations of training. Each instance of the model used a different set of validation patients for determining which iteration the lowest validation score was obtained. Using all 1,200 plans per training patient—64,800 training plans total—we trained the model for 100,000 iterations using the Adam optimizer, with a learning rate of \( 1 \times 10^{ - 4} \), using an NVIDIA V100 GPU. The 10 test patients were then evaluated using the trained models.
To equally compare across patients, the test plans were first normalized such that the dose to 95% of the PTV (D95) was equal to the prescription dose. For evaluation criteria, the PTV coverage (D98, D99), PTV max dose (defined as D2 by the ICRU-83 report [14]), homogeneity \( \left( {\frac{D2 - D98}{D50}} \right) \), and the structure max and mean doses (Dmax and Dmean) were evaluated.
3 Results
The HD U-net is capable of accurately predicting the Pareto optimal 3D dose distributions, with average mean dose errors of 3.4% (PTV), 1.6% (bladder), 3.7% (rectum), 3.2% (left femur), 2.9% (right femur), and 0.04% (body) of the prescription dose, as compared to the optimized plans. In addition, the HD U-net maintains the average max dose error of 7.7% (PTV), 5.6% (bladder), 4.2% (rectum), 8.0% (left femoral head), 7.7% (right femoral head), and 5.4% (body) of the prescription dose. The PTV dose coverage prediction was also very similar, with errors of 1.3% (D98) and 2.0% (D99) of the prescription dose. On average, the PTV homogeneity between the optimized reference dose and the prediction differed by 0.06. Figure 3 shows the avoidance map, optimized dose and prediction, and Fig. 4 shows the dose volume histogram for a test patient.
It took approximately 15 days to train each instance of the model for 100,000 iterations. Figure 5 represents the mean training and validation loss for the HD U-net over the 100,000 iterations of training. The validation curve begins to flatten out at around 80,000 iterations while the training loss continues to decrease. The small standard deviation in validation loss between the model instances indicate the stability and reproducibility of the overall model framework and choice of hyperparameters.
Given any structure weights set and anatomy, the neural network is capable of predicting the dose distribution in 1.7 s. Clinically, the optimization and dose calculation for IMRT takes approximately 5–10 min to complete. This makes it feasible for the model to be used in a real-time setting with a human operator.
4 Discussion and Conclusion
While other deep learning models designed to learn and predict the dose distribution of a patient plans, based either on historical clinical data or optimized plans to meet standardized clinical criteria, were developed in recent years [10, 15,16,17,18,19,20], this Pareto dose distribution model, to our knowledge, is the first deep learning model to able to generate any optimized plan from just the anatomy and structure weights. Although the model does not generate the final plan in terms of deliverability, its real-time prediction capabilities allow for it to be used as a tool for the physician quickly generate a dose distribution with realistic tradeoffs between the PTV and various OARs. This can then be given to the planner as an endpoint, alongside the other typical planning information provided by the physician. The treatment planner now has a tangible, physician-preferred endpoint to meet, and the physician gets an initial understanding of what is physically achievable. To further improve the automation, we plan to implement a robust dose mimicking optimization, such as TORA [21], which will automatically generate a deliverable plan given a dose distribution or constraints. We expect that the implementation of such a framework would drastically reduce the number of feedback loops between the planner and physician, and potentially fully automate the treatment planning for simple cases. The valuable time that is saved would allow for the physician and planner to focus on more challenging cases to produce the best achievable plan.
References
Brahme, A.: Optimization of stationary and moving beam radiation therapy techniques. Radiother. Oncol. 12, 129–140 (1988)
Convery, D., Rosenbloom, M.: The generation of intensity-modulated fields for conformal radiotherapy by dynamic collimation. Phys. Med. Biol. 37, 1359 (1992)
Bortfeld, T.R., Kahler, D.L., Waldron, T.J., Boyer, A.L.: X-ray field compensation with multileaf collimators. Int. J. Radiation Oncol. Biol. Phys. 28, 723–730 (1994)
Yu, C.X.: Intensity-modulated arc therapy with dynamic multileaf collimation: an alternative to tomotherapy. Phys. Med. Biol. 40, 1435 (1995)
Crooks, S.M., Wu, X., Takita, C., Watzich, M., Xing, L.: Aperture modulated arc therapy. Phys. Med. Biol. 48, 1333 (2003)
Earl, M., Shepard, D., Naqvi, S., Li, X., Yu, C.: Inverse planning for intensity-modulated arc therapy using direct aperture optimization. Phys. Med. Biol. 48, 1075 (2003)
Otto, K.: Volumetric modulated arc therapy: IMRT in a single gantry arc. Med. Phys. 35, 310–317 (2008)
Jahn, J.: Scalarization in multi objective optimization. In: Serafini, P. (ed.) Mathematics of Multi Objective Optimization. ICMS, vol. 289, pp. 45–88. Springer, Vienna (1985). https://doi.org/10.1007/978-3-7091-2822-0_3
Chambolle, A., Pock, T.: A first-order primal-dual algorithm for convex problems with applications to imaging. J. Math. Imaging Vis. 40, 120–145 (2011)
Nguyen, D., et al.: 3D radiotherapy dose prediction on head and neck cancer patients with a hierarchically densely connected U-net deep learning architecture. Phys. Med. Biol. 64, 065020 (2019)
Huang, G., Liu, Z., van der Maaten, L., Weinberger, K.Q.: Densely connected convolutional networks. In: 30th IEEE Conference on Computer Vision and Pattern Recognition, (CVPR 2017), vol. 1, pp. 2261–2269 (2017)
Ronneberger, O., Fischer, P., Brox, T.: U-net: convolutional networks for biomedical image segmentation. In: Navab, N., Hornegger, J., Wells, W.M., Frangi, A.F. (eds.) MICCAI 2015. LNCS, vol. 9351, pp. 234–241. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-24574-4_28
Klambauer, G., Unterthiner, T., Mayr, A., Hochreiter, S.: Self-normalizing neural networks. In: Advances in Neural Information Processing Systems, pp. 971–980. (2017)
Grégoire, V., Mackie, T.R.: State of the art on dose prescription, reporting and recording in Intensity-Modulated Radiation Therapy (ICRU report No. 83). Cancer/Radiothérapie 15, 555–559 (2011)
Nguyen, D., et al.: A feasibility study for predicting optimal radiation therapy dose distributions of prostate cancer patients from patient anatomy using deep learning. Sci. Rep. 9, 1076 (2019)
Chen, X., Men, K., Li, Y., Yi, J., Dai, J.: A feasibility study on an automated method to generate patient-specific dose distributions for radiotherapy using deep learning. Med. Phys. 46, 56–64 (2019)
Fan, J., Wang, J., Chen, Z., Hu, C., Zhang, Z., Hu, W.: Automatic treatment planning based on three-dimensional dose distribution predicted from deep learning technique. Med. Phys. 46, 370–381 (2019)
Shiraishi, S., Moore, K.L.: Knowledge-based prediction of three-dimensional dose distributions for external beam radiotherapy. Med. Phys. 43, 378–387 (2016)
Mahmood, R., Babier, A., McNiven, A., Diamant, A., Chan, T.C.: Automated treatment planning in radiation therapy using generative adversarial networks. arXiv preprint arXiv:1807.06489 (2018)
Babier, A., Mahmood, R., McNiven, A.L., Diamant, A., Chan, T.C.: Knowledge-based automated planning with three-dimensional generative adversarial networks. arXiv preprint arXiv:1812.09309 (2018)
Long, T., Chen, M., Jiang, S.B., Lu, W.: Threshold-driven optimization for reference-based auto-planning. Phys. Med. Biol. 63, 04NT01 (2018)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Nguyen, D., Barkousaraie, A.S., Shen, C., Jia, X., Jiang, S. (2019). Generating Pareto Optimal Dose Distributions for Radiation Therapy Treatment Planning. In: Shen, D., et al. Medical Image Computing and Computer Assisted Intervention – MICCAI 2019. MICCAI 2019. Lecture Notes in Computer Science(), vol 11769. Springer, Cham. https://doi.org/10.1007/978-3-030-32226-7_7
Download citation
DOI: https://doi.org/10.1007/978-3-030-32226-7_7
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-32225-0
Online ISBN: 978-3-030-32226-7
eBook Packages: Computer ScienceComputer Science (R0)