Improved Outcome Models with Denoising Diffusion

D Dudas; T J Dilling; I El Naqa

doi:10.1016/j.ejmp.2024.103307

. Author manuscript; available in PMC: 2025 Mar 1.

Published in final edited form as: Phys Med. 2024 Feb 6;119:103307. doi: 10.1016/j.ejmp.2024.103307

Improved Outcome Models with Denoising Diffusion

D Dudas ^1,³, T J Dilling ², I El Naqa ^1,²

PMCID: PMC10939775 NIHMSID: NIHMS1964973 PMID: 38325221

Abstract

Purpose:

Radiotherapy outcome modelling often suffers from class imbalance in the modelled endpoints. One of the main options to address this issue is by introducing new synthetically generated datapoints, using generative models, such as Denoising Diffusion Probabilistic Models (DDPM). In this study, we implemented DDPM to improve performance of a tumor local control model, trained on imbalanced dataset, and compare this approach with other common techniques.

Methods:

A dataset of 535 NSCLC patients treated with SBRT (50 Gy/5 fractions) was used to train a deep learning outcome model for tumor local control prediction. The dataset included complete treatment planning data (planning CT images, 3D planning dose distribution and patient demographics) with sparsely distributed endpoints (6–7 % experiencing local failure). Consequently, we trained a novel conditional 3D DDPM model to generate synthetic treatment planning data. Synthetically generated treatment planning datapoints were used to supplement the real training dataset and the improvement in the model’s performance was studied. Obtained results were also compared to other common techniques for class imbalanced training, such as Oversampling, Undersampling, Augmentation, Class Weights, SMOTE and ADASYN.

Results:

Synthetic DDPM-generated data were visually trustworthy, with Fréchet inception distance (FID) below 50. Extending the training dataset with the synthetic data improved the model’s performance by more than 10%, while other techniques exhibited only about 4% improvement.

Conclusions:

DDPM introduces a novel approach to class-imbalanced outcome modelling problems. The model generates realistic synthetic radiotherapy planning data, with a strong potential to increase performance and robustness of outcome models.

Keywords: Outcome modelling, lung cancer, event imbalance Deep Learning, Denoising Diffusion Probabilistic Models

1. Introduction

Over the last few years, machine learning models have dominated in the area of treatment outcome modeling thanks to their superior performance, interpretation, and possibility to combine a variety of input data (e.g., imaging data, treatment planning data, multi-omics data, patient demographics) [1–7]. Unfortunately, many modeled endpoints (e.g., local control, regional/distant recurrence or radiation toxicities) are often distributed sparsely, with resultant class imbalance in the dataset [8–12].

There are various techniques to overcome the class imbalance problem. Traditional approaches may include Undersampling, Oversampling, Augmentation, Class Weights, Synthetic Minority Oversampling Technique (SMOTE), Adaptive Synthetic Sampling Approach (ADASYN) and others [13–23]. More advanced techniques utilize a combination of traditional approaches with certain deep learning architectures, such as deepSMOTE [24] or SMOTified-GAN[25].

In recent years, there has been a rapid growth of denoising diffusion probabilistic models (DDPM). Since its introduction by Ho, et al. in 2020 [26], it has been applied in various applications, including generation of synthetic 2D medical images [27–31] and 3D ones [32]. Such synthetic medical images can be used to enhance classification models. As demonstrated in various studies [33–35], DDPM can effectively diminish problems arising from class imbalance or general lack of sufficient training data.

This study introduces a novel conditional 3D DDPM for multi-modal radiotherapy planning data, including patient demographics. We specifically investigate its potential to improve the performance of deep learning outcome models and compare it with other techniques to solve the class imbalance. Having a generative model of reliable and complete radiotherapy planning data might considerably increase performance and robustness of outcome models, and thus, help translating them into clinics, where they can contribute to more personalized treatment.

2. Material and Methods

2.1. Local control outcome model

A dataset of 535 non-small cell lung cancer (NSCLC) patients undergoing stereotactic body radiotherapy (SBRT) was used to develop a deep learning model for time-to-local recurrence prediction (DL-surv) [36]. The patients were treated between 2009 and 2017. Most of the patients were stage I (82 %), 16 % were stage II and 2 % were stage III. More dataset details are provided in Table 1. The maximum follow-up time was 5 years, after which tumors were considered locally controlled. The mean follow-up time was 28 months.

Table 1.

Clinical characteristics of the dataset

	Total number of patients (n = 535)	Number of LR patients (n = 31)
Gender
Male	279 (52.1 %)	20 (64.5 %)
Female	256 (47.9 %)	11 (35.5 %)
Age [years]
< 60	34 (6.40 %)	0 (0 %)
60 – 70	146 (27.3 %)	8 (25.8 %)
70 – 80	222 (41.5 %)	16 (51.6 %)
> 80	133 (24.9 %)	7 (22.6 %)
PTV volume [cm³]
< 25	185 (34.6 %)	6 (19.4 %)
25 – 50	187 (35.0 %)	9 (29.0 %)
50 – 100	115 (21.5 %)	10 (32.3 %)
> 100	48 (9.0 %)	6 (19.4 %)
Clinical maximum tumor diameter [mm]
≤ 10	53 (34.6 %)	1 (3.2 %)
11–20	240 (35.0 %)	12 (38.7 %)
21 – 30	149 (21.5 %)	11 (35.5 %)
≥ 31	93 (9.0 %)	7 (22.6 %)
Lobe
LLL	80 (15.0 %)	6 (19.4 %)
LUL	150 (28.0 %)	5 (16.1 %)
RLL	110 (20.6 %)	8 (25.8 %)
RML	5 (0.9 %)	0 (0 %)
RUL	190 (35.5 %)	12 (38.7 %)

Open in a new tab

The prescribed dose was 50 Gy in 5 fractions, i.e., 100 Gy of biologically effective dose (BED [α/β=10]). Planning requirements were 95 % of PTV and 100 % of IGTV to be covered by the prescribed dose. Treatments were delivered on Varian TrueBeam or Trilogy linear accelerators, using IMRT (32% of patients) or VMAT (68% of patients).

The architecture of the DL algorithm is depicted in Figure 1. The model predicts conditional probability of local control in discrete time intervals [6,37], using multi-modal planning data (CT images, 3D planning dose) and patient demographics (PTV size, clinical maximum tumor diameter, gender, age, and lung lobe tumor location). There are 3 parallel neural networks (NN). One 3D convolutional neural network (3D CNN) for feature extraction from planning dose distribution (Dose-CNN), one 3D CNN for feature extraction from planning CT images, and one variational autoencoder (VAE) for encoding patient demographic features (Demo-VAE). Extracted features from all 3 NN are then concatenated and fed into the discrete-time survival NN (Surv-net) [6,37], which predicts probability of local control in specified time intervals. In this study, the time intervals were 0–1, 1–2, 2–3 and 3–5 years. The optimized hyper parameters were learning rate=5e-4, batch size=32, weight decay=0.05, dropout=0.2.

The model was cross-validated (CV) on 80% of the data, using stratified 5-fold approach with 50 iterations in total. The remaining 20% of data was withheld for independent testing. The data split was done according to TRIPOD criteria 2b [38]. The CV Harrell’s concordance index (c-index) was 0.72 with 95% confidence interval (CI) 0.68–0.75, and the testing c-index was 0.69. One of the main drawbacks, holding back the model’s performance, was severe class imbalance. There were only 31 patients (5.8 %) experiencing local recurrence (LR), while the rest was either LR-free or censored before the maximum follow-up point. Therefore, we implemented and compared various techniques to solve the class imbalance problem.

2.2. Traditional approaches to solve the class imbalance

Multiple traditional techniques [39] and their combinations were implemented and evaluated. Among the most basic traditional techniques were: Undersampling, Oversampling, Augmentation (rotation, resizing, flipping) and Class Weights. With respect to the Class Weights technique, the most promising ratio of class weights was found to be 1:5 (w_major=1, w_minor=5). We then applied more advanced techniques, such as SMOTE and ADASYN.

2.3. Denoising Diffusion Probabilistic Model

The conditional DDPM for generation of synthetic NSCLC SBRT data (DDPM_NSCLC) was developed in accordance with the original paper by Ho, et al. [26] and its improved version published by Nichol, et al. [40]. Figure 2 shows diagram of the model. It was trained on the same dataset as the DL-surv model, keeping the withheld testing subset untouched. The input of the model consists of 3D treatment planning data (CT images, planning dose and PTV mask) and 3 embedded conditions - sex, class and time-to-event bin. Consequently, it is possible to generate synthetic samples specifically according to the class and time-to-event, which is necessary for generation of new samples to balance the original dataset. The remaining demographic details were calculated from generated samples (PTV volume, clinical maximum tumor diameter), and the lung lobe was imputed according to the distribution in the original dataset.

Figure 2: — Diagram of the DDPM model used for the generation of synthetic radiotherapy planning data.

Synthetic samples were evaluated utilizing Fréchet Inception Distance (FID) [41], which is a common technique to evaluate the quality of images from generative neural networks, such as VAE, GAN or DDPM. It is a measure that can compare feature representations of real and generated images. The features are extracted from a pre-trained InceptionV3 NN and their distributions are compared using Fréchet Distance. Lower FID values indicate higher feature similarity between real and synthetic images, thus, more realistic image generation.

The model was built in PyTorch 1.13.1 [42]. The optimization algorithm used in this study was Adam [43], and the loss function was mean squared error. The model was optimized for batch size (bs), learning rate (lr), weight decay (w_d) and variance schedule in the diffusion process (β_min, β_max, n_steps). The optimized values were bs=16, lr=0.0005, w_d=0, β_min=0.0001, β_max=0.02 and n_steps=1000.

Subsequently, 300 synthetically generated NSCLC SBRT data supplemented the real training data, which improved the class imbalance ratio from about 1:16 to 2:3.

We also tried building conditional Generative Adversarial Network (GAN) for the same purpose. None of the implemented GAN models (DCGAN, WGAN, WGAN-GP) provided acceptable results, as all of them failed exhibiting a mode collapse. This was presumably caused by the limited number of local failures datapoints, i.e., limited feature space for conditional GAN training.

3. Results

The comparison of real and DDPM generated data is provided in Figure 3. It shows examples of 4 real and 4 synthetic data samples, including CT images, planning dose distributions and PTV masks, which qualitatively appear to be satisfactory. The FID values of synthetic CT, Dose and PTV samples were 41.7, 27.0 and 59.4, respectively. The FIDs were calculated as mean values from all inceptionV3 feature layers. The reason for not using only the last feature layer is the low number of samples in the dataset, which limits the use of FID, as it strongly depends on the number of samples.

Table 2 provides a comparison of DL-surv performance for 8 different class imbalance solutions. The DDPM approach outperformed the other competing methods, with a testing c-index of 0.75. We used this approach to generate 300 additional synthetic patients to reduce the class imbalance for training the DL-surv model.

Table 2:

Cross-validation and testing performance of DL-surv model for different techniques to solve the class imbalance.

Class-balancing technique	CV c-index (95% CI)	Testing c-index
Original dataset	0.72 (0.68–0.75)	0.66
Undersampling	0.70 (0.66–0.74)	0.66
Oversampling	0.66 (0.62–0.70)	0.67
Augmentation	0.70 (0.67–0.74)	0.67
Class weights (1:5)	0.72 (0.69–0.74)	0.69
Augmentation + class weights	0.68 (0.64–0.72)	0.67
SMOTE	0.66 (0.63–0.69)	0.69
ADASYN	0.67 (0.64–0.70)	0.69
DDPM_NSCLC	0.74 (0.71–0.77)	0.75

Open in a new tab

4. Discussion

This work has implemented a conditional DDPM for simultaneous generation of realistic CT images, 3D dose distributions, and PTV masks, along with selected patient demographic details. As can be seen from Figure 3 the images are trustworthy and the FID values for CT, Dose and PTV mask are within acceptable range and are consistent with previous publications [27,44].

Traditional approaches used to solve the class imbalance were mostly unsuccessful (see Table 2). Undersampling and Oversampling reached testing c-indexes of 0.66 and 0.67 respectively, which is about the same performance compared to the original dataset. The reason is data scarcity in the minority class. The whole dataset contains only 31 patients with LR; thus, there are only 25 (80%) of them in the CV subset. While Undersampling and Oversampling help to balance training data batches, they do not necessarily help with the model performance, as the training data might still be inadequately sparse for reasonable training [45]. One must confront this problem in highly imbalanced datasets with a low number of minority class samples, such as this. Similarly, the Class Weights technique exhibited only slightly better testing c-index (0.69) than the original dataset. Class Weights aims to adapt the loss function according to the severity of the class imbalance. It can reduce class imbalance but does not tackle the problem of insufficient training datapoints. On the other hand, augmentation is promising not only for the class imbalance issue, but also for the lack of minority class samples, as it generates new data points. However, it did not improve the testing c-index (0.69) by much. The reason for the poor performance here is presumably due to the homogeneity of the data. All patients were positioned identically (headfirst - supine). Therefore, rotating or flipping data would not improve the model’s performance, since it only introduces features that are not normally present in the data.

ADASYN and SMOTE showed similar results. The testing c-index was 0.69, which is only a slight improvement. This is probably due to the inappropriateness of these techniques for complex imaging multi-modal data. Even though there were previously successful attempts to apply these techniques in imaging data [46–48], they were never applied in more complex data, such as radiotherapy planning data. Originally, SMOTE/ADASYN were designed for tabular data [13,14]. Hence, they might underperform when using complex multi-modal imaging data.

As shown in Table 2, DL-surv performance was considerably enhanced after supplementing the original training dataset by synthetic DDPM_NSCLC generated data. The testing c-index increased from 0.66 to 0.75, which is more than 10% improvement. DDPM is generating synthetic data, which are visually entirely new, although they have the same distribution of underlying features. Consequently, DDPM generated data do not necessarily extend the feature space covered by the training samples but do significantly increase its sampling. This approach is helpful in this particular class imbalance problem and makes the DDPM approach superior to other implemented class imbalance techniques. However, the reason(s) for limited performance in any outcome modelling should always be carefully assessed. Usually, there is an interplay of multiple factors, and the main limiting factor can often be different than the class imbalance issue.

In conclusion, denoising diffusion probabilistic models are highly promising generative models for complex multi-modal imaging data. This work presents DDPM_NCSLC, which is a novel generative model that can be used to generate complex synthetic radiotherapy planning data, including all necessary components. The generated data look very realistic and show potential to increase performance of multi-modality outcome models trained on data with a high event sparsity. In the presented tumor local control model, with the class imbalance ratio of about 1:16, the performance improved by more than 10%. Other implemented techniques (Undersampling, Oversampling, Augmentation, Class Weights, SMOTE, ADASYN) did not provide more than ~4% improvement.

DDPM generating treatment planning data was developed and verified (FID<50)
Synthetic data helped to improve performance of SBRT outcome model by >10%
Other techniques for imbalanced training improved performance by less than 4%
DDPM model can diminish the negative effect of event imbalance in outcome models.
Introduced approach can help in clinical translation of radiotherapy outcome models

Acknowledgements

This work was supported by National Institute of Health (NIH) grant R01-CA233487 and Department of Defense (DoD) Congressional Directed Med Res Prog (CDMRP) W81XWH-22-1-0277.

Footnotes

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

Previous presentation of the study

AAPM 2023 65^th Annual Meeting

References

[1].Niraula D, Cui S, Pakela J, Wei L, Luo Y, Haken RKT, et al. Current status and future developments in predicting outcomes in radiation oncology. British Journal of Radiology 2022;95. 10.1259/bjr.20220239. [DOI] [PMC free article] [PubMed] [Google Scholar]
[2].Cui S, Hope A, Dilling TJ, Dawson LA, Ten Haken R, El Naqa I. Artificial Intelligence for Outcome Modeling in Radiotherapy. Semin Radiat Oncol 2022;32:351–64. 10.1016/j.semradonc.2022.06.005. [DOI] [PubMed] [Google Scholar]
[3].Luo Y, Tseng H-H, Cui S, Wei L, Ten Haken RK, El Naqa I. Balancing accuracy and interpretability of machine learning approaches for radiation treatment outcomes modeling. BJR|Open 2019;1:20190021. 10.1259/bjro.20190021. [DOI] [PMC free article] [PubMed] [Google Scholar]
[4].Wei L, Owen D, Rosen B, Guo X, Cuneo K, Lawrence TS, et al. A deep survival interpretable radiomics model of hepatocellular carcinoma patients. Physica Medica 2021;82:295–305. https://doi.org/ 10.1016/j.ejmp.2021.02.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
[5].El Naqa I, Johansson A, Owen D, Cuneo K, Cao Y, Matuszak M, et al. Modeling of Normal Tissue Complications Using Imaging and Biomarkers After Radiation Therapy for Hepatocellular Carcinoma. International Journal of Radiation Oncology*Biology*Physics 2018;100:335–43. https://doi.org/ 10.1016/j.ijrobp.2017.10.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
[6].Cui S, Ten Haken RK, El Naqa I. Integrating Multiomics Information in Deep Learning Architectures for Joint Actuarial Outcome Prediction in Non-Small Cell Lung Cancer Patients After Radiation Therapy. Int J Radiat Oncol Biol Phys 2021;110:893–904. 10.1016/j.ijrobp.2021.01.042. [DOI] [PMC free article] [PubMed] [Google Scholar]
[7].Pang S, Field M, Dowling J, Vinod S, Holloway L, Sowmya A. Training radiomics-based CNNs for clinical outcome prediction: Challenges, strategies and findings. Artif Intell Med 2022;123:102230. https://doi.org/ 10.1016/j.artmed.2021.102230. [DOI] [PubMed] [Google Scholar]
[8].Cui S, Luo Y, Tseng H-H, Ten Haken RK, El Naqa I. Combining handcrafted features with latent variables in machine learning for prediction of radiation-induced lung damage. Med Phys 2019;46:2497–511. https://doi.org/ 10.1002/mp.13497. [DOI] [PMC free article] [PubMed] [Google Scholar]
[9].Aldraimli M, Soria D, Grishchuck D, Ingram S, Lyon R, Mistry A, et al. A data science approach for early-stage prediction of Patient’s susceptibility to acute side effects of advanced radiotherapy. Comput Biol Med 2021;135:104624. https://doi.org/ 10.1016/j.compbiomed.2021.104624. [DOI] [PubMed] [Google Scholar]
[10].El Naqa I, Pandey G, Aerts H, Chien J-T, Andreassen CN, Niemierko A, et al. Radiation Therapy Outcomes Models in the Era of Radiomics and Radiogenomics: Uncertainties and Validation. Int J Radiat Oncol Biol Phys 2018;102:1070–3. 10.1016/j.ijrobp.2018.08.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
[11].Welch ML, McIntosh C, McNiven A, Huang SH, Zhang B-B, Wee L, et al. User-controlled pipelines for feature integration and head and neck radiation therapy outcome predictions. Physica Medica 2020;70:145–52. https://doi.org/ 10.1016/j.ejmp.2020.01.027. [DOI] [PubMed] [Google Scholar]
[12].Gangil T, Shahabuddin AB, Dinesh Rao B, Palanisamy K, Chakrabarti B, Sharan K. Predicting clinical outcomes of radiotherapy for head and neck squamous cell carcinoma patients using machine learning algorithms. J Big Data 2022;9:25. 10.1186/s40537-022-00578-3. [DOI] [Google Scholar]
[13].He H, Bai Y, Garcia EA, Li S. ADASYN: Adaptive synthetic sampling approach for imbalanced learning. 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence), 2008, p. 1322–8. 10.1109/IJCNN.2008.4633969. [DOI] [Google Scholar]
[14].Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP. SMOTE: synthetic minority over-sampling technique. Journal of Artificial Intelligence Research 2002;16:321–57. [Google Scholar]
[15].Fernando KRM, Tsokos CP. Dynamically Weighted Balanced Loss: Class Imbalanced Learning and Confidence Calibration of Deep Neural Networks. IEEE Trans Neural Netw Learn Syst 2022;33:2940–51. 10.1109/TNNLS.2020.3047335. [DOI] [PubMed] [Google Scholar]
[16].Shamsolmoali P, Zareapoor M, Shen L, Sadka AH, Yang J. Imbalanced data learning by minority class augmentation using capsule adversarial networks. Neurocomputing 2021;459:481–93. https://doi.org/ 10.1016/j.neucom.2020.01.119. [DOI] [Google Scholar]
[17].Gosain A, Sardana S. Handling class imbalance problem using oversampling techniques: A review. 2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI), 2017, p. 79–85. 10.1109/ICACCI.2017.8125820. [DOI] [Google Scholar]
[18].Devi D, Biswas SK, Purkayastha B. A Review on Solution to Class Imbalance Problem: Undersampling Approaches. 2020 International Conference on Computational Performance Evaluation (ComPE), 2020, p. 626–31. 10.1109/ComPE49325.2020.9200087. [DOI] [Google Scholar]
[19].Liu X-Y, Wu J, Zhou Z-H. Exploratory Undersampling for Class-Imbalance Learning. IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics) 2009;39:539–50. 10.1109/TSMCB.2008.2007853. [DOI] [PubMed] [Google Scholar]
[20].Amin A, Anwar S, Adnan A, Nawaz M, Howard N, Qadir J, et al. Comparing Oversampling Techniques to Handle the Class Imbalance Problem: A Customer Churn Prediction Case Study. IEEE Access 2016;4:7940–57. 10.1109/ACCESS.2016.2619719. [DOI] [Google Scholar]
[21].Temraz M, Keane MT. Solving the class imbalance problem using a counterfactual method for data augmentation. Machine Learning with Applications 2022;9:100375. https://doi.org/ 10.1016/j.mlwa.2022.100375. [DOI] [Google Scholar]
[22].Gameng HA, Gerardo BB, Medina RP. Modified Adaptive Synthetic SMOTE to Improve Classification Performance in Imbalanced Datasets. 2019 IEEE 6th International Conference on Engineering Technologies and Applied Sciences (ICETAS), 2019, p. 1–5. 10.1109/ICETAS48360.2019.9117287. [DOI] [Google Scholar]
[23].Saini M, Susan S. Deep transfer with minority data augmentation for imbalanced breast cancer dataset. Appl Soft Comput 2020;97:106759. https://doi.org/ 10.1016/j.asoc.2020.106759. [DOI] [Google Scholar]
[24].Dablain D, Krawczyk B, Chawla NV. DeepSMOTE: Fusing Deep Learning and SMOTE for Imbalanced Data. IEEE Trans Neural Netw Learn Syst 2023;34:6390–404. 10.1109/TNNLS.2021.3136503. [DOI] [PubMed] [Google Scholar]
[25].Sharma A, Singh PK, Chandra R. SMOTified-GAN for Class Imbalanced Pattern Classification Problems. IEEE Access 2022;10:30655–65. 10.1109/ACCESS.2022.3158977. [DOI] [Google Scholar]
[26].Ho J, Jain A, Abbeel P. Denoising Diffusion Probabilistic Models 2020.
[27].Müller-Franzes G, Niehues JM, Khader F, Arasteh ST, Haarburger C, Kuhl C, et al. Diffusion Probabilistic Models beat GANs on Medical Images 2022.
[28].Kazerouni A, Aghdam EK, Heidari M, Azad R, Fayyaz M, Hacihaliloglu I, et al. Diffusion Models for Medical Image Analysis: A Comprehensive Survey 2022. [DOI] [PubMed]
[29].Kim B, Ye JC. Diffusion Deformable Model for 4D Temporal Medical Image Generation 2022.
[30].Rombach R, Blattmann A, Lorenz D, Esser P, Ommer B. High-Resolution Image Synthesis with Latent Diffusion Models 2021.
[31].Pinaya WHL, Tudosiu P-D, Dafflon J, da Costa PF, Fernandez V, Nachev P, et al. Brain Imaging Generation with Latent Diffusion Models 2022.
[32].Khader F, Mueller-Franzes G, Arasteh ST, Han T, Haarburger C, Schulze-Hagen M, et al. Medical Diffusion: Denoising Diffusion Probabilistic Models for 3D Medical Image Generation 2022. [DOI] [PMC free article] [PubMed]
[33].Sagers LW, Diao JA, Melas-Kyriazi L, Groh M, Rajpurkar P, Adamson AS, et al. Augmenting medical image classifiers with synthetic data from latent diffusion models. ArXiv Preprint ArXiv:230812453 2023. [Google Scholar]
[34].Akrout M, Gyepesi B, Holló P, Poór A, Kincső B, Solis S, et al. Diffusion-based Data Augmentation for Skin Disease Classification: Impact Across Original Medical Datasets to Fully Synthetic Images. ArXiv Preprint ArXiv:230104802 2023. [Google Scholar]
[35].Sagers LW, Diao JA, Groh M, Rajpurkar P, Adamson AS, Manrai AK. Improving dermatology classifiers across populations using images generated by large diffusion models 2022.
[36].Dudas D, Saghad PG, Dilling TJ, Perez BA, Rosenberg SA, Naqa I El. Deep Learning-Guided Dosimetry for Mitigating Local Failure of Non-Small Cell Lung Cancer Patients Receiving SBRT. Int J Radiat Oncol Biol Phys 2024. 10.1016/j.ijrobp.2023.11.059. [DOI] [PubMed] [Google Scholar]
[37].Gensheimer MF, Narasimhan B. A scalable discrete-time survival model for neural networks. PeerJ 2019;7:e6257. 10.7717/peerj.6257. [DOI] [PMC free article] [PubMed] [Google Scholar]
[38].Collins GS, Reitsma JB, Altman DG, Moons KGM. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): The TRIPOD Statement. BMC Med 2015;13. 10.1186/s12916-014-0241-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
[39].He H, Garcia EA. Learning from Imbalanced Data. IEEE Trans Knowl Data Eng 2009;21:1263–84. 10.1109/TKDE.2008.239. [DOI] [Google Scholar]
[40].Nichol A, Dhariwal P. Improved Denoising Diffusion Probabilistic Models 2021.
[41].Heusel M, Ramsauer H, Unterthiner T, Nessler B, Hochreiter S. GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium. Proceedings of the 31st International Conference on Neural Information Processing Systems, Red Hook, NY, USA: Curran Associates Inc.; 2017, p. 6629–40. [Google Scholar]
[42].Paszke A, Gross S, Massa F, Lerer A, Bradbury J, Chanan G, et al. Pytorch: An imperative style, high-performance deep learning library. Adv Neural Inf Process Syst 2019;32. https://doi.org/ 10.48550/arXiv.1912.01703. [DOI] [Google Scholar]
[43].Kingma DP, Ba J. Adam: A Method for Stochastic Optimization. CoRR 2014;abs/1412.6980. [Google Scholar]
[44].Skandarani Y, Jodoin P-M, Lalande A. GANs for Medical Image Synthesis: An Empirical Study. J Imaging 2023;9. 10.3390/jimaging9030069. [DOI] [PMC free article] [PubMed] [Google Scholar]
[45].Ling CX, Sheng VS. Cost-Sensitive Learning. In: Sammut C, Webb GI, editors. Encyclopedia of Machine Learning, Boston, MA: Springer US; 2010, p. 231–5. 10.1007/978-0-387-30164-8_181. [DOI] [Google Scholar]
[46].Feng W, Huang W, Bao W. Imbalanced Hyperspectral Image Classification With an Adaptive Ensemble Method Based on SMOTE and Rotation Forest With Differentiated Sampling Rates. IEEE Geoscience and Remote Sensing Letters 2019;16:1879–83. 10.1109/LGRS.2019.2913387. [DOI] [Google Scholar]
[47].Chamseddine E, Mansouri N, Soui M, Abed M. Handling class imbalance in COVID-19 chest X-ray images classification: Using SMOTE and weighted loss. Appl Soft Comput 2022;129:109588. https://doi.org/ 10.1016/j.asoc.2022.109588. [DOI] [PMC free article] [PubMed] [Google Scholar]
[48].Reza MS, Ma J. Imbalanced Histopathological Breast Cancer Image Classification with Convolutional Neural Network. 2018 14th IEEE International Conference on Signal Processing (ICSP), 2018, p. 619–24. 10.1109/ICSP.2018.8652304. [DOI] [Google Scholar]

[R1] [1].Niraula D, Cui S, Pakela J, Wei L, Luo Y, Haken RKT, et al. Current status and future developments in predicting outcomes in radiation oncology. British Journal of Radiology 2022;95. 10.1259/bjr.20220239. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R2] [2].Cui S, Hope A, Dilling TJ, Dawson LA, Ten Haken R, El Naqa I. Artificial Intelligence for Outcome Modeling in Radiotherapy. Semin Radiat Oncol 2022;32:351–64. 10.1016/j.semradonc.2022.06.005. [DOI] [PubMed] [Google Scholar]

[R3] [3].Luo Y, Tseng H-H, Cui S, Wei L, Ten Haken RK, El Naqa I. Balancing accuracy and interpretability of machine learning approaches for radiation treatment outcomes modeling. BJR|Open 2019;1:20190021. 10.1259/bjro.20190021. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R4] [4].Wei L, Owen D, Rosen B, Guo X, Cuneo K, Lawrence TS, et al. A deep survival interpretable radiomics model of hepatocellular carcinoma patients. Physica Medica 2021;82:295–305. https://doi.org/ 10.1016/j.ejmp.2021.02.013. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R5] [5].El Naqa I, Johansson A, Owen D, Cuneo K, Cao Y, Matuszak M, et al. Modeling of Normal Tissue Complications Using Imaging and Biomarkers After Radiation Therapy for Hepatocellular Carcinoma. International Journal of Radiation Oncology*Biology*Physics 2018;100:335–43. https://doi.org/ 10.1016/j.ijrobp.2017.10.005. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R6] [6].Cui S, Ten Haken RK, El Naqa I. Integrating Multiomics Information in Deep Learning Architectures for Joint Actuarial Outcome Prediction in Non-Small Cell Lung Cancer Patients After Radiation Therapy. Int J Radiat Oncol Biol Phys 2021;110:893–904. 10.1016/j.ijrobp.2021.01.042. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R7] [7].Pang S, Field M, Dowling J, Vinod S, Holloway L, Sowmya A. Training radiomics-based CNNs for clinical outcome prediction: Challenges, strategies and findings. Artif Intell Med 2022;123:102230. https://doi.org/ 10.1016/j.artmed.2021.102230. [DOI] [PubMed] [Google Scholar]

[R8] [8].Cui S, Luo Y, Tseng H-H, Ten Haken RK, El Naqa I. Combining handcrafted features with latent variables in machine learning for prediction of radiation-induced lung damage. Med Phys 2019;46:2497–511. https://doi.org/ 10.1002/mp.13497. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R9] [9].Aldraimli M, Soria D, Grishchuck D, Ingram S, Lyon R, Mistry A, et al. A data science approach for early-stage prediction of Patient’s susceptibility to acute side effects of advanced radiotherapy. Comput Biol Med 2021;135:104624. https://doi.org/ 10.1016/j.compbiomed.2021.104624. [DOI] [PubMed] [Google Scholar]

[R10] [10].El Naqa I, Pandey G, Aerts H, Chien J-T, Andreassen CN, Niemierko A, et al. Radiation Therapy Outcomes Models in the Era of Radiomics and Radiogenomics: Uncertainties and Validation. Int J Radiat Oncol Biol Phys 2018;102:1070–3. 10.1016/j.ijrobp.2018.08.022. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R11] [11].Welch ML, McIntosh C, McNiven A, Huang SH, Zhang B-B, Wee L, et al. User-controlled pipelines for feature integration and head and neck radiation therapy outcome predictions. Physica Medica 2020;70:145–52. https://doi.org/ 10.1016/j.ejmp.2020.01.027. [DOI] [PubMed] [Google Scholar]

[R12] [12].Gangil T, Shahabuddin AB, Dinesh Rao B, Palanisamy K, Chakrabarti B, Sharan K. Predicting clinical outcomes of radiotherapy for head and neck squamous cell carcinoma patients using machine learning algorithms. J Big Data 2022;9:25. 10.1186/s40537-022-00578-3. [DOI] [Google Scholar]

[R13] [13].He H, Bai Y, Garcia EA, Li S. ADASYN: Adaptive synthetic sampling approach for imbalanced learning. 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence), 2008, p. 1322–8. 10.1109/IJCNN.2008.4633969. [DOI] [Google Scholar]

[R14] [14].Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP. SMOTE: synthetic minority over-sampling technique. Journal of Artificial Intelligence Research 2002;16:321–57. [Google Scholar]

[R15] [15].Fernando KRM, Tsokos CP. Dynamically Weighted Balanced Loss: Class Imbalanced Learning and Confidence Calibration of Deep Neural Networks. IEEE Trans Neural Netw Learn Syst 2022;33:2940–51. 10.1109/TNNLS.2020.3047335. [DOI] [PubMed] [Google Scholar]

[R16] [16].Shamsolmoali P, Zareapoor M, Shen L, Sadka AH, Yang J. Imbalanced data learning by minority class augmentation using capsule adversarial networks. Neurocomputing 2021;459:481–93. https://doi.org/ 10.1016/j.neucom.2020.01.119. [DOI] [Google Scholar]

[R17] [17].Gosain A, Sardana S. Handling class imbalance problem using oversampling techniques: A review. 2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI), 2017, p. 79–85. 10.1109/ICACCI.2017.8125820. [DOI] [Google Scholar]

[R18] [18].Devi D, Biswas SK, Purkayastha B. A Review on Solution to Class Imbalance Problem: Undersampling Approaches. 2020 International Conference on Computational Performance Evaluation (ComPE), 2020, p. 626–31. 10.1109/ComPE49325.2020.9200087. [DOI] [Google Scholar]

[R19] [19].Liu X-Y, Wu J, Zhou Z-H. Exploratory Undersampling for Class-Imbalance Learning. IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics) 2009;39:539–50. 10.1109/TSMCB.2008.2007853. [DOI] [PubMed] [Google Scholar]

[R20] [20].Amin A, Anwar S, Adnan A, Nawaz M, Howard N, Qadir J, et al. Comparing Oversampling Techniques to Handle the Class Imbalance Problem: A Customer Churn Prediction Case Study. IEEE Access 2016;4:7940–57. 10.1109/ACCESS.2016.2619719. [DOI] [Google Scholar]

[R21] [21].Temraz M, Keane MT. Solving the class imbalance problem using a counterfactual method for data augmentation. Machine Learning with Applications 2022;9:100375. https://doi.org/ 10.1016/j.mlwa.2022.100375. [DOI] [Google Scholar]

[R22] [22].Gameng HA, Gerardo BB, Medina RP. Modified Adaptive Synthetic SMOTE to Improve Classification Performance in Imbalanced Datasets. 2019 IEEE 6th International Conference on Engineering Technologies and Applied Sciences (ICETAS), 2019, p. 1–5. 10.1109/ICETAS48360.2019.9117287. [DOI] [Google Scholar]

[R23] [23].Saini M, Susan S. Deep transfer with minority data augmentation for imbalanced breast cancer dataset. Appl Soft Comput 2020;97:106759. https://doi.org/ 10.1016/j.asoc.2020.106759. [DOI] [Google Scholar]

[R24] [24].Dablain D, Krawczyk B, Chawla NV. DeepSMOTE: Fusing Deep Learning and SMOTE for Imbalanced Data. IEEE Trans Neural Netw Learn Syst 2023;34:6390–404. 10.1109/TNNLS.2021.3136503. [DOI] [PubMed] [Google Scholar]

[R25] [25].Sharma A, Singh PK, Chandra R. SMOTified-GAN for Class Imbalanced Pattern Classification Problems. IEEE Access 2022;10:30655–65. 10.1109/ACCESS.2022.3158977. [DOI] [Google Scholar]

[R26] [26].Ho J, Jain A, Abbeel P. Denoising Diffusion Probabilistic Models 2020.

[R27] [27].Müller-Franzes G, Niehues JM, Khader F, Arasteh ST, Haarburger C, Kuhl C, et al. Diffusion Probabilistic Models beat GANs on Medical Images 2022.

[R28] [28].Kazerouni A, Aghdam EK, Heidari M, Azad R, Fayyaz M, Hacihaliloglu I, et al. Diffusion Models for Medical Image Analysis: A Comprehensive Survey 2022. [DOI] [PubMed]

[R29] [29].Kim B, Ye JC. Diffusion Deformable Model for 4D Temporal Medical Image Generation 2022.

[R30] [30].Rombach R, Blattmann A, Lorenz D, Esser P, Ommer B. High-Resolution Image Synthesis with Latent Diffusion Models 2021.

[R31] [31].Pinaya WHL, Tudosiu P-D, Dafflon J, da Costa PF, Fernandez V, Nachev P, et al. Brain Imaging Generation with Latent Diffusion Models 2022.

[R32] [32].Khader F, Mueller-Franzes G, Arasteh ST, Han T, Haarburger C, Schulze-Hagen M, et al. Medical Diffusion: Denoising Diffusion Probabilistic Models for 3D Medical Image Generation 2022. [DOI] [PMC free article] [PubMed]

[R33] [33].Sagers LW, Diao JA, Melas-Kyriazi L, Groh M, Rajpurkar P, Adamson AS, et al. Augmenting medical image classifiers with synthetic data from latent diffusion models. ArXiv Preprint ArXiv:230812453 2023. [Google Scholar]

[R34] [34].Akrout M, Gyepesi B, Holló P, Poór A, Kincső B, Solis S, et al. Diffusion-based Data Augmentation for Skin Disease Classification: Impact Across Original Medical Datasets to Fully Synthetic Images. ArXiv Preprint ArXiv:230104802 2023. [Google Scholar]

[R35] [35].Sagers LW, Diao JA, Groh M, Rajpurkar P, Adamson AS, Manrai AK. Improving dermatology classifiers across populations using images generated by large diffusion models 2022.

[R36] [36].Dudas D, Saghad PG, Dilling TJ, Perez BA, Rosenberg SA, Naqa I El. Deep Learning-Guided Dosimetry for Mitigating Local Failure of Non-Small Cell Lung Cancer Patients Receiving SBRT. Int J Radiat Oncol Biol Phys 2024. 10.1016/j.ijrobp.2023.11.059. [DOI] [PubMed] [Google Scholar]

[R37] [37].Gensheimer MF, Narasimhan B. A scalable discrete-time survival model for neural networks. PeerJ 2019;7:e6257. 10.7717/peerj.6257. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R38] [38].Collins GS, Reitsma JB, Altman DG, Moons KGM. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): The TRIPOD Statement. BMC Med 2015;13. 10.1186/s12916-014-0241-z. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R39] [39].He H, Garcia EA. Learning from Imbalanced Data. IEEE Trans Knowl Data Eng 2009;21:1263–84. 10.1109/TKDE.2008.239. [DOI] [Google Scholar]

[R40] [40].Nichol A, Dhariwal P. Improved Denoising Diffusion Probabilistic Models 2021.

[R41] [41].Heusel M, Ramsauer H, Unterthiner T, Nessler B, Hochreiter S. GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium. Proceedings of the 31st International Conference on Neural Information Processing Systems, Red Hook, NY, USA: Curran Associates Inc.; 2017, p. 6629–40. [Google Scholar]

[R42] [42].Paszke A, Gross S, Massa F, Lerer A, Bradbury J, Chanan G, et al. Pytorch: An imperative style, high-performance deep learning library. Adv Neural Inf Process Syst 2019;32. https://doi.org/ 10.48550/arXiv.1912.01703. [DOI] [Google Scholar]

[R43] [43].Kingma DP, Ba J. Adam: A Method for Stochastic Optimization. CoRR 2014;abs/1412.6980. [Google Scholar]

[R44] [44].Skandarani Y, Jodoin P-M, Lalande A. GANs for Medical Image Synthesis: An Empirical Study. J Imaging 2023;9. 10.3390/jimaging9030069. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R45] [45].Ling CX, Sheng VS. Cost-Sensitive Learning. In: Sammut C, Webb GI, editors. Encyclopedia of Machine Learning, Boston, MA: Springer US; 2010, p. 231–5. 10.1007/978-0-387-30164-8_181. [DOI] [Google Scholar]

[R46] [46].Feng W, Huang W, Bao W. Imbalanced Hyperspectral Image Classification With an Adaptive Ensemble Method Based on SMOTE and Rotation Forest With Differentiated Sampling Rates. IEEE Geoscience and Remote Sensing Letters 2019;16:1879–83. 10.1109/LGRS.2019.2913387. [DOI] [Google Scholar]

[R47] [47].Chamseddine E, Mansouri N, Soui M, Abed M. Handling class imbalance in COVID-19 chest X-ray images classification: Using SMOTE and weighted loss. Appl Soft Comput 2022;129:109588. https://doi.org/ 10.1016/j.asoc.2022.109588. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R48] [48].Reza MS, Ma J. Imbalanced Histopathological Breast Cancer Image Classification with Convolutional Neural Network. 2018 14th IEEE International Conference on Signal Processing (ICSP), 2018, p. 619–24. 10.1109/ICSP.2018.8652304. [DOI] [Google Scholar]

PERMALINK

Improved Outcome Models with Denoising Diffusion

D Dudas

T J Dilling

I El Naqa