Table 1.
Variable | Variable model | |
---|---|---|
Aggressiveness | ||
Fitness | ||
Heterogeneity | ||
Size | ||
Treatment | ||
Survival |
For each observation , an image is drawn from the total pool of images with the closest and . This ensures the required association between factors of variation in the image and the simulated outcome data. The parametric equations follow the DAG presented in Fig. 1: are continuous independent noise variables. The collider is the difference between and , with a small amount of Gaussian noise (standard deviation of noise ). and have a standard deviation of to ensure that has a standard deviation of . Treatment is modeled as a Bernoulli variable with a logistic link function, where increased increases the probability of being treated. is subtracted to assure that ~ of patients are treated. Gaussian noise of standard deviation is added to the inverse log-odds of being treated to assure that every patient has some probability of being treated with the more intense treatment. This reflects the clinical world better as some patients may have strong preferences regarding their treatment, regardless of their underlying health status. Overall survival () increases with treatment (the true treatment effect is ) and decreases with heterogeneity in radiodensity and tumor aggressiveness. Again, Gaussian noise of standard deviation is added to introduce some uncertainty in the data