Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2026 Feb 7.
Published in final edited form as: Epidemiology. 2025 Aug 19;37(1):e1–e2. doi: 10.1097/EDE.0000000000001909

Re. Prediagnostic Exposures and Cancer Survival

Can a Meaningful Causal Estimand be Specified?

Matthew M Coates 1,2, Charles J Wolock 3, Onyebuchi A Arah 4,5,6,7
PMCID: PMC12880854  NIHMSID: NIHMS2133669  PMID: 40827761

To the Editor:

Albers and colleagues highlighted the challenges of defining meaningful and useful estimands for effects of pre-cancer-diagnosis exposures on post-cancer-diagnosis outcomes.1 We aim to contribute to this conversation by discussing (i) truncation and (ii) specification of causal estimands for cancer-relevant target populations.

First, we must contend with truncation: potential outcomes may be undefined under treatment levels that prevent cancer (or its diagnosis) if the outcome is defined relative to diagnosis (e.g., survival after diagnosis).2 The causal estimands examined by Albers and colleagues handle truncation differently.1 The target population for the survivor average causal effect (SACE) comprises individuals who would be diagnosed regardless of their exposure level. The controlled direct effect directly intervenes on diagnosis. Conditional separable effects consider distinct components of treatment that affect the outcome through separate pathways. The total effect example given by Albers et al. measures cancer mortality from the time of exposure, not diagnosis.

Second, it is crucial to specify causal estimands for cancer-relevant target populations here. Of the above estimands, only the SACE and conditional separable effect specify target populations of people diagnosed with cancer. Estimands (including controlled direct effects and total effects) can be defined among individuals diagnosed with cancer (C=1), those who would be diagnosed with cancer under specified treatment(s) (e.g., CX=1=1 or CX=0=1), or combinations of these.3,4

We could consider other estimands that avoid truncation and also target people diagnosed with cancer. Redefining Y as survival time from exposure (or all-cause mortality some specified time after exposure), the total effect of a prediagnosis exposure among people diagnosed with cancer after the time of exposure assessment could be defined as E[YX=1YX=0C=1] (Table). Here, outcomes under observed (factual) treatment are “post-diagnosis,” but outcomes under counterfactual treatment may occur before, and possibly preclude, diagnosis. Identification of such effects is theoretically possible but requires conditions that may only be plausible in particular scenarios.57 Others have also defined the net treatment difference – a contrast of potential outcomes that allows the population in each part of the contrast to vary based on the exposure: EYX=1CX=1=1EYX=0CX=0=1.8,9

Table.

Selected Example Estimands Among People Diagnosed With Cancer or People Who Would be Diagnosed With Cancer Under Certain Levels of Treatment

Estimand Description
Y = Years of Survival Since Exposurea Y = Years of Survival Since Cancer Diagnosisb

EYX=1YX=0C=1 Among people diagnosed with cancer after an exposure, what is the average difference in number of years of life lived since exposure assessment had they been exposed to X=1 versus X=0 Possibly ill-defined: YX=1 or YX=0 can be undefined for at least some individuals if X affects C
EYX=1YX=0C=1,X=0
or
EYX=1YX=0CX=0=1,X=0
Among people who were exposed to X=0 and subsequently diagnosed with cancer, what is the average difference in number of years of life lived since exposure assessment had they been exposed to X=1 versus X=0 Possibly ill-defined: YX=1 can be undefined for at least some individuals if X affects C
EYX=1YX=0C=1,X=1
or
EYX=1YX=0CX=1=1,X=1
Among people who were exposed to X=1 and subsequently diagnosed with cancer, what is the average difference in number of years of life lived since exposure assessment had they been exposed to X=1 versus X=0 Possibly ill-defined: YX=0 can be undefined for at least some individuals if X affects C
EYX=1YX=0CX=0=1 Among people who would be diagnosed with cancer under exposure X=0, what is the average difference in number of years of life lived since exposure assessment had they been exposed to X=1 versus X=0 Possibly ill-defined: YX=1 can be undefined for at least some individuals if X affects C
EYX=1YX=0CX=1=1,CX=0=1 Survivor average causal effect (SACE): among people who would be diagnosed with cancer after either level of exposure, what is the average difference in number of years of life lived since exposure assessment had they been exposed to X=1 versus X=0 Survivor average causal effect (SACE): among people who would be diagnosed with cancer under either level of exposure, what is the average difference in number of years of life lived since diagnosis had they been exposed to X=1 versus X=0

X, prediagnosis exposure (binary); C, cancer diagnosis after exposure assessment (binary).

a

Y could also be defined as the probability of survival a specified number of years after exposure.

b

Y could also be defined as the probability of survival (or of cancer survival) a specified number of years after diagnosis.

In all cases, research questions and estimands should concur on and be explicit about the target population(s) of interest. The bias of a statistical procedure is then defined relative to a specified estimand. For prediagnosis exposures, some estimands with relevant target populations may be ill-defined because of truncation; others are well defined but may be challenging to identify. There may still be interest in the effects of these prediagnosis exposures or their use as proxies for the effects of exposures at other time points. Alternatively, prediagnosis exposures may be of interest as modifiers of postdiagnosis exposure effects, or, as Albers and colleagues suggest, for description and prediction.

Acknowledgments

M.M.C. was supported by the National Cancer Institute, grant number NIH/NCI T32 CA009142. Work on this project was also supported in part by the UCLA Jonsson Comprehensive Cancer Center’s Office of Cancer Training and Education.

Footnotes

The authors report no conflicts of interest.

Contributor Information

Matthew M. Coates, Department of Epidemiology, Fielding School of Public Health, University of California, Los Angeles (UCLA), Los Angeles, CA Practical Causal Inference Lab, UCLA, Los Angeles, CA.

Charles J. Wolock, Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania, Philadelphia, PA

Onyebuchi A. Arah, Department of Epidemiology, Fielding School of Public Health, University of California, Los Angeles (UCLA), Los Angeles, CA Practical Causal Inference Lab, UCLA, Los Angeles, CA; Department of Statistics and Data Science, Division of Physical Sciences, College of Letters and Science, UCLA, Los Angeles, CA; Department of Public Health, Research Unit for Epidemiology, Aarhus, University, Aarhus, Denmark.

REFERENCES

  • 1.Albers FEM, Moreno-Betancur M, Milne RL, English DR, Lynch BM, Dashti SG. Prediagnostic exposures and cancer survival: can a meaningful causal estimand be specified? Epidemiology. 2025;36:408–412. [DOI] [PubMed] [Google Scholar]
  • 2.Young JG, Stensrud MJ. Identified versus interesting causal effects in fertility trials and other settings with competing or truncation events. Epidemiology. 2021;32:569–572. [DOI] [PubMed] [Google Scholar]
  • 3.Pearl J, Glymour M, Jewell NP. Section 4.3.1 in Causal Inference in Statistics: A Primer. John Wiley & Sons; 2016. [Google Scholar]
  • 4.Frangakis CE, Rubin DB. Principal stratification in causal inference. Biometrics. 2004;58:21–29. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Flanders WD, Eldridge RC, McClellan W. A nearly unavoidable mechanism for collider bias with index-event studies. Epidemiology. 2014;25:762–764. [DOI] [PubMed] [Google Scholar]
  • 6.Pearl J Generalizing experimental findings. J Causal Inference. 2015;3:259–266. [Google Scholar]
  • 7.Hazlett C, McMurry N, Shinkre T. Post-treatment problems: what can we say about the effect of a treatment among sub-groups who (would) respond in some way? [published online ahead of print May 10, 2025]. doi: 10.48550/arXiv.2505.06754. [DOI] [Google Scholar]
  • 8.Rosenbaum PR. The consequences of adjustment for a concomitant variable that has been affected by the treatment. J R Stat Soc Ser A (Gen). 1984;147:656–666. [Google Scholar]
  • 9.Mathur MB, Shpitser I. Simple graphical rules for assessing selection bias in general-population and selected-sample treatment effects. Am J Epidemiol. 2024;194:267–277. [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES