AN AGE-OLD CONTROVERSY: WHICH TIMESCALE TO CHOOSE
The timescale debate on using “time” versus “age” in longitudinal analyses goes back decades,1–5 alongside Age–Period–Cohort considerations.6–8 While attention is often given to which timescale might be preferred, considering them as complementary can be beneficial. The work by Hayes-Larson and colleagues in this issue,9 is a tour de force of exploring timescale options in multi-cohort analyses. The authors examined relationships between memory decline and two primary predictors, on each of 10 studies, across 15 “time” and 12 “age” model formulations (540 different sets of estimates). The authors conclude that “timescale choice is not necessarily a ‘one or the other’ decision.” We agree with their take-home-message that standard practice should entail examining both timescales, with efforts to understand why they diverge when they do.
A TIMELESS TRADEOFF: BIAS VERSUS PRECISION/VARIANCE
The well-known “bias–variance tradeoff” describes how, as a model’s complexity increases, bias is expected to decrease, while variance is expected to increase. Figure 1 shows the common U-shaped complexity curve. Optimal model complexity balances bias–variance components. Hayes-Larson et al.9 offered empirical evidence on how estimates of APOE ε4 genotype (APOE4) and diabetes effects on cognitive decline might differ by timescale specification. Relating timescale choices to model complexity and bias–variance tradeoff decisions can further illuminate selection implications. Our aims with this commentary are to (1) help clarify relationships between “time” and “age” axes in longitudinal models, and (2) connect research goals, timescale choices, and bias–variance tradeoffs.
FIGURE 1.
The Bias–variance complexity curve. Bias, variance, and noise all contribute to overall error. Suppose Y=f(X)+ϵ with ϵ∼N(0,σ2); expected squared prediction error at a point x is: i.e. Bias2 + Variance + Irreducible error. As parameters are added, models generally become more flexible and fit the data better, reducing bias. However, more flexible models tend to have more uncertainty in their parameters and higher variance.
IN GOD WE TRUST, ALL OTHERS BRING DATA
Three cohorts described in Hayes-Larson et al.9 (Atherosclerosis Risk in Communities Study,10 Baltimore Longitudinal Study of Aging,11 and Mayo Clinic Study of Aging12), generously provided data access for illustration; all were approved by their institutional review boards. Briefly, we followed Hayes-Larson and colleagues using Memory z-scores as the outcome, APOE4 positivity as an “exposure,” and sex and education as adjustors. With access to individual data, our pooled cohort included 80,956 observations on 21,913 participants, baseline ages 50–92 with up to 44 years of follow-up. See eAppendix1; https://links.lww.com/EDE/C240 for individual study contributions, study characteristics (eTable1; https://links.lww.com/EDE/C240), and other details (eFigure1; https://links.lww.com/EDE/C240).
TIME TO CONNECT ACROSS THE AGES
Hayes-Larson et al.9 specified the difference between cognitive decline trajectories (changes in memory z-scores) for a defined binary “exposure” as the estimand of interest. Figure 2 compares three linear trajectory exploratory plots representing potential estimates for “any APOE4 positivity” effects using different timescales. We show estimates using the pooled cohort and highlight the contrasts each model is based on.
FIGURE 2.
Three common timescales. Linear trajectory estimates by APOE4 status and their differences for three estimation approaches using the pooled data. Each panel shows data, slopes, slope estimates (per 10 years), standard errors (SE), and examples of information used across participants when estimating the slopes. Dashed lines represent between-person age comparisons and solid lines represent within-person aging. A, “Age0,” shows baseline memory z-scores versus baseline age, regression slopes for all APOE4± participants, and scores from six fictitious example participants who are 20 years different in age, 3 for each group. B, “Time,” shows repeated-measure scores versus years spent on the study, regression slopes from mixed models (see eAppendices), and the observed trajectories of four participants from the six in (A) who each aged 20 years in the study; the two participants enrolled at age 90 did not contribute longitudinal information. Note that within-person slopes in (B) appear much shallower over time than the between-person slopes in A. C, “Age,” shows repeated-measure scores versus observed age, mixed model regression slopes, and scores for the same 6 participants. Figure 2 shows visually and numerically that using Age as the timescale axis induces a bias–variance type tradeoff by combining information from both within-person and between-person contrasts, leading to (in this simple example) a more precise (21% smaller SE) estimate of the trajectory difference between APOE4 groups that is still similar to the “time” axis estimate. While we specify linear relationships here to clarify relations between the timescale axes; adding smoothing curves (splines, fractional polynomials, other) to these plots can help diagnose nonlinearities for each axis (see eFigure 2).
Longitudinal studies are able to separate cross-sectional, between-person age effects from within-person aging effects. In Figure 2A, the “Age0” (baseline age) axis uses only between-person information to estimate APOE4 associations with cross-sectional age, comparing different people who enrolled at different ages. Figure 2B, the “time” axis, uses only the within-person information on changes, contrasting for example, changes from e4- Jim and Gary, against changes from e4+ Todd and Josh, each of whom has aged 15–20 years while participating in the study. Although the average within-person slopes in Figure 2B are much shallower than the between-person slopes in Figure 2A for both APOE4 negative (−0.38 vs. −0.62) and positive (−0.53 vs. −0.72) individuals, the estimated APOE4 positivity effect (difference in slopes) is larger (−0.15 vs. −0.09 additional standard deviation [SD] decline per decade). Figure 2C, the “current age” axis, pools information from both within-person (Jim’s aging over 20 years) and between-person (Josh who is 70 vs. Elmer who is 90) contrasts, imposing an assumption that they have the same magnitude (shown below). While this can increase precision (e.g., smaller standard errors [SE] using “age” vs. “time”), it does so by potentially increasing biases. Luckily, we can examine these assumptions and implications.
YOU KNOW WHAT I “MEAN.” COMMON MODELS FOR AGE AND TIME
Throughout, we simplify notation to ease exposition and remove details to eAppendices; https://links.lww.com/EDE/C240. Here, we focus on parameters in the mean function E(Y), similar to the regression models used by Hayes-Larson et al.9 We use the common decomposition of “current age” into: age = (age0 + time), where “age0” is a person’s age at an index such as their first assessment (“baseline age”), again representing between-person (cross-sectional) age differences, and “time” is the time since their index measure, capturing within-person longitudinal changes. Other time-invariant ages can be used for the index instead of baseline age; the multilevel modeling literature often uses each person’s mean age as the index.
Age-time connections in longitudinal analyses are commonly introduced as follows:
Consider a simple, “current age” model, specifying a linear trajectory:
| (1a) |
| (1b) |
This shows that using current age in the model alone (Figure 2C, equation 1a) forces the longitudinal, within-person aging association for time, , to be identical to the between-person, cross-sectional baseline age0 coefficient (also ). This formalizes how “current age” models use both between-person and within-person information to estimate trajectories, as visualized in Figure 1. If between- and within-person associations are indeed similar and both information sources are pooled, estimation variance may decrease. Further, as in Figure 2C, sometimes biases can cancel each other when comparing trajectories for etiologic questions, (more on this below). However, if the between- and within-person associations diverge, is a biased estimate of either. Thankfully the magnitude of this bias can generally be checked:
-
2.
Consider a model using two terms to define age; “age0+time”:
| (2) |
This model is often presented as examining within-person aging effects over time, , while “adjusting for baseline age differences,” . Comparing equation (2) versus (1b) reveals that support for the simple “current age” timescale in (1), which assumes , can be examined using the linear contrast () after estimating (2). If () is negligible, the “current age” model and its implicit equality assumption are supported.
When (), choosing the best model requires understanding why the coefficients diverge. The time-coefficient () may be distorted compared with typical within-person aging by (a) test-retest/learning/practice effects (common with cognitive assessments), (b) study protocol changes, or (c) differential survivorship or attrition after enrollment. Participants can “learn” the brief cognitive tests used in cohort studies and perform better upon successive encounters compared with the first. Similarly, cognitive aging studies often alter their assessment protocols over time, for example, remote assessments were necessary during the COVID pandemic. If remote assessments routinely understate an older adult’s cognition compared with in-person assessments, time coefficients, including pandemic data will appear distorted. Additionally, survivorship/attrition differences in longitudinal cohort studies are also common; participants with cognitive measurements over long study times have inherently escaped potential death, dementia, and other missing not at random biasing events.
In contrast, the age0-coefficient () may deviate from the rate of aging due to between-person confounding phenomena such as (a) cohorts born in different periods averaging different cognitive scores for reasons unrelated to aging (b) rates of aging differing across birth cohorts or (c) differential survivorship before enrollment. For example, different life experiences for people born in different eras, (e.g., education quality in 1950 versus 1980, pre–post personal computing, etc.), could distort cross-sectional age representations of aging. Similarly, cumulative life experiences that affect rates of aging (1960s Vietnam war, 1970s disco, 1980s stock market crash, etc.) may bias the age0-coefficient away from the rate of change individuals in any given generation experience. Survivorship bias may occur because a new enrollee who is 90 years old may have selection influences making them intrinsically different than a new enrollee who is 50 years old. Whenever the within-person “time” () and between-person “age0” () coefficients diverge substantially, additional care should be taken, as neither may be the obvious best estimate for changes in cognitive aging.
Additional variables may be included in the models above without affecting generality, (with care taken in interpretations when using interactions with age, age0, or time terms). Using standard exposure (e4+=0/1) interaction terms to study APOE4 trajectory differences:
-
3.
“age” timescale:
| (3a) |
| (3b) |
-
4.
“age0+time” timescale:
| (4) |
Commonly, from (3a) and from (4) are taken as describing the differences in group trajectories of etiologic interest. (3b) shows the underlying within–between equality pooling assumptions of the “age” model, contrasted against the less restricted “age0+time” model. Hayes-Larson and colleagues empirically evaluate whether ≈ . We note that even when the between-versus-within-person effects differ in (4) if the differences are the same for both groups (e4+ vs e4−), then the differences will cancel out and the estimated trajectory contrasts across APOE4 positivity will indeed be equivalent in the two models (; see eAppendix2; https://links.lww.com/EDE/C240). This is a key concept for evaluating bias–variance trade-off decisions for etiologic investigations. Sometimes (as in Figure 1) despite different trajectories using within-only (time) versus between-only (age0) information, the forces driving these differences operate similarly across the groups of interest (e4+ vs e4−). When this occurs the “age” and “age0+time” models give similar trajectory contrasts, and although group trajectories in the “age” model would be biased, potentially more precise trajectory comparisons would not be.
These formulations all make an implicit assumption that effects across different cohorts are similar, that is, trajectories are assumed to be the same for a cohort starting in 1980 compared with one starting in 2000, as discussed in Age–Period–Cohort literature.
WHAT ELSE COULD YOU MEAN? INTERACTIONS AND NONLINEARITIES
Hayes-Larson and colleagues consider both interactions of baseline age0 with time/age axes and nonlinear mean trajectories. The main points above hold for mean functions like these as well; that is, one can expand “current age” into its two components of “age0” and “time” to better understand within-versus-between-person contributions and related bias–variance tradeoffs. The mean trajectory formulations in Hayes-Larson et al.9 (even those using “current age”) can be thought of as inducing alternative assumptions about the mean outcome surface over the two-dimensional predictor space of age0 and time; they consider generalized additive models (GAM)13 of the form:
where the trajectory functions fk(·) are parameterized using linear and/or restricted cubic spline formulations. Figure 3 shows selected 3-dimensional surface fits of induced E(Y) = f(age0, time) for common f(age) formulations, (eAppendix3; https://links.lww.com/EDE/C240 gives details and additional examples). The linear “age” model (equation 1a, Figure 3A) assumes that the outcome surface E(Y)=f(age0, time) is a plane defined by two identical slopes (for age0 and time) defining the plane. Specifying a linear interaction between age0 and age (Figure 3B) is equivalent to assuming an interaction between “age0” and within-person “time,” with certain coefficients restricted to be equal; this model also imposes an additional “hidden” assumption of a quadratic association with cross-sectional age0 that is identical to the age0-by-time interaction. Similarly, if quadratic(age) is specified, (Figure 3C), it induces equivalent quadratic(age0) and quadratic (time) associations, alongside a linear age0-by-time interaction term with a restricted coefficient. These restricted functions of f(age0, time), induced by the “age” timescale, would often not be the initial choice; for example, if a quadratic cross-sectional age0 association and simultaneous linear age0-by-time interaction are of interest, the less restricted comparison model in 3B would be a natural candidate. Figure 3 and eAppendix3; https://links.lww.com/EDE/C240 give suggested corresponding nonrestricted E(Y) = f(age0, time) models that can be compared with assumed E(Y) = f(age) models to examine degrees of potential bias induced by “age” timescale restricting assumptions.
FIGURE 3.
Selected E(Y)= f(age) models and corresponding induced three-dimensional surface fits for E(Y)=f(age0, time). In general, every chosen “current age” model, (linear, quadratic, interactions, etc.), induces assumed forms on both axes of the (age0, time) space, with additional parameter equivalences and potential interactions. Links for online rotatable versions are in eAppendix3.
TURNING TIME INTO AGE: MARGINAL STANDARDIZATION
Three-dimensional surfaces can be arduous to interpret. Figure 4 shows surface slices at selected baseline ages using pooled data fits of the most flexible mean surfaces Hayes-Larson et al.9 consider: Restricted cubic spline (RCS) functions for age0 interacted with those functions for “time” and/or “current age,” (adjusted for sex and education). Briefly, Figure 4A,B) show “time” axis estimates, for:
FIGURE 4.
Estimated trajectories for APOE4 groups (e4−, e4+) and related differences from mean models using restricted cubic splines (RCS) for baseline age0, time since baseline and/or current age, and interactions of RCS’s for age0 with RCS’s for time and/or current age. Akaike\Bayesian Information Criterion slightly favored the “time” model in panels C\D (AIC=179,027; BIC=179,369) over the “age” model in panels A\B (AIC =179,038; BIC=179,380), (lower is better). However, APOE4 difference estimates from the “age” model are nearly identical to the “time” model, and appear more precise when there is less information, e.g. the small group of 80–90-year-olds who entered early (age0 = 50) and were observed for 30–40 years. See eAppendix4 for details.
while Figure 4C,D show “age” axis estimates, for:
All Figure 4 estimated trajectories and 10-year differences between APOE4 groups are constructed using postestimation marginal standardization.14–16 This approach allows projections of the estimates onto the “current age” dimension for both models. Similar to the models in Figure 3, the f(age0, age) model induces specific forms for the RCS(age0) & RCS(time) parameters, while E(Y) = f(age0, time) does not. Even so, the model fits and APOE4 trajectory differences were nearly identical, potentially due to the similarity of between-person and within-person effects, or to the flexibility of restricted cubic spline interactions. Model fit statistics showed little difference, slightly favoring f(age0, time) over f(age0, age). However, while most APOE4 differences were very similar between the models in Figure 4 (eAppendix4; https://links.lww.com/EDE/C240, eTables 2 and 3; https://links.lww.com/EDE/C240), estimates for the few participants who entered the study frame in their 50s–60s and were observed into their 80–90s (i.e., over 30–40 years), are more precise in the “age” model than the “time” model. Simple exploratory data analyses (Figure 1, eFigure2; https://links.lww.com/EDE/C240), reveal that only a handful of participants contributing 30+ years of within-person information for the “time” model, whereas many 80–90 years old participants contributed between-person information. Although the age timescale pooling assumptions do not always decrease variance, (estimates in time models may be more precise than in age models), in many settings precision is gained by leveraging the cross-sectional information. In this case, the Akaike\Bayesian information criterion (AIC\BIC) slightly favors the “time” model, but given better precision, the “age” model may be more valuable. Again, considering “age” and “time” models together provides a useful understanding of why the estimates perform as they do and helps with bias–variance tradeoff choices.
MIXING IT UP: VARIANCE MODELS
Connections above for mean functions hold for both generalized estimating equation and linear mixed model estimation approaches. Hayes-Larson et al.9 use linear mixed models with random intercepts, -slopes and -splines for time and age. Here, we make three comments. First, when using random slopes/functions on age, centering can be important for interpretation (and sometimes convergence). If “current age” is used without centering, random intercepts correspond with an extrapolated age = 0, and their variance and covariance with the random slopes can be artificially inflated. Using noncentered random age slopes in the pooled data led to random intercept variances over 10 times greater compared with centering age at 65 and a nearly perfect inverse correlation between random intercepts and slopes. We centered both age and age0 for fixed and random terms across models (detailed descriptions in eAppendix4, eTable4; https://links.lww.com/EDE/C240).
The second point is that expanding age = ” age0 + time” shows that using random “age” slope models induces equivalence assumptions about variance contributions from between-person age0 and within-person time terms, similar to the equivalence assumptions on mean structures shown earlier. As before:
| (5a) |
| (5b) |
So the random-slopes assumption induces both a , and a random-coefficients with identical variance terms. Although is constant within-person, the random term can be thought of as introducing heteroscedastic variance across age0. To allow both within-person slopes over time and differential overall variance across baseline age0, one could examine a less restricted “age0+time” model:
| (6) |
where and and their variance terms are allowed to differ. This shows again that using “current age” for random slopes in (5a) induces additional equality pooling assumptions. As before, if the “age” model assumptions hold, (, , and in (6)), then pooling the information and using in (5a) is supported and may provide more precision. The commonly used “time” random-slopes variance model:
| (7) |
simply assumes there is no additional heteroscedastic variance in outcomes across baseline age0 () after accounting for the fixed effects, random intercepts and random-slopes over time. Variance assumptions in (5a) and (7) can be examined with the alternative variance model (6) and its (age0,time) sources. The authors and others5 purport that random slopes in “age” versus “time” models cannot be made mathematically equivalent. This is true if you compare basic models (5a) and (7) above, ignoring the additional age0 term in the variance model. Equation (6) above shows that you can make a random-slopes “time” model equivalent to a random-slopes “age” model by including the random “age0” term in the “time” model with the identical random effect (5b); this can be fit using constraints in standard software packages. Note that while the models can be made mathematically equivalent, model convergence depends on both the model and data. Impossibilities of fitting a model offer evidence that it is inconsistent with the data or over-specified/overly complex. eTable5; https://links.lww.com/EDE/C240 gives results from the pooled data for all variance models (5)–(7). Although variance terms for random age0 and random time slopes in model (6) show 95% confidence intervals that support the inclusion of both. Model (5a), random age slopes assuming () shows better AIC/BIC fit statistics compared with (6) or (7), Model (5b) was fit using constraints () and has equivalent estimates, log-likelihood and AIC\BIC to (5a). Additional useful work related to both variance points 1 & 2 (centering and random-slopes equivalence) has appeared in the multilevel model literature for disaggregating “conflated” within-cluster and between-cluster fixed and random effects.17
Our third point on variance models is that we used semi-robust “sandwich” variance estimates18,19 throughout and suggest others do so whenever possible. Semi-robust variance estimates help protect against mis-specifications when we have guessed a useful but incorrect variance model, and per the famous George Box quote, all models are wrong.
ALL MODELS ARE WRONG, SOME ARE USEFUL… AND MULTIPLE USEFUL MODELS ARE ILLUMINATING
The old joke goes: “That person uses statistics like a drunk uses a lamppost, more for support than illumination.” Examining multiple approaches to scientific queries can offer deeper insights than using a single model and simply stating a number for support (P value, AIC, BIC, etc.). We have attempted to illuminate technical aspects underlying the rich set of example timescale models examined by Hayes-Larson and colleagues. Many discussions regarding longitudinal timescale choice try to answer the question “which timescale should you use?” We take Box’s position that they are all wrong. However, recognizing how “age” and “time” are connected can help lead to using more useful models. Comparing timescales across multiple mean and variance structures helps to better understand within-person and between-person contributions to the estimates, even though they may cancel out when comparing groups. When contributions diverge, examining factors such as learning/practice effects,20–22 survivorship/attrition/selection effects,23–25 and Age–Period–Cohort effects,26,27 can help illuminate optimal estimates and truths about the world.
ACKNOWLEDGMENTS
We thank the ARIC, BLSA, and MCSA studies for making their data available. We thank Gwen Windham for constructive editorial comments and Terry Therneau for invigorating discussions on timescales, centering and general modeling approaches.
ABOUT THE AUTHORS
MICHAEL GRISWOLD is Professor of Biostatistics and Data Science in the MIND Center at the University of Mississippi Medical Center and adjunct faculty in the Department of Biostatistics at the Johns Hopkins University. His work involves translational biostatistics, data harmonization, and longitudinal models for complex data with a substantive emphasis on healthy aging and Alzheimer’s Disease and Related Dementias.
MARIA GLYMOUR is Chair of the Department of Epidemiology at Boston University School of Public Health. Her work evaluates social, behavioral, and clinical factors that influence Alzheimer’s Disease and Related Dementias, with attention to methodologic challenges in dementia research.
Supplementary Material
Footnotes
Editor’s Note: A related article appears on page 560.
This paper utilized data from three established cohort studies, with funding support as follows:
• The Atherosclerosis Risk in Communities Study (ARIC) is supported by National Heart, Lung, and Blood Institute contracts (75N92022D00001, 75N92022D00002, 75N92022D00003, 75N92022D00004, 75N92022D00005). The ARIC Neurocognitive Study is supported by grants U01HL096812, U01HL096814, U01HL096899, U01HL096902, and U01HL096917 from the NIH (NHLBI, NINDS, NIA and NIDCD).
• The Baltimore Longitudinal Study of Aging (BLSA) is supported by the Intramural Research Program, National Institute on Aging.
• The Mayo Clinic Study of Aging (MCSA) is supported by the National Institutes of Health grant U01AG06786.
• The authors thank the investigators, staff and participants of the ARIC, BLSA and MCSA studies for their important contributions.
The authors report no conflicts of interest.
SDC Supplemental digital content is available through direct URL citations in the HTML and PDF versions of this article (www.epidem.com).
ARIC data can be made available to interested researchers through established study protocols (https://sites.cscc.unc.edu/aric/). BLSA data can be made available to interested researchers through established study protocols (https://www.blsa.nih.gov/). MCSA data can be made available to interested researchers through established study protocols (https://mayo.edu/). Code can be made available to interested researchers by contacting the corresponding author.
REFERENCES
- 1.Diggle P, Heagerty P, Liang K-Y, Zeger SL. Analysis of longitudinal data. Oxford University Press; 2002. [Google Scholar]
- 2.Hedeker D, Gibbons RD. Longitudinal data analysis. John Wiley & Sons; 2006. [Google Scholar]
- 3.Mendes de Leon CF. Aging and the elapse of time: a comment on the analysis of change. J Gerontol B Psychol Sci Soc Sci. 2007;62:S198–S202. [DOI] [PubMed] [Google Scholar]
- 4.Morrell CH, Brant LJ, Ferrucci L. Model choice can obscure results in longitudinal studies. J Gerontol A Biol Sci Med Sci. 2009;64:215–222. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Hoffman L. Considering alternative metrics of time: does anybody really know what“ Time” is? In: Harring JR, Hancock GR, eds. Advances in Longitudinal Methods in the Social and Behavioral Sciences. Information Age Publishing; 2012;255–287. [Google Scholar]
- 6.Bell A. Age period cohort analysis: a review of what we should and shouldn’t do. Ann Hum Biol. 2020;47:208–217. [DOI] [PubMed] [Google Scholar]
- 7.Glenn ND. Cohort analysts futile quest - statistical attempts to separate age, period and cohort effects. Am Sociol Rev. 1976;41:900–904. [Google Scholar]
- 8.Goldstein H. Age, period and cohort effects—A confounded confusion. J Appl Statist. 1979;6:19–24. [Google Scholar]
- 9.Hayes-Larson E, Andrews RM, Kezios KL, et al. Approaches to timescale choice in cognitive aging research and potential implications for estimated exposure effects: coordinated analyses in ten cohorts of older adults. Epidemiology. 2025: In press. doi:10.1097/EDE.0000000000001859. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.The ARIC Investigators. The atherosclerosis risk in communit (ARIC) study: design and objectives. Am J Epidemiol. 1989;129:687–702. [PubMed] [Google Scholar]
- 11.Ferrucci L. The Baltimore Longitudinal Study of Aging (BLSA): a 50-year-long journey and plans for the future. Vol. 63. Oxford University Press; 2008;1416–1419. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Roberts RO, Geda YE, Knopman DS, et al. The Mayo Clinic Study of Aging: design and sampling, participation, baseline measures and sample characteristics. Neuroepidemiology. 2008;30:58–69. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Hastie T, Tibshirani R. Generalized additive models. Stat Sci. 1986;1:297–310. [DOI] [PubMed] [Google Scholar]
- 14.Localio AR, Henegan JA, Chang S, et al. Standardization and prediction to control confounding: estimating risk differences and ratios for clinical interpretations and decision making. Ann Intern Med. 2025: In press. doi:10.7326/ANNALS-25-00082. [DOI] [PubMed] [Google Scholar]
- 15.Searle SR, Speed FM, Milliken GA. Population marginal means in the linear model: an alternative to least squares means. Am Stat. 1980;34:216–221. [Google Scholar]
- 16.Graubard BI, Korn EL. Predictive margins with survey data. Biometrics. 2004;55:652–659. [DOI] [PubMed] [Google Scholar]
- 17.Rights JD, Sterba SK. On the common but problematic specification of conflated random slopes in multilevel models. Multivariate Behav Res. 2023;58:1106–1133. [DOI] [PubMed] [Google Scholar]
- 18.Huber PJ. The behavior of maximum likelihood estimates under nonstandard conditions. In: Proceedings of the fifth Berkeley symposium on mathematical statistics and probability. Vol 1. Statistics: University of California Press, 1967;221–234. [Google Scholar]
- 19.White H. A heteroskedasticity-consistent covariance matrix estimator and a direct test for heteroskedasticity. Econom J Econom Soc. 1980;48:817–838. [Google Scholar]
- 20.Hoffman L, Hofer SM, Sliwinski MJ. On the confounds among retest gains and age-cohort differences in the estimation of within-person change in longitudinal studies: a simulation study. Psychol Aging. 2011;26:778–791. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Jones RN. Practice and retest effects in longitudinal studies of cognitive functioning. Alzheimers Dement (Amst). 2015;1:101–102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Scharfen J, Jansen K, Holling H. Retest effects in working memory capacity tests: a meta-analysis. Psychon Bull Rev. 2018;25:2175–2199. [DOI] [PubMed] [Google Scholar]
- 23.Rouanet A, Helmer C, Dartigues JF, Jacqmin-Gadda H. Interpretation of mixed models and marginal models with cohort attrition due to death and drop-out. Stat Methods Med Res. 2019;28:343–356. [DOI] [PubMed] [Google Scholar]
- 24.Griswold ME, Talluri R, Zhu X, et al. Reflection on modern methods: shared-parameter models for longitudinal studies with missing data. Int J Epidemiol. 2021;50:1384–1393. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Davis-Plourde KL, Mayeda ER, Lodi S, et al. Joint models for estimating determinants of cognitive decline in the presence of survival bias. Epidemiology. 2022;33:362–371. [DOI] [PubMed] [Google Scholar]
- 26.Bell A, Jones K. The impossibility of separating age, period and cohort effects. Soc Sci Med (1982). 2013;93:163–165. [DOI] [PubMed] [Google Scholar]
- 27.Sjölander A, Gabriel EE. A generalization of the mechanism-based approach for age–period–cohort models. Epidemiology. 2025;36:227–236. [DOI] [PubMed] [Google Scholar]




