Abstract
Background
Blood-based biomarkers for Alzheimer’s disease (AD) pathology are appealing options in large population-based studies due to their low cost, minimal invasiveness, and feasibility of collection in non-clinical settings. Despite these benefits, blood-based biomarkers have lower test-retest reliability than neuroimaging measures like amyloid positron emission tomography (amyloid-PET) Centiloids; trade-offs in power and bias remain unexplored.
Methods
We use data from Alzheimer’s Disease Neuroimaging Initiative (ADNI) and the Anti-Amyloid Treatment in Asymptomatic Alzheimer’s Disease (A4) studies, which include both amyloid-PET and blood-based measures, to assess differences in statistical power, required sample size, and bias when replacing a neuroimaging measure with a blood-based measure. We use simulations parameterized based on these studies to show potential implications of using plasma p-tau 181 or p-tau 217, blood-based AD biomarkers, in place of Centiloids from amyloid-PET, when the biomarker is either the exposure or the outcome in an analysis of interest.
Results
We demonstrated that substituting amyloid-PET Centiloids with a blood-based measure of p-tau can substantially reduce power, requiring 1.5-6.5 times the sample size to achieve 80% power compared to amyloid-PET. In addition, using a blood-based biomarker as the exposure can introduce significant regression dilution bias, attenuating estimated associations.
Conclusions
While blood-based biomarkers are lower cost and easier to collect than neuroimaging measures, their use as proxies for AD pathology may introduce substantial methodological challenges, depending on the p-tau isoform. Consideration of the sample sizes they necessitate and their potential for bias is critical for the design and interpretation of studies employing these biomarkers.
Keywords: Amyloid-PET, Centiloids, Blood-based biomarkers, Measurement error, Regression dilution bias
Introduction
Given the growing disease burden associated with dementia and inequitable distribution across groups defined by social factors like race and education, scalable solutions for measuring disease pathology in research studies while facilitating greater inclusion and longitudinal follow-up are urgently needed.1,2 Collection of neuroimaging (eg, positron emission tomography with amyloid ligands or amyloid-PET), cerebrospinal fluid (CSF), and neuropathological measures of Alzheimer’s disease (AD) pathology in studies has historically been challenging. Neuroimaging and CSF sample collection are expensive and burdensome, and may require visits to clinics with neuroimaging facilities.3 Neuropathology is not feasible when potential study participants are uncomfortable with brain donation, and requires substantial logistics to obtain and preserve the brain quickly following death. In addition, neuropathology cannot capture how pathology changes over the life course, and distinguishing between changes that contribute to death from those associated with survival is challenging. As a result, blood-based biomarkers for AD pathology are appealing and increasingly available options in population-based studies due to their low cost, minimal invasiveness, and feasibility of collection in non-clinical settings.3,4 In addition, research using blood-based biomarkers will continue to be a priority because the biomarkers are increasingly available and used for clinical practice, including diagnosis and decision-making regarding amyloid-targeting drugs.5,6
Despite the benefits of blood-based biomarkers for inclusion in longitudinal research, there are several disadvantages compared with neuroimaging measures like amyloid-PET. Neuroimaging can provide more detailed information, including topographical data about the location of pathology or atrophy in the brain. Although documentation in the literature is limited, the summary numeric measures (eg, Centiloids, a standardized scale for brain amyloid burden designed to overcome differences across analysis techniques and tracers by anchoring scores to 0 for individuals who are confidently classified as amyloid-negative, and 100 for a typical Alzheimer’s disease patient7) are likely to have good reliability.7–13 In contrast, blood-based biomarkers typically only offer a single quantitative value with no topographical information, and are subject to measurement challenges, including pre-analytical factors, lower test-retest reliability, difficulty standardizing across laboratories and assays, and, because they are measured from blood, influence from peripheral factors, such as body mass index, liver function, and kidney function.14,15
Substantial attention in the AD biomarker literature has focused on validity and measurement challenges related to sample storage and transport, assay differences, and the impact of clinical comorbidities such as body mass index and kidney function on analyte levels.14,15 However, even in the absence of these issues, random measurement error (ie, “noise”) brings challenges, including reduced precision and potential bias.16,17 Notably, although direct head-to‐head comparisons remain limited, in models that include both imaging and plasma biomarkers, PET-derived measures (including those derived from amyloid- or tau-PET) show stronger associations with cognitive outcomes than blood-based measures alone.18,19 Extant literature comparing blood-based and PET measures typically focuses on individual-level diagnosis or progression. Among blood-based biomarkers for AD, phosphorylated tau (p-tau) is the most closely correlated with brain amyloid burden, possibly because it may reflect rate of tau accumulation, which is hypothesized to be driven by cumulative amyloid burden in the dominant model of AD progression).20 As a result, measures of plasma p-tau, particularly the 217 isoform, have emerged as a key, ubiquitously used blood-based marker, with growing evidence related to its use for screening and diagnosis.21,22
Despite clear epidemiologic evidence that noise in an outcome measure reduces power, and that noise in an exposure measure can induce regression dilution bias,16,17 we are not aware of evidence related to implications of measurement error in practice in AD biomarker research, such as an evaluation of the tradeoffs between the increased sample size that blood-based biomarkers facilitate with the increased precision associated with summary neuroimaging measures. For example, if a sample of 100 is needed for an amyloid-PET neuroimaging study, how many are required for similar power for a blood-based biomarker study that uses p-tau in lieu of Centiloids, and what bias could be expected if the biomarkers are an exposure variable? A clear understanding of these trade-offs is essential for study design and planning, as well as for the interpretation of secondary analyses of existing data.
To address this gap, we leverage data from Alzheimer’s Disease Neuroimaging Initiative (ADNI) and the Anti-Amyloid Treatment in Asymptomatic Alzheimer’s Disease (A4) studies, which include both amyloid PET scans and blood-based measures. We focus on amyloid-PET Centiloids and phosphorylated tau (p-tau). We use simulations parameterized based on data from ADNI and A4 to show the potential implications of measurement error for the use of blood-based biomarkers in AD research, including for research questions in which the biomarker is either the exposure or outcome.
Methods
Overview
Our analysis had 3 main steps. First, we used empirical data to fit models for parameters of interest, including (a) the relationship between age and Centiloids, (b) the relationship between Centiloids and p-tau, and (c) memory score as a function of age and Centiloids. Second, we used estimates from the fitted models to generate simulated study data on age, Centiloids, p-tau, and memory score, varying sample sizes of the simulated studies and the amount of random error in both the Centiloids and p-tau measures. Finally, in the simulated data, we compared the power and bias of estimated associations. Below, we describe each step in more detail.
1. Empirical data and analysis
Data used in preparation of this article were from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database and the Anti-Amyloid Treatment in Asymptomatic Alzheimer’s Disease (A4) study. We used two different cohorts with two different isoforms of p-tau to determine how robust the observed patterns were across biomarker assays and study populations. ADNI has measures of both p-tau 181 and p-tau 217, while A4 has only p-tau 217 measures.
All participants provided informed consent at the time of enrollment. Institutional review board (IRB) approval was obtained at each trial site.
Alzheimer’s Disease Neuroimaging Initiative
The ADNI was launched in 2003 as a public-private partnership, led by Principal Investigator Michael W. Weiner, MD. The original goal of ADNI was to test whether serial magnetic resonance imaging (MRI), positron emission tomography (PET), other biological markers, and clinical and neuropsychological assessment can be combined to measure the progression of mild cognitive impairment (MCI) and early Alzheimer’s disease (AD). The current goals include validating biomarkers for clinical trials, improving the generalizability of ADNI data by increasing diversity in the participant cohort, and to provide data concerning the diagnosis and progression of Alzheimer’s disease to the scientific community. For up-to-date information, see adni.loni.usc.edu. ADNI data was obtained via an application on the Laboratory of Neuro Imaging website (adni.loni.usc.edu).
We used data from ADNI participants who were cognitively normal or had mild cognitive impairment at baseline. For the blood-based biomarker, we used plasma p-tau 181 and 217 measures: p-tau 181 was analyzed using the Single Molecule array (Simoa) technique, using an in-house assay developed in the Clinical Neurochemistry Laboratory, University of Gothenburg, Sweden (Sept 2, 2022), while p-tau 217 was analyzed using the Fujirebio Lumipulse chemiluminescent enzyme immunoassay platform at the University of Pennsylvania (January 8, 2026). (The dates reported reflect ADNI data release and processing versions rather than the timing of participant-level data collection.) For each isoform, we selected the earliest plasma p-tau for each participant, and used amyloid PET and cognitive measures closest in date to time of plasma collection. Individuals with longer than a 2 year time interval between blood collection and amyloid-PET were dropped. The University of California, Berkeley processed amyloid-PET data, which was used to obtain Centiloids (6 mm, Nov 1, 2024). To assess cognition, we used memory scores obtained from the Center for Psychometric Analyses in Aging and Neurodegeneration (CPAAN) at the University of Washington (June 17, 2025). CPAAN is generating co-calibrated scores for memory, executive functioning, language, and visuospatial abilities using modern psychometric approaches as the Cognition Core for the Alzheimer’s Disease Sequencing Project Phenotype Harmonization Consortium (ADSP-PHC). These scores incorporate data from multiple cognitive tests23 and use modern psychometric approaches that result in co-calibrated and harmonized scores across datasets. Methods detailing this approach have been previously published.23,24
A4 study
The A4 study was a phase 3 randomized-controlled clinical trial of solanezumab among individuals with elevated amyloid assessed via amyloid-PET but without clinical symptoms. Enrollment began in 2014 and 1169 participants aged 65-86 were randomized to receive intravenous solanezumab or placebo. A4 study data were obtained from the A4/LEARN Study Data Package (A4studydata.org).
Our analysis uses data from visit 6 (time 0 during the placebo-controlled period), when plasma samples were collected, including p-tau 217 measured using Lilly Clinical Diagnostics Laboratory assay. Neuroimaging data were collected at visit 2 (screening visit), up to 90 days prior to visit 6. Amyloid-PET standardized uptake value ratios (SUVrs) were converted to Centiloids using the following formula: Centiloids = 183.07*(Florbetapir SUVr) − 177.26. Cognitive analyses focused on memory performance at visit 6 of the A4 study. Memory scores were co-calibrated and harmonized by the CPAAN with data from ADNI and several other aging cohorts,24 allowing findings related to memory to be directly compared across studies. Details of items used for score development are provided in the Supplement (Tables S1 and S2—see online supplementary material). We only use data up to the point of randomization, so we do not include study arm in our analyses.
Empirical analysis and parameter estimation
To parameterize simulations, we fit models separately to the ADNI p-tau 181, ADNI p-tau 217, and A4 samples. For all samples, we fit a gamma distribution to the empirical age distribution. For the remaining relationships, we used generalized additive models for location, scale, and shape (GAMLSS). GAMLSS allows for mean modeling and also modeling variance and skewness as functions of parameters. Mean and standard deviation of Centiloids were modeled as linear functions of age using a skew-normal distribution. Mean and shape parameters for p-tau were modeled as a linear function of Centiloids using a gamma distribution. The mean and variance of memory scores were modeled as functions of Centiloids and age using a normal distribution.
2. Simulation study
Data generation
We generated simulated data according to the diagram shown in Figure 1, using pre-fitted model parameters from ADNI or A4 (Tables S3-S5—see online supplementary material) for age, Centiloids, p-tau, and memory scores as described in the Empirical Analysis section. Age was simulated exogenously using a gamma distribution. True Centiloids, corresponding to unmeasured true amyloid burden, was simulated as a function of age using a skew-normal distribution. Measured Centiloids were simulated as a function of true Centiloids and Gaussian error. The blood-based biomarker p-tau (p-tau 181 for ADNI, p-tau 217 for ADNI and A4) was simulated as a gamma distribution with mean (for true p-tau measure) and variance (for measured p-tau) as functions of “true Centiloids.” Although a linear transformation of the “true” Centiloids measure was used, the “true” p-tau measure was needed to provide a comparison on the same scale as the measured p-tau to evaluate bias introduced by the variance in measured p-tau. Finally, memory scores were simulated from a normal distribution with mean and variance depending on both age and true Centiloids.
Figure 1.
Diagram showing data-generating structure (directed acyclic graph) for simulation study parameterized based on empirical data.
Scenarios
Scenarios were defined by combinations of three parameters. First, we used fitted models from ADNI p-tau 181, ADNI p-tau 217, or A4 to parameterize the data generation models. Second, we varied sample sizes generated from 10 to 10 000 participants. Finally, we varied the amount of Gaussian error added to the “true” Centiloid values to obtain the “measured” Centiloid variable; for the error, we used a mean of zero and standard deviations ranging from 0 to 20 Centiloids to account for a range of plausible measurement errors. Random measurement error for Centiloids is not precisely documented in the literature. Still, it is thought to be on the order of a few Centiloids (3-12, depending on the tracer and dataset),7,10,25,26 making our range of up to 20 Centiloids a realistic but somewhat pessimistic estimate. We evaluated a total of 165 scenarios, corresponding to parameters estimated from 3 samples times 11 sample sizes times 5 levels of noise (4 in Centiloids, and p-tau).
3. Quantifying power and bias
For each scenario, we ran 5000 data-generating iterations. In each iteration of data generation, models for the two associations of interest were estimated using standard modeling strategies. First, the relationship between age and biomarker outcome (Centiloids or p-tau) was estimated using simple linear regression. Second, the relationship between the biomarker and the memory score was estimated using linear regression adjusted for age.
Simulation results were summarized across the 5000 iterations by scenario (ie, study used to parameterize the simulations, p-tau isoform, sample size, and magnitude of added error). Power was calculated as the proportion of times p < .05 was achieved for the coefficient of interest (age, in age-biomarker models, and biomarker, in biomarker-memory score models). Confidence intervals for percent power were calculated using the exact binomial method. Differences in percent power and 95% confidence intervals were calculated using the Newcombe method based on Wilson score intervals. Bias was calculated for each iteration as the difference between a coefficient for the measured biomarker and the coefficient for the true biomarker (ie, coefficients for Centiloids with Gaussian error added vs. no error added, or coefficients for p-tau with variance added vs. p-tau that was a linear transformation of true Centiloids). Confidence intervals were calculated by multiplying the standard norm deviate (approximately 1.96) by the standard error, where the standard error is the standard deviation of the difference divided by the square root of 5000, the number of simulations.
All analyses were conducted using R version 4.5.1. ChatGPT (GPT-5.2, OpenAI) was used for code debugging assistance. All code was reviewed and verified by the authors, who retain full responsibility for the analysis and conclusions.
Results
Descriptive statistics for the ADNI and A4 datasets are presented in Table 1. Figure S1 (see online supplementary material for a color version of this figure) shows the distribution of Centiloid values for the three samples. The analytic samples included 925 individuals from ADNI for p-tau 181 and 658 individuals from ADNI for p-tau 217 and 1082 individuals from A4. The p-tau 181 and 217 samples in ADNI had overlap of only 143 individuals, and thus reflect largely distinct subsamples of the ADNI cohort. Figure S2 (see online supplementary material for a color version of this figure) shows the distribution of time differences between blood collection for plasma measures and amyloid PET scans. Mean age and co-calibrated and harmonized memory scores were comparable across the two cohorts, although memory scores were slightly higher in ADNI. Centiloids levels were higher in A4, reflecting that amyloid-PET positivity was an inclusion criterion for that study. Levels of p-tau are not directly comparable since platforms and assays differed. Tables S3-S5 (see online supplementary material for a color version of this figure) provide the parameters for models fit to these data and used for subsequent simulations.
Table 1.
Descriptives of variables used to parameterize simulations in the ADNI and A4 analytic samples.
| Variable, mean (SD) | ADNI: p-tau 181 sample | ADNI: p-tau 217 sample | A4 sample |
|---|---|---|---|
| N = 925 | N = 658 | N = 1082 | |
| Age | 72.00 (8.00) | 72.58 (7.87) | 71.92 (4.80) |
| Centiloids | 33.05 (45.42) | 30.84 (44.18) | 66.21 (32.79) |
| p-tau | 18.47 (20.09) | 0.24 (0.27) | 0.28 (0.16) |
| Co-calibrated and harmonized memory z-scores | 0.63 (0.97) | 0.81 (0.97) | 0.43 (0.40) |
Abbreviation: ADNI, Alzheimer’s Disease Neuroimaging Initiative; A4, Anti-Amyloid Treatment in Asymptomatic Alzheimer’s Disease; p-tau, phosphorylated tau.
In simulations, we first evaluated bias and power for estimated associations between age (exposure/independent variable) and the biomarker (dependent variable). As shown in Figure 2A, additional measurement error in Centiloids up to 20 Centiloids did not have a large impact on power in either cohort. However, as also shown in Figure 2A, substituting plasma p-tau for Centiloids significantly reduced statistical power, particularly for p-tau 181. Figure 2B shows that the difference in power was largest in ADNI using p-tau 181, and smallest in ADNI using p-tau 217; power differences in A4 (p-tau 217) were intermediate. Figure 2C shows that sample sizes required for 80% power when using plasma p-tau 181 in ADNI were larger than when using p-tau 217 in ADNI or A4. Achieving 80% power required 1.5-6.5 times the sample size when using plasma p-tau measures versus the true Centiloids measure (tabular data in Table S6—see online supplementary material). Figure 3 and Figure S3 (see online supplementary material for a color version of this figure) show that there was no bias in the estimated coefficient for age in models with a biomarker outcome.
Figure 2.
Power curves and power curve differences for biomarker outcome models. (A) Percent power as a function of sample size for detecting a statistically significant correlation between a biomarker and age. Power curves for ADNI and A4 show that power is not substantially affected by added error in Centiloids, but replacing Centiloids with p-tau measures dramatically reduces power, especially for p-tau 181. (B) Power curve differences show the difference in power compared with using true Centiloids (no added error) as the outcome measure. Reductions in power when replacing Centiloids with p-tau are larger for p-tau 181 than for p-tau 217 in ADNI. (C) Number of participants needed to achieve 80% power for true Centiloids, Centiloids with added error, and p-tau in ADNI and A4 in biomarker outcome models. For models assessing the relationship between an exposure, age, and a biomarker, the sample size required to achieve 80% power is between 1.5 and 6.5 times larger when p-tau vs. true Centiloids is the outcome. 95% confidence intervals are calculated using the delta method. ADNI, Alzheimer’s Disease Neuroimaging Initiative; A4, Anti-Amyloid Treatment in Asymptomatic Alzheimer’s Disease; p-tau, phosphorylated tau.
Figure 3.

Bias under each scenario for models with the biomarker-as-outcome (triangles) and biomarker-as-exposure (circles) in ADNI and A4. Bias is estimated using simulations with sample sizes of 10 000. When the biomarker is the outcome, no bias occurs when adding error to Centiloids or replacing Centiloids with p-tau. When the biomarker is the exposure, adding error to true Centiloids only modestly attenuates association estimates, whereas replacing Centiloids with p-tau 181 dramatically attenuates association estimates. Attenuations are larger in A4 than ADNI for p-tau 217. ADNI, Alzheimer’s Disease Neuroimaging Initiative; A4, Anti-Amyloid Treatment in Asymptomatic Alzheimer’s Disease; p-tau, phosphorylated tau.
Simulation findings differed when estimating associations between biomarkers (exposure/independent variable) and memory score (dependent variable), adjusted for age. As shown in Figure 3, regression dilution bias attenuated estimated associations whenever measurement error was present, and increased with the magnitude of measurement error in Centiloids (eg, for 20 Centiloids of error, bias was −18% in ADNI for p-tau 181, −20% in ADNI for p-tau 217, and −28% in A4). Negative bias was particularly substantial when estimating associations using plasma p-tau (−88% for p-tau 181 in ADNI, −34% for p-tau 217 in ADNI, and −62% for p-tau 217 in A4). As shown in Figure 4, measurement error was also associated with substantial decreases in power to detect statistically significant biomarker coefficients, a consequence of the attenuation of the estimated association due to regression dilution bias.
Figure 4.
Power curves for biomarker-as-exposure models. In addition to inducing regression dilution bias, measurement error reduces power in biomarker-as-exposure models. Dramatic reductions in power are seen when using plasma p-tau, with larger reductions in ADNI. (A) Percent power as a function of sample size for detecting a statistically significant coefficient for the biomarker and memory score, adjusting for age. (B). Power curve differences show the difference in power compared with using true Centiloids (no added error) as the exposure measure. Reductions in power when replacing Centiloids with plasma p-tau is larger for p-tau 181 than p-tau 217 in ADNI and for p-tau 217 in A4 than p-tau 217 in A4. ADNI, Alzheimer’s Disease Neuroimaging Initiative; A4, — Anti-Amyloid Treatment in Asymptomatic Alzheimer’s Disease; p-tau, phosphorylated tau.
Discussion
The study aimed to quantify potential implications for power and bias of using blood-based AD biomarkers in place of neuroimaging measures in AD biomarker studies. We demonstrated that using blood-based p-tau in lieu of amyloid-PET Centiloids can substantially reduce power to detect associations when the biomarker is the outcome, and can introduce significant regression dilution bias, yielding attenuated association estimates and reduced power, when the biomarker is the exposure. Relationships between age and amyloid and amyloid and memory score depended significantly on the cohort. P-tau 217 (Lilly) in the A4 study and p-tau 217 in ADNI (Fujirebio) appeared to outperform p-tau 181 (in-house Simoa) in ADNI relative to Centiloids.
There is substantial enthusiasm surrounding AD blood-based biomarkers in both research and clinical practice. Epidemiologists have long called for more representative samples, including in AD research27; until recently,28 the majority of neuroimaging cohorts have comprised predominantly non-Latino White populations. Blood-based biomarkers have significant appeal for feasibly increasing the diversity and representativeness of samples and for longitudinal follow-up, as they are less expensive and do not require travel to a neuroimaging site, typically an academic medical center.3 There are additional geographic requirements for proximity to equipment for generating radioactive tracers used in PET imaging, making PET infeasible in many rural settings.29–31 In contrast, blood can be collected at a wider range of locations, including in-home collection or, as the field develops further, potentially as dried blood spots32; samples can then be shipped to and processed at a central laboratory.33 Clinically, provisional recommendations already exist for using blood-based biomarkers for diagnosis and to make prognoses about dementia risk.34,35 Following the approvals of the amyloid-targeting therapies lecanemab and donanemab, this enthusiasm has grown, as these biomarkers currently play and will continue to play a role in determining eligibility for treatment.6 Approaches to address measurement error and uncertainty in clinical use are currently being explored, but remain limited and incompletely validated (eg, a two-threshold approach with very low values ruling out amyloid positivity, very high values ruling it in, and follow-up imaging for those with intermediate values)36; ongoing research and integration with other biomarkers and diagnostics is going to remain a significant priority.
In the research setting, our results show that studies using p-tau measures may need to be substantially larger to achieve the same statistical power as a comparable amyloid-PET study (1.5 to 6.5 times larger for the relationships we examined in this analysis, based on ADNI and A4). Even allowing for pessimistic measurement error in Centiloids (ie, 20 Centiloids of error), sample sizes required for comparable 80% power analyses using plasma p-tau were larger. Differences across samples in our analysis in absolute power to detect age-amyloid relationships, and differences between power using p-tau versus centiloids measures likely reflect both the difference between p-tau 181 (ADNI) and p-tau 217 (ADNI/A4)37 and sample-specific age-Centiloid association strengths in the empirical data.
We also observed substantial reductions in power and large bias (attenuation up to 88%) for measures of p-tau versus Centiloids when the biomarker was the exposure of interest. Exact magnitudes of bias and reduced power in other settings likely depend on the sample composition, associations, biomarkers of interest, and isoform used. Still, it is possible that both reduced power and regression dilution bias may explain the mixed or null findings in several recent AD biomarker studies.38,39 As a result, researchers using existing biomarker data should consider quantitative bias analysis to evaluate the potential role of measurement error in their findings.40,41 Simulations showed p-tau 217 is preferable to p-tau 181in terms of bias and precision, however this may not be possible given legacy decisions in existing cohorts. In addition, using combinations of multiple biomarkers (eg, a factor score with better measurement precision) may reduce some of these concerns, and research on how best to integrate multiple measures is ongoing.42–44
Our findings also have several implications for planning future research using blood-based AD biomarkers. Investigators should consider tradeoffs between the ease and accessibility of blood-based biomarkers and the larger sample size required for adequate statistical power compared with neuroimaging. Given that a panel of AD blood tests currently costs a few hundred dollars, while the cost of an amyloid-PET scan can exceed several thousand dollars, cost-effectiveness may still favor blood-based biomarkers.45 However, recruitment and evaluation of additional study participants confer additional costs, particularly in longitudinal studies. In addition, increasing sample size does not mitigate regression dilution bias, and researchers may want to consider the collection of amyloid-PET imaging in a subset to develop quantitative methods to account for such bias. Careful consideration of such issues in the study design phase can ensure optimal use of study resources.46
We expect these findings to broadly extend to other blood-based biomarkers and analytic questions. Because p-tau 217 is more strongly correlated with amyloid-PET than other blood-based markers,37 including Aβ42/40, we would expect power loss and regression dilution to be at least as pronounced for alternative biomarkers. Our simulations show this principle: Although ADNI subsample differences likely play a role as well, loss of power and regression dilution bias were correspondingly much larger for p-tau 181 than p-tau 217 when parameterized using ADNI data, as p-tau 181 is less strongly correlated with amyloid-PET than p-tau 217. In addition, studies aiming to evaluate effect modification using blood-based biomarkers would also be expected to require considerably larger sample sizes than those focused on main effects, particularly when biomarkers are used as substitutes rather than complements to PET-based measures.
This study has several limitations. We used a simulation approach, which has the key advantage of allowing specification and comparison against a known “truth,” but requires simplifications to isolate the question of interest, in this case related to measurement error in biomarkers. For example, our approach assumed that Centiloids was an unbiased gold standard. To mitigate this, we included scenarios in which Centiloids are also measured with random error up to 20 Centiloids, which is larger (more pessimistic) than suggested in the literature7–13; other forms of systematic error, including atrophy and partial volume effects,47,48 were beyond the scope of this paper. In addition, we treated errors in Centiloids and variability in plasma p-tau relative to Centiloids as random, but heteroskedastic, measurement error. Some non-random measurement error may reasonably be modeled as random, such as error due to small differences in thaw time across samples that do not systematically differ by study site or participant characteristics. Systematic measurement error in either Centiloids or p-tau that is not reasonably modeled as random, which we do not address, may induce additional biases. We did not account for other factors that may be associated with p-tau levels such as BMI, liver function, or kidney function, either through adjustment or use of a p-tau to total tau ratio14,49; although these may account for some variability in p-tau relative to Centiloids, making our estimates of measurement error in p-tau potentially somewhat pessimistic, the field is still developing best practices and standardized approaches.
We also did not account for measurement error in other variables (eg, memory score); however, the co-calibrated memory score we used has been shown to have lower measurement error than single cognitive tests.50 We expect that absolute power would be lower in all scenarios for cognitive outcomes with lower measurement precision. Implications for differences in power between Centiloids and p-tau measures based on cognitive outcome measurement precision are harder to predict. Finally, we focused on isolating the impact of measurement error on estimated associations, rather than causal effects, and therefore included minimal confounder adjustment beyond age and did not evaluate potential selection biases.
Finally, our analysis was limited to demonstrating the potential impact of measurement error in blood-based biomarkers using parameters derived from specific empirical data; although the principles underlying the measurement error apply broadly, our specific findings may not generalize to other cohorts or measures. This is clearly demonstrated by differences in our results between the ADNI and A4 studies; these studies have different inclusion criteria, different Centiloids distributions, use different plasma p-tau measures (p-tau 181 and p-tau 217 in ADNI, p-tau 217 in A4), have varying strengths of association for variables of interest, and consequently show different sample size requirements and magnitude of bias. Because both ADNI and A4 lack substantial racial and ethnic diversity, our findings may not fully generalize to more diverse populations in which biomarker performance and variability differ. Unmeasured or uncorrected for variation in liver function, kidney function, and BMI, which are factors known to influence blood-based biomarker concentrations and that vary systematically across populations, may contribute to non-differential measurement error. This may introduce complex, potentially unpredictable bias, particularly in more demographically diverse cohorts.
In summary, despite their expanding role in clinical practice and growing interest in using blood-based biomarkers to assess AD pathology in large population-based research samples, we showed that plausible magnitudes of measurement error in blood-based biomarkers could substantially reduce power and introduce regression dilution bias, attenuating estimated effects. Awareness of these challenges can facilitate optimal design, analysis, and interpretation of studies employing these biomarkers.
Supplementary Material
Acknowledgments
Data collection and sharing for this project was funded by the Alzheimer’s Disease Neuroimaging Initiative (ADNI) (National Institutes of Health Grant U01 AG024904) and DOD ADNI (Department of Defense award number W81XWH-12-2-0012). ADNI is funded by the National Institute on Aging, the National Institute of Biomedical Imaging and Bioengineering, and through generous contributions from the following: AbbVie, Alzheimer’s Association; Alzheimer’s Drug Discovery Foundation; Araclon Biotech; BioClinica, Inc.; Biogen; Bristol-Myers Squibb Company; CereSpir, Inc.; Cogstate; Eisai Inc.; Elan Pharmaceuticals, Inc.; Eli Lilly and Company; EuroImmun; F. Hoffmann-La Roche Ltd and its affiliated company Genentech, Inc.; Fujirebio; GE Healthcare; IXICO Ltd.; Janssen Alzheimer Immunotherapy Research & Development, LLC.; Johnson & Johnson Pharmaceutical Research & Development LLC.; Lumosity; Lundbeck; Merck & Co., Inc.; Meso Scale Diagnostics, LLC.; NeuroRx Research; Neurotrack Technologies; Novartis Pharmaceuticals Corporation; Pfizer Inc.; Piramal Imaging; Servier; Takeda Pharmaceutical Company; and Transition Therapeutics. The Canadian Institutes of Health Research is providing funds to support ADNI clinical sites in Canada. Private sector contributions are facilitated by the Foundation for the National Institutes of Health (www.fnih.org). The grantee organization is the Northern California Institute for Research and Education, and the study is coordinated by the Alzheimer’s Therapeutic Research Institute at the University of Southern California. ADNI data are disseminated by the Laboratory for Neuro Imaging at the University of Southern California. The A4 Study was a secondary prevention trial in preclinical Alzheimer’s disease, aiming to slow cognitive decline associated with brain amyloid accumulation in clinically normal older individuals. The A4 Study was funded by a public-private-philanthropic partnership, including funding from the National Institutes of Health-National Institute on Aging, Eli Lilly and Company, Alzheimer’s Association, Accelerating Medicines Partnership, GHR Foundation, an anonymous foundation, and additional private donors, with in-kind support from Avid Radiopharmaceuticals, Cogstate, Albert Einstein College of Medicine and the Foundation for Neurologic Diseases.The companion observational Longitudinal Evaluation of Amyloid Risk and Neurodegeneration (LEARN) Study was funded by the Alzheimer’s Association and GHR Foundation. The A4 and LEARN Studies were led by Dr. Reisa Sperling at Brigham and Women’s Hospital, Harvard Medical School, and Dr. Paul Aisen at the Alzheimer’s Therapeutic Research Institute (ATRI) at the University of Southern California. The A4 and LEARN Studies were coordinated by ATRI at the University of Southern California, and the data are made available under the auspices of Alzheimer’s Clinical Trial Consortium through the Global Research & Imaging Platform (GRIP). The complete A4 Study Team list is available on: https://www.actcinfo.org/a4-study-team-lists/. We would like to acknowledge the dedication of the study participants and their study partners who made the A4 and LEARN Studies possible. Calibration of memory scores for ADNI and for A4 were funded by U24 AG074855 from NIA. A prior version of this manuscript was posted as a preprint on medRxiv: medRxiv 2025.11.06.25339696; doi: https://doi.org/10.1101/2025.11.06.25339696.
Contributor Information
Sarah F Ackley, Department of Epidemiology, Brown University, Providence, Rhode Island, United States.
Renaud La Joie, Edward and Pearl Fein Memory and Aging Center, Department of Neurology, Weill Institute for Neurosciences, University of California, San Francisco, San Francisco, California, United States.
Michelle Caunca, Neurovascular Division, Department of Neurology, University of California, San Francisco, San Francisco, California, United States.
Shubhabrata Mukherjee, Department of Medicine, University of Washington, Seattle, Washington, United States.
Seo-Eun Choi, Department of Medicine, University of Washington, Seattle, Washington, United States.
Emily H Trittschuh, GRECC, VA Puget Sound Health Care System, Seattle, Washington, United States; Department of Psychiatry, University of Washington, Seattle, Washington, United States.
Paul K Crane, Department of Medicine, University of Washington, Seattle, Washington, United States.
Eleanor Hayes-Larson, Leonard Davis School of Gerontology, University of Southern California, Los Angeles, California, United States.
Roger Fielding, (Medical Sciences Section).
Supplementary material
Supplementary material is available at The Journals of Gerontology, Series A: Biological Sciences and Medical Sciences online.
Funding
This project was supported by the National Institute on Aging grants: R00AG073454, R01AG082730, R00AG075317, R13AG064971, and R01AG072681. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.
Conflicts of interest
SFA receives funding from Sanofi for a Brown University student fellowship unrelated to Alzheimer's biomarkers or therapeutics.
Data availability
ADNI data are publicly available and can be obtained from https://adni.loni.usc.edu. A4 Study data are publicly available and can be obtained from https://www.a4studydata.org/.
Author contributions
Conceptualization: S.F.A., E.H.-L. Methodology: S.F.A., R.L.J., M.C., S.M., S.-E.C., E.H.T., P.K.C., E.H.-L. Data curation: E.H.-L., S.F.A., S.M., S.-E.C., E.H.T., P.K.C. Formal analysis: S.F.A., E.H.-L. Writing—original draft: S.F.A., E.H.-L. Writing—review & editing: S.F.A., R.L.J., M.C., S.M., S.-E.C., E.H.T., P.K.C., E.H.-L.
References
- 1. Raman R, Aisen PS, Carillo MC, et al. Tackling a major deficiency of diversity in Alzheimer’s disease therapeutic trials: an CTAD task force report. J Prev Alzheimers Dis. 2022;9:388-392. 10.14283/jpad.2022.50 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Gilmore-Bykovskyi AL, Jin Y, Gleason C, et al. Recruitment and retention of underrepresented populations in Alzheimer’s disease research: a systematic review. Alzheimers Dement (N Y). 2019;5:751-770. 10.1016/j.trci.2019.09.018 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Hayes-Larson E, Ackley SF, Turney IC, La Joie R, Mayeda ER, Glymour MM. Considerations for use of blood-based biomarkers in epidemiologic dementia research. Am J Epidemiol. 2024;193:527-535. 10.1093/aje/kwad197 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Delaby C, Hirtz C, Lehmann S. Overview of the blood biomarkers in Alzheimer’s disease: promises and challenges. Rev Neurol (Paris). 2023;179:161-172. 10.1016/j.neurol.2022.09.003 [DOI] [PubMed] [Google Scholar]
- 5. Burnham SC, Harper L, Locascio T, McPherson HE, Pontecorvo M, Wang J. Evaluating the interchangeability of blood-based biomarkers and amyloid-PET for identifying patients with Alzheimer’s pathology. Alzheimer’s & Dementia. 2024;20:e091465. 10.1002/dad2.70196 [DOI] [Google Scholar]
- 6. Hansson O, Blennow K, Zetterberg H, Dage J. Blood biomarkers for Alzheimer’s disease in clinical practice and trials. Nat Aging. 2023;May3:506-519. 10.1038/s43587-023-00403-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Klunk WE, Koeppe RA, Price JC, et al. The Centiloid Project: Standardizing quantitative amyloid plaque estimation by PET. Alzheimers Dement. 2015;11:1-15.e4. 10.1016/j.jalz.2014.07.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Rowe CC, Jones G, Doré V, et al. Standardized expression of 18F-NAV4694 and 11C-PiB β-amyloid PET results with the Centiloid scale. J Nucl Med. 2016;57:1233-1237. 10.2967/jnumed.115.171595 [DOI] [PubMed] [Google Scholar]
- 9. Iaccarino L, Burnham SC, Tunali I, et al. A practical overview of the use of amyloid-PET Centiloid values in clinical trials and research. Neuroimage Clin. 2025;46:103765. 10.1016/j.nicl.2025.103765 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Kang YK, Min JW, Kwon SJ, Ha S. Reliability of automated amyloid PET quantification: real-world validation of commercial tools against Centiloid project method. Tomography. 2025;11:86. 10.3390/tomography11080086 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Tolboom N, Yaqub M, Boellaard R, et al. Test-retest variability of quantitative [11C]PIB studies in Alzheimer’s disease. Eur J Nucl Med Mol Imaging. 2009;36:1629-1638. 10.1007/s00259-009-1129-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Devous MD, Joshi AD, Navitsky M, et al. Test–retest reproducibility for the tau PET imaging agent Flortaucipir F 18. J Nucl Med. 2018;59:937-943. 10.2967/jnumed.117.200691 [DOI] [PubMed] [Google Scholar]
- 13. Timmers T, Ossenkoppele R, Visser D, et al. Test–retest repeatability of [18F]Flortaucipir PET in Alzheimer’s disease and cognitively normal individuals. J Cereb Blood Flow Metab. 2020;40:2464-2474. 10.1177/0271678X19879226 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Zheng HT, Wu Z, Mielke MM, Murray AM, Ryan J. Plasma biomarkers of Alzheimer’s disease and neurodegeneration according to sociodemographic characteristics and chronic health conditions. J Prev Alzheimers Dis. 2024;11:1189-1197. 10.14283/jpad.2024.142 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Panikkar D, Vivek S, Crimmins E, Faul J, Langa KM, Thyagarajan B. Pre-analytical variables influencing stability of blood-based biomarkers of neuropathology. J Alzheimers Dis. 2023;95:735-748. 10.3233/JAD-230384 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Hutcheon JA, Chiolero A, Hanley JA. Random measurement error and regression dilution bias. BMJ. 2010;340:c2289. 10.1136/bmj.c2289 [DOI] [PubMed] [Google Scholar]
- 17. Devine OJ, Smith JM. Estimating sample size for epidemiologic studies: the impact of ignoring exposure measurement uncertainty. Stat Med. 1998;17:1375-1389. 10.1002/(sici)1097-0258(19980630)17:12<1375::aid-sim857>3.0.co; 2-d [DOI] [PubMed] [Google Scholar]
- 18. Mundada NS, Rojas JC, Vandevrede L, et al. Head-to-head comparison between plasma p-tau217 and flortaucipir-PET in amyloid-positive patients with cognitive impairment. Alzheimers Res Ther. 2023;15:157. 10.1186/s13195-023-01302-w [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Jack CR, Wiste HJ, Algeciras-Schimnich A, et al. Comparison of plasma biomarkers and amyloid PET for predicting memory decline in cognitively unimpaired individuals. Alzheimers Dement. 2024;20:2143-2154. 10.1002/alz.13651 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Therriault J, Vermeiren M, Servaes S, et al. Association of phosphorylated tau biomarkers with amyloid positron emission tomography vs tau positron emission tomography. JAMA Neurol. 2023;80:188-199. 10.1001/jamaneurol.2022.4485 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Collij LE, Mattsson‐Carlgren N, Janelidze S, Ossenkoppele R, Hansson O. Complementary utility of plasma biomarkers and Aβ‐PET for diagnosis, risk‐stratification, and treatment monitoring in Alzheimer’s disease. Alzheimers Dement. 2025;21:e70763. 10.1002/alz.70763 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Barthélemy NR, Salvadó G, Schindler SE, et al. Highly accurate blood test for Alzheimer’s disease is similar or superior to clinical cerebrospinal fluid tests. Nat Med. 2024;30:1085-1095. 10.1038/s41591-024-02869-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23. Hampton OL, Mukherjee S, Properzi MJ, et al. Harmonizing the preclinical Alzheimer cognitive composite for multicohort studies. Neuropsychology. 2023;37:436-449. 10.1037/neu0000833 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Mukherjee S, Choi S-E, Lee ML, et al. Cognitive domain harmonization and cocalibration in studies of older adults. Neuropsychology. 2023;37:409-423. 10.1037/neu0000835 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Rowe CC, Doré V, Jones G, et al. 18F-Florbetaben PET beta-amyloid binding expressed in Centiloids. Eur J Nucl Med Mol Imaging. 2017;44:2053-2059. 10.1007/s00259-017-3749-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Navitsky M, Joshi AD, Kennedy I, et al. Standardization of amyloid quantitation with florbetapir standardized uptake value ratios to the Centiloid scale. Alzheimer’s Dementia. 2018;14:1565-1571. 10.1016/j.jalz.2018.06.1353 [DOI] [Google Scholar]
- 27. Manly JJ, Glymour MM. What the aducanumab approval reveals about Alzheimer disease research. JAMA Neurol. 2021;78:1305-1306. 10.1001/jamaneurol.2021.3404 [DOI] [PubMed] [Google Scholar]
- 28. Petersen ME, Zhou Z, Hall JR, et al. Health and Aging Brain Study-Health Disparities (HABS-HD) methods and partner characteristics. Alzheimers Dement (N Y). 2025;11:e70140. 10.1002/trc2.70140 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Korn SM, Qian Z, Zurl H, et al. Geographic variability in contemporary utilization of PET imaging for prostate cancer: a medicare claims cohort study. Cancer Imaging. 2025;25:86. 10.1186/s40644-025-00898-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Ducharme J, Goertzen AL, Patterson J, Demeter S. Practical aspects of 18F-FDG PET when receiving 18F-FDG from a distant supplier. J Nucl Med Technol. 2009;37:164-169. 10.2967/jnmt.109.062950 [DOI] [PubMed] [Google Scholar]
- 31. Pichler V, Berroterán-Infante N, Philippe C, et al. An overview of PET radiochemistry, Part 1: the covalent labels 18F, 11C, and 13N. J Nucl Med. 2018;59:1350-1354. 10.2967/jnumed.117.190793 [DOI] [PubMed] [Google Scholar]
- 32. Huber H, Blennow K, Zetterberg H, et al. Biomarkers of Alzheimer’s disease and neurodegeneration in dried blood spots—a new collection method for remote settings. Alzheimers Dement. 2024;20:2340-2352. 10.1002/alz.13697 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Reed RG, Hillmann AR, Nation M, Braksator S, Sigler K. Remote dried blood spot collection for inflammatory markers in older adults is feasible, reliable, and valid. Brain Behav Immun. 2024;120:545-553. 10.1016/j.bbi.2024.07.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Jack CR, Andrews JS, Beach TG, et al. Revised criteria for diagnosis and staging of Alzheimer’s disease: Alzheimer’s Association Workgroup. Alzheimers Dement. 2024;20:5143-5169. 10.1002/alz.13859 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Feldman HH, Villain N, Frisoni GB, Dubois B. An International Working Group viewpoint on stratifying risk and prevention of the clinical expression of Alzheimer’s disease. Nat Aging. 2025;5:1930-1932. [DOI] [PubMed] [Google Scholar]
- 36. Palmqvist S, Whitson HE, Allen LA, et al. Alzheimer’s Association Clinical Practice Guideline on the use of blood-based biomarkers in the diagnostic workup of suspected Alzheimer’s disease within specialized care settings. Alzheimers Dement. 2025;21:e70535. 10.1038/s43587-025-00977-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Therriault J, Brum WS, Trudel L, et al. Blood phosphorylated tau for the diagnosis of Alzheimer’s disease: a systematic review and meta-analysis. Lancet Neurol. 2025;24:740-752. 10.1016/S1474-4422(25)00227-3 [DOI] [PubMed] [Google Scholar]
- 38. Lee JK, Johnson L, Hall JR, et al. Chronic stress, social support, and Alzheimer’s blood‐based biomarkers in the HABS‐HD study. Alzheimers Dement. 2025;21:e70043. 10.1002/alz.70043 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39. Ramanan VK, Graff-Radford J, Syrjanen J, et al. Association of plasma biomarkers of Alzheimer disease with cognition and medical comorbidities in a biracial cohort. Neurology. 2023;101:e1402-e1411. 10.1212/WNL.0000000000207675 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. Lash TL, Fox MP, MacLehose RF, Maldonado G, McCandless LC, Greenland S. Good practices for quantitative bias analysis. Int J Epidemiol. 2014;43:1969-1985. 10.1093/ije/dyu149 [DOI] [PubMed] [Google Scholar]
- 41. Banack HR, Hayes-Larson E, Mayeda ER. Monte Carlo simulation approaches for quantitative bias analysis: a tutorial. Epidemiol Rev. 2022;43:106-117. 10.1093/epirev/mxab012 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42. Pan F, Huang Y, Cai X, et al. Integrated algorithm combining plasma biomarkers and cognitive assessments accurately predicts brain β-amyloid pathology. Commun Med (Lond). 2023;3:65. 10.1038/s43856-023-00295-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43. Wang Y, Li Y, Li Y, et al. A blood-based composite panel that screens Alzheimer’s disease. Biomark Res. 2023;11:53. 10.1186/s40364-023-00485-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44. Abbatecola AM, Giuliani A, Biscetti L, et al. Circulating biomarkers of inflammaging and Alzheimer’s disease to track age-related trajectories of dementia: can we develop a clinically relevant composite combination? Ageing Res Rev. 2024;96:102257. 10.1016/j.arr.2024.102257 [DOI] [PubMed] [Google Scholar]
- 45. Schindler SE, Li Y, Li M, et al. Using Alzheimer’s disease blood tests to accelerate clinical trial enrollment. Alzheimers Dement. 2023;19:1175-1183. 10.1002/alz.12754 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46. Fox MP, Lash TL. Quantitative bias analysis for study and grant planning. Ann Epidemiol. 2020;43:32-36. 10.1016/j.annepidem.2020.01.013 [DOI] [PubMed] [Google Scholar]
- 47. Vemuri P, Jack CR. Role of structural MRI in Alzheimer’s disease. Alzheimers Res Ther. 2010;2:23. 10.1186/alzrt47 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48. Thomas BA, Erlandsson K, Modat M, et al. The importance of appropriate partial volume correction for PET quantification in Alzheimer’s disease. Eur J Nucl Med Mol Imaging. 2011;38:1104-1119. 10.1007/s00259-011-1745-9 [DOI] [PubMed] [Google Scholar]
- 49. Jacobs T, Brien CO', Figueredo L, et al. Body mass index and blood volume influence plasma biomarkers and positron emission tomography classification in preclinical Alzheimer’s disease. Alzheimers Dement. 2025;21:e70765. 10.1002/alz.70765 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50. Crane PK, Choi S-E, Lee M, et al. Measurement precision across cognitive domains in the Alzheimer’s Disease Neuroimaging Initiative (ADNI) dataset. Neuropsychology. 2023;37:373-382. 10.1037/neu0000901 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
ADNI data are publicly available and can be obtained from https://adni.loni.usc.edu. A4 Study data are publicly available and can be obtained from https://www.a4studydata.org/.



