Skip to main content
Alzheimer's & Dementia logoLink to Alzheimer's & Dementia
. 2025 Jan 29;21(1):e14467. doi: 10.1002/alz.14467

The higher benefit of lecanemab in males compared to females in CLARITY AD is probably due to a real sex effect

Daniel Andrews 1,2,, Simon Ducharme 3,4, Howard Chertkow 4,5,6,7,8,9, Maria Pia Sormani 10,11, D Louis Collins 1,2,4,; for the Alzheimer's Disease Neuroimaging Initiative
PMCID: PMC11779744  PMID: 39887549

Abstract

INTRODUCTION

The phase 3 trial CLARITY AD found lecanemab slowed cognitive decline by 27%. However, subgroup analyses indicated a significant 31% sex difference in the effect and suggested no or limited effectiveness in females. We used simulations constrained by the trial design to determine whether that difference reflects a pre‐existing sex difference in Alzheimer's disease progression or was a random event.

METHODS

Simulations were generated using linear mixed models of cognitive decline fit to data from Alzheimer's Disease Neuroimaging Initiative participants satisfying CLARITY AD inclusion criteria.

RESULTS

The statistically non‐significant 7.9% smaller cognitive decline rate in our cohort's males versus females does not explain CLARITY AD's 31% sex difference in lecanemab's effect. A ≥ 31% difference occurred randomly in only 12 of our 10,000 simulations (0.0012 probability).

DISCUSSION

CLARITY AD's sex difference was probably not random. Lecanemab is likely less effective in females than males, but we cannot conclude the drug is ineffective in females.

Highlights

  • Lecanemab is more clinically effective in males than in females.

  • Forest plots should only report subgroup‐specific effects in well‐powered subgroups.

  • Trial simulations based on real data enable investigation of subgroup drug effects.

  • We cannot conclude that lecanemab is clinically ineffective in females.

  • A sex difference in lecanemab's efficacy could be linked to its action mechanism.

Keywords: Alzheimer's disease, Alzheimer's disease treatment, clinical trial, cognitive decline, data analysis, dementia, drug efficacy, forest plot, lecanemab, longitudinal data, mild cognitive impairment, phase 3 clinical trial, sex differences, simulation, statistical power, subgroup analysis

1. BACKGROUND

Alzheimer's disease (AD) affects males and females at different rates. 1 Two thirds of patients are female, and lifetime risk in females is twice that of males. 2 , 3 , 4 Sex differences in genetics, hormones, and other biological mechanisms are hypothesized to contribute to sex differences in the disease progression and, possibly, drug efficacy. 2 Recent work has therefore called for clinical trials powered to include primary endpoints disaggregated by sex. 2 , 5 While no such trial has yet been done in AD, trials of new drugs do often run sex subgroup analyses. 6 , 7 , 8

The purpose of such analyses is to generate hypotheses for future research, not to evaluate efficacy within each subgroup. 9 Ideally, statistical interaction tests are used to determine whether one subgroup benefited more from a drug than a corresponding subgroup. 9 In some trials, however, subgroup results are only reported visually on a forest plot, where the primary endpoint and confidence interval are presented for each subgroup individually. 6 , 7 , 10 This format implies that subgroup‐specific effectiveness was evaluated, despite typical subgroup sizes not being powered to test the primary hypothesis. 6 , 7 , 8 , 9 Biostatisticians recommend that primary hypothesis tests not be run and reported separately for low‐powered subgroups. 9 , 10 , 11

These complications make forest plots hard to interpret clinically without strong statistical expertise, especially if there are striking differences between corresponding subgroups. That exact scenario may have contributed to important misinterpretations of the sex subgroup results in CLARITY AD, the phase 3 trial of the amyloid‐targeting drug lecanemab. 7

CLARITY AD showed a statistically significant 27% slowing of cognitive decline for lecanemab‐treated versus placebo participants, supporting lecanemab's US Food and Drug Administration approval. 7 , 12 , 13 , 14 However, a supplementary forest plot from CLARITY AD's results paper (Figure S 1‐B in the cited paper's appendix) indicated a statistically significant sex difference in primary clinical effect. 7 More specifically, in that plot the point estimate of the effect in females was outside the confidence interval of the male point estimate. The confidence interval for the female subgroup also crossed zero. Overall, the plot shows that males had a statistically significant 43% mean slowing of decline, while females had a non‐significant 12% mean slowing that was significantly different from the effect in males.

Several recent works have therefore concluded lecanemab might not effectively slow decline in females. 15 , 16 , 17 , 18 , 19 , 20 Others have emphasized the apparent sex difference in effectiveness. 2 , 5 The CLARITY AD paper's main text did not discuss the sex results; the authors ostensibly concluded there was no significant sex difference, as has another paper, possibly because sex was not significant as a covariate in a subgroup analysis model. 7 , 19 , 21 Also, the confidence intervals of the male and female subgroups in the aforementioned forest plot overlapped the 27% cohort mean. 7

These interpretations could influence decisions on whether lecanemab is prescribed to female patients. The question remains: How should clinicians interpret the CLARITY AD sex results?

Critically, CLARITY AD was not powered to evaluate efficacy separately for each sex. 7 , 19 A too‐small sample could explain the non‐significance of the effect in females, an effect that nonetheless trended toward favoring lecanemab. 7 However, low power does not address lecanemab's smaller clinical effect on females than males, which might be explained by AD‐related sex differences in cognitive decline. Females with prodromal AD have been shown to decline faster than corresponding males. 22 , 23 , 24 , 25

Alternatively, CLARITY AD's sex difference could be an artifact linked to randomization and participant heterogeneity in disease progression. Studies have shown that patients with similar AD severity at baseline can have very different cognitive decline rates. 26 , 27 , 28 , 29 AD trial simulations have shown that randomization can produce an imbalance of fast‐ and slow‐progressing patients between treatment groups, thus affecting observed drug effect sizes. 30

In this article, we use sex‐specific cognitive decline models and simulations constrained by CLARITY AD's sampling scheme and cohort composition to test the hypothesis that the trial's sex difference could be explained by the phenomena above.

The forest plot described earlier already implies that CLARITY AD's sex difference was probably not an artifact. However, that plot did not report a formal interaction test and does not provide information on whether natural AD sex differences explain the results. By empirically evaluating both possibilities, we aim to conclusively interpret the real trial's sex subgroup results.

2. METHODS

2.1. Experimental reasoning

To test our hypothesis, we ask two basic questions: (1) Could the CLARITY AD sex difference be explained by known pre‐existing sex differences in cognitive decline? (2) Could the sex difference be explained as a random artifactual difference between subgroups?

One question is answered in each of the two experiments. If the answer to either question is “yes,” then our original hypothesis is likely true, implying there was likely no genuine sex difference in lecanemab's clinical effect in CLARITY AD. If the answer to both questions is “no,” then lecanemab may have had a different effect on the trial's males and females.

2.2. Alzheimer's Disease Neuroimaging Initiative

Our analyses used participant data from the Alzheimer's Disease Neuroimaging Initiative (ADNI) database (https://adni.loni.usc.edu), specifically from the ADNI‐1, ADNI‐GO, ADNI‐2, and ADNI‐3 cohorts. 31 , 32 , 33 , 34 The ADNI was launched in 2003 as a public–private partnership, led by Principal Investigator Michael W. Weiner, MD. The primary goal of ADNI has been to test whether serial magnetic resonance imaging (MRI), positron emission tomography (PET), other biological markers, and clinical and neuropsychological assessment can be combined to measure the progression of mild cognitive impairment (MCI) and early AD. For up‐to‐date information, see www.adni‐info.org. ADNI received ethics approval from all participating institutions and received informed consent from all participants.

2.3. Simulation cohort participants

We selected ADNI participants who satisfied CLARITY AD inclusion criteria listed on ClinicalTrials.gov. 35 Data from the selected participants were used in our modeling and simulation experiments. These participants comprise our “simulation cohort.” At baseline, each participant met the clinical criteria for intermediate likelihood AD‐related MCI or probable mild AD dementia, was between 50 and 90 years old, and had Mini‐Mental State Examination (MMSE) scores ≥ 22 and ≤ 30. Participants with MCI had global Clinical Dementia Rating (CDR) scores of 0.5 and CDR Memory Box scores ≥ 0.5. Mild AD dementia participants had global CDR scores of 0.5 or 1.0 and CDR Memory Box scores ≥ 0.5. All participants had a body mass index > 17 and < 35 at baseline. Participants were apolipoprotein E (APOE) ε4 genotyped and were positive for abnormal amyloid levels at baseline. Amyloid status was determined using ADNI's reported baseline PET tracer measures of brain amyloid levels (thresholds: Pittsburgh compound B standardized uptake value ratio [SUVR] > 1.2 or AV45 SUVR > 1.11), 36 , 37 or baseline cerebrospinal fluid measures of amyloid beta 42 (thresholds: Aβ42 ≤ 980 pg/mL). 38 The left panel in Figure 1 shows how the simulation cohort is generated.

FIGURE 1.

FIGURE 1

Procedure for generating 10,000 clinical trial simulations constrained by the CLARITY AD design. First, ADNI participants who fit CLARITY AD inclusion criteria are selected to form our “simulation cohort.” We then fit the Equation 1 model to data from subsets of that cohort in Experiment 1 (only females and only males) and from the full cohort in Experiment 2. The model longitudinally tracks cohort‐ and participant‐level CDRSBΔbl scores. Next, to simulate a trial, we first randomly resample participants from our simulation cohort and then add noise to the baseline CDRSB score of each sampled participant. This process generates a cohort of synthetic participants—our “synthetic cohort”—for our simulated trial, for example where the sample size is equal to that specified in the CLARITY AD design. We then randomize those participants into “drug” and “placebo” groups. The synthetic cohort data are then input to the Equation 1 model that was originally fit to the simulation cohort. With noise added to the model parameters, Equation 1 is used to calculate CDRSBΔbl scores for each participant at a number of visits that matches the observation scheme of CLARITY AD, up to 18 months. A drug effect is simultaneously injected as a reduction in CDRSBΔbl slope for drug‐treated participants relative to placebo. Once the longitudinal CDRSBΔbl scores are generated (i.e., the trial data simulation is complete), the drug versus placebo group difference in CDRSBΔbl slope is extracted. The extracted slope difference is the observed drug effect size in the simulated trial. This process is repeated 10,000 times. Each simulated trial will have a different observed effect size because of the heterogeneity in natural cognitive decline trajectories in the trial's unique synthetic cohort. Finally, the observed effect sizes across the 10,000 simulations are plotted in a histogram, in which the mean observed drug versus placebo difference in slope is here indicated by the black dashed line. AD, Alzheimer's disease; ADNI, Alzheimer's Disease Neuroimaging Initiative; CDRSB, Clinical Dementia Rating Sum of Boxes; CDRSBΔbl, Clinical Dementia Rating Sum of Boxes change since baseline.

RESEARCH IN CONTEXT

  1. Systematic review: The sex subgroup analysis in CLARITY AD, the phase 3 trial of the now US Food and Drug Administration–approved anti‐amyloid drug lecanemab, showed a significant clinical effect in males only. Our literature review (e.g., on PubMed) found that recent work interpreted the sex result as implying that lecanemab has either no or limited effectiveness in females. We found no study had quantitatively and didactically evaluated whether the CLARITY AD result could be explained by pre‐existing sex differences or drug‐independent participant heterogeneity in cognitive decline rate.

  2. Interpretation: Our study empirically demonstrates that CLARITY AD's observed sex difference likely cannot be explained by drug‐independent participant heterogeneity in cognitive decline rate, nor by inherent sex differences in decline rate. Lecanemab is likely more effective in males than females but might yet be effective in females.

  3. Future directions: Future work should examine possible links between a drug's action mechanism and sex differences in amyloid clearing and clinical efficacy.

2.4. Experiment 1

In Experiment 1 we asked: Given the CLARITY AD trial design and cohort, could the observed sex difference in lecanemab's clinical effect be explained by lecanemab‐independent sex differences in cognitive decline trajectories? We answer this question in two parts.

In Part 1 we use a linear mixed effects model to track cognitive decline separately in the male and female participants in our selected ADNI cohort. Our goal is to determine whether there is an inherent average sex difference in cognitive decline rate.

In Part 2 we use the sex‐specific models from Part 1 to simulate trials adhering to CLARITY AD design parameters in the hypothetical scenarios of including only males or only females. In these scenarios, a simulated drug slows cognitive decline by 27% versus placebo (the overall lecanemab effect in CLARITY AD). 7 The sex‐specific placebo decline rate is defined by the estimated slope of the corresponding model from Part 1. If the placebo group's decline rate is sufficiently different between males and females, a sex difference in the observed drug effect size could result. Our goal here is to determine whether drug effect sizes observed in simulated female‐only trials would be different, on average, than effect sizes observed in simulated male‐only trials, despite there being no sex difference in the drug's percentage slowing of decline.

A sufficiently large sex difference in cognitive decline rates and trajectories in Part 1 or in possible observed drug effect sizes in Part 2 might explain the CLARITY AD sex difference result.

2.4.1. Part 1: sex‐specific modeling procedure

We fitted a continuous‐time linear mixed effects model (Equation 1) separately in the male and female subsets of our ADNI simulation cohort (see the left panel of Figure 1). These models longitudinally track group‐ and participant‐level trajectories in CDR Sum of Boxes change since baseline (CDRSBΔbl). Increasing CDRSB scores indicate worsening impairment. 39 All our analyses were done in R version 4.3.2 in macOS 14.2.1. 40 To fit the model, we used the function lmer in package lme4. 41

Our model is a continuous‐time analog of the categorical‐time primary analysis mixed model for repeated measures (MMRM) from CLARITY AD. 7 An MMRM would let us evaluate sex differences in the magnitude of cognitive decline at specific time points (e.g., relative to baseline) and could approximately replicate the CLARITY AD analysis. We selected a different model for two main reasons. First, our approach models change continuously, which lets us more intuitively communicate any sex difference in the overall trend and rate of cognitive decline compared to an MMRM. Second, continuously modeling the actual time of data acquisition in ADNI better leverages both time and cognitive test scores to fit trajectories for individual participants, which is critical for simulating clinical trials in Experiment 1 Part 2 and Experiment 2 (see the corresponding Methods for details). Equation 1 provides our model's syntax.

CDRSB_change_since_baseline0+CDRSB_baseline_median_centered+Diagnosis_AD+Years_since_baseline+APOE4_status+CDRSB_baseline_median_centered:Years_since_baseline+(0+Years_since_baseline|participant_ID) (1)

The dependent variable is CDRSB_change_since_baseline, which signifies the magnitude of a participant's change in CDRSB score relative to their score at baseline. This variable has the same units as the corresponding variable in CLARITY AD's MMRM.

The independent variables correspond to those in CLARITY AD's MMRM, with some key differences. 7 Median‐centered baseline CDRSB scores (CDRSB_baseline_median_centered) are used instead of baseline CDRSB scores to help ensure reliable parameter estimates. Median centering here is done by subtracting the median value of all baseline CDRSB scores across our selected cohort from each participant's baseline CDRSB. Years_since_baseline is the continuous time variable and indicates number of years elapsed since a participant's baseline visit in ADNI. In the fitted model, the coefficient of this variable signifies the cohort‐average rate of cognitive decline (units of change in CDRSBΔbl per year). This time variable is used in place of the categorical “Visit” term in CLARITY AD's MMRM. We also include an interaction term for median‐centered baseline CDRSB scores with years since baseline (CDRSB_baseline_median_centered:Years_since_baseline). This term corresponds to the CLARITY AD model's interaction term of baseline CDRSB with categorical “Visit.” Our model also includes two terms ported directly from CLARITY AD's MMRM: AD diagnosis (Diagnosis_AD, AD = 1 or MCI = 0), and APOE ε4 status (APOE4_status, homozygote or heterozygote = 1 or non‐carrier = 0).

There are four terms in CLARITY AD's model that we did not include here: geographic region (defined in the trial as North America, Europe, and Asia‐Pacific), trial group (i.e., lecanemab group or placebo group), trial group interaction with time (time being categorical “Visit” in CLARITY AD's model), and baseline use of medication for AD symptoms. ADNI collected data at different sites only in North America, so we are unable to replicate CLARITY AD's region term. There is no monolithic group‐partitioned drug effect like lecanemab's in ADNI, so we do not include trial group here. However, trial group does factor into the simulations in Experiment 1 Part 2 and Experiment 2 (see the corresponding Methods). Information on the use of medication for AD symptoms was not available in all our selected ADNI participants so we did not include a corresponding term in our model.

Another key difference of our model compared to CLARITY AD's is that ours encodes random effects by participant for “Years_since_baseline.” This addition lets our model estimate each participant's cognitive decline rate, quantified as a deviation from the cohort's average decline rate (same units as described above for the coefficient of “Years_since_baseline”).

No intercepts are encoded for the model because all participants’ CDRSBΔbl scores (CDRSB_change_since_baseline) at baseline are zero, that is, there is no change since baseline at baseline.

To complete Part 1 of Experiment 1, we compared the male and female models’ estimated coefficients on the “Years_since_baseline” term and their standard errors to evaluate sex differences in cognitive decline rate.

For Part 2, we used the sex‐specific models separately to generate synthetic trial data (i.e., synthetic CDRSB_change_since_baseline scores over time) in male‐only and female‐only simulated trials. This process is illustrated in the middle and right panels of Figure 1 and is described below.

2.4.2. Part 2: sex‐specific trial simulation procedure

Using our female‐only trial simulation as the example here, we first randomly sampled female participants from our simulation cohort, with replacement. This resampling procedure generated baseline data for 1800 synthetic female participants, that is, the approximate full sample size targeted in the CLARITY AD design. The resampling preserved the approximate baseline AD stage distribution of the real trial, in which ≈ 60% of participants had MCI and 40% had mild AD dementia at baseline. 7 For each sampled participant, random noise corresponding to measurement error was added to their baseline CDRSB score. 42 Noise was also added to the cohort‐level and participant‐specific model parameters to account for uncertainty and to further differentiate synthetic participants in our resulting “synthetic cohort.”

Next, we randomized these participants 1:1 to “drug” and “placebo” groups. We then used the participants’ baseline data and the estimated parameters from our female‐only model to calculate CDRSBΔbl scores (i.e., CDRSB_change_since_baseline values) for each participant at seven visits up to 18 months of follow‐up. Random noise was added to each participant's calculated CDRSBΔbl score at each visit to incorporate uncertainty and measurement error. Noise values were drawn from a zero mean normal distribution with a standard deviation equal to that of the observation‐level residuals in our female‐only simulation cohort model. Calculated CDRSBΔbl scores were then rounded to correspond to standard CDRSB 0.5 increments (e.g., 1.1 rounded to 1.0, and 1.4 rounded to 1.5). 39 Visits were defined to occur every 3 months, and each participant's specific visit time values had random noise added to simulate an approximate ± 1‐month window around each visit. This observation scheme approximates that of CLARITY AD. Note that ADNI did not observe CDRSB every 3 months. 31 , 32 , 33 , 34 By adding noise values in the manner described above to the model parameters and to each simulated CDRSBΔbl score, we aim to account for the uncertainty associated with calculating CDRSBΔbl scores for the defined observation scheme.

A drug effect was simultaneously injected as a 27% reduction in the linear slope of drug‐treated synthetic participants’ CDRSBΔbl trajectories compared to the placebo participants. The slope for the placebo group is equal to the β (i.e., the coefficient) of the “Years_since_baseline” term in the Equation 1 model fit to our female‐only simulation cohort. In this way, our female‐only trial simulation model encodes a female‐specific rate of cognitive decline for the placebo group, and thus a female‐specific decline rate for the drug group. We also encoded 20% participant dropout by the trial's final observation, as was assumed in the real CLARITY AD sample size calculation. 7

This overall procedure generates a simulated trial constrained by the CLARITY AD parameters specified in its analysis plan, as described in the main results paper. 7 The process was repeated 10,000 times to simulate 10,000 female‐only trials matching the CLARITY AD design parameters. Each simulation had the same cohort‐level trend but a different sampling of participant‐level trajectories, thus accounting for randomization and inter‐participant heterogeneity in cognitive decline within and across simulations. 30 Next, we extracted the drug versus placebo CDRSBΔbl linear slope difference (i.e., the observed drug effect size) from each simulation using the Equation 1 model, but here it contained an additional interaction term of trial group (drug = 1, placebo = 0) with Years_since_baseline. The estimated coefficient of this new term represents the observed drug effect size in the simulated trial. While a standalone trial group term was included in CLARITY AD's primary analysis model, we do not include that term here because our simulated drug effect is encoded only as a slope‐reducing effect, that is, a cognitive decline rate–reducing effect as reported in CLARITY AD's primary analysis. We do not encode any other offset in effect between trial groups. Overall, this methodology is conceptually analogous to simultaneously running 10,000 identically designed female‐only trials in the real world.

We also applied this procedure using the male‐only data and model to generate 10,000 male‐only trial simulations. Overall, we obtain a set of 10,000 observed effect sizes for the female‐only simulated trials, and 10,000 for the male‐only trials.

To finish Experiment 1 Part 2, we evaluated differences between our female and male effect size distributions using a t test to compare means and a Bartlett test to compare variances. 43 , 44 We also calculated the percent difference in those means.

While the 10,000 female‐only and 10,000 male‐only simulated trials all had the same cohort‐level drug effect, we hypothesize that the mean and variance of the effect size distributions would be different between the sexes. A difference in means could result because the placebo group's cognitive decline rate (parameterized as the β for the “Years_since_baseline” term from Equation 1) differed between female‐only and male‐only simulations. A difference in variance could result because the female‐only and male‐only cohorts contained different cognitive decline trajectories. Other parameters from Equation 1 (e.g., the estimated β values and standard errors) also differed between the female‐only and male‐only models and trial simulators.

Our analysis approach could also be applied to CLARITY AD's secondary outcome metrics (e.g., Alzheimer's Disease Assessment Scale–Cognitive subscale [ADAS‐Cog14]). We prioritized CDRSB because of its primary outcome status in CLARITY AD, which made CDRSB the focus of recent works that highlighted a sex difference in lecanemab's effectiveness or concluded the drug might be ineffective in females. 15 , 16 , 17 , 18 , 19 , 20 We aim to help resolve this ambiguity. CDRSB was also present in all our ADNI datasets, giving us the maximum sample possible. ADAS‐Cog14, for example, was not available in every ADNI dataset we used. 31 , 32 , 33 , 34

2.5. Experiment 2

In Experiment 2 we asked: Could CLARITY AD's observed sex difference have been a random artifactual difference between the male and female subgroups? An artifact here represents an observed sex difference in drug effect when there is no genuine sex difference in the effect (i.e., a Type 1 error).

We answer this question using simulations, posing it technically as: What is the probability of observing a ≥ 31% difference in a drug's effect between two equally sized subgroups containing participants selected totally at random? This 31% was the difference between the male and female effects from lecanemab in CLARITY AD. If the probability is sufficiently high (e.g., > 0.05), then the sex difference in CLARITY AD might have been an artifact.

2.5.1. Trial simulation procedure

Here we simulate trials using the same CLARITY AD design parameters as in Experiment 1. We use the same model as in Equation 1, but here we fit the model to data from all participants in our ADNI simulation cohort. The same trial simulation procedure as in Experiment 1 is then applied, but here using the pooled male and female data. However, this time we randomize 1800 synthetic participants 1:1 to “Subgroup 1” and “Subgroup 2,” signifying two subgroups of 900 participants selected totally at random with no consideration of sex. This subgroup sample size is approximately equal to that of the male and female subgroups in the CLARITY AD design. 7

After running 10,000 simulations, we extracted the Subgroup 1 versus Subgroup 2 CDRSBΔbl linear slope difference (i.e., the observed difference in drug effect between the subgroups) from each simulation. Next, we calculated the proportion of simulations with a between‐subgroup difference ≥ 31%, yielding the probability of at least that difference arising as a random artifact.

3. RESULTS

3.1. Participant demographics

Of the 2420 ADNI participants available, 644 met the specified CLARITY AD inclusion criteria, with 264 (41%) being female and 380 (59%) being male. The mean baseline ages (± standard deviations) for females and males, respectively, were 71.6 ± 7.3 years and 73.9 ± 6.8 years. The mean number of years of education was 15.3 ± 2.7 for females and 16.4 ± 2.8 for males. One hundred eighty‐three females (69%) and 250 males (66%) were APOE ε4 heterozygotic or homozygotic. At baseline, 76 females (29%) and 102 males (27%) were diagnosed with mild AD dementia, with the remaining participants having a baseline diagnosis of AD‐related MCI. See Table 1 for the demographic summary.

TABLE 1.

Demographics of our simulation cohort participants selected from ADNI.

Females (n = 264) Males (n = 380)
Age (mean years ± SD) 71.6 ± 7.3 73.9 ± 6.8
Education (mean years ± SD) 15.3 ± 2.7 16.4 ± 2.8
APOE ε4 positive 183 (69%) 250 (66%)
Mild AD dementia 76 (29%) 102 (27%)
Mild cognitive impairment 188 (71%) 278 (73%)

Abbreviations: AD, Alzheimer's disease; ADNI, Alzheimer's Disease Neuroimaging Initiative; APOE, apolipoprotein E; SD, standard deviation.

3.2. Experiment 1 results

3.2.1. Part 1: sex‐specific modeling results

Females and males who fit CLARITY AD inclusion criteria had similar CDRSBΔbl trajectories estimated by the Equation 1 model. We used the Nakagawa conditional R 2 for mixed models as an absolute value goodness‐of‐fit measure, yielding 0.931 and 0.928 for the male‐only and female‐only models, respectively, indicating well‐fitting models. 45 , 46 Key parameters for the male‐only and female‐only models are summarized in Table 2A and B, respectively. Figure 2A shows estimated cognitive decline trends by sex, as well as real trajectories for individual participants. There was no statistically significant sex difference in the estimated cognitive decline rate between the male and female models, as the confidence intervals on the “Years_since_baselineβ value of each sex overlaps the opposing sex β value on this term (see Table 2A,B).

TABLE 2.

Key estimated parameters from the Equation 1 model fit to data from (A) only male participants, (B) only female participants, and (C) male and female participants together in the simulation cohort.

(A) Male‐only model Nakagawa conditional R 2 = 0.931 𝛽 Standard error t value p value
Median‐centered baseline CDRSB −0.08145 0.03508 −2.322 0.020350
Baseline AD diagnosis status 0.33407 0.11964 2.792 0.005285
Years since baseline 0.77923 0.06486 12.014 < 2e−16
APOE ε4 status −0.18959 0.04935 −3.841 0.000127
Interaction of median‐centered baseline CDRSB with years since baseline 0.31434 0.04464 7.041 7.6e−12

(B) Female‐only model

Nakagawa conditional

R 2 = 0.928

𝛽 Standard error t value p value
Median‐centered baseline CDRSB −0.16455 0.04115 −3.999 6.76e−05
Baseline AD diagnosis status 0.53409 0.14458 3.694 0.00023
Years since baseline 0.84646 0.08339 10.151 < 2e−16
APOE ε4 status −0.32775 0.05849 −5.604 2.64e−08
Interaction of median‐centered baseline CDRSB with years since baseline 0.35413 0.05168 6.852 4.70e−11

(C) Male and female model

Nakagawa conditional

R 2 = 0.930

𝛽 Standard error t value p value
Median‐centered baseline CDRSB −0.11808 0.02668 −4.426 9.96e−06
Baseline AD diagnosis status 0.41625 0.09223 4.513 6.62e−06
Years since baseline 0.80663 0.05113 15.777 < 2e−16
APOE ε4 status −0.24694 0.03774 −6.543 7.12e−11
Interaction of median‐centered baseline CDRSB with years since baseline 0.33388 0.03365 9.923 < 2e−16

Abbreviations: AD, Alzheimer's disease; APOE, apolipoprotein E; CDRSB, Clinical Dementia Rating Sum of Boxes.

FIGURE 2.

FIGURE 2

CDRSBΔbl trajectories for our simulation cohort participants selected from ADNI. Scores are tracked over number of years elapsed since a participant's baseline ADNI observation. A, Real individual trajectories for all simulation cohort participants, with overlaid female‐ and male‐specific group‐level trends estimated by Equation 1 indicated in red and blue, respectively, with confidence intervals. These two trends nearly overlap for the 18‐month period that corresponds to the CLARITY AD trial duration. There is no statistically significant sex difference in the estimated cognitive decline rate, as the confidence intervals on the slope values overlap between the two models (see Table 2). The models describing these trends were used in generating the Experiment 1 simulations. B, Real individual trajectories for all simulation cohort participants with the cohort‐level trend line estimated by Equation 1 indicated in green, with confidence interval. The model describing this trend was used in generating the Experiment 2 simulations. ADNI, Alzheimer's Disease Neuroimaging Initiative; CDRSBΔbl, Clinical Dementia Rating Sum of Boxes change since baseline

3.2.2. Part 2: sex‐specific trial simulation results

Figure 3 shows histograms of simulated CDRSBΔbl slope differences (i.e., differences in cognitive decline rate) between drug and placebo groups, taken from within 10,000 male‐only and 10,000 female‐only simulated trials adhering to the CLARITY AD trial design parameters. The horizontal axis of the histogram signifies values of the difference in cognitive decline rate between a trial's drug and placebo groups. Each value is obtained from within one simulated trial instance and equals the drug‐treated group's slope on its CDRSBΔbl trend minus the placebo group's slope on its CDRSBΔbl trend. The female‐only simulations and the male‐only simulations each have their own histogram (red and blue, respectively, in Figure 3). Each histogram thus illustrates the distribution of observed drug effect sizes across the corresponding 10,000 sex‐specific simulations.

FIGURE 3.

FIGURE 3

Histograms of simulated CDRSBΔbl slope differences for drug‐treated versus placebo groups (i.e., observed drug effect sizes) for simulated trials adhering to the CLARITY AD design parameters, but including only males (blue) or only females (red). Each histogram represents 10,000 simulations. The mean value for each distribution is indicated by a color‐coded dashed line. The small difference in means is statistically significant, as is the difference in variances. CDRSBΔbl, Clinical Dementia Rating Sum of Boxes change since baseline

Every simulated trial had the same cohort‐level drug effect encoded as a 27% reduction in CDRSBΔbl slope for drug versus placebo participants. However, the decline rate for the placebo group differed between the female‐only and male‐only simulations. Within each trial, the placebo decline rate was defined as the β for the “Years_since_baseline” term in Table 2; the male‐only and female‐only models each have their own value for this β. Thus, as the drug effect in each trial was encoded as a percent reduction in the placebo decline rate, the 10,000 male‐only and 10,000 female‐only simulated trials had a different value on average for the observed drug effect. This on‐average difference explains the offset positioning of the histograms in Figure 3.

A Bartlett test of homogeneity of variances showed that the distributions’ variances were significantly different (K 2 = 7.4819, P = 0.006232). 43 , 44 A two‐sample t test showed a statistically significant difference in means for the distributions (t = 21.306, P < 2.2e−16). The mean effect sizes from the male and female simulations are indicated in Figure 3. The means are µ male = −0.211 and µ female = −0.229, and the standard deviations are σ male = 0.059 and σ female = 0.061. The mean from the male‐only simulations is ≈ 7.9% less than the mean from the female‐only simulations.

This difference in means is linked directly to the statistically non‐significant sex difference in cognitive decline rate in our ADNI simulation cohort, that is, the difference between the β values for “Years_since_baseline” in Table 2A and B. Concretely, the real male participants in our selected ADNI cohort had a numerically smaller cognitive decline rate than the real female participants. In a sex‐specific simulated trial, the sex‐specific decline rate from the corresponding real ADNI participants defined the placebo group's decline rate: The male trial simulations used the male ADNI decline rate for the placebo group, and similarly for the female simulated trials. In all sex‐specific trial simulations (male and female), the drug effect was defined as a 27% reduction in cognitive decline rate versus the placebo group's decline rate. The observed male drug effect size was smaller on average than the female one because the male placebo decline rate was numerically smaller than the female placebo decline rate.

When using only 1000 simulations of the male or female trial scenarios, the statistically significant difference in means persisted (t = 5.8396, P = 6.097e−09), but the t value decreased and the P value increased compared to using 10,000 simulations. The difference in statistical significance for 1000 versus 10,000 simulations is linked primarily to statistical power: The more sex‐specific trials we simulate (in which there is a true difference in mean drug effect between the male and female simulations), the higher the probability of obtaining a statistically significant estimate for that difference in means. The implication here is that even with a reduced number of sex‐specific trials simulated—1000 versus 10,000—we still obtained a statistically significant estimate for the difference in mean observed drug effect size between the male‐only and female‐only simulations. Thus, the statistically non‐significant sex difference in natural cognitive decline rate (summarized in Table 2A,B) was large enough to produce a sex difference in the mean simulated drug effect when using only 1000 simulations, a scenario in which statistical power to detect the difference in means was much lower compared to using 10,000 simulations.

3.3. Experiment 2 results

Figure 4 shows a histogram of 10,000 simulated CDRSBΔbl slope difference values, each signifying the difference in drug effect size between two subgroups of 900 randomly selected participants from within one simulated trial. Male and female participants were included at random in each simulation. The Equation 1 model, here fit to data from all our simulation cohort participants, was used to generate the simulations. Table 2 lists key model parameters, where a Nakagawa conditional R 2 of 0.930 indicates a well‐fitting model. Figure 2B shows the estimated overall cognitive decline trend and individual trajectories for all real participants in the simulation cohort.

FIGURE 4.

FIGURE 4

Histogram of differences in CDRSBΔbl slope (here signifying differences in drug effect size) between two subgroups of randomly selected participants in 10,000 trial simulations constrained by the CLARITY AD design. The red dashed line indicates the value corresponding to the 31% difference between sex subgroups reported in CLARITY AD. The black dashed line corresponds to the preloaded known sex difference in drug effect from Experiment 1. Only 12 of 10,000 simulations (indicated in purple to the left of the red line) had a subgroup difference ≥ 31%. Even when a large difference is preloaded between subgroups, there is an extremely low probability of randomly observing a ≥ 31% difference in drug effect between subgroups. CDRSBΔbl, Clinical Dementia Rating Sum‐of‐Boxes change since baseline

The red vertical line in Figure 4 indicates the difference in drug effect size between subgroups that corresponds to the CLARITY AD–reported 31% difference between its male and female subgroups. A ≥ 31% difference here implies that one subgroup's participants have at least a 31% greater reduction in cognitive decline rate compared to the other subgroup.

Additionally, in each simulation, we encoded a 7.9% between‐subgroup difference in drug effect. With this bias, one subgroup always declines 7.9% slower than the other, resulting in a corresponding on‐average difference in decline rate between subgroups across our 10,000 simulations (indicated by the black line in Figure 4). This 7.9% equals the percent difference in mean drug effect size between males and females in our Experiment 1. This bias effectively preloads the known sex difference in drug effect and thus biases the simulation in favor of CLARITY AD. Any further difference between subgroups is attributable to participant heterogeneity and randomization.

Only 12 of the 10,000 simulated trials had a subgroup difference in drug effect ≥ 31%, even with our known sex difference in drug effect preloaded. This result signifies a 0.0012 probability of observing a drug effect difference ≥ 31% between two subgroups, even when there is a known difference of 7.9% between the subgroups.

4. DISCUSSION

We examined whether the sex difference in lecanemab's clinical effect observed in CLARITY AD could be explained by inherent sex differences in cognitive decline or as an artifact linked to participant heterogeneity and randomization. Our selection of ADNI participants who met CLARITY AD's inclusion criteria resulted in a female cohort that was slightly younger and less educated (age: 71.6 ± 7.3 years, education: 15.3 ± 2.7 years) than the males (age: 73.9 ± 6.8 years, education: 16.4 ± 2.8 years); the groups had similar proportions of APOE ε4–positive individuals and a similar split of MCI and mild AD dementia participants.

The yearly change in CDRSB, estimated in Experiment 1 Part 1 from the β values of the sex‐specific models (Equation 1), was not statistically significantly different between males (β male = 0.77923 ± 0.06486) and females (β female = 0.84646 ± 0.08339). In other words, even though the graph in Figure 2A shows a slight sex‐related difference, the confidence intervals (in gray) overlap for the 10+ year period fitted, and the two regression lines overlap almost completely in the 18‐month period corresponding to the CLARITY AD trial duration.

Nonetheless, Experiment 1 Part 2 showed a statistically significant difference in mean observed drug effect size between male‐only and female‐only simulated trials constrained by CLARITY AD design parameters, in which we encoded the same drug effect in all simulations as a 27% reduction in decline rate for drug‐treated versus placebo participants. Males had a mean observed effect that was ≈ 7.9% smaller than females. This sex difference in mean effect occurred because the simulated male placebo groups had a smaller rate of cognitive decline than the simulated female placebo groups. Overall, this 7.9% sex difference does not explain the 31% difference in drug effect between CLARITY AD's male and female participants. Moreover, CLARITY AD reported a larger effect for males than females, while our outcome was the opposite. Given our results and the magnitude and direction of CLARITY AD's sex difference, it is unlikely that the trial's males and females experienced the same drug effect.

The answer to our Experiment 1 question is therefore: No, CLARITY AD's sex difference cannot be explained by lecanemab‐independent sex differences in cognitive decline. Our simulations provide empirical justification.

Experiment 2 showed that a ≥ 31% difference in drug effect between subgroups is extremely unlikely to occur as a random artifact, even when a known difference between subgroups is present. Based on our simulations, the estimated probability of that occurrence is 0.0012. Experiment 2 produced the same conclusion as an expert reading of the key CLARITY AD forest plot (Figure S 1‐B in the cited paper's appendix), in which the confidence intervals on the male and female mean effects do not overlap the mean effect of the opposing sex. 7 The answer to our Experiment 2 question is thus: No, the CLARITY AD sex difference cannot be explained as an artifactual difference between the male and female subgroups.

These results imply that the CLARITY AD sex difference is unlikely to be explained by the phenomena we investigated. CLARITY AD also showed statistically significant larger effects for lecanemab in males than in females in two of three non‐primary clinical endpoints (AD Composite Score and Alzheimer's Disease Cooperative Study MCI Activities of Daily Living). 7 The point estimate of lecanemab's effect was numerically larger in males than females for the third non‐primary endpoint, ADAS‐Cog14, but the difference was not statistically significant. 7 However, the sex subgroup analysis of four quality‐of‐life (QOL) metrics showed no statistically significant sex difference in QOL change since baseline at 18 months. 47 Nonetheless, point estimates on three of the metrics (Zarit Burden Interview total score and QOL in AD total score [A] by subject and [B] by subject by proxy) trended toward larger benefit in males than females. 48 , 49

Given these outcomes, CLARITY AD likely observed a genuine sex difference in lecanemab's clinical effect. Further research is needed to determine whether such a difference could be related to biological or gender differences, such as in hormone profiles, cognitive reserve, and education. 2 Our study points to a possible sex difference in the clinical efficacy of amyloid‐targeting drugs, specifically for lecanemab, and possibly for those with different mechanisms of action such as aducanumab and donanemab. 6 , 7 , 8

In aducanumab's phase 3 trial EMERGE, a statistically significant primary clinical effect was reported only for the male subgroup in the main results paper's supplementary Figure 3‐A forest plot (see citation). 6 However, there was no significant sex difference in that effect. Like lecanemab, aducanumab trended toward effectiveness in females, and the non‐significance of the effect could be explained by low power. Similarly, no significant sex difference in primary efficacy was observed in donanemab's positive phase 3 trial TRAILBLAZER‐ALZ 2. 8 However, the eFigure 9‐B forest plot in the results paper's supplement indicates a statistically non‐significant effect in males only (see citation). 8 This non‐significance could again be due to low power.

These outcomes suggest that CLARITY AD's sex difference could be linked to lecanemab's mechanism of action. Unfortunately, to the best of our knowledge, sex‐disaggregated data on amyloid clearance are not publicly available for the trials discussed. Sex analyses of CLARITY AD's amyloid PET data might reveal whether the sex difference in efficacy is accompanied by corresponding differences in amyloid clearing. A sex difference in the clinical impact of removing amyloid would fit conceptually with previously observed modulating effects of sex on the association between AD pathology and clinical progression. For example, amyloid plaques and neurofibrillary tangles have been observed to manifest clinically as dementia at higher rates in females than in males. 25 Alternatively, a sex difference in amyloid clearance alone could explain CLARITY AD's observed sex difference in lecanemab's efficacy. A comparison of CLARITY AD's sex‐disaggregated PET data with corresponding data from the phase 3 aducanumab and donanemab trials could highlight links between a drug's action mechanism and any sex differences in amyloid clearing and clinical efficacy.

While CLARITY AD had near‐equal sex representation in its lecanemab and placebo groups, future trials might benefit from recruiting larger cohorts while preserving sex parity. Larger sample sizes could ensure adequate power to definitively observe sex differences, including in preclinical trial phases and drug development research. 22 Stratification within sex subgroups, such as by menopausal state or gender‐biased risk factors (e.g., lower education in women than men) might help disentangle sex and gender contributions to any male/female difference in a drug's clinical effect. 50 , 51

CLARITY AD also reported differences in drug effect for older versus younger participants, and between APOE ε4 non‐carriers or heterozygotes versus homozygotes. 7 Well‐powered subgroup analyses might thus be needed for demographic categories in addition to sex. Cohort enrichment techniques (e.g., subject selection via cognitive decline prediction) could enhance the power to detect effects within subgroups without requiring more participants. 52 , 53 Advanced trial designs such as those relying on digital twins might also require fewer participants than conventional randomized placebo‐controlled trials. 54

For now, however, subgroup analyses in AD trials should be reported and interpreted cautiously. As evidenced by recent works examining CLARITY AD's sex difference, the current style of reporting AD trial subgroup results in forest plots might lead to misinterpretations of statistically non‐significant effects in low‐powered subgroups. The CLARITY AD paper's authors themselves stated that the trial was not powered to evaluate subgroups separately. 19 The phase 3 aducanumab and donanemab trials could also suffer from such misinterpretations. Unless a trial is powered to evaluate efficacy within each subgroup, single P values (and/or confidence intervals) in subgroup analyses should only be used to indicate whether treatment was significantly more effective in one subgroup than another. Trials should still be powered to evaluate such differences.

Overall, our analysis has some limitations. We used a linear model to generate our simulations, but AD progresses non‐linearly over the full disease timeline. 55 However, our simulations cover trajectories over 18 months only (the CLARITY AD trial duration), so linear assumptions are reasonable. Linear mixed effects models also capture inter‐participant heterogeneity in cognitive decline around a cohort‐level average trend, even across participants with MCI or mild dementia. 56 Future analyses might use non‐linear models, including ones that make no assumptions about disease trajectories when analyzing trial data. 56

We also do not include placebo effects, so our simulations essentially compare a drug effect to ADNI's standard of care. ADNI participants were recruited across Canada and the United States, where care standards can differ between and within the countries. 57 , 58 Also, ADNI participants are predominantly highly educated, White, and male. 59 , 60 , 61 CLARITY AD was more diverse, with participants having been recruited from across three continental regions (North America, Europe, and Asia‐Pacific). 7 Future work might use open datasets from diverse past international trials (e.g., from the Critical Path for Alzheimer's Disease). 62

To conclude, our results combine with the published CLARITY AD sex outcome to suggest that lecanemab has lower clinical benefits in females than in males. Results from CLARITY AD's open‐label extension might clarify the longer term impact of lecanemab in females. Also, certain types of amyloid‐targeting drugs might function differently in females and males. Research into possible mechanisms could be accelerated by drug developers sharing recent AD trial data. 6 , 7 , 8

CONFLICT OF INTEREST STATEMENT

Daniel Andrews declares no conflicts of interest. Dr. Simon Ducharme has received grants or contracts from the Alzheimer's Drug Discovery Foundation, the Canadian Institutes for Health Research (CIHR), the Fonds de recherche du Québec, Novo Nordisk, Biogen, Janssen, Alnylam, and Innodem Neurosciences. Dr. Ducharme has received consulting fees from Eisai, QuRALIS, and Eli Lilly. Dr. Ducharme has received payment or honoraria from Eisai for presentations, manuscript writing, or educational events. Dr. Ducharme has participated on a data safety monitoring or advisory board for IntelGenX and Aviado Bio. Dr. Ducharme is co‐founder and holds stock or stock options in AFX Medical Inc. Dr. Howard Chertkow has received research grants from the following organizations: CIHR, Alzheimer's Society of Canada, BrightFocus Ltd. (grant number A2022046S), National Institutes of Health (grant number 1R01AG075111‐01A1). Dr. Chertkow has received industry‐associated grants from the following organizations: IntelGenx Corp., Alector Inc., Eli Lilly & Co., Biogen MA Inc., and Hoffman‐La Roche. Dr. Chertkow has participated on National Advisory boards for Eisai, Biogen, and Lilly. Dr. Chertkow is the Scientific Director of the Canadian Consortium on Neurodegeneration in Aging. Dr. Chertkow heads up a clinical trials unit and is site investigator for phase 2 and 3 international drug studies sponsored by Roche, Biogen, Eisai, BMS, Alector, IntelGenX, and Anavex. Dr. Maria Pia Sormani has received consulting fees from Biogen, Merck, Sanofi, Roche, Novartis, Alexion, and Immunic. Dr. Sormani has participated on a data safety monitoring or advisory board for Novartis and Sanofi. Dr. D. Louis Collins declares no conflicts of interest. Author disclosures are available in the supporting information.

CONSENT STATEMENT

ADNI received ethics approval from all participating institutions and received informed consent from all participants.

DIVERSITY STATEMENT

Our study examines a possible sex difference in the clinical efficacy of lecanemab, an anti‐amyloid drug designed to slow Alzheimer's disease progression. Sex is thus a core consideration in all parts of this work. Participant characteristics such as race or gender (as different than sex) were not considered as disaggregated factors due to data availability limitations. However, our discussion of the limitations of standard trial subgroup analyses also applies to analyses of demographic groupings other than sex. Future work with more diverse datasets than ADNI will explore possible differences in drug efficacy between additional subgroups.

Supporting information

Supporting Information

ALZ-21-e14467-s001.pdf (1.7MB, pdf)

ACKNOWLEDGMENTS

Data collection and sharing for this project was funded by the Alzheimer's Disease Neuroimaging Initiative (ADNI; National Institutes of Health Grant U01 AG024904) and DOD ADNI (Department of Defense award number W81XWH‐12‐2‐0012). ADNI is funded by the National Institute on Aging, the National Institute of Biomedical Imaging and Bioengineering, and through generous contributions from the following: AbbVie; Alzheimer's Association; Alzheimer's Drug Discovery Foundation; Araclon Biotech; BioClinica, Inc.; Biogen; Bristol‐Myers Squibb Company; CereSpir, Inc.; Cogstate; Eisai Inc.; Elan Pharmaceuticals, Inc.; Eli Lilly and Company; EuroImmun; F. Hoffmann‐La Roche Ltd and its affiliated company Genentech, Inc.; Fujirebio; GE Healthcare; IXICO Ltd.; Janssen Alzheimer Immunotherapy Research & Development, LLC; Johnson & Johnson Pharmaceutical Research & Development LLC; Lumosity; Lundbeck; Merck & Co., Inc.; Meso Scale Diagnostics, LLC; NeuroRx Research; Neurotrack Technologies; Novartis Pharmaceuticals Corporation; Pfizer Inc.; Piramal Imaging; Servier; Takeda Pharmaceutical Company; and Transition Therapeutics. The Canadian Institutes of Health Research is providing funds to support ADNI clinical sites in Canada. Private sector contributions are facilitated by the Foundation for the National Institutes of Health (www.fnih.org). The grantee organization is the Northern California Institute for Research and Education, and the study is coordinated by the Alzheimer's Therapeutic Research Institute at the University of Southern California. ADNI data are disseminated by the Laboratory for Neuro Imaging at the University of Southern California. The present study was supported by a donation from the Famille Louise & André Charron  and a research grant (Principal Investigator: Dr. D. Louis Collins) from the Canadian Institutes of Health Research (CIHR; Project Grant FRN‐165921). Dr. Collins has also received funding from the Brain Canada Foundation. Daniel Andrews is supported by McGill University and by a CIHR Frederick Banting and Charles Best Canada Graduate Scholarships Doctoral Award.

Andrews D, Ducharme S, Chertkow H, Sormani MP, Collins DL. The higher benefit of lecanemab in males compared to females in CLARITY AD is probably due to a real sex effect. Alzheimer's Dement. 2025;21:e14467. 10.1002/alz.14467

Data used in preparation of this article were obtained from the Alzheimer's Disease Neuroimaging Initiative (ADNI) database (https://adni.loni.usc.edu). As such, the investigators within the ADNI contributed to the design and implementation of ADNI and/or provided data but did not participate in analysis or writing of this report. A complete listing of ADNI investigators can be found at: http://adni.loni.usc.edu/wp‐content/uploads/how_to_apply/ADNI_Acknowledgement_List.pdf.

Contributor Information

Daniel Andrews, Email: daniel.andrews2@mail.mcgill.ca.

D. Louis Collins, Email: louis.collins@mcgill.ca.

REFERENCES

  • 1. Alzheimer's Association . Alzheimer's Association 2024 Alzheimer's disease facts and figures. Alzheimers Dement. 2024;20(5):3708‐3821. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Lopez‐Lee C, Torres ERS, Carling G, Gan L. Mechanisms of sex differences in Alzheimer's disease. Neuron. 2024;112(8):1208‐1221. doi: 10.1016/j.neuron.2024.01.024 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Rajan KB, Weuve J, Barnes LL, McAninch EA, Wilson RS, Evans DA. Population estimate of people with clinical Alzheimer's disease and mild cognitive impairment in the United States (2020‐2060). Alzheimers Dement. 2021;17(12):1966‐1975. doi: 10.1002/alz.12362 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Schwartz JB, Weintraub S. Treatment for Alzheimer disease—sex and gender effects need to be explicitly analyzed and reported in clinical trials. JAMA Netw Open. 2021;4(9):e2124386. doi: 10.1001/jamanetworkopen.2021.24386 [DOI] [PubMed] [Google Scholar]
  • 5. Buckley RF, Gong J, Woodward M. A call to action to address sex differences in Alzheimer disease clinical trials. JAMA Neurol. 2023;80(8):769‐770. doi: 10.1001/jamaneurol.2023.1059 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Budd Haeberlein S, Aisen PS, Barkhof F, et al. Two randomized phase 3 studies of aducanumab in early Alzheimer's disease. J Prev Alzheimers Dis. 2022;9:197‐210. Published online 2022. doi: 10.14283/jpad.2022.30 [DOI] [PubMed] [Google Scholar]
  • 7. van Dyck CH, Swanson CJ, Aisen P, et al. Lecanemab in early Alzheimer's disease. N Engl J Med. 2023;388:9‐21Published online 2022. doi: 10.1056/NEJMoa2212948 [DOI] [PubMed] [Google Scholar]
  • 8. Sims JR, Zimmer JA, Evans CD, et al. Donanemab in early symptomatic Alzheimer disease: the TRAILBLAZER‐ALZ 2 randomized clinical trial. JAMA. 2023;330(6):512. doi: 10.1001/jama.2023.13239 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Sormani MP. Subgroup analysis in MS trials. Mult Scler J. 2017;23(1):34‐35. doi: 10.1177/1352458515625808 [DOI] [PubMed] [Google Scholar]
  • 10. Wang R, Lagakos SW, Ware JH, Hunter DJ, Drazen JM. Statistics in medicine—reporting of subgroup analyses in clinical trials. N Engl J Med. 2007;357(21):2189‐2194. doi: 10.1056/NEJMsr077003 [DOI] [PubMed] [Google Scholar]
  • 11. Sormani MP, Bruzzi P. Reporting of subgroup analyses from clinical trials. Lancet Neurol. 2012;11(9):747. doi: 10.1016/S1474-4422(12)70181-3 [DOI] [PubMed] [Google Scholar]
  • 12. Mahase E. Alzheimer's disease: FDA approves lecanemab amid cost and safety concerns. BMJ. 2023;380:73. doi: 10.1136/bmj.p73 [DOI] [PubMed] [Google Scholar]
  • 13. United States FDA . UPDATED INFORMATION: June 9, 2023: Meeting of the Peripheral and Central Nervous System Drugs Advisory Committee Meeting Announcement‐06/09/2023. United States Food and Drug Administration (FDA). 2023. Accessed September 17, 2023. https://www.fda.gov/advisory‐committees/advisory‐committee‐calendar/updated‐information‐june‐9‐2023‐meeting‐peripheral‐and‐central‐nervous‐system‐drugs‐advisory [Google Scholar]
  • 14. Office of the United States FDA Commissioner . FDA Converts Novel Alzheimer's Disease Treatment to Traditional Approval (lecanemab). United States Food and Drug Administration (FDA). 2023. Accessed September 17, 2023. https://www.fda.gov/news‐events/press‐announcements/fda‐converts‐novel‐alzheimers‐disease‐treatment‐traditional‐approval [Google Scholar]
  • 15. Teli A, Dhande P. Advancing Alzheimer's care: a novel therapy with lecanemab. Egypt J Neurol Psychiatry Neurosurg. 2023;59(1):1‐5. doi: 10.1186/s41983-023-00747-y [DOI] [Google Scholar]
  • 16. Kurkinen M. Lecanemab (Leqembi) is not the right drug for patients with Alzheimer's disease. Adv Clin Exp Med. 2023;32(9):943‐947. doi: 10.17219/acem/171379 [DOI] [PubMed] [Google Scholar]
  • 17. Strohl WR. Structure and function of therapeutic antibodies approved by the US FDA in 2023. Antib Ther. 2024;7(2):132‐156. doi: 10.1093/abt/tbae007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Doggrell SA. More failure with solanezumab – this time in preclinical Alzheimer's disease. Expert Opin Biol Ther. 2024;24(3):119‐123. doi: 10.1080/14712598.2024.2325551 [DOI] [PubMed] [Google Scholar]
  • 19. van Dyck CH, Sabbagh M, Cohen S. Lecanemab in early Alzheimer's disease. Reply. N Engl J Med. 2023;388(17):1630‐1632. doi: 10.1056/NEJMc2301380 [DOI] [PubMed] [Google Scholar]
  • 20. Lynch MA. A case for seeking sex‐specific treatments in Alzheimer's disease. Front Aging Neurosci. 2024;16:1346621. doi: 10.3389/fnagi.2024.1346621 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Choi SA, Jee HJ, Bormate KJ, Kim Y, Jung YS. Sex differences in the preventive effect of cardiovascular and metabolic therapeutics on dementia. Biomol Ther. 2023;31(6):583‐598. doi: 10.4062/biomolther.2023.115 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Ferretti MT, Iulita MF, Cavedo E, et al. Sex differences in Alzheimer disease—the gateway to precision medicine. Nat Rev Neurol. 2018;14(8):457‐469. doi: 10.1038/s41582-018-0032-9 [DOI] [PubMed] [Google Scholar]
  • 23. Sohn D, Shpanskaya K, Lucas JE, et al. Sex differences in cognitive decline in subjects with high likelihood of mild cognitive impairment due to Alzheimer's disease. Sci Rep. 2018;8(1):7490. doi: 10.1038/s41598-018-25377-w [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Lin KA, Choudhury KR, Rathakrishnan BG, et al. Marked gender differences in progression of mild cognitive impairment over 8 years. Alzheimers Dement Transl Res Clin Interv. 2015;1(2):103‐110. doi: 10.1016/j.trci.2015.07.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Barnes LL, Wilson RS, Bienias JL, Schneider JA, Evans DA, Bennett DA. Sex differences in the clinical manifestations of Alzheimer disease pathology. Arch Gen Psychiatry. 2005;62(6):685‐691. doi: 10.1001/archpsyc.62.6.685 [DOI] [PubMed] [Google Scholar]
  • 26. van Rossum IA, Vos SJB, Burns L, et al. Injury markers predict time to dementia in subjects with MCI and amyloid pathology. Neurology. 2012;79(17):1809‐1816. doi: 10.1212/WNL.0b013e3182704056 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Vos SJB, Verhey F, Frölich L, et al. Prevalence and prognosis of Alzheimer's disease at the mild cognitive impairment stage. Brain. 2015;138(5):1327‐1338. doi: 10.1093/brain/awv029 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Jack CR, Wiste HJ, Vemuri P, et al. Brain beta‐amyloid measures and magnetic resonance imaging atrophy both predict time‐to‐progression from mild cognitive impairment to Alzheimer's disease. Brain. 2010;133(11):3336‐3348. doi: 10.1093/brain/awq277 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Scheltens NME, Tijms BM, Heymans MW, et al. Prominent non‐memory deficits in Alzheimer's disease are associated with faster disease progression. J Alzheimers Dis. 2018;65(3):1029‐1039. doi: 10.3233/JAD-171088 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Jutten RJ, Sikkes SAM, Van der Flier WM, et al. Finding treatment effects in Alzheimer trials in the face of disease progression heterogeneity. Neurology. 2021;96(22):e2673‐e2684. doi: 10.1212/WNL.0000000000012022 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. The Alzheimer's Disease Neuroimaging Initiative (ADNI) . ADNI1 Procedures Manual. The Alzheimer's Disease Neuroimaging Initiative; 2004. http://adni.loni.usc.edu/wp‐content/uploads/2010/09/ADNI_GeneralProceduresManual.pdf [Google Scholar]
  • 32. The Alzheimer's Disease Neuroimaging Initiative (ADNI) . ADNI2 Procedures Manual. The Alzheimer's Disease Neuroimaging Initiative; 2011. https://adni.loni.usc.edu/wp‐content/uploads/2008/07/adni2‐procedures‐manual.pdf [Google Scholar]
  • 33. The Alzheimer's Disease Neuroimaging Initiative (ADNI) . ADNI GO Procedures Manual. The Alzheimer's Disease Neuroimaging Initiative; 2009. https://adni.loni.usc.edu/wp‐content/uploads/2008/07/ADNI_GO_Procedures_Manual_06102011.pdf [Google Scholar]
  • 34. The Alzheimer's Disease Neuroimaging Initiative (ADNI) . ADNI3 Procedures Manual. The Alzheimer's Disease Neuroimaging Initiative; 2016. https://adni.loni.usc.edu/wp‐content/uploads/2024/02/ADNI3_Procedures_Manual_v3.0.pdf [Google Scholar]
  • 35. Eisai Inc .. A Study to Confirm Safety and Efficacy of Lecanemab in Participants With Early Alzheimer's Disease (Clarity AD): NCT03887455. ClinicalTrials.gov. 2022. https://clinicaltrials.gov/ct2/show/NCT03887455
  • 36. Villeneuve S, Rabinovici GD, Cohn‐Sheehy BI, et al. Existing Pittsburgh compound‐B positron emission tomography thresholds are too high: statistical and pathological evaluation. Brain. 2010;138(7):2020‐2033. doi: 10.1093/brain/awv112 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. Bucci M, Chiotis K, Nordberg A. Alzheimer's disease profiled by fluid and imaging markers: tau PET best predicts cognitive decline. Mol Psychiatry. 2021;26(10):5888‐5898. doi: 10.1038/s41380-021-01263-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38. Shaw LM, Trojanowski JQ. Biomarker Core report: Year1 ADNI3, Roche Elecsys immunoassay analyses of ADNI1/GO/2 CSF samples. Alzheimer's Association; 2017. https://www.alz.org/media/documents/ww‐adni‐may‐2017‐biomarker‐core‐shaw.pdf [Google Scholar]
  • 39. O'Bryant SE. Staging dementia using clinical dementia rating scale sum of boxes scores: a Texas Alzheimer's research consortium study. Arch Neurol. 2008;65(8):1091. doi: 10.1001/archneur.65.8.1091 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40. R Core Team . R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing; 2021. https://www.R‐project.org/ [Google Scholar]
  • 41. Bates D, Mächler M, Bolker B, Walker S. Fitting linear mixed‐effects models using lme4. J Stat Softw. 2015;67(1):1‐48. doi: 10.18637/jss.v067.i01 [DOI] [Google Scholar]
  • 42. Andrews JS, Desai U, Kirson NY, Zichlin ML, Ball DE, Matthews BR. Disease severity and minimal clinically important differences in clinical outcome assessments for Alzheimer's disease clinical trials. Alzheimers Dement Transl Res Clin Interv. 2019;5(1):354‐363. doi: 10.1016/j.trci.2019.06.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43. Bartlett MS, Fowler RH. Properties of sufficiency and statistical tests. Proc R Soc Lond Ser—Math Phys Sci. 1937;160(901):268‐282. doi: 10.1098/rspa.1937.0109 [DOI] [Google Scholar]
  • 44. Snedecor GW, Cochran WG. Statistical Methods. 8th ed. Iowa State University Press; 1989. http://www.gbv.de/dms/bowker/toc/9780813815619.pdf [Google Scholar]
  • 45. Nakagawa S, Schielzeth H. A general and simple method for obtaining R2 from generalized linear mixed‐effects models. Methods Ecol Evol. 2013;4(2):133‐142. doi: 10.1111/j.2041-210x.2012.00261.x [DOI] [Google Scholar]
  • 46. Nakagawa S, Johnson PCD, Schielzeth H. The coefficient of determination R2 and intra‐class correlation coefficient from generalized linear mixed‐effects models revisited and expanded. J R Soc Interface. 2017;14:20170213. doi: 10.1098/rsif.2017.0213 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47. Cohen S, Van Dyck CH, Gee M, et al. Lecanemab Clarity AD: quality‐of‐life results from a randomized, double‐blind phase 3 trial in early Alzheimer's disease. J Prev Alzheimers Dis. 2023;10:771‐777. Published online 2023. doi: 10.14283/jpad.2023.123 [DOI] [PubMed] [Google Scholar]
  • 48. Kahle‐Wrobleski K, Ye W, Henley D, et al. Assessing quality of life in Alzheimer's disease: implications for clinical trials. Alzheimers Dement Diagn Assess Dis Monit. 2017;6(1):82‐90. doi: 10.1016/j.dadm.2016.11.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49. Bédard M, Molloy DW, Squire L, Dubois S, Lever JA, O'Donnell M. The Zarit Burden Interview: a new short version and screening version. Gerontologist. 2001;41(5):652‐657. doi: 10.1093/geront/41.5.652 [DOI] [PubMed] [Google Scholar]
  • 50. Vemuri P, Knopman DS, Lesnick TG, et al. Evaluation of amyloid protective factors and Alzheimer disease neurodegeneration protective factors in elderly individuals. JAMA Neurol. 2017;74(6):718‐726. doi: 10.1001/jamaneurol.2017.0244 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51. Livingston G, Huntley J, Sommerlad A, et al. Dementia prevention, intervention, and care: 2020 report of the Lancet Commission. Lancet. 2020;396(10248):413‐446. doi: 10.1016/S0140-6736(20)30367-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52. Shafiee N, Dadar M, Ducharme S, Collins DL. Automatic prediction of cognitive and functional decline can significantly decrease the number of subjects required for clinical trials in early Alzheimer's disease. J Alzheimer's Dis. 2021;84(3). doi: 10.3233/JAD-210664 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53. Zandifar A, Fonov VS, Ducharme S, Belleville S, Collins DL. MRI and cognitive scores complement each other to accurately predict Alzheimer's dementia 2 to 7 years before clinical onset. NeuroImage Clin. 2020;25:102121. doi: 10.1016/j.nicl.2019.102121 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54. Walsh J. An updated digital twin generator for Alzheimer's disease (version 3.1). Unlearn. 2023. Accessed August 1, 2023. https://unlearnai.substack.com/p/an‐updated‐digital‐twin‐generator
  • 55. Raket LL. Statistical disease progression modeling in Alzheimer disease. Front Big Data. 2020;3:24. doi: 10.3389/fdata.2020.00024 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56. Donohue MC, Aisen PS. Mixed model of repeated measures versus slope models in Alzheimer's disease clinical trials. J Nutr Health Aging. 2012;16(4):360‐364. doi: 10.1007/s12603-012-0047-7 [DOI] [PubMed] [Google Scholar]
  • 57. Office of the US Assistant Secretary for Planning and Evaluation . National Plan to Address Alzheimer's Disease. US; Department of Health and Human Services; 2022. Accessed September 22, 2023. https://aspe.hhs.gov/collaborations‐committees‐advisory‐groups/napa/napa‐documents/napa‐national‐plan [Google Scholar]
  • 58. Public Health Agency of Canada . A Dementia Strategy for Canada: Together We Aspire. Public Health Agency of Canada; 2021. Accessed September 22, 2023. https://www.canada.ca/en/public‐health/services/publications/diseases‐conditions/dementia‐strategy.html [Google Scholar]
  • 59. Petersen RC, Aisen PS, Beckett LA, et al. Alzheimer's Disease Neuroimaging Initiative (ADNI): clinical characterization. Neurology. 2010;74(3):201‐209. doi: 10.1212/WNL.0b013e3181cb3e25 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60. Weber CJ, Carrillo MC, Jagust W, et al. The Worldwide Alzheimer's Disease Neuroimaging Initiative: ADNI‐3 updates and global perspectives. Alzheimers Dement Transl Res Clin Interv. 2021;7(1):e12226. doi: 10.1002/trc2.12226 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61. Aisen PS, Petersen RC, Donohue M, Weiner MW; Alzheimer's Disease Neuroimaging Initiative . Alzheimer's Disease Neuroimaging Initiative 2 clinical core: progress and plans. Alzheimers Dement. 2015;11(7):734‐739. doi: 10.1016/j.jalz.2015.05.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62. Sivakumaran S, Hanan NJ, Cui Z, et al. The Critical Path for Alzheimer's Disease (CPAD) Consortium: a platform for pre‐competitive data sharing, standardization, and analysis to support quantitative tools for AD drug development. Alzheimers Dement. 2021;17(S9):e051903. doi: 10.1002/alz.051903 [DOI] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supporting Information

ALZ-21-e14467-s001.pdf (1.7MB, pdf)

Articles from Alzheimer's & Dementia are provided here courtesy of Wiley

RESOURCES