Abstract
Background:
The Learning Early About Peanut Allergy (LEAP) trial showed that early dietary introduction of peanut reduced the risk of developing peanut allergy by age 60 months in infants at high risk for peanut allergy. In this secondary analysis of LEAP data, we aimed to determine risk subgroups within these infants and estimate their respective intervention effects of early peanut introduction.
Methods:
LEAP raw data were retrieved from ITNTrialShare.org. Conditional random forest was applied to participants in the peanut avoidance arm to select statistically important features for the classification and regression tree (CART) analysis to group infants based on their risk of peanut allergy at 60 months of age. Intervention effects were estimated for each derived risk subgroup using data from both arms. Our main model was generated based on baseline data when the participants were 4-11 months old. Specific IgE measurements were truncated to account for the limit of detection commonly used by laboratories in clinical practice.
Results:
The model found infants with higher predicted probability of peanut allergy at 60 months of age had a similar relative risk reduction, but a greater absolute risk reduction in peanut allergy with early introduction of peanut, than those with lower probability. The intervention effects were significant across all risk subgroups. Participants with baseline peanut sIgE ≥0.22 kU/L (n=78) had an absolute risk reduction of 40.4% (95% CI 27.3, 51.9) whereas participants with baseline peanut sIgE<0.22 kU/L and baseline Ara h 2 sIgE <0.10 kU/L (n=226) had an absolute risk reduction of 6.5% (95% CI 2.6, 11.0). These findings were consistent in sensitivity analyses using alternative models.
Conclusion:
In this study, risk subgroups were determined among infants from the LEAP trial based on the probability of developing peanut allergy and the intervention effects of early peanut introduction were estimated. This may be relevant for further risk assessment and personalized clinical decision-making.
Keywords: Early introduction, Intervention effect estimation, Learning Early About Peanut Allergy (LEAP), Peanut allergy, Prevention, Risk subgroups
Graphical Abstract

Introduction
The prevalence of food allergy has increased over the past few decades, and studies indicate that hospitalizations due to food-induced anaphylaxis have also increased worldwide.1,2 Peanut allergy often manifests in early childhood and only about 20% of patients will outgrow their peanut allergy.3,4
In the early 2000s, delaying the oral introduction of allergens was the primary food allergy prevention strategy, but this recommendation was later withdrawn in many countries in the mid-2000s due to lack of evidence supporting the recommendation.5,6 In the past 10 years, several clinical trials have now shown a beneficial effect of early and sustained consumption of egg and peanut for the prevention of food allergy to these allergens.7–9 Among these studies, the findings of the Learning Early About Peanut Allergy (LEAP) trial showed that the prevalence of peanut allergy could be significantly reduced in high-risk patients with or without evidence of sensitization to peanut.7 Guidelines worldwide have since changed now recommending peanut introduction in infancy.10–13
Predictive modeling of peanut allergy focused on identifying risk factors and the utility of biomarkers as tools for peanut allergy diagnosis at the time of testing has been extensively studied.14–20 However, a clinical decision model that can be used for identifying subgroups of patients who may have varying risks for future peanut allergy and benefit differently from early introduction of peanut is lacking. Recent studies have focused on the effect of early introduction of peanut within pre-specified subgroups.21,22 A prognostic model that can predict the probability of peanut allergy development in high-risk infants avoiding peanut consumption has not been reported. This would allow for examination of the heterogeneity of the effect of intervening with early peanut introduction. Such a model could be part of the shared decision making in the approach to peanut allergy and a useful tool for the prevention of peanut allergy via early introduction. The goal of this study was to identify risk subgroups within infants at high-risk for peanut allergy according to their predicted probability of developing peanut allergy if avoiding peanut. To do this, we performed a secondary analysis of publicly available data from the LEAP trial, a randomized clinical trial with well-characterized clinical data.7 We additionally estimated the intervention effect of early introduction of peanut for the risk subgroups that we identified.
Methods
Study population and samples
The LEAP study was a randomized controlled trial studying whether early introduction of peanut could reduce the prevalence of peanut allergy at age 60 months among high-risk infants. The trial enrolled 640 infants from 4 to 11 months of age with severe eczema, egg allergy, or both diagnoses, and a peanut skin prick test (SPT) of ≤4mm, and randomized them to peanut consumption (i.e. intervention) or peanut avoidance until 60 months of age.7 In this work, the intention-to-treat population of 314 participants from the avoidance arm was used to develop prognostic models to group the infants according to their predicted risk of peanut allergy at 60 months. To calculate intervention effects, a sub-population with 307 participants from the consumption arm (after removal of 7 participants who reacted to the baseline peanut oral food challenge and never received the intervention), was utilized along with participants in the avoidance arm.
The individual participant-level data of LEAP are made available through ImmPort (SDY660) and ITN TrialShare (ITN032AD, www.itntrialshare.org) from where the data were retrieved.23–25 More details about the study population and data preprocessing can be found in the Supplemental methods and Table S1.
Throughout the manuscript, ‘baseline’ refers to measurements at the start of the LEAP trial when the participants were between 4 and 11 months old. ‘12-month’ and ‘60-month’ refer to the visits in LEAP when the participants were 12 and 60 months old, respectively.
Data analysis
The primary outcome is the allergy status at 60 months of age as determined by oral food challenge or diagnostic algorithm in the LEAP trial. The data analysis was carried out in two stages (Figure 1).
Figure 1. Analysis overview.

Raw data from the LEAP trial were downloaded from ITN TrialShare and preprocessed. Data from the baseline and/or 12-month (i.e., when the participants were 12 months old) visits were used to generate 4 different models. For each model, the first stage was the selection of important variables (see supplement for a complete list of included variables) for predicting peanut allergy status at 60 months of age using a conditional random forest on the participants from the avoidance arm. Important variables were then used to create a decision tree to group the infants in the avoidance arm based on their risk of peanut allergy at 60 months of age. In the second stage, for models including only baseline data, the intervention effects of early peanut introduction were estimated for each risk subgroup (i.e., terminal node in the decision tree).
The first stage applied classification and regression tree (CART) modeling to derive risk subgroups of infants based on their predicted risk of peanut allergy at 60 months if avoiding peanut using data from 314 participants in the avoidance arm. A conditional random forest (RF) was used to select variables, based on importance scores, that were then included in the CART modeling. The variables that were included in this variable selection step via RF are shown in Table 1. For the main model (model 1), all immunoglobulin E (IgE) baseline data were truncated to account for the limit of detection (LOD) commonly used for IgE measurements by laboratories in clinical practice (i.e., 0.09 kU/L) to derive a model with more clinical utility than when relying on a resolution of values at the lower limit of the measurements that is usually not available in clinical practice.
Table 1.
Variables presented to the RF for variable selection for the main model (model 1)
| Demographic variables | Age, sex, race |
| Blood biomarkers | Peanut sIgE, peanut specific IgG4, sIgE to Ara h 1, Ara h 2, Ara h 3, Ara h 8, and Ara h 9 |
| Clinical variables | Severe eczema status, egg allergy status, SPT stratum |
In the second stage, utilizing data from participants from the avoidance and consumption arms, the intervention effect was estimated for each risk subgroup identified from the model using baseline data with LOD adjustment (model 1) from the first stage. Given the RCT design, participants randomized to receiving the intervention are considered exchangeable to those randomized to the control. However, the balance may not be maintained in each of the subgroups. To ensure the covariate balance, we applied stabilized inverse probability treatment weighting (sIPTW) and estimated the averaged treatment effect after ensuring the covariate balance in each risk subgroup. See Supplemental methods for details for each stage of data analysis.
Sensitivity analysis
We performed analyses as described in the first stage to develop two additional models: one based solely on 12-month data (model 2) and another combining both baseline and 12-month data (model 3). These models were to assess the impact of 12-month data and change in covariates between baseline and 12 months of age on predictions of peanut allergy at 60 months of age.
Additionally, we examined a model using baseline data without sIgE LOD modification (model 4) to determine its influence on peanut allergy prediction at 60 months and intervention effects. Details are available in the Supplemental methods.
Results
Study population
The baseline characteristics of 314 participants in the avoidance and 307 participants in the consumption arm of the LEAP trial were shown in Tables 2 and S2, respectively, and further described in the corresponding section of the supplemental results and Figure S1.
Table 2.
Demographics and clinical characteristics of participants in the avoidance arm
| Variable | Non-allergic | Allergic | P* | Overall |
|---|---|---|---|---|
| N | 260 | 54 | 314 | |
|
| ||||
| Sex, male, no. (%) | 164 (63.1) | 39 (72.2) | 0.262 | 203 (64.6) |
|
| ||||
| Race, no. (%) | 0.002 | |||
| White | 207 (79.6) | 31 (57.4) | 238 (75.8) | |
| Asian | 6 (2.3) | 5 (9.3) | 11 (3.5) | |
| Black | 20 (7.7) | 6 (11.1) | 26 (8.3) | |
| Mixed | 27 (10.4) | 12 (22.2) | 39 (12.4) | |
|
| ||||
| Age, month (median [IQR]) | 7.80 [6.47, 9.11] | 7.84 [5.96, 8.85] | 0.49 | 7.82 [6.42, 9.10] |
|
| ||||
| SPT-positive Stratum, no. (%) | 33 (12.7) | 18 (33.3) | <0.001 | 51 (16.2) |
|
| ||||
| Eczema, no. (%) | 225 (86.5) | 52 (96.3) | 0.073 | 277 (88.2) |
|
| ||||
| Egg allergy, no. (%) | 160 (61.5) | 43 (79.6) | 0.018 | 203 (64.6) |
|
| ||||
| Peanut sIgE at baseline, kU/L (median [IQR]) | 0.02 [0.00, 0.13] | 0.43 [0.08, 2.35] | <0.001 | 0.03 [0.00, 0.31] |
|
| ||||
| Peanut sIgE at 12 months, kU/L (median [IQR]) | 0.02 [0.00, 0.15] | 1.29 [0.26, 3.97] | <0.001 | 0.05 [0.00, 0.43] |
|
| ||||
| Ara h 1 sIgE at baseline, kU/L (median [IQR]) | 0.00 [0.00, 0.00] | 0.03 [0.00, 0.13] | <0.001 | 0.00 [0.00, 0.02] |
|
| ||||
| Ara h 1 sIgE at 12 months, kU/L (median [IQR]) | 0.00 [0.00, 0.02] | 0.10 [0.02, 0.51] | <0.001 | 0.00 [0.00, 0.04] |
|
| ||||
| Ara h 2 sIgE at baseline, kU/L (median [IQR]) | 0.00 [0.00, 0.04] | 0.04 [0.03, 0.12] | <0.001 | 0.00 [0.00, 0.04] |
|
| ||||
| Ara h 2 sIgE at 12 months, kU/L (median [IQR]) | 0.00 [0.00, 0.03] | 0.05 [0.03, 0.26] | <0.001 | 0.00 [0.00, 0.04] |
|
| ||||
| Ara h 3 sIgE at baseline, kU/L (median [IQR]) | 0.00 [0.00, 0.02] | 0.02 [0.00, 0.08] | <0.001 | 0.00 [0.00, 0.02] |
|
| ||||
| Ara h 3 sIgE at 12 months, kU/L (median [IQR]) | 0.00 [0.00, 0.02] | 0.05 [0.02, 0.25] | <0.001 | 0.00 [0.00, 0.04] |
|
| ||||
| Ara h 8 sIgE at baseline, kU/L (median [IQR])^ | 0.00 [0.00, 0.00] | 0.00 [0.00, 0.00] | 0.012 | 0.00 [0.00, 0.00] |
|
| ||||
| Ara h 8 sIgE at 12 months, kU/L (median [IQR]) | 0.00 [0.00, 0.00] | 0.00 [0.00, 0.00] | 0.001 | 0.00 [0.00, 0.00] |
|
| ||||
| Ara h 9 sIgE at baseline, kU/L (median [IQR]) | 0.00 [0.00, 0.00] | 0.00 [0.00, 0.00] | 0.01 | 0.00 [0.00, 0.00] |
|
| ||||
| Ara h 9 sIgE at 12 months, kU/L (median [IQR]) | 0.00 [0.00, 0.00] | 0.00 [0.00, 0.03] | <0.001 | 0.00 [0.00, 0.00] |
|
| ||||
| Peanut sIgG4 at baseline, μg/L (median [IQR]) | 70.00 [70.00, 70.00] | 70.00 [70.00, 70.00] | 0.005 | 70.00 [70.00, 70.00] |
|
| ||||
| Peanut sIgG4 at 12 months, μg/L (median [IQR]) | 70.00 [70.00, 70.00] | 70.00 [70.00, 70.00] | 0.002 | 70.00 [70.00, 70.00] |
|
| ||||
| Peanut wheal at baseline, mm (median [IQR]) | 0.00 [0.00, 0.00] | 0.00 [0.00, 2.00] | <0.001 | 0.00 [0.00, 0.00] |
|
| ||||
| Peanut wheal at 12 months, mm (median [IQR]) | 0.00 [0.00, 0.00] | 2.00 [0.00, 4.00] | <0.001 | 0.00 [0.00, 1.00] |
P values by Wilcoxon rank-sum test or Chi-square test. P-values calculated at logarithmic scale for serum biomarkers.
For Ara h 8 sIgE at baseline, there were 8 (14.8%) allergic participants and 15 (5.8%) non-allergic participants with values outside IQR; for Ara h 8 sIgE at 12 months, there were 10 (18.5%) allergic participants and 17 (6.5%) non-allergic participants with values outside IQR; for Ara h 9 sIgE at baseline, there were 10 (18.5%) allergic participants and 21 (8.1%) non-allergic participants with values outside IQR; for peanut sIgG4 at baseline, there were 8 (14.8%) allergic participants and 12 (4.6%) non-allergic participants with values outside IQR; for peanut sIgG4 at 12 months, there were 12 (22.2%) allergic participants and 21 (8.1%) non-allergic participants with values outside IQR;
Grouping infants avoiding peanut by their probability of peanut allergy at 60 months via CART modeling
A model using baseline data with LOD was developed to group the infants randomized to peanut avoidance based on their estimated probability of peanut allergy at 60 months. In the RF analysis, Ara h 2 and peanut sIgE at baseline were selected as important variables in predicting peanut allergy status at 60 months and were therefore used in the CART analysis (Figure 2A). A complexity parameter of 0.00769, corresponding to two splits, was found to be optimal for the CART model (Figure S2A), resulting in a tree with three terminal nodes (Figure 2B). The CART model had an AUC of 0.76, a sensitivity of 0.72, and specificity of 0.79 for predicting peanut allergy at 60 months (Table S3). When the peanut sIgE at baseline was ≥0.22 kU/L, the predicted probability of peanut allergy at 60 months was 41.38% (95% CI: [31.08%, 52.45%]). Participants with a peanut sIgE at baseline of <0.22 kU/L and Ara h 2 sIgE ≥0.10 kU/L showed a predicted probability of peanut allergy at 60 months of 50% (95% CI: [13.95%, 86.05%]). Participants with a baseline peanut sIgE <0.22 kU/L and Ara h 2 sIgE <0.10 kU/L had a probability of peanut allergy at 60 months of 6.79% (95% CI: [3.98%, 11.66%]).
Figure 2. Permutation importance from conditional RF variable selection for weighted CART and decision tree using baseline data with limit-of-detection adjustment.

A. Permutation importance score from the conditional RF. Important variables (peanut and Ara h 2 sIgE) were selected for the weighted CART analysis. B. Decision tree of CART analysis and intervention effect estimation per terminal node. sIgE in kU/L. Details in supplement.
Utilizing the predicted peanut allergy status from the CART model and the measured peanut allergy status at 60 months, the participants fell into four categories in the confusion matrix (true positive [N=39], true negative [N=206], false positive [N=54], and false negative [N=15]). Given that peanut sIgE at baseline was selected as one of the important variables, we explored longitudinal peanut sIgE levels and average trajectories for the participants in each one of the four cells in the confusion matrix (Figure 3). Details on peanut sIgE trajectories are in the supplemental results.
Figure 3. Trajectories of peanut sIgE grouped by the peanut allergy status predicted by the model using baseline data with LOD (model 1) and the observed allergy status at 60 months of age.

Each gray line represents one of the 314 infants from the avoidance arm used in the development of the CART. The blue line represents the average peanut sIgE.
Early peanut consumption intervention effects estimation
To estimate the intervention effect for each risk subgroup, the participants were classified using the CART model (Figure 2B). Of the 307 participants from the consumption arm, 78 were classified to the third terminal node and 4 of those participants were observed to be peanut allergic at 60 months. For this node, the data suggested that, among the 165 participants from both arms (n=87 avoidance, n=78 consumption arm), 3.88% (95% CI: [1.02%, 11.58%]) would be allergic at 60 months had these participants received the intervention, while their estimated proportion of peanut allergy had these participants not received the intervention was 44.29% (95% CI: [33.65%, 55.45%]). This suggests that the early peanut consumption intervention among the participants with a baseline peanut sIgE ≥0.22 kU/L could lead to a −40.41% difference in proportion of peanut allergy at 60 months of age with a 95% CI of [−51.93%, −27.28%]. The probability of peanut allergy at 60 months among participants classified to the second terminal node with peanut sIgE <0.22 kU/L and Ara h 2 sIgE ≥0.10 kU/L could be reduced by an absolute value of 50.00% (95% CI: [−86.05%, −27.85%]) with the early peanut consumption intervention. The first terminal node, predicted by peanut sIgE <0.22 kU/L and Ara h 2 sIgE <0.10 kU/L, had an average intervention effect of −6.49% (95% CI [−11.00%, −2.55%]). Due to the small number of participants classified into the second terminal node, the intervention effect calculation was performed without sIPTW adjustment. For the other two nodes, the SMD for all baseline characteristics were reduced after sIPTW adjustment (Figure S3A).
Sensitivity analysis
Model 2, which was developed using 12-month data only (Figures S4–S5), had higher AUC than that of model 1 (Figure 4, Table S3). For model 3 (Figures S6–S7), using both baseline and 12-month data to account for changes in biomarkers between visits, RF selected two variables reflecting changes between the two visits as important (i.e., difference in log Ara h 2 and slope of Ara h 2 sIgE), but they were not used by the final CART model (Figure S6B). Lastly, the optimal decision tree of model 4 using the same baseline data as model 1 without LOD modification closely resembled model 1 except for a lower splitting value for the Ara h 2 sIgE node (Figures S8–S9), and an insignificant intervention effect in the first terminal node. More detailed results of the sensitivity analysis are in the supplemental results.
Figure 4. Receiver operating characteristic curves for the 4 CART models.

‘12m data’ refers to data when the participants were 12 months old. AUC: area under the receiver operating characteristic curve.
Discussion
We utilized publicly available data from the LEAP trial to develop a model for grouping infants into risk subgroups based on their estimated risk of peanut allergy at 60 months of age if dietary peanut is avoided and estimated the individualized intervention effect of early introduction of peanut for each risk subgroup predicted by the model. We used a machine learning approach based on RF and CART. RF is an ensemble learning method used for both classification and regression by creating multiple decision trees during training and then using their combined predictions to make more accurate and robust predictions. Unlike standard regression procedures, RF allowed us to measure the importance of predictors. CART is also a type of decision tree algorithm that has been widely used due to high interpretability without the need of assumptions for underlying data distribution. A similar machine learning approach has previously been used in a study to predict the severity of peanut challenge outcomes.26 Our analysis provided patient-centered estimates of intervention effects accounting for all independent variables selected by RF importance measure and utilized in the CART analyses simultaneously. In contrast, the one variable at a time approach of traditional subgroup analysis is more susceptible to false positive and false negative results.27
Two recent studies have focused on the intervention effect of early introduction of peanut within prespecified subgroups using data from multiple trials. Logan et al. estimated the intervention effect among a number of different prespecified risk strata in pooled data from the LEAP and Enquiring About Tolerance (EAT) trials.7,28 Their causal effect subgroup analyses of the per-protocol population showed graded and significant beneficial effects of early introduction of peanut across all eczema severity and baseline peanut sIgE subgroups with the intervention being more beneficial among severe patients.21 These findings are consistent with the results of our study in that the intervention effects were graded. Contrasting Logan et al.’s methods, we used a machine learning approach and applied real-world LOD to determine the risk subgroups based on data from the avoidance arm, resulting in different sets of risk subgroups between the studies.
Robert et al. studied the timing of introduction of peanut for peanut allergy prevention using data from LEAP, Peanut Allergy Sensitization (PAS), and EAT studies, and their model indicated that the amount of reduction in peanut allergy in the general population decreased with every month of delayed introduction.22 Furthermore, their model highlighted that the younger the participants, the higher their risk of peanut allergy by 36/60 months if peanuts were not introduced, as shown in Figure E3 of their paper.22 This aligns with our findings that interventions were especially beneficial for those with higher probability of peanut allergy.
Both studies suggest a greater intervention effect the earlier peanut is introduced into the diet.21,22 To account for variation in age at baseline (4-11 months), age was included in the list of variables used in our analysis. However, age was not selected by the RF variable selection process. This suggests that, while increasing age of introduction was shown to be associated with decreasing preventive benefit of early peanut introduction, it may not be as important as the variables that were selected for stratifying infants within the LEAP trial into risk subgroups when avoiding peanut.
Greenhawt et al. performed a secondary analysis on the LEAP trial data using a logistic regression model with nonlinear interaction and found that the difference of probability of peanut tolerance between intervention and avoidance arms grew larger with increasing peanut SPT wheal size, suggesting that early peanut introduction may be more beneficial in subjects with greater peanut SPT wheal size.29 Our findings were similar in that there was a better intervention effect for the risk subgroups with higher predicted probability of peanut allergy characterized by higher peanut sIgE or higher Ara h 2 sIgE if avoiding peanut. Sever et al. developed a statistical model to impute peanut allergy status using LEAP data from the time of study outcome assessment when a food challenge result was not available, differing from our approach predicting peanut allergy at 60 months using baseline data.30 Additionally, peanut sIgE and Ara h 2 sIgE were identified by RF and used by the CART as predictors of peanut allergy at 60 months in our main analysis. When measured 1 week to 6 months prior to diagnosis, both biomarkers have been previously identified alone or in combination with other variables as good predictors of peanut allergy diagnosis.14–20
Our main model relied on baseline data in which sIgE measurements were truncated to reflect the LOD used by laboratories in clinical practice to increase clinical utility. We performed a sensitivity analysis utilizing baseline data without adjustment for real-world LOD (model 4). The resulting model was very similar to our main model with the exception that one of the risk subgroups (peanut sIgE <0.22 kU/L, Ara h 2 sIgE <0.024 kU/L) had a statistically insignificant intervention effect of −2.18% (95% CI:[−6.69, 2.23]) implying that there might be a subgroup of infants within the LEAP participants that has a low risk of peanut allergy at 60 months of age even without intervention. However, the risk of peanut allergy for this subgroup was nevertheless greatly reduced by 62.46% from 3.49% to 1.31% by the intervention.
The majority of misclassification in our model came from false positive predictions. Given that the model predicted the probability of peanut allergy at 60 months based on baseline data and showed that children significantly benefit from early introduction of peanut across all risk subgroups, children with a false positive prediction would likely consume peanut as part of early introduction without adverse reaction. Conversely, false negative prediction may result in infrequent or unsustained peanut consumption due to caregiver perception of low risk. Additional discussion on the main model and sensitivity analysis is available in the supplement.
After the findings from the LEAP trial showed the significant beneficial effect of early oral introduction and sustained exposure of peanut for prevention, the complementary feeding recommendations in the United States changed to recommend earlier oral peanut introduction, particularly among those at high-risk. Guidelines were also updated in the UK, Australia, and Japan.10–12,31,32 Despite these new recommendations, the acceptability of early peanut introduction remains unclear on the part of caregivers.31 The EAT trial had a below 50% adherence rate to the protocol for high-dose consumption due to feeding difficulties and symptoms with food consumption.33 A survey among caregivers found that only 31% showed willingness to introduce peanut before or at 6 months of age with 40% of caregivers showing willingness to introduce peanut after 11 months of age.34 Furthermore, the intervention in LEAP trial was early and sustained introduction of peanut; emphasizing maintenance of consistent exposure, which requires commitment and persistence, and a recent study implied that early introduction of peanut alone without sustained exposure might not be enough to prevent peanut allergy.35 The models from this study may be helpful to aid clinicians in developing personalized approaches based on individual biomarker data, for educating caregivers about the benefits of early introduction and to counter any possible hesitancy.
To note is that all 6 peanut allergy cases in the study population of the intervention arm discontinued the intervention during the study period, with 5 out of the 6 participants discontinuing the intervention due to peanut-induced allergic symptoms.7 Thus, the intervention effects estimated in this study include the effect of being unable to comply with the protocol due to adverse symptoms associated with the intervention. For infants who cannot tolerate early introduction of peanut due to allergy, the focus should transition from prevention to treatment. Chua et al. suggested that peanut oral immunotherapy is an effective and safe intervention for infants who do not tolerate early introduction of peanut.36 On the other hand, three participants that discontinued the intervention due to adverse symptoms were non-allergic at 60 months.
Another limitation is the lack of additional public datasets that would allow us to test the model’s performance against an external validation dataset, which limits the model’s generalizability. Additionally, the dataset in this study had a small number of cases, which resulted in even smaller sample size in some of the terminal nodes and large uncertainty in the intervention effect, but the effects in some risk subgroups were still significantly beneficial due to how effective the intervention was.
Lastly, the study results are limited to high-risk infants as specified by the inclusion and exclusion criteria of the LEAP trial. Specifically, the LEAP trial included children aged 4-11 months who had an egg allergy, severe eczema, or both diagnoses. The eligibility criteria intended to identify a cohort of infants with an elevated risk for peanut allergy for enrollment into the LEAP study. As a result, in our model, the identified strata with low probability of peanut allergy do not represent a low-risk group in the general population. The LEAP trial also excluded infants with a peanut SPT wheal of greater than 4 mm as they were considered likely to have peanut allergy.37 Our models cannot be applied to those who were excluded by the LEAP trial’s criteria, including infants with a peanut SPT wheal >4 mm. Further studies are needed to see if the results might be accurate and consistent for these excluded infants. Given that the pediatric population that our model applies to is considered high-risk, it is likely that they are being evaluated by both pediatricians and allergists, who would obtain sIgE measurements, making the model applicable to clinical practice.
Conclusions
In conclusion, we used publicly available data from the LEAP trial to develop a prediction model for infants at high-risk for peanut allergy grouping them into risk subgroups based on their estimated risk of peanut allergy at 60 months of age, if they avoided peanut. The calculated individualized intervention effects were significant across all risk subgroups. These results support the messaging that early and sustained introduction of peanut be recommended to infants at high-risk for peanut allergy as defined by the LEAP trial, regardless of their probability of peanut allergy development. In addition, the intervention effect results showed that infants with higher probability of peanut allergy according to our model benefited more from the early introduction of peanut. Our findings suggest that within this high-risk population, further stratification can occur to identify infants for which the early intervention may be most important.
Supplementary Material
Key messages.
Machine learning identified risk subgroups for peanut allergy at 60 months within LEAP participants.
Baseline peanut and Ara h 2-specific-IgE were selected as predictors of 60-months peanut allergy.
Infants with higher predicted probability of peanut allergy benefit more from the early introduction intervention.
Acknowledgments:
We thank Tingting Qiu, MPH, for her help with data management. We thank the study investigators for making the individual participant level data from the LEAP trial available on ITN TrialShare.
Funding:
This work was supported in part by the National Institutes of Health [grant numbers UH2AI145837, 2KL2TR001426-05A1]; and the Cincinnati Children’s Hospital Medical Center [Research Innovation and Pilot Funding (RIP)]. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.
Abbreviations:
- CART
Classification and Regression Tree
- CI
Confidence Interval
- EAT
Enquiring About Tolerance
- LEAP
Learning Early About Peanut Allergy
- LOD
Limit of detection
- PAS
Peanut Allergy Sensitization
- RF
Random forest
- sIgE
Specific immunoglobulin E
- sIPTW
Stabilized inverse probability treatment weighting
- SPT
Skin prick test
Footnotes
Conflict of interest
The authors indicate no relevant conflict of interest.
Reference
- 1.Tang MLK, Mullins RJ. Food allergy: is prevalence increasing? Internal Medicine Journal. 2017;47(3):256–261. doi: 10.1111/imj.13362 [DOI] [PubMed] [Google Scholar]
- 2.Pouessel G, Turner PJ, Worm M, et al. Food-induced fatal anaphylaxis: From epidemiological data to general prevention strategies. Clinical & Experimental Allergy. 2018;48(12):1584–1593. doi: 10.1111/cea.13287 [DOI] [PubMed] [Google Scholar]
- 3.Deschildre A, Elegbédé CF, Just J, et al. Peanut-allergic patients in the MIRABEL survey: characteristics, allergists’ dietary advice and lessons from real life. Clin Exp Allergy. 2016;46(4):610–620. doi: 10.1111/cea.12681 [DOI] [PubMed] [Google Scholar]
- 4.Venter C, Maslin K, Patil V, et al. The prevalence, natural history and time trends of peanut allergy over the first 10 years of life in two cohorts born in the same geographical location 12 years apart. Pediatr Allergy Immunol. 2016;27(8):804–811. doi: 10.1111/pai.12616 [DOI] [PubMed] [Google Scholar]
- 5.Fisher HR, Keet CA, Lack G, du Toit G. Preventing Peanut Allergy: Where Are We Now? J Allergy Clin Immunol Pract. 2019;7(2):367–373. doi: 10.1016/j.jaip.2018.11.005 [DOI] [PubMed] [Google Scholar]
- 6.Sampson HA. Managing peanut allergy: demands aggressive intervention in prevention and treatment. British Medical Journal. 1996;312(7038):1050–1052. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Du Toit G, Roberts G, Sayre PH, et al. Randomized Trial of Peanut Consumption in Infants at Risk for Peanut Allergy. New England Journal of Medicine. 2015;372(9):803–813. doi: 10.1056/NEJMoa1414850 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Natsume O, Kabashima S, Nakazato J, et al. Two-step egg introduction for prevention of egg allergy in high-risk infants with eczema (PETIT): a randomised, double-blind, placebo-controlled trial. Lancet. 2017;389(10066):276–286. doi: 10.1016/S0140-6736(16)31418-0 [DOI] [PubMed] [Google Scholar]
- 9.Perkin MR, Logan K, Tseng A, et al. Randomized Trial of Introduction of Allergenic Foods in Breast-Fed Infants. New England Journal of Medicine. 2016;374(18):1733–1743. doi: 10.1056/NEJMoa1514210 [DOI] [PubMed] [Google Scholar]
- 10.Australasian Society of Clinical Immunology and Allergy. ASCIA Guidelines for infant feeding and allergy prevention. Accessed October 22, 2022. https://www.allergy.org.au/hp/papers/infant-feeding-and-allergy-prevention-clinical-update
- 11.Ebisawa M, Ito K, Fujisawa T, Committee for Japanese Pediatric Guideline for Food Allergy, The Japanese Society of Pediatric Allergy and Clinical Immunology, The Japanese Society of Allergology. Japanese guidelines for food allergy 2017. Allergol Int. 2017;66(2):248–264. doi: 10.1016/j.alit.2017.02.001 [DOI] [PubMed] [Google Scholar]
- 12.Scientific Advisory Committee on Nurtition. Assessing the health benefits and risks of the introduction of peanut and hen’s egg into the infant diet before six months of age in the UK: A Joint Statement from the Scientific Advisory Committee on Nutrition and the Committee on Toxicity of Chemicals in food, Consumer products and the Environment. Published online 2017. Accessed October 22, 2022. https://cot.food.gov.uk/sites/default/files/jointsacncotallergystatementfinal2.pdf [Google Scholar]
- 13.Fleischer DM, Chan ES, Venter C, et al. A Consensus Approach to the Primary Prevention of Food Allergy Through Nutrition: Guidance from the American Academy of Allergy, Asthma, and Immunology; American College of Allergy, Asthma, and Immunology; and the Canadian Society for Allergy and Clinical Immunology. J Allergy Clin Immunol Pract. 2021;9(1):22–43.e4. doi: 10.1016/j.jaip.2020.11.002 [DOI] [PubMed] [Google Scholar]
- 14.Martinet J, Couderc L, Renosi F, Bobée V, Marguet C, Boyer O. Diagnostic Value of Antigen-Specific Immunoglobulin E Immunoassays against Ara h 2 and Ara h 8 Peanut Components in Child Food Allergy. Int Arch Allergy Immunol. 2016;169(4):216–222. doi: 10.1159/000446181 [DOI] [PubMed] [Google Scholar]
- 15.Beyer K, Grabenhenrich L, Härtl M, et al. Predictive values of component-specific IgE for the outcome of peanut and hazelnut food challenges in children. Allergy. 2015;70(1):90–98. doi: 10.1111/all.12530 [DOI] [PubMed] [Google Scholar]
- 16.Rajput S, Sharma V, Hughes SM, Ewing CI, Arkwright PD. Allergy testing in predicting outcome of open food challenge to peanut. J Allergy Clin Immunol. 2018;141(1):457–458. doi: 10.1016/j.jaci.2017.05.001 [DOI] [PubMed] [Google Scholar]
- 17.Preece K, Bhatia R, Belcher J, et al. The fraction of exhaled nitric oxide improves prediction of clinical allergic reaction to peanut challenge in children. Clin Exp Allergy. 2014;44(3):371–380. doi: 10.1111/cea.12258 [DOI] [PubMed] [Google Scholar]
- 18.Johannsen H, Nolan R, Pascoe EM, et al. Skin prick testing and peanut-specific IgE can predict peanut challenge outcomes in preschoolchildren with peanut sensitization. Clin Exp Allergy. 2011;41(7):994–1000. doi: 10.1111/j.1365-2222.2011.03717.x [DOI] [PubMed] [Google Scholar]
- 19.DunnGalvin A, Daly D, Cullinane C, et al. Highly accurate prediction of food challenge outcome using routinely available clinical data. J Allergy Clin Immunol. 2011;127(3):633-639.e1-3. doi: 10.1016/j.jaci.2010.12.004 [DOI] [PubMed] [Google Scholar]
- 20.Klemans RJB, Otte D, Knol M, et al. The diagnostic value of specific IgE to Ara h 2 to predict peanut allergy in children is comparable to a validated and updated diagnostic prediction model. J Allergy Clin Immunol. 2013;131(1):157–163. doi: 10.1016/j.jaci.2012.08.010 [DOI] [PubMed] [Google Scholar]
- 21.Logan K, Bahnson HT, Ylescupidez A, et al. Early introduction of peanut reduces peanut allergy across risk groups in pooled and causal inference analyses. Allergy. n/a(n/a). doi: 10.1111/all.15597 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Roberts G, Bahnson HT, Toit GD, et al. Defining the window of opportunity and target populations to prevent peanut allergy. Journal of Allergy and Clinical Immunology. 2023;151(5):1329–1336. doi: 10.1016/j.jaci.2022.09.042 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Bhattacharya S, Andorf S, Gomes L, et al. ImmPort: disseminating data to the public for the future of immunology. Immunol Res. 2014;58(2-3):234–239. doi: 10.1007/s12026-014-8516-1 [DOI] [PubMed] [Google Scholar]
- 24.Bhattacharya S, Dunn P, Thomas CG, et al. ImmPort, toward repurposing of open access immunological assay data for translational and clinical research. Sci Data. 2018;5:180015. doi: 10.1038/sdata.2018.15 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Asare AL, Carey VJ, Rotrosen D, Nepom GT. Clinical trial data access: Opening doors with TrialShare. J Allergy Clin Immunol. 2016;138(3):724–726. doi: 10.1016/j.jaci.2016.05.034 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Chinthrajah RS, Purington N, Andorf S, et al. Development of a tool predicting severity of allergic reaction during peanut challenge. Ann Allergy Asthma Immunol. 2018;121(1):69–76.e2. doi: 10.1016/j.anai.2018.04.020 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Kent DM, Paulus JK, van Klaveren D, et al. The Predictive Approaches to Treatment effect Heterogeneity (PATH) Statement. Ann Intern Med. 2020;172(1):35–45. doi: 10.7326/M18-3667 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Perkin MR, Logan K, Bahnson HT, et al. Efficacy of the Enquiring About Tolerance (EAT) study among infants at high risk of developing food allergy. J Allergy Clin Immunol. 2019;144(6):1606–1614.e2. doi: 10.1016/j.jaci.2019.06.045 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Greenhawt M, Fleischer DM, Chan ES, et al. LEAPing through the looking glass: secondary analysis of the effect of skin test size and age of introduction on peanut tolerance after early peanut introduction. Allergy. 2017;72(8):1254–1260. doi: 10.1111/all.13100 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Sever ML, Calatroni A, Roberts G, et al. Developing a Prediction Model for Determination of Peanut Allergy Status in the Learning Early About Peanut Allergy (LEAP) Studies. The Journal of Allergy and Clinical Immunology: In Practice. 2023;11(7):2217–2227.e9. doi: 10.1016/j.jaip.2023.04.032 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Abrams EM, Shaker M, Greenhawt M, Mack DP. International Peanut Allergy Prevention, 6 Years After the Learning Early About Peanut Study. J Allergy Clin Immunol Pract. 2022;10(1):71–77. doi: 10.1016/j.jaip.2021.07.015 [DOI] [PubMed] [Google Scholar]
- 32.Togias A, Cooper SF, Acebal ML, et al. Addendum guidelines for the prevention of peanut allergy in the United States: Report of the National Institute of Allergy and Infectious Diseases-sponsored expert panel. J Allergy Clin Immunol. 2017;139(1):29–44. doi: 10.1016/j.jaci.2016.10.010 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Perkin MR, Bahnson HT, Logan K, et al. Factors influencing adherence in a trial of early introduction of allergenic food. Journal of Allergy and Clinical Immunology. 2019;144(6):1595–1605. doi: 10.1016/j.jaci.2019.06.046 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Greenhawt M, Chan ES, Fleischer DM, et al. Caregiver and expecting caregiver support for early peanut introduction guidelines. Ann Allergy Asthma Immunol. 2018;120(6):620–625. doi: 10.1016/j.anai.2018.03.001 [DOI] [PubMed] [Google Scholar]
- 35.Soriano VX, Peters RL, Moreno-Betancur M, et al. Association Between Earlier Introduction of Peanut and Prevalence of Peanut Allergy in Infants in Australia. JAMA. 2022;328(1):48–56. doi: 10.1001/jama.2022.9224 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Chua GT, Greenhawt M, Shaker M, et al. The Case for Prompt Salvage Infant Peanut Oral Immunotherapy Following Failed Primary Prevention. J Allergy Clin Immunol Pract. 2022;10(10):2561–2569. doi: 10.1016/j.jaip.2022.05.040 [DOI] [PubMed] [Google Scholar]
- 37.Du Toit G, Roberts G, Sayre PH, et al. Identifying infants at high risk of peanut allergy: the Learning Early About Peanut Allergy (LEAP) screening study. J Allergy Clin Immunol. 2013;131(1):135-143.e1-12. doi: 10.1016/j.jaci.2012.09.015 [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
