Summary
The atopic march lacks early identification methods for high-risk children. In this study, we assessed whether the risk of atopic diseases in infants could be predicted using a polygenic score (PGS) for total immunoglobulin E (IgE) levels. The PGS estimated using the polygenic model generated by PRS-CS was significantly correlated with log-transformed IgE levels (ρ = 0.200, p < 2.2 × 10−16). Assessment of the risk from birth to 2 years of age in a Japanese birth cohort (n = 17,154) applying the estimated PGS revealed significantly elevated incidence risk ratios in the highest PGS quintile (Q5) compared with those in the reference quintiles (Q1–Q3) for food allergy (1.51-fold; 95% confidence interval: 1.30–1.76), atopic dermatitis (1.30-fold; 1.12–1.51), and both conditions (1.88-fold; 1.46–2.43). These findings address critical gaps in allergy and PGS research among non-European populations, suggesting the contribution of genetic predisposition to IgE production in early-onset allergic diseases and supporting the use of PGS in early intervention.
Keywords: allergic disease, birth cohort, HLA, phenome-wide association study, polygenic score, atopic march, immunoglobulin E, food allergy, atopic dermatitis, genetic predisposition
We report that genetic predisposition influencing serum immunoglobulin E (sIgE) levels is associated with the development of allergies in children, including food allergy and atopic dermatitis. Polygenic scores for sIgE may help identify children at high genetic risk, enabling early intervention to reduce allergy-related disease burden.
Introduction
Allergic diseases are common health issues in children, profoundly affecting their quality of life.1,2 Food allergies (FAs), for instance, affect approximately 10% of infants in the United States3 and Australia,4 with evidence suggesting rising prevalence rates in non-Western countries, although data from these regions remain limited.5
In childhood, allergic conditions often follow a sequential progression known as the “atopic march,” which reflects the natural history of allergic diseases.6,7 This progression typically begins with atopic dermatitis (AD [MIM: 603165]) and advances to immunoglobulin E (IgE)-mediated FAs, asthma (AS [MIM: 600807]), and allergic rhinitis (AR [MIM: 607154]).7,8 In particular, AD is strongly correlated with an increased risk of developing food-specific IgE antibodies and subsequent FAs,9,10 as well as other allergic conditions.11
The risk of allergic diseases varies among individuals and is significantly influenced by genetic predispositions. Family history is a well-established risk factor for atopic diseases.12,13 Recent genome-wide association studies (GWASs) have identified associated genetic factors and indicated the existence of a shared genetic background across allergic diseases.14,15 Among these, IgE—a key mediator in allergic diseases—has shown substantial heritability, ranging from 36% to 78% in segregation analyses16 and twin studies.17 Genetic variants associated with IgE levels have been identified in IL4R (MIM: 147781), FCER1A (MIM: 147140), RAD50 (MIM: 604040), STAT6 (MIM: 601512), IL13 (MIM: 147683], and HLA loci (HLA-A [MIM: 142800], HLA-G [MIM: 142871], and HLA-DQA2 [MIM: 613503]) using GWASs.18,19,20 IL4R encodes the alpha chain of the receptors for interleukin (IL)-4 and IL-13 and interacts with the JAK1/2/3-STAT6 pathway, which regulates IgE production and is implicated in the development of AS and other allergic diseases.21,22,23 These findings underscore the genetic overlap between total IgE levels and various allergic conditions.
Given the central role of IgE in allergic disease development, we hypothesized that evaluation of the genetic predisposition to elevated IgE levels would offer a promising strategy for predicting the overall risk of atopic diseases. To test this hypothesis, in this study, we aimed to develop and validate a polygenic score (PGS)-based model for the assessment of IgE levels and evaluate its potential for allergy risk prediction. Our study used data from one of the largest Asian birth cohorts, enabling the classification of children based on their genetic predisposition for IgE production and estimation of their allergy risks. This approach is particularly relevant in children, in whom genetic influences are pronounced due to limited environmental exposure, offering an opportunity for early and comprehensive risk assessment. By addressing the lack of data from non-European cohorts, our findings contribute to the growing field of personalized medicine and provide a foundation for targeted prevention strategies for childhood allergies, aiming to improve the quality of life of high-risk children through optimized early interventions.
Methods
Study population
The Tohoku Medical Megabank Community-Based Cohort (TMM CommCohort) study design has been described in previous studies.24,25 Briefly, residents from Iwate and Miyagi prefectures, located on the Pacific coast of northeast Japan, were recruited between May 2013 and March 2016. This study included three datasets within this cohort: TMM67K, TMM18K, and TMM9K. TMM67K included participants recruited during public health checkups, while TMM18K and TMM9K included individuals who visited assessment centers for the cohort study. The TMM9K dataset was used for training, and the remaining datasets were used for validation. Overlapping samples in TMM9K were excluded from other datasets. Phenotypes were determined using a baseline survey, and total IgE concentrations were measured using ImmunoCAP total IgE reagents.20 Affected individuals were identified based on self-reported medical histories obtained from self-administered questionnaires.25
The TMM Project Birth and Three-Generation (TMM BirThree) cohort study has been described previously.26 Briefly, pregnant women residing in Miyagi and Iwate prefectures were recruited into the cohort from 2013 to 2016, and genotypic and phenotypic data were collected from the participants, their newborns, and their families. The parent-reported medical histories of the newborns surveyed at approximately 1, 6, 12, and 24 months after birth were used for phenotyping. If a parent affirmed the question, “Has your child ever been diagnosed with any diseases by a clinician?” and selected immunological diseases like “allergic conjunctivitis, AR, pollinosis,” “AD,” “FA,” or “bronchial asthma,” the child was classified as an affected individual. Children with other responses were classified as control subjects, excluding instances where the parent did not answer the first question. The observation period started 1 month before the first survey date (approximately 1 month post-birth) and ended either when the event was reported or on the last survey day. Questionnaires submitted 36 months after birth were excluded from the study.
The study was approved by the institutional review boards of Iwate Medical University (HG H25-2 and MH2023-024) and the Tohoku University Tohoku Medical Megabank Organization (ToMMo; first edition: 2012-4-617, latest edition: 2021-4-113). Written consent was obtained from all participants in the TMM study. Personal genotype and phenotype data analyses were conducted exclusively using the ToMMo supercomputer, which is a standalone computing system. This study adhered to the ethical guidelines of the Declaration of Helsinki.
The datasets used in this study are not publicly accessible due to ethical constraints. However, access may be obtained upon approval from the Ethical Committee of Iwate Medical University, Ethical Committee of the Tohoku University Tohoku Medical Megabank Organization (ToMMo), and Materials and Information Distribution Review Committee of the TMM Project.
Genotyping and imputation
Participants from the TMM CommCohort were genotyped using the Axiom Japonica Array v.2 (JPAv2), as described in previous studies,27,28 whereas those in the TMM BirThree cohort were genotyped using either JPAv2 (“BirThree 1”) or the Axiom Japonica Array NEO (JPA-NEO; “BirThree 2”).29 Participants with a call rate < 0.95 and gender discrepancies between genotypic and cohort data were excluded. Genotypes failing to meet the quality control criteria (call rate < 0.99, Hardy-Weinberg equilibrium exact test p < 0.05, and minor-allele frequency [MAF] < 0.05) were also omitted. Pre-phasing and imputation were performed using SHAPEIT2 (r837) and IMPUTE4 (r300.3) with 3.5KJPNv2 haplotype reference panels.28,30,31 The variant positions in the genome were based on GRCh37.
PGS
To generate the PGS model, single-nucleotide polymorphism weights were estimated using a publicly available GWAS summary of 9,260 Japanese residencies20 in jMorp30 (ID: TGA000004). Candidate models were constructed as previously described32 using three methods: linkage disequilibrium (LD) pruning and thresholding (P+T), LDpred,33 and PRS-CS.34 For the P+T method, 24 models were generated under different LD pruning conditions using one of the six parameters for p values (1, 5 × 10−1, 5 × 10−2, 5 × 10−4, 5 × 10−6, or 5 × 10−8) and one of the four parameters for R2 values (0.2, 0.4, 0.6, or 0.8). For LDpred, seven models were generated for ρ values (1, 0.3, 0.1, 0.03, 0.01, 0.003, and 0.001). For PRS-CS, six models were generated for each ϕ value (1 × 10−1, 1 × 10−2, 1 × 10−4, 1 × 10−6, 1 × 10−8, and “auto”). Individual PGSs were calculated using Plink2 (v.2.00a2LM)35 for each model and were standardized to Z scores. Variants with low imputation quality (R2 < 0.3) were excluded using the “--minimac3-r2-filter 0.3” option. In addition, to avoid potential sample overlap with the GWAS of total IgE,20 individuals genotyped with the Illumina Infinium OmniExpressExome-8 BeadChip in the TMM CommCohort were excluded. Model fitness was assessed using Spearman’s rank correlation coefficient (ρ) between the PGS from each candidate model and the residuals of log-transformed and Z score-standardized total IgE levels, obtained from a linear regression model adjusted for age and sex in the TMM9K dataset.
For comparison, PGSs based on publicly available models from the PGS Catalog36 were calculated using PLINK2 (v.2.0.0-a.6.9). PGSs based on GRCh38/hg38 positions were converted to GRCh37 positions using CrossMap.37 Associations between each PGS and allergic phenotypes were evaluated using logistic regression adjusted for age, sex, and principal components (PCs) 1–10; in the analysis using the birth cohorts (BirThree 1 and 2), age was excluded from the adjustment.
Statistical analysis
A phenome-wide association study (PheWAS) of standardized PGSs was conducted using logistic regression for binomial phenotypes adjusted for age, sex, and the top 10 genotype PCs in R (v.4.2.1). The list of phenotypes was based on our previous study25 and updated to characterize the PGS for IgE levels, including immune-related, inflammatory, and other diseases relevant to internal medicine. In contrast, relatively rare cancers, diseases of the sensory organs (e.g., macular degeneration and deafness), obesity, and hypertension were excluded, as they were considered less relevant to this objective. The results from each validation dataset were pooled using the R package “meta” (v.6.2-1) with a fixed-effect model. To account for multiple testing in the PheWAS analysis, p values were adjusted using the Benjamini-Hochberg procedure to control the false discovery rate (FDR). Associations with FDR-adjusted p values (q values) below 0.05 were considered statistically significant. The receiver operating characteristic (ROC) curve and area under the ROC curve (AUC) for PGS to detect phenotypes were estimated using the R package “pROC” (v.1.18.0) without adjustment. The confidence interval (CI) for AUC was estimated using the DeLong method.
For birth cohort data, the odds ratio (OR) was estimated for each PGS quintile using logistic regression with covariates, such as sex and the top 10 genotype PCs. A trend analysis was conducted for the standardized PGS using the same regression approach. The Venn diagram was drawn using the R package “gplots” (v.3.1.3.1). Incidence rates per 1,000 person months were calculated for the lowest-to-middle PGS groups (Q1–Q3), Q4, and Q5. Incidence rate ratios (IRRs) with 95% CIs were estimated using the metainc function in the R package “meta” (v.6.2-1) based on the summed number of events and person months in the exposure (Q4 or Q5) and the control (Q1–Q3) groups.
Mendelian randomization (MR) analysis was performed using the ivreg function in the “AER” package (v.1.2.14) on R (v.4.2.1), with adjustments for age, sex, and PC1–10. Instrumental variables (IVs) for serum IgE were selected from a previous GWAS20 using LD clumping in PLINK2 (v.2.0.0-a.6.9) with the following options: “--clump-p1 5e-8 --clump-p2 5e-8 --clump-r2 0.001 --clump-kb 10000.” All IVs were confirmed to be significantly associated (F-statistics > 10) with log-transformed and standardized total IgE levels in the training dataset (TMM9K) using a linear regression model adjusted for sex, age, and PC1–10.
Causal mediation analysis was performed using the “mediation” package (v.4.5.0) in R (v.4.2.1). The mediator model was constructed using a binomial generalized linear model (GLM) with a probit link function, adjusting for PGS (exposure), sex, and the top PC1–10. The outcome model had the same structure as the mediator model but included the mediator as an additional covariate. The 95% CIs for the effect sizes (β) were estimated using a quasi-Bayesian approximation via Monte Carlo simulation with 2,000 iterations.
HLA imputation and association study
Genotyped data from JPAv2 were used for human leukocyte antigen (HLA) imputation following the procedure described in a previous study.31 Briefly, quality control was conducted using Plink2,excluding participants and genotypes with call rates of <0.99. Genotypes with a Hardy-Weinberg equilibrium exact test p < 1 × 10−10 and MAF < 20 were also omitted. The genotyped data were pre-phased using Eagle (v.2.4.1). The 54KJPN-HLA reference panel constructed by TMM,38 formatted using the NomenCleaner and MakeReference programs in SNP2HLA from the HLA-TAPAS pipeline,39,40 was used in this study. After converting the file format using bcftools (v.1.18)41 and savvy (v.2.0.1), the reference panel was used for the imputation of the pre-phased genotype data using Minimac4 (v.4.1.6).42 The HDS value, which is an estimated phased haploid alternate allele dosage, of the imputed alleles was used for downstream association.
For the association study, participants with call rates < 0.99 were excluded. Additionally, genotypes with a call rate < 0.99, Hardy-Weinberg equilibrium exact test p < 1 × 10−5, and MAF < 0.01 were excluded. Participants with close kinship within the second degree were excluded using the KING method,43 and principal-component analysis (PCA) was conducted using LD-pruned variants in Plink2 using the “--indep-pairwise 1500 150 0.03” option.
Association tests were performed using BOLT-LMM (v.2.3.6)44 for quantitative traits (such as common logarithmic total IgE) or SAIGE (v.0.44)45 for binary traits (other phenotypes). Subjects estimated to be monozygotic (MZ)-twin or the same individual were excluded from the analysis. Imputed HLA alleles with an MAF of ≥0.01 and INFO >0.3 were analyzed, with age, sex, and 10 genotype PCs as covariates. To represent the association study, the direction of the effect was aligned with that of the minor allele as the effect allele. To account for multiple testing in the HLA association test, p values below the Bonferroni-corrected threshold (p = 0.05 divided by the number of alleles tested) were considered statistically significant.
Results
Model selection and validation
The study design is shown in Figure S1. The characteristics of the dataset used in this study are presented in Table 1. To evaluate the genetic predisposition for IgE production, we constructed a suitable polygenic model for the present study. A total of 37 models were generated for PGS using three different methods (Figure 1A). The PGS estimated from the model generated using PRS-CS with ϕ = 1 × 10−6 correlated best with the logarithmic total IgE levels after adjusting for age and sex (ρ = 0.200, p < 2.2 × 10−16). Therefore, we selected this model for the subsequent analysis.
Table 1.
Characteristics of the study population at baseline
|
Role |
Model selection |
Validation |
Application to birth cohort |
||
|---|---|---|---|---|---|
| TMM9K | TMM67K | TMM18K | BirThree 1 | BirThree 2 | |
| n | 7,482 | 47,766 | 11,698 | 9,425 | 7,729 |
| Female, % | 66.5 | 60.3 | 75.6 | 48.5 | 48.5 |
| Age, year | 56.4 ± 13.2 | 60.3 ± 11.4 | 58.0 ± 12.9 | 0 | 0 |
| IgE, IU/mL | 190.6 ± 568.0 | 189.9 ± 578.4 | 169.0 ± 514.6 | N/D | N/D |
| log10IgEb | 1.77 ± 0.67 | 1.77 ± 0.65 | 1.72 ± 0.64 | N/D | N/D |
| Array | JPAv2 | JPAv2 | JPAv2 | JPAv2 | JPA-NEO |
| Residencya | Iwate | Iwate and Miyagi | Miyagi | Miyagi | Miyagi |
| Recruited | assessment center | health checkup | assessment center | assessment center | assessment center |
N/D, no data available; IgE, immunoglobulin E; JPA-NEO, Japonica Array NEO; JPAv2, Japonica Array v.2; PGS, polygenic score; TMM, Tohoku Medical Megabank.
Prefecture in Japan.
Common logarithmic IgE value.
Figure 1.
Optimized polygenic model for IgE correlates with total IgE levels
(A) The model with the highest Spearman’s ρ coefficient with logarithmic total IgE levels among the 37 candidate models identified as the best model. The horizontal dotted line in the PRS-CS (right) image indicates the ρ value for the model generated using the “auto” mode. The model generated by PRS-CS at ϕ = 1 × 10−6 showed the highest ρ.
(B) Bimodal distribution of standardized PGS calculated using the best model in the training dataset (TMM9K) and two validation datasets (TMM18K and TMM67K). The dotted curve indicates a normal distribution.
(C) Correlation of PGS in the validation datasets. PGS correlated significantly (p < 2.2 × 10−16) with logarithmic total IgE levels adjusted for age, sex, and the top ten genotype principal components (PCs) and then was standardized. Dotted lines indicate the diagonal. The color indicates the density of the subject.
IgE, immunoglobulin E; PGS, polygenic score.
The PGS distribution was bimodal (Figure 1B). In the TMM18K and TMM67K validation datasets, the PGS showed a significant correlation with log-transformed total IgE levels (ρ = 0.189 and 0.174, respectively; p < 2.2 × 10−16 in Spearman’s rank correlation test) after adjusting for age, sex, and the top 10 genotype PCs, followed by standardization (Figure 1C). In the best model, the proportions of variants within the major histocompatibility complex (MHC) region were 1.35% and 1.01% in the top and bottom percentile groups of effect weights, respectively (Figure S2A). Stratification of the TMM9K dataset by allele type with the top three effect weights (both positive and negative) clearly shifted the PGS distribution (Figures S2B and S2C).
PheWAS
A PheWAS of 63 phenotypes (Table S1) and subsequent meta-analysis of the results from two validation datasets (TMM67K and TMM18K) identified eight associations for PGS (Benjamini-Hochberg q < 0.05) (Figure 2A; Table S2). Among these, three were allergic disorders (cedar pollinosis, hives, and FAs), and two were autoimmune diseases (Graves’ disease [MIM: 275000] and Hashimoto’s disease [MIM: 140300]). A one standard deviation (SD) increase in PGS increased the risk of allergic disorders, such as cedar pollinosis (OR per SD: 1.10 [95% CI: 1.09–1.12]), hives (1.04 [1.02–1.07]), and FAs (1.10 [1.04–1.15]). Conversely, Graves’ and Hashimoto’s diseases exhibited protective effects (0.84 [0.78–0.90] and 0.90 [0.85–0.96], respectively).
Figure 2.
Phenome-wide association study of PGS and disease risk
(A) The association between binary phenotypes and standardized PGS analyzed using a logistic regression model adjusted for age, sex, and the top 10 genotype PCs in the genotype data. Phenotypes are colored according to the ICD-11 category. The q value indicates the adjusted p value obtained using the Benjamini-Hochberg method. The red horizontal line indicates q = 0.05. An increasing or decreasing odds ratio is indicated using an upward or downward arrowhead, respectively.
(B) Comparison of the risk analyzed using the same method based on the standardized common logarithmic total IgE instead of the PGS.
PC, principal component; IBD, inflammatory bowel disease; IgE, immunoglobulin E; PGS, polygenic score.
Subsequently, an analysis was conducted using total IgE levels instead of PGS for comparison (Figure 2B; Table S3). The three allergic disorders were also significantly correlated with the total IgE levels: cedar pollinosis (OR: 2.71 [95% CI: 2.65–2.77]), hives (1.35 [1.32–1.39]), and FAs (1.59 [1.51–1.68]). However, other conditions like Graves’ disease (1.07 [1.00–1.14]), Hashimoto’s disease (0.95 [0.90–1.01]), dental cavities (1.00 [0.98–1.01]), colon cancer (MIM: 114500) (1.02 [0.95–1.10]), and inflammatory bowel disease (IBD [MIM: 266600]) (0.98 [0.86–1.12]) did not show significant associations.
Furthermore, MR analyses were conducted using IVs significantly associated with total IgE levels (F-statistics > 10; Table S4). The pooled results indicated significant causal effects (Benjamini-Hochberg q < 0.05) of IgE levels on FAs, AR, cedar pollinosis, AS, and hives in the positive direction and on Graves’ disease and Hashimoto’s disease in the negative direction (Figure S3; Tables S5 and S6). In contrast, no causal association between IgE levels and colon cancer, dental cavities, or IBD was identified, suggesting different roles of IgE in these phenotypes.
For these diseases, PGS showed modest AUC values in both validation datasets (Figures S4 and S5), such as FAs (AUC in TMM18K and TMM67K: 0.549 [95% CI: 0.523–0.576] and 0.517 [0.500–0.535], respectively), cedar pollinosis (0.528 [0.517–0.539] and 0.527 [0.522–0.532]), and hives (0.518 [0.504–0.532] and 0.515 [0.506–0.523]).
HLA alleles influence PGS and allergic disease risk
To assess the contribution of HLA alleles to the polygenic model, we estimated their effects on the PGS and related phenotypes. HLA alleles were imputed with high quality (mean INFO value of approximately 1.0) for an MAF > 0.01 (Figure S6).
We then performed association analyses between imputed HLA alleles and various phenotypes. To validate the reliability of our association results prior to detailed interpretation, we compared them with previously reported findings identified through literature searches (Table S7). The direction of association identified in our study was consistent with that reported in previous studies for diseases such as Graves’ disease, Hashimoto’s disease, and IBD, supporting the robustness of our HLA association analyses.
The effects (β) of HLA alleles on total IgE levels, PGS, and disease risk were estimated through meta-analysis of the validation datasets (Figure 3; Tables S8–S10). A significant correlation was observed between the effects of PGS and total IgE levels (Figure 3A; Spearman’s ρ coefficient for all HLA allele types, ρn = 0.824). Most HLA alleles were significantly associated with IgE levels and PGS. Additionally, the effects of HLA alleles on PGS significantly correlated with their effects on phenotypes (Figure 3B), although a few HLA alleles were directly associated with specific phenotypes. These correlations were more pronounced when the alleles were significantly associated with both total IgE levels and PGS (ρs).
Figure 3.
Influence of HLA alleles on PGS and total IgE levels
(A) Effect sizes (β) of HLA alleles for common logarithmic total IgE or standardized PGS estimated using a linear mixed model with BOLT-LMM, adjusted for age, sex, and the top 10 genotype PCs. Dark blue and larger points represent effect sizes significantly associated with PGS and IgE levels, respectively, under Bonferroni correction. The shapes of the points indicate HLA classes. ρn and ρs indicate Spearman’s ρ coefficients across all HLA alleles in the analysis (89 alleles) and those significantly associated with PGS and total IgE levels (48 alleles), respectively.
(B) Effect sizes for the phenotypes estimated using SAIGE under the same conditions described above. The blue and gray areas indicate the approximate line obtained using the least-squares method and its 95% CI, respectively. ∗p < 0.05, Spearman’s correlation test.
PC, principal component; HLA, human leukocyte antigen; IBD, inflammatory bowel disease; IgE, immunoglobulin E; MHC, major histocompatibility complex; PGS, polygenic score; CI, confidence interval.
Next, we simultaneously compared the effects of HLA alleles on the various phenotypes (Figure S7). Heatmap analysis revealed that phenotypes with similar correlations (positive Spearman’s ρ) with PGS, such as FAs, hives, and cedar pollinosis, shared common profiles. Clustering analysis showed that HLA alleles with higher effect sizes for these allergic diseases, total IgE levels, dental cavities, and IBD included A∗24:02, B∗52:01, C∗12:02, DRB1∗15:01 or DRB1∗15:02, DQA1∗01:02 or DQA1∗01:03, DQB1∗06:01 or DQB1∗06:02, DPB1∗09:01, and DPA1∗02:01. In contrast, the relevance of these haplotypes was reduced in conditions such as colon cancer, Graves’ disease, and Hashimoto’s disease.
PGS predicts allergy risk in birth cohort study
The model developed for allergy risk prediction was applied to the TMM BirThree birth cohort (Figure 4A; Table S11). This cohort comprised two datasets, designated BirThree 1 and BirThree 2, which differed in genotyping arrays (JPAv2 and JPA-NEO, respectively).
Figure 4.
Allergy risk stratified by PGS quintile
(A) Standardized PGS distribution in the TMM BirThree cohort is represented by the timing of each survey after birth. The dotted vertical line and curve indicate the highest PGS quintile (Q5) and normal distribution, respectively.
(B) Allergy prevalence by PGS quintile in the 24 month (2 year) survey (left and middle). The OR was estimated using a logistic model with sex and the top 10 genotype PCs as covariates, with results pooled by meta-analysis using a fixed-effects model (right). Bars indicate 95% CIs.
(C) Incidence rate per 1,000 person months for the lowest-middle PGS groups (Q1–Q3), Q4, and Q5 (left and middle). Error bars indicate 95% CIs. IRRs were calculated using the Q1–Q3 group as the baseline (right), with 95% CIs indicated in square brackets.
(D) Prevalence of food allergies. The color codes show the PGS quintile as shown in (B). † indicates significance in trend analysis: †p < 0.05, ††p < 0.005, and †††p < 0.0005. ∗ indicates significance in meta-analysis: ∗p < 0.05, ∗∗p < 0.005, and ∗∗∗p < 0.0005.
CI, confidence interval; IRR, incidence rate ratio; OR, odds ratio; PC, principal component; PGS, polygenic score.
In both datasets, the prevalence of FAs in the 24 month survey exhibited a significant positive trend with increasing PGS quintiles (Figure 4B). Individuals in the highest PGS quintile (Q5) demonstrated a significantly elevated risk of FAs (OR: 1.95 [95% CI: 1.57–2.43]) and AD (1.33 [1.09–1.62]) in the meta-analysis of the two datasets (Figure 4B; Table S12). The incidence of affected individuals increased in subsequent surveys, with AD-affected individuals rising in the 6 month survey and FA-affected individuals increasing around the 12 month mark (Figure S8).
The incidence rates of FAs (per 1,000 person months) were 4.45 (95% CI: 3.74–5.24) in BirThree 1 and 4.26 (3.50–5.14) in BirThree 2 (Figure 4C; Table S13). The highest PGS group showed significant IRRs for FAs and AD: 1.51-fold (95% CI: 1.30–1.76) and 1.30-fold (1.12–1.51), respectively, compared with the lowest-to-intermediate (Q1–Q3) groups. Additionally, FA prevalence according to allergen type showed a significant trend in the PGS quintile (Figure 4D; Table S14). In the highest PGS group, the prevalence rates for egg white allergy were 7.39% (95% CI: 5.84–8.94) in BirThree 1 and 6.24% (95% CI: 4.66–7.83) in BirThree 2.
Furthermore, individuals in the highest (Q5) PGS group showed an increased risk of dual phenotypes during the follow-up period compared with those in the intermediate (Q3) group (Figure 5A). The highest PGS group exhibited significantly elevated risks for FAs and AS (OR in meta-analysis: 3.00 [95% CI: 1.30–6.93]), AD and FAs (1.89 [1.33–2.69]), and AS and AR/allergic conjunctivitis/pollinosis (2.40 [1.07–5.37]) compared to the lowest PGS group (Q1) (Figure 5B; Table S15). Compared with the lowest-to-intermediate PGS groups, the highest PGS group showed significant IRRs for FAs and AS, AD and FAs, and AD and AS (Figure 5C; Table S16).
Figure 5.
Risk of dual phenotypes in PGS quintiles
(A) Venn diagram of affected individuals during the follow-up period. The increased number of affected individuals in Q5 compared with Q3 is indicated in bold.
(B) OR estimated for individuals who developed both phenotypes during the OR follow-up period and were treated as affected individuals .
(C) Incidence and IRRs of both phenotypes. The incidence rates and IRRs of individuals with both phenotypes were calculated. Affected individuals were identified when the second phenotype was reported, and all other individuals were included in the control group. The estimation method and representation are the same as those shown in Figure 4. ∗p < 0.05, ∗∗p < 0.005, and ∗∗∗p < 0.0005.
AD, atopic dermatitis; AR, atopic rhinitis; AS, asthma; CI, confidence interval; FA, food allergy; IRR, incidence rate ratio; OR, odds ratio; PGS, polygenic score.
The PGS effectively predicted the risk of FAs within a 2 year follow-up period, with AUC values of 0.559 (95% CI: 0.534–0.584) for BirThree 1 and 0.546 (95% CI: 0.518–0.574) for BirThree 2 (0.518–0.574) (Figure S9). Similarly, PGS predicted the risk of dual allergies (AD and FAs), with AUC values of 0.573 (95% CI: 0.528–0.618) for BirThree 1 and 0.553 (0.504–0.601) for BirThree 2 (Figure S10).
Interactions among the risks of atopic diseases were assessed using causal mediation analysis (Figure S11; Table S17). In models with significant total effects (q < 0.05), FAs demonstrated a significant mediation effect in the PGS-to-AD pathway, suggesting a potential causal interplay between these two allergic phenotypes.
To further characterize the PGS developed in this study, we compared it to publicly available PGS models (Figures S12–15; Tables S18–22). The resulting correlation matrix and phenotype association patterns indicated that PGS in this study showed a relatively low correlation with those in other studies and lacked distinctive associations with most individual phenotypes, except for FAs in the birth cohort and IgE levels.
Discussion
The present study demonstrated that PGS correlates significantly not only with total IgE levels but also with various immune-related phenotypes, including allergies. An analysis using birth cohort data indicated that PGS was associated with the risk of FAs and AD. Furthermore, individuals in the highest PGS quintile exhibited an elevated risk for both phenotypes. These findings suggest that the PGS for IgE levels may serve as a promising early predictor of atopic diseases.
Currently, few risk factors that predict allergy risk earlier than eczema have been identified.46,47,48,49 A study of IgE sensitization at 4 years and comorbidity incidence at age 8 years found a 38% attributable risk, indicating that IgE levels alone may not fully explain the development of eczema, AR, and AS despite their shared mechanisms. The atopic march might involve a complex interplay of environmental and genetic factors rather than a straightforward increase in IgE levels.50 This complexity may account for the moderate AUC of our PGS for atopic phenotypes despite its significant correlation with IgE levels.
Previous studies have explored the associations of genetic risk scores for pediatric allergies, such as those for AD15,51 and AS14; however, longitudinal studies on non-European populations are limited. To our knowledge, this study represents the largest evaluation of PGS for allergies in a birth cohort. These disease-specific PGSs are often associated with the risk of other atopic diseases; however, their cross-disease applicability and potential biases introduced by shared environmental factors remain unexplored. In contrast, our PGS, which was specifically tuned to correlate with IgE levels, demonstrated unique phenotype associations differing from those observed with previously reported PGSs. These findings support that the model reported in this study offers a statistically robust "IgE-based" estimate of allergy risk without requiring further phenotype-specific optimization.
A population-based study conducted in the United States from 2015 to 2016 found that 7.6% of 38,408 children had FAs (95% CI: 7.1–8.1) based on parent-proxy responses.3 Among them, 10.0% (95% CI: 7.2–13.7) of children aged 2 years were reported to have at least one type of FA. Reliable epidemiological data have been limited in other regions, including Asia.13 In the present study, the FA prevalence in the TMM BirThree cohort in the 2-year-old survey was approximately 5.5% (see details in Table S8), which is considerably lower than that in the United States study.3 Differences in common allergens between the studies—such as milk (4.3% [95% CI: 2.2–8.3]), peanut (2.4% [95% CI: 1.6–3.6]), and egg (1.4% [95% CI: 0.8–2.4]) in the United States3 compared to lower rates of milk (∼1.5%), peanut (∼0.5%), egg white (∼5.3%), and egg yolk (∼3.3%) allergies in the BirThree 1 and 2 datasets—suggest regional variations. Our study suggests an association between the risk of egg white allergy and PGS, highlighting the need for comprehensive surveys that include genetic risks and recent food environments around children to better address allergy prevalence.
Current guidelines recommend that women should consume a healthy diet without eliminating or increasing the consumption of potentially allergenic foods during pregnancy or breastfeeding to prevent the development of FAs.13,52 Additionally, the early introduction of allergens, such as peanuts53 and hen eggs,54 reduced the risk rather than avoidance.55,56 The present PGS could help identify children at a higher genetic risk for FAs before severe symptoms develop, potentially guiding early intervention strategies.
Our analysis also evaluated the contribution of HLA alleles to PGS. We observed significant associations between the HLA alleles and PGS, total IgE levels, and phenotypes. This indicates that our model incorporates not only conventional genetic variants but also HLA allele effects. Moreover, total IgE levels and allergic diseases showed similar allele profiles. Among these, A∗24:02-B∗52:01-C∗12:02-DRB1∗15:02-(DRB5∗0102, not determined in this study)-DQA1∗01:03-DQB1∗06:01-DPB1∗09:01 and DRB1∗15:01-DQB1∗06:02 have been identified as haplotypes in a previous study57 and could serve as targets for identifying causal genes.
Recent multiethnic studies have reported associations between HLA alleles and total IgE levels, such as A∗02:01 (β: −0.06 [95% CI: −0.08 to −0.03] and −0.05 [−0.08 to −0.02] in discovery and replication analyses, respectively) and DQA1∗03:01 (−0.07 [−0.10 to −0.03] and −0.08 [−0.12 to −0.05]).58 These alleles were replicated in the present analysis, with β: −0.029 (95% CI: −0.034 to −0.024) and −0.032 (−0.035 to −0.029) for the meta-analysis of A∗02:01 and DQA1∗03:01, respectively. Considering the antigen specificity of HLA alleles and their requirement for antigen peptides to stimulate the immune response, these alleles may influence total IgE levels transethnically and transculturally in distinct antigen pools in different environments.
Future research should explore the association between PGS and nonallergic phenotypes. For example, dental anomalies and severe susceptibility to caries have been reported in patients with hyper-IgE syndrome (MIM: 147060).59 IBD, colon cancer, Graves’ disease, and Hashimoto’s disease were not significantly associated with total IgE levels but were associated with PGS, coinciding with their mechanisms that do not directly lead to a Th2 response, accompanied by IgE production.
This study has several limitations. The TMM CommCohort dataset includes related individuals,60 which may have influenced the results to a certain extent. The best model was selected and validated using Japanese genotype data imputed by the Japanese genotype panel. Therefore, additional validation is recommended before using other populations in future studies. Additionally, our phenotyping was based on self- or parent-proxy-reported surveys, which could lead to an overestimation of allergy prevalence and make it impossible to distinguish non-IgE-mediated FAs.
In conclusion, our study evaluated an approach using PGS for total IgE levels to estimate future atopic risks and elucidate the genetic contributions to these phenotypes. These findings offer important insights into the genetic factors that influence atopic diseases and provide a potential tool for early prediction and intervention.
Despite these promising results, further validation in other populations is required to ensure the robustness of the model. This approach could help inform early interventions for individuals at a high genetic risk of developing allergies, potentially guiding clinical decisions and preventive measures. Future research should focus on refining the PGS model for broader applications, including considering diverse populations and non-IgE-mediated allergies.
Acknowledgments
This study was supported by the Tohoku Medical Megabank (TMM) Project (Special Account for the Reconstruction of the Great East Japan Earthquake) of the Ministry of Education, Culture, Sports, Science and Technology (MEXT) and the Japan Agency for Medical Research and Development (AMED) under grant no. JP23tm0124006. The supercomputer system in the TMM Project under grant number JP21tm0424601 was used for the data analysis. Y.S. and A.S. were supported by the Japan Society for the Promotion of Science (JSPS) KAKENHI grant JP23K09747.
Author contributions
Conceptualization, Y.S. and A.S.; design of the work, Y.S. and T.H.; acquisition of data, Y.O.-Y., S. Komaki, S.M., H.O., N.N., A.N., M.O., M.I., T.O., K.T., A.H., and S. Kuriyama; analysis of data, Y.S.; interpretation of data, Y.S., T.H., Y.O.-Y., S. Komaki, S.M., H.O., and A.S.; writing – original draft, Y.S.; writing – review & editing, Y.S., T.H., Y.O.-Y., S. Komaki, S.M., H.O., K.T., A.H., N.N., A.N., M.O., M.I., T.O., S. Kuriyama, M.S., and A.S.
Declaration of interests
T.H. is a board member of Genome Analytics Japan, Inc.
Published: July 21, 2025
Footnotes
Supplemental information can be found online at https://doi.org/10.1016/j.ajhg.2025.06.015.
Contributor Information
Yoichi Sutoh, Email: ysutoh@iwate-med.ac.jp.
Atsushi Shimizu, Email: ashimizu@iwate-med.ac.jp.
Web resources
OMIM, https://www.omim.org
Supplemental information
References
- 1.de Silva D., Halken S., Singh C., Muraro A., Angier E., Arasi S., Arshad H., Beyer K., Boyle R., du Toit G., et al. Preventing food allergy in infancy and childhood: Systematic review of randomised controlled trials. Pediatr. Allergy Immunol. 2020;31:813–826. doi: 10.1111/pai.13273. [DOI] [PubMed] [Google Scholar]
- 2.Protudjer J.L.P., Jansson S.-A., Heibert Arnlind M., Bengtsson U., Kallström-Bengtsson I., Marklund B., Middelveld R., Rentzos G., Sundqvist A.-C., Åkerström J., et al. Household costs associated with objectively diagnosed allergy to staple foods in children and adolescents. J. Allergy Clin. Immunol. Pract. 2015;3:68–75. doi: 10.1016/j.jaip.2014.09.021. [DOI] [PubMed] [Google Scholar]
- 3.Gupta R.S., Warren C.M., Smith B.M., Blumenstock J.A., Jiang J., Davis M.M., Nadeau K.C. The Public Health Impact of Parent-Reported Childhood Food Allergies in the United States. Pediatrics. 2018;142 doi: 10.1542/peds.2018-1235. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Osborne N.J., Koplin J.J., Martin P.E., Gurrin L.C., Lowe A.J., Matheson M.C., Ponsonby A.-L., Wake M., Tang M.L.K., Dharmage S.C., et al. Prevalence of challenge-proven IgE-mediated food allergy using population-based sampling and predetermined challenge criteria in infants. J. Allergy Clin. Immunol. 2011;127 doi: 10.1016/j.jaci.2011.01.039. [DOI] [PubMed] [Google Scholar]
- 5.Leung A.S.Y., Wong G.W.K., Tang M.L.K. Food allergy in the developing world. J. Allergy Clin. Immunol. 2018;141:76–78.e1. doi: 10.1016/j.jaci.2017.11.008. [DOI] [PubMed] [Google Scholar]
- 6.Ricci G., Patrizi A., Baldi E., Menna G., Tabanelli M., Masi M. Long-term follow-up of atopic dermatitis: retrospective analysis of related risk factors and association with concomitant allergic diseases. J. Am. Acad. Dermatol. 2006;55:765–771. doi: 10.1016/j.jaad.2006.04.064. [DOI] [PubMed] [Google Scholar]
- 7.Hill D.A., Spergel J.M. The atopic march: Critical evidence and clinical relevance. Ann. Allergy Asthma Immunol. 2018;120:131–137. doi: 10.1016/j.anai.2017.10.037. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Schuler C.F., Billi A.C., Maverakis E., Tsoi L.C., Gudjonsson J.E. Novel insights into atopic dermatitis. J. Allergy Clin. Immunol. 2023;151:1145–1154. doi: 10.1016/j.jaci.2022.10.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Geba G.P., Li D., Xu M., Mohammadi K., Attre R., Ardeleanu M., Musser B. Attenuating the atopic march: Meta-analysis of the dupilumab atopic dermatitis database for incident allergic events. J. Allergy Clin. Immunol. 2023;151:756–766. doi: 10.1016/j.jaci.2022.08.026. [DOI] [PubMed] [Google Scholar]
- 10.Kulig M., Bergmann R., Klettke U., Wahn V., Tacke U., Wahn U. Natural course of sensitization to food and inhalant allergens during the first 6 years of life. J. Allergy Clin. Immunol. 1999;103:1173–1179. doi: 10.1016/s0091-6749(99)70195-8. [DOI] [PubMed] [Google Scholar]
- 11.Zheng T., Yu J., Oh M.H., Zhu Z. The atopic march: progression from atopic dermatitis to allergic rhinitis and asthma. Allergy Asthma Immunol. Res. 2011;3:67–73. doi: 10.4168/aair.2011.3.2.67. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Ravn N.H., Halling A.-S., Berkowitz A.G., Rinnov M.R., Silverberg J.I., Egeberg A., Thyssen J.P. How does parental history of atopic disease predict the risk of atopic dermatitis in a child? A systematic review and meta-analysis. J. Allergy Clin. Immunol. 2020;145:1182–1193. doi: 10.1016/j.jaci.2019.12.899. [DOI] [PubMed] [Google Scholar]
- 13.Sampath V., Abrams E.M., Adlou B., Akdis C., Akdis M., Brough H.A., Chan S., Chatchatee P., Chinthrajah R.S., Cocco R.R., et al. Food allergy across the globe. J. Allergy Clin. Immunol. 2021;148:1347–1364. doi: 10.1016/j.jaci.2021.10.018. [DOI] [PubMed] [Google Scholar]
- 14.Namjou B., Lape M., Malolepsza E., DeVore S.B., Weirauch M.T., Dikilitas O., Jarvik G.P., Kiryluk K., Kullo I.J., Liu C., et al. Multiancestral polygenic risk score for pediatric asthma. J. Allergy Clin. Immunol. 2022;150:1086–1096. doi: 10.1016/j.jaci.2022.03.035. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Simard M., Madore A.-M., Girard S., Waserman S., Duan Q., Subbarao P., Sears M.R., Moraes T.J., Becker A.B., Turvey S.E., et al. Polygenic risk score for atopic dermatitis in the Canadian population. J. Allergy Clin. Immunol. 2021;147:406–409. doi: 10.1016/j.jaci.2020.04.057. [DOI] [PubMed] [Google Scholar]
- 16.Meyers D.A., Beaty T.H., Freidhoff L.R., Marsh D.G. Inheritance of total serum IgE (basal levels) in man. Am. J. Hum. Genet. 1987;41:51–62. [PMC free article] [PubMed] [Google Scholar]
- 17.Jacobsen H.P., Herskind A.M., Nielsen B.W., Husby S. IgE in unselected like-sexed monozygotic and dizygotic twins at birth and at 6 to 9 years of age: high but dissimilar genetic influence on IgE levels. J. Allergy Clin. Immunol. 2001;107:659–663. doi: 10.1067/mai.2001.113565. [DOI] [PubMed] [Google Scholar]
- 18.Weidinger S., Gieger C., Rodriguez E., Baurecht H., Mempel M., Klopp N., Gohlke H., Wagenpfeil S., Ollert M., Ring J., et al. Genome-wide scan on total serum IgE levels identifies FCER1A as novel susceptibility locus. PLoS Genet. 2008;4 doi: 10.1371/journal.pgen.1000166. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Granada M., Wilk J.B., Tuzova M., Strachan D.P., Weidinger S., Albrecht E., Gieger C., Heinrich J., Himes B.E., Hunninghake G.M., et al. A genome-wide association study of plasma total IgE concentrations in the Framingham Heart Study. J. Allergy Clin. Immunol. 2012;129:840–845.e21. doi: 10.1016/j.jaci.2011.09.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Shido K., Kojima K., Shirota M., Yamasaki K., Motoike I.N., Hozawa A., Ogishima S., Minegishi N., Tanno K., Katsuoka F., et al. GWAS Identified IL4R and the Major Histocompatibility Complex Region as the Associated Loci of Total Serum IgE Levels in 9,260 Japanese Individuals. J. Invest. Dermatol. 2021;141:2749–2752. doi: 10.1016/j.jid.2021.02.762. [DOI] [PubMed] [Google Scholar]
- 21.Georas S.N., Donohue P., Connolly M., Wechsler M.E. JAK inhibitors for asthma. J. Allergy Clin. Immunol. 2021;148:953–963. doi: 10.1016/j.jaci.2021.08.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Takeuchi I., Yanagi K., Takada S., Uchiyama T., Igarashi A., Motomura K., Hayashi Y., Nagano N., Matsuoka R., Sugiyama H., et al. STAT6 gain-of-function variant exacerbates multiple allergic symptoms. J. Allergy Clin. Immunol. 2023;151:1402–1409.e6. doi: 10.1016/j.jaci.2022.12.802. [DOI] [PubMed] [Google Scholar]
- 23.Suratannon N., Ittiwut C., Dik W.A., Ittiwut R., Meesilpavikkai K., Israsena N., Ingrungruanglert P., Dalm V.A.S.H., van Daele P.L.A., Sanpavat A., et al. A germline STAT6 gain-of-function variant is associated with early-onset allergies. J. Allergy Clin. Immunol. 2023;151:565–571.e9. doi: 10.1016/j.jaci.2022.09.028. [DOI] [PubMed] [Google Scholar]
- 24.Hozawa A., Tanno K., Nakaya N., Nakamura T., Tsuchiya N., Hirata T., Narita A., Kogure M., Nochioka K., Sasaki R., et al. Study Profile of the Tohoku Medical Megabank Community-Based Cohort Study. J. Epidemiol. 2021;31:65–76. doi: 10.2188/jea.JE20190271. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Sutoh Y., Komaki S., Yamaji T., Suzuki S., Katagiri R., Sawada N., Ono K., Ohmomo H., Hachiya T., Otsuka-Yamasaki Y., et al. Low MICA gene expression confers an increased risk of Graves’ disease: a Mendelian randomization study. Thyroid. 2022;32:188–195. doi: 10.1089/thy.2021.0417. [DOI] [PubMed] [Google Scholar]
- 26.Kuriyama S., Metoki H., Kikuya M., Obara T., Ishikuro M., Yamanaka C., Nagai M., Matsubara H., Kobayashi T., Sugawara J., et al. Cohort Profile: Tohoku Medical Megabank Project Birth and Three-Generation Cohort Study (TMM BirThree Cohort Study): rationale, progress and perspective. Int. J. Epidemiol. 2020;49:18–19m. doi: 10.1093/ije/dyz169. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Minegishi N., Nishijima I., Nobukuni T., Kudo H., Ishida N., Terakawa T., Kumada K., Yamashita R., Katsuoka F., Ogishima S., et al. Biobank Establishment and Sample Management in the Tohoku Medical Megabank Project. Tohoku J. Exp. Med. 2019;248:45–55. doi: 10.1620/tjem.248.45. [DOI] [PubMed] [Google Scholar]
- 28.Yasuda J., Kinoshita K., Katsuoka F., Danjoh I., Sakurai-Yageta M., Motoike I.N., Kuroki Y., Saito S., Kojima K., Shirota M., et al. Genome analyses for the Tohoku Medical Megabank Project towards establishment of personalized healthcare. J. Biochem. 2019;165:139–158. doi: 10.1093/jb/mvy096. [DOI] [PubMed] [Google Scholar]
- 29.Sakurai-Yageta M., Kumada K., Gocho C., Makino S., Uruno A., Tadaka S., Motoike I.N., Kimura M., Ito S., Otsuki A., et al. Japonica Array NEO with increased genome-wide coverage and abundant disease risk SNPs. J. Biochem. 2021;170:399–410. doi: 10.1093/jb/mvab060. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Tadaka S., Saigusa D., Motoike I.N., Inoue J., Aoki Y., Shirota M., Koshiba S., Yamamoto M., Kinoshita K. jMorp: Japanese Multi Omics Reference Panel. Nucleic Acids Res. 2018;46:D551–D557. doi: 10.1093/nar/gkx978. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Sakaue S., Kanai M., Tanigawa Y., Karjalainen J., Kurki M., Koshiba S., Narita A., Konuma T., Yamamoto K., Akiyama M., et al. A cross-population atlas of genetic associations for 220 human phenotypes. Nat. Genet. 2021;53:1415–1424. doi: 10.1038/s41588-021-00931-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Sutoh Y., Hachiya T., Otsuka-Yamasaki Y., Komaki S., Minabe S., Ohmomo H., Sasaki M., Shimizu A. Healthy lifestyle practice correlates with decreased obesity prevalence in individuals with high polygenic risk: TMM CommCohort study. J. Hum. Genet. 2025;70:9–15. doi: 10.1038/s10038-024-01280-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Vilhjálmsson B.J., Yang J., Finucane H.K., Gusev A., Lindström S., Ripke S., Genovese G., Loh P.-R., Bhatia G., Do R., et al. Modeling Linkage Disequilibrium Increases Accuracy of Polygenic Risk Scores. Am. J. Hum. Genet. 2015;97:576–592. doi: 10.1016/j.ajhg.2015.09.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Ge T., Chen C.-Y., Ni Y., Feng Y.-C.A., Smoller J.W. Polygenic prediction via Bayesian regression and continuous shrinkage priors. Nat. Commun. 2019;10:1776. doi: 10.1038/s41467-019-09718-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Chang C.C., Chow C.C., Tellier L.C., Vattikuti S., Purcell S.M., Lee J.J. Second-generation PLINK: rising to the challenge of larger and richer datasets. GigaScience. 2015;4:7. doi: 10.1186/s13742-015-0047-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Lambert S.A., Gil L., Jupp S., Ritchie S.C., Xu Y., Buniello A., McMahon A., Abraham G., Chapman M., Parkinson H., et al. The Polygenic Score Catalog as an open database for reproducibility and systematic evaluation. Nat. Genet. 2021;53:420–425. doi: 10.1038/s41588-021-00783-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Zhao H., Sun Z., Wang J., Huang H., Kocher J.-P., Wang L. CrossMap: a versatile tool for coordinate conversion between genome assemblies. Bioinformatics. 2014;30:1006–1007. doi: 10.1093/bioinformatics/btt730. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Mimori T., Yasuda J., Kuroki Y., Shibata T.F., Katsuoka F., Saito S., Nariai N., Ono A., Nakai-Inagaki N., Misawa K., et al. Construction of full-length Japanese reference panel of class I HLA genes with single-molecule, real-time sequencing. Pharmacogenomics J. 2019;19:136–146. doi: 10.1038/s41397-017-0010-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Luo Y., Kanai M., Choi W., Li X., Sakaue S., Yamamoto K., Ogawa K., Gutierrez-Arcelus M., Gregersen P.K., Stuart P.E., et al. A high-resolution HLA reference panel capturing global population diversity enables multi-ancestry fine-mapping in HIV host response. Nat. Genet. 2021;53:1504–1516. doi: 10.1038/s41588-021-00935-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Jia X., Han B., Onengut-Gumuscu S., Chen W.-M., Concannon P.J., Rich S.S., Raychaudhuri S., de Bakker P.I.W. Imputing amino acid polymorphisms in human leukocyte antigens. PLoS One. 2013;8 doi: 10.1371/journal.pone.0064683. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Danecek P., Bonfield J.K., Liddle J., Marshall J., Ohan V., Pollard M.O., Whitwham A., Keane T., McCarthy S.A., Davies R.M., Li H. Twelve years of SAMtools and BCFtools. GigaScience. 2021;10:giab008. doi: 10.1093/gigascience/giab008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Howie B., Fuchsberger C., Stephens M., Marchini J., Abecasis G.R. Fast and accurate genotype imputation in genome-wide association studies through pre-phasing. Nat. Genet. 2012;44:955–959. doi: 10.1038/ng.2354. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Manichaikul A., Mychaleckyj J.C., Rich S.S., Daly K., Sale M., Chen W.-M. Robust relationship inference in genome-wide association studies. Bioinformatics. 2010;26:2867–2873. doi: 10.1093/bioinformatics/btq559. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Loh P.-R., Tucker G., Bulik-Sullivan B.K., Vilhjálmsson B.J., Finucane H.K., Salem R.M., Chasman D.I., Ridker P.M., Neale B.M., Berger B., et al. Efficient Bayesian mixed-model analysis increases association power in large cohorts. Nat. Genet. 2015;47:284–290. doi: 10.1038/ng.3190. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Zhou W., Nielsen J.B., Fritsche L.G., Dey R., Gabrielsen M.E., Wolford B.N., LeFaive J., VandeHaar P., Gagliano S.A., Gifford A., et al. Efficiently controlling for case-control imbalance and sample relatedness in large-scale genetic association studies. Nat. Genet. 2018;50:1335–1341. doi: 10.1038/s41588-018-0184-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.van der Hulst A.E., Klip H., Brand P.L.P. Risk of developing asthma in young children with atopic eczema: a systematic review. J. Allergy Clin. Immunol. 2007;120:565–569. doi: 10.1016/j.jaci.2007.05.042. [DOI] [PubMed] [Google Scholar]
- 47.Dharmage S.C., Lowe A.J., Matheson M.C., Burgess J.A., Allen K.J., Abramson M.J. Atopic dermatitis and the atopic march revisited. Allergy. 2014;69:17–27. doi: 10.1111/all.12268. [DOI] [PubMed] [Google Scholar]
- 48.von Kobyletzki L.B., Bornehag C.-G., Hasselgren M., Larsson M., Lindström C.B., Svensson Å. Eczema in early childhood is strongly associated with the development of asthma and rhinitis in a prospective cohort. BMC Dermatol. 2012;12:11. doi: 10.1186/1471-5945-12-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Hu C., Nijsten T., van Meel E.R., Erler N.S., Piketty C., de Jong N.W., Pasmans S.G.M.A., de Jongste J.C., Duijts L. Eczema phenotypes and risk of allergic and respiratory conditions in school age children. Clin. Transl. Allergy. 2020;10:7. doi: 10.1186/s13601-020-0310-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Holguin F. The atopic march: IgE is not the only road. Lancet Respir. Med. 2014;2:88–90. doi: 10.1016/S2213-2600(13)70243-1. [DOI] [PubMed] [Google Scholar]
- 51.Paternoster L., Savenije O.E.M., Heron J., Evans D.M., Vonk J.M., Brunekreef B., Wijga A.H., Henderson A.J., Koppelman G.H., Brown S.J. Identification of atopic dermatitis subgroups in children from 2 longitudinal birth cohorts. J. Allergy Clin. Immunol. 2018;141:964–971. doi: 10.1016/j.jaci.2017.09.044. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Greer F.R., Sicherer S.H., Burks A.W., COMMITTEE ON NUTRITION, and SECTION ON ALLERGY AND IMMUNOLOGY. SECTION ON ALLERGY AND IMMUNOLOGY The Effects of Early Nutritional Interventions on the Development of Atopic Disease in Infants and Children: The Role of Maternal Dietary Restriction, Breastfeeding, Hydrolyzed Formulas, and Timing of Introduction of Allergenic Complementary Foods. Pediatrics. 2019;143 doi: 10.1542/peds.2019-0281. [DOI] [PubMed] [Google Scholar]
- 53.Du Toit G., Roberts G., Sayre P.H., Bahnson H.T., Radulovic S., Santos A.F., Brough H.A., Phippard D., Basting M., Feeney M., et al. Randomized trial of peanut consumption in infants at risk for peanut allergy. N. Engl. J. Med. 2015;372:803–813. doi: 10.1056/NEJMoa1414850. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Perkin M.R., Logan K., Tseng A., Raji B., Ayis S., Peacock J., Brough H., Marrs T., Radulovic S., Craven J., et al. Randomized Trial of Introduction of Allergenic Foods in Breast-Fed Infants. N. Engl. J. Med. 2016;374:1733–1743. doi: 10.1056/NEJMoa1514210. [DOI] [PubMed] [Google Scholar]
- 55.Fisher H.R., Du Toit G., Bahnson H.T., Lack G. The challenges of preventing food allergy: Lessons learned from LEAP and EAT. Ann. Allergy Asthma Immunol. 2018;121:313–319. doi: 10.1016/j.anai.2018.06.008. [DOI] [PubMed] [Google Scholar]
- 56.Fleischer D.M., Chan E.S., Venter C., Spergel J.M., Abrams E.M., Stukus D., Groetch M., Shaker M., Greenhawt M. A Consensus Approach to the Primary Prevention of Food Allergy Through Nutrition: Guidance from the American Academy of Allergy. J. Allergy Clin. Immunol. Pract. 2021;9:22–43.e4. doi: 10.1016/j.jaip.2020.11.002. [DOI] [PubMed] [Google Scholar]
- 57.Nakajima F., Nakamura J., Yokota T. Analysis of HLA haplotypes in Japanese, using high resolution allele typing. MHC. 2001;8:1–32. doi: 10.12667/mhc.8.1. [DOI] [Google Scholar]
- 58.Daya M., Cox C., Acevedo N., Boorgula M.P., Campbell M., Chavan S., Cho M.H., David G.L., Kachroo P., Lasky-Su J., et al. Multiethnic genome-wide and HLA association study of total serum IgE level. J. Allergy Clin. Immunol. 2021;148:1589–1595. doi: 10.1016/j.jaci.2021.09.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Esposito L., Poletti L., Maspero C., Porro A., Pietrogrande M.C., Pavesi P., Dellepiane R.M., Farronato G. Hyper-IgE syndrome: dental implications. Oral Surg. Oral Med. Oral Pathol. Oral Radiol. 2012;114:147–153. doi: 10.1016/j.oooo.2012.04.005. [DOI] [PubMed] [Google Scholar]
- 60.Sutoh Y., Hachiya T., Otsuka-Yamasaki Y., Tokutomi T., Yoshida A., Kotozaki Y., Komaki S., Minabe S., Ohmomo H., Tanno K., et al. Reference-Based Standardization Approach Stabilizing Small Batch Risk Prediction via Polygenic Score. Genet. Epidemiol. 2025;49 doi: 10.1002/gepi.70002. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.





