Abstract
Hispanic/Latinos have been underrepresented in genome-wide association studies (GWAS) for anthropometric traits despite their notable anthropometric variability, ancestry proportions, and high burden of growth stunting and overweight/obesity. To address this knowledge gap, we analyzed densely imputed genetic data in a sample of Hispanic/Latino adults to identify and fine-map genetic variants associated with body mass index (BMI), height, and BMI-adjusted waist-to-hip ratio (WHRadjBMI). We conducted a GWAS of 18 studies/consortia as part of the Hispanic/Latino Anthropometry (HISLA) Consortium (stage 1, n = 59,771) and generalized our findings in 9 additional studies (stage 2, n = 10,538). We conducted a trans-ancestral GWAS with summary statistics from HISLA stage 1 and existing consortia of European and African ancestries. In our HISLA stage 1 + 2 analyses, we discovered one BMI locus, as well as two BMI signals and another height signal each within established anthropometric loci. In our trans-ancestral meta-analysis, we discovered three BMI loci, one height locus, and one WHRadjBMI locus. We also identified 3 secondary signals for BMI, 28 for height, and 2 for WHRadjBMI in established loci. We show that 336 known BMI, 1,177 known height, and 143 known WHRadjBMI (combined) SNPs demonstrated suggestive transferability (nominal significance and effect estimate directional consistency) in Hispanic/Latino adults. Of these, 36 BMI, 124 height, and 11 WHRadjBMI SNPs were significant after trait-specific Bonferroni correction. Trans-ancestral meta-analysis of the three ancestries showed a small-to-moderate impact of uncorrected population stratification on the resulting effect size estimates. Our findings demonstrate that future studies may also benefit from leveraging diverse ancestries and differences in linkage disequilibrium patterns to discover novel loci and additional signals with less residual population stratification.
Keywords: Hispanic/Latino, anthropometrics, obesity, diversity, trans-ancestral or trans-ethnic, fine-mapping, population stratification
Large-scale genetic analyses of diverse populations hold great potential for advancing the field. In our sample of Hispanic/Latino adults, and when combined with publicly available results from other ancestries, we identified, described, and characterized 42 novel genomic findings. We additionally illustrated that novel biologic insights can be garnered from studying ancestrally diverse populations. Overall, our findings indicate the added value of building large, more diverse genome-wide association studies in the future.
Introduction
A complex interplay between political, social, and economic factors has led to an increasing obesogenic global environment in which many low-to-middle income nations have experienced a rapid transition from under-nutrition and growth stunting to over-nutrition and obesity.1 In Latin America, by 2016, 35% of the total population was overweight (body mass index [BMI] 25 to <30 kg/m2) and another 23% was living with obesity (BMI ≥ 30 kg/m2).2 In Mexico, it is projected that by 2050 only 12% of men and 9% of women will have a healthy weight (BMI < 25 kg/m2).3 In South America in 2010–2011, the prevalence of obesity was 36%, but abdominal obesity (based on waist circumference) was even more common (53%).4
Ancestry may also play a role in anthropometric-related health disparities in Hispanic/Latino populations. Previous studies have described the historical contexts leading to admixture in Latin American populations5,6 as characterized by highly diverse (variable) ancestral proportions7, 8, 9 from any of the following regions: the Americas, Europe, Africa, and East Asia.10, 11, 12, 13, 14, 15 The proportion of Native American ancestry is associated with obesity-related traits, and even more strongly associated with height.16,17 Height is inversely associated with proportion of Native American ancestry, even after taking into account that over time populations globally have become taller due to mainly non-genetic nutritional factors.16 The ultimate drivers of this association remain unclear; it is possible that genetic factors and/or socio-economic factors strongly associated with Native American ancestry could be responsible. Recent studies are starting to provide relevant insights into this topic, including a recent genome-wide association study (GWAS) in Peru18 that identified a missense variant in the FBN1 gene (rs200342067) that has the largest effect size so far described for common height-associated variants in human populations. In the 1000 Genomes Project samples, rs200342067 is only present in two Latin American samples (MXL, 0.78%; and PEL, 4.12%), and yet the authors reported that this missense variant shows subtle evidence of positive selection in the Peruvian population.18
In the US, as in other high-income nations, both the population size and diversity in national origins (backgrounds) of Hispanic/Latinos have been increasing over the past several decades,19 with 24% of the US adult population identifying as Hispanic/Latino by 2065.19 US Hispanic/Latino adults and children/adolescents face a greater burden of obesity than their non-Hispanic white counterparts.20, 21, 22, 23
Thus, there is a need to study Hispanic/Latino populations to fully address these disparities.23,24 Specifically, we sought to understand the role that Native American or other under-studied components of admixture have on the genetic architecture of anthropometric traits in Hispanic/Latinos, and their relationship with gene expression. To date, no large-scale GWAS of anthropometric traits has been conducted among Hispanic/Latino populations; we therefore performed a large-scale genomic study of multiple anthropometric traits, including BMI, height, and waist-to-hip ratio adjusted for BMI (WHRadjBMI), in Hispanic/Latino populations to describe what may be novel loci, or new signals in established loci, for this population.
Materials and methods
Hispanic/Latino study samples
The Hispanic/Latino Anthropometry (HISLA) Consortium is comprised of 27 studies/consortia of adult participants. First, HISLA stage 1 includes 17 studies and one consortium (Consortium for the Analysis of the Diversity and Evolution of Latin America [CANDELA]17) collectively representing up to 59,771 adults, depending on the trait, from Brazil, Chile, Colombia, Mexico, Peru, or the US with self-reported heritage from across Spanish-speaking Latin America, or Native American heritage, primarily Pima and Zuni25 (Table S1). HISLA stage 2 includes 9 studies with up to 10,538 adults from across Spanish-speaking Latin America or with related heritage and living in the US (Table S1).
This study was approved by the institutional review boards of the University of North Carolina at Chapel Hill, and all contributing studies had received prior institutional review boards approval for each study’s activities.
Anthropometric traits
BMI is a commonly derived index of obesity risk and is calculated as the ratio of body weight to height squared (kg/m2). Adult height was measured or self-reported using either metric or US units and then converted to meters. Waist-to-hip ratio (WHR) is used to capture central fat deposition and is derived from the circumference of the waist at the umbilicus compared with the circumference of the hip at the maximum protrusion of the gluteal muscles.
Residuals were calculated by sex and/or case status, adjusting for age, age2, and study-specific covariates (e.g., center; principal components [PCs]). For WHR, we also adjusted for BMI when creating the residuals to isolate the central deposition of fat from overall body mass. Residuals were then used to create inverse normalizations of BMI and WHRadjBMI, and Z scores of height (=residual/standard deviation for all residuals). In family-based studies, the residuals were calculated in women and men together, adjusting for age, sex, and other study covariates including PCs. Descriptive statistics on the covariates and anthropometric measures are provided for each study’s analytic sample in Table S2. Only one family-based study in stage 1 and two non-family-based studies in stage 2 (Genetics of Latinos Diabetic Retinopathy, 0.3% <18 years; and Mapping the Genes for Hypertension, Insulin Resistance, and Salt Sensitivity Study, 3.9%) included a small subset of adolescents aged 15–17 years, each less than 5% of the total sample. All other study samples included individuals aged 18–98 years.
Childhood/adolescence study samples, anthropometric traits, and obesity
We assembled an independent sample of children/adolescents with anthropometrics from three studies from the US, Mexico, and Chile (Table S3). The distribution of covariates and anthropometrics of the samples of children/adolescents in each analysis are described in Table S4. First, childhood/adolescent obesity was defined as ≥95th BMI-for-age percentile (versus ≤50th BMI-for-age percentile), based on the Centers for Disease Control and Prevention growth curves,26 as done in previous analyses of childhood obesity.27 We used these two analyses to look up the BMI and height findings from our adult HISLA meta-analysis as well as our trans-ancestral analyses. This resulted in 1,814 children/adolescents aged 2–18 years in a case-control analysis of childhood obesity (Tables S3 and S4). Second, BMI and height-for-age Z scores were calculated in children/adolescents aged 5–18 years from the US and Chile (Table S4) based on the more international reference growth curves from the World Health Organization.28 In Viva la Familia, a family-based study,29 these residuals were calculated adjusting for sex in the combined sample. The resulting BMI and height-for-age Z scores were available for 1,914 and 1,945 children/adolescents, respectively.
SNP imputation and statistical analyses
We generated autosomal genome-wide imputed data based on 1000 Genomes phase 1 and 3 references, except for two studies that contributed Exomechip and MetaboChip (Illumina, San Diego, CA) genotypes and one study that blended genotypes from multiple platforms (Tables S5 and S6). Principal-component analyses (PCA) were conducted in each study (see select examples provided in Figures S1–S3) to capture the main components of genetic ancestry from the Americas, Europe, Africa, and East Asia. Studies with samples from related individuals accommodated this non-independence by projecting their PCA from the reference to the study sample, and by accounting for relatedness using either generalized estimating equations30 or mixed linear models.9,31 Assuming an additive genetic model, we tested for the association of over 20 million autosomal variants on our traits, accounting for all trait- or study-specific covariates (e.g., center, PCs).
Meta-analyses of HISLA stage 1 + 2
The studies of the HISLA Consortium were meta-analyzed in two stages: discovery (stage 1) and replication (stage 2). Stage 1 included a total sample of 59,771 individuals with data on BMI, 56,161 with height, and 42,455 with WHRadjBMI. All stage 1 studies/consortia provided full genome-wide analysis results. All SNPs that met our significance criteria were brought forward for replication in stage 2, which included 10,538 individuals with data on BMI, 8,110 with height, and 4,393 with WHRadjBMI. All reported association results passed our quality control criteria; i.e., variants with low quality (info score < 0.4 or R2 < 0.3), minor allele count (MAC) < 5, or sample size < 100 were removed. We meta-analyzed effects across all studies using a fixed-effect inverse variance weighted meta-analysis with genomic control in METAL.32 Given the unique patterns of admixture and ancestry represented by the Brazilian or Native American samples, we conducted sensitivity analyses in stage 1 studies (i.e., comparing the inclusion and exclusion of the Baependi Heart Study, the 1982 Pelotas Birth Cohort Study, and the Family Investigation of Nephropathy and Diabetes substudy of individuals of Pima and Zuni heritage) to assess the influence of these three studies on the meta-analysis results. CANDELA was retained in all analyses as <10% of the consortium’s samples came from Brazil, primarily originating from the South of Brazil and being characterized as having high European heritage and less Native American or African admixture.17 We provide the quantile-quantile plots for all analyses in Figure S4.
Regional plots of all GWAS significant HISLA stage 1 findings were plotted using LocusZoom. From stage 1, we selected lead variants for replication that met genome-wide significance (p < 5 × 10−8) that were independent of each other. In cases where stage 2 studies did not have the lead variant, we selected two proxies per lead variant with a linkage disequilibrium (LD) r2 ≥ 0.9 using 1000 Genomes AMR. Stage 2 studies provided a list of the requested lead variants and/or their proxies from stage 1 for replication. Stage 2 studies were meta-analyzed and subsequently combined with stage 1 using METAL.25 Effect heterogeneity was assessed through I2 across all 27 HISLA adult studies/consortia by entering each study separately into the meta-analysis, irrespective of stage. The characteristics of the final SNP array data used in the HISLA adult studies and the children/adolescent Hispanic/Latino studies are summarized separately in Tables S5 and S6.
Meta-analyses of HISLA stage 1 with other ancestral consortia
In addition to a Hispanic/Latino-only meta-analysis, we combined the HISLA stage 1 meta-analysis with data from previous large-scale GWAS meta-analyses of European (the Genetic Investigation of Anthropometric Traits [GIANT] Consortium,33, 34, 35 N ∼ 300,000) and/or African (the African Ancestry Anthropometry Genetics Consortium [AAAGC],36,37 N ∼ 50,000) descent populations. We used fixed-effect inverse variance weighted meta-analytic techniques in METAL to generate our trans-ancestral meta-analysis.32 We then assessed (1) the transferability of the findings from the BMI, height,38 and WHRadjBMI39 trans-ancestral meta-analyses to an independent sample of Hispanic/Latino children/adolescents or (2) the replication of the signal in the British subsample GWAS of the United Kingdom Biobank (UKBB). LD plots and regional plots are shown in the supplemental information (Figures S5–S53).
Thresholds for conditional signals, discovery, and transferability
We conducted approximate conditional analyses using genome-wide complex trait analysis (version 1.93.1) software. For HISLA analyses, we used our stage 1 discovery results with the Hispanic Community Health Study/Study of Latinos (HCHS/SOL) as the LD reference dataset. For the approximate conditional trans-ancestral analyses, we used our trans-ancestral results from HISLA stage 1, AAAGC, or GIANT and a trans-ancestral LD reference dataset of Europeans and African Americans from the Atherosclerosis Risk in Communities (ARIC) cohort, and Hispanic/Latinos from the HCHS/SOL cohort, as a representation of the ancestry distribution of our meta-analysis. In both conditional analyses (HISLA-only and trans-ancestral results), we first identified all independent SNPs using the --cojo-slct command. Then, we conditioned each of these independent SNPs on all known SNPs from GWAS (curated in the GWAS catalog) or reported as part of targeted-array analyses published through December 2019 (BMI,33,36,38,40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55 height,34,38,47,50,55,56 and WHRadjBMI35,36,39,44,46,54,55,57, 58, 59, 60, 61) within 10 Mb (±5 Mb) of the lead SNP. The trans-ancestral meta-analysis results with a p < 5 × 10−8 after conditioning on known SNPs were taken forward for replication in the British subsample of the UKBB.
In this paper, we consider a signal as replicated when the effect of an allele is observed in two independent populations with the same or overlapping ancestral background, whereas generalization (transferability) refers to the observation of the same signal in an independent sample but with a distinct ancestral background, or distinct period of the life course. Furthermore, SNP associations were then defined as either newly discovered or established, depending on their location. An established locus was defined as an SNP association within ±500 kb of at least one previously identified index SNP, otherwise the association was considered a newly discovered locus.
We designated our Hispanic/Latino SNP associations within either newly discovered or established loci as “novel” if they met the following criteria for replication: (1) were associated at p < 5 × 10−8 in HISLA stage 1 and directionally consistent in the stage 2 independent sample, and (2) the addition of stage 2 samples improved the estimated p value of the stage 1 + 2 meta-analysis. For the trans-ancestral analyses, the designation of a signal as novel was based on SNPs that were: (1) associated at p < 5 × 10−8 in the combined HISLA, AAAGC, and GIANT meta-analysis, and (2) directionally consistent with the trans-ancestral meta-analysis and associated at p < 5 × 10−2 in an independent sample of Hispanic/Latino children/adolescents (generalized across age period) or in the British subsample GWAS from the UKBB (replication).
Hispanic/Latino SNP effects were considered to transfer (or generalize) to Hispanic/Latino children/adolescents or to African or European ancestry adults if they were: (1) directionally consistent, (2) associated at p < 5 × 10−2, and (3) had a heterogeneity of I2 < 75% in the Hispanic/Latino children/adolescent lookups, the adult AAAGC, or the adult GIANT GWAS lookups. SNP effects of variants previously associated with anthropometric traits in non-Hispanic/Latino populations (i.e., index published SNPs) were considered to be transferable (generalizable) to Hispanic/Latinos only if they were: (1) directionally consistent, (2) displayed a p < 5 × 10−2, and (3) had little to moderate effect heterogeneity (I2 < 75%) in stage 1.
Fine-mapping methods
We used FINEMAP62 for analyses of the newly discovered loci identified as part of the HISLA stage 1 meta-analysis or trans-ancestral meta-analysis, for both established and novel loci. For the established loci, we included index SNP associations published as of April 2018 (BMI,33,36,40,42, 43, 44,46,48,51, 52, 53, 54 height,34,50,56 and WHRadjBMI35,36,44,46,59) prior to the publications with the UKBB results.38,39 We used a 1 Mb region subset of the summary statistics from the stage 1 meta-analyses and HCHS/SOL9 unrelated sample set (N ∼ 7,670) to calculate the LD for each locus. For consistency with the FINEMAP package, we refer to the results using similar language; however, we do not mean to imply that the SNP(s) in the “causal set” are causative variants.
For trans-ancestral fine-mapping of the novel loci or new signals identified in the trans-ancestral meta-analysis of HISLA, AAAGC, and GIANT, we used a 1 Mb region defining each locus using the summary statistics of the given meta-analysis. We calculated the LD for Hispanic/Latino samples using the HCHS/SOL9 unrelated sample (N ∼ 7,670). For African and European ancestry samples, we calculated the LD using the ARIC unrelated sample that included self-reported African ancestry (N ∼ 2,800) and European ancestry (N ∼ 9,700). We weighted the LD matrices by the GWAS sample sizes for each trait (HISLA range, ∼42,400–56,100; AAAGC, 20,300–42,700; GIANT, 210,000–330,000).
All regions allowed up to a maximum of 10 causal variants, as defined by FINEMAP. The cumulative 95th percentile credible set was calculated from the estimated posterior probabilities. Convergence failed for three regions (lead SNPs at known height loci: rs2902635, rs6900530, and rs4425978) using the stochastic approach. For these three regions, we used the conditional approach to determine number of causal variants.
Gene expression and other bioinformatic analyses
We performed association analyses of measured whole blood gene expression in 606 individuals from the Cameron County Hispanic Cohort.63 RNA sequencing was conducted using 150 bp paired-end reads on the Illumina NovaSeq 6000 by Vanderbilt Technologies for Advanced Genomics. Initial sequencing quality was checked by FastQC.64 STAR-2.7.8a was applied to align sequencing reads alignment to the human genome reference (UCSC, hg38),65 and the aligned reads were assigned to genes using featureCounts.66 We excluded either samples with less than 15 million total aligned reads, a rate of successful alignment of less than 20%, or less than 15 million total assigned reads. The sequencing library size was normalized using DESeq267 and read counts were transformed using variance stabilizing transformations (vst in DESeq2 package). We performed expression quantitative trait loci (eQTL) analysis with our top HISLA SNP findings, by modeling SNP dosages (exposure) in a linear regression of gene expression levels (outcomes), for each gene within the 1 Mb interval around each lead SNP. We inverse normalized the gene expression levels and adjusted for age, sex, and three PCs to capture population substructure. Bonferroni correction for each region varied according to the number of SNPs tested.
To gain further insight into the possible functional role of the identified variants and to assess their relevance to other phenotypes, we conducted bioinformatic queries of our potentially novel loci and new signals within known loci in multiple publicly available databases, including PhenoScanner,68 RegulomeDB,69 Haploreg,70 UCSC GenomeBrowser,71 and GTEx.72
Trans-ancestral findings to account for population structure in previous GWAS
We demonstrated the degree to which the present trans-ancestral meta-analysis could lessen the bias induced by population stratification, using height from HISLA as an example. We first conducted PCA on the four European populations (CEU, GBR, IBS, and TSI) from 1000 Genomes. We excluded the Finnish population because of its known unique demographic history that could drive or dominate the top PCs in a limited sample.33 We only used biallelic SNPs with minor allele frequency (MAF) > 5% in the four European populations, and then pruned them by both distance and LD using PLINK 1.9.73 Specifically, we pruned the dataset such that no two SNPs were closer than 2 kb, and then pruned using a 50 SNP LD window (moving in steps of 5 SNPs), such that no SNPs had r2 > 0.2. We further removed SNPs in regions of long-range LD.74 PCA was performed on the remaining SNPs using Eigensoft version 7.2.1.
We performed linear regressions of individual PC values on the allelic genotype count for each polymorphic variant in the four European populations from 1000 Genomes and used the resulting regression coefficients as the estimate of the variant’s PC loading. For each PC, we then computed Pearson correlation coefficients of PC loadings and effect sizes (of variants with MAF > 1%) from each GWAS summary statistic. We estimated p values based on Jackknife standard errors, by splitting the genome into 1,000 blocks with an equal number of variants. If the GWAS summary statistics are not biased by residual stratification (in this case due to European geographical structure), the correlation coefficients would be expected to be zero. If there was significant correlation in either the GIANT dataset or the HISLA stage 1, AAAGC, and GIANT trans-ancestral meta-analysis, we then further evaluated the improvement of bias due to stratification in trans-ancestral meta-analysis by comparing the correlation coefficients in the trans-ancestral meta-analysis with those in GIANT. Restricting to variants shared between GIANT and the trans-ancestral meta-analysis, we computed their difference in correlation coefficients of PC loadings and effect sizes, and estimated p values again based on Jackknife standard errors from 1,000 equal sized blocks.
Results
Discovery of one BMI locus in Hispanic/Latino adults
The first goal of this study was to conduct a genome-wide meta-analysis of anthropometric traits in Hispanic/Latino adults to identify loci in an under-studied population (Figure 1). All regional plots of all potentially novel GWAS significant HISLA stage 1 findings are shown in the supplemental information (Figures S6–S11).
No novel anthropometric loci were identified in all HISLA stage 1 samples combined. Yet, when we excluded the samples of exclusively Brazilian or Native American heritage from stage 1, we discovered one locus for adult BMI at PAX3 on chromosome 2 in the HISLA stage 1 sample (Table S7) and replicated this locus in HISLA stage 2 (Table 1). The lead SNP at this locus, rs994108, is in moderate LD with a previously-reported SNP (rs7559271, r2 = 0.46 in 1000 Genomes AMR) (Figure 2) and lies on the same haplotype as reported to influence facial morphology, including position of the nasion (the deepest point on the nasal bridge where the nose meets the forehead) in Europeans75 and Hispanic/Latino76 descent individuals. Other PAX3 variants in lower LD with the lead SNP have also been associated with nasion position,77 monobrow, and male-pattern baldness.78,79 PAX3 is a well-known transcription factor in normal embryonic neural crest development and differentiation.80 Neural crest cells can give rise to mesenchymal stem cells,81 which can in turn give rise to adipocytes;81, 82, 83 thus, the possible role of PAX3 in adipogenesis may at least partially explain the association signal with BMI near this gene.
Table 1.
Trait | Locus name | SNP RSID | Genomic regionb | Chr | Position (hg19) | Effect/other alleles | Stage | EAF | Beta | SE | p value | HetISq | N | Novel? Yes/noc |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Novel loci | ||||||||||||||
BMI | PAX3a | rs994108 | intergenic | 2 | 223057288 | C/A | stage 1: discovery | 0.390 | 0.041 | 0.007 | 1.62 × 10−8 | 0 | 43,048 | yes |
stage 2: validation | 0.394 | 0.030 | 0.016 | 5.65 × 10−2 | 8.0 | 9,336 | ||||||||
stage 1 + 2 | 0.389 | 0.038 | 0.006 | 2.19 × 10−9 | 0 | 52,384 | ||||||||
AAAGC | 0.526 | 0.007 | 0.007 | 3.26 × 10−1 | 0 | 42,751 | ||||||||
GIANT | 0.342 | <0.001 | 0.004 | 9.81 × 10−1 | – | 233,955 | ||||||||
ARRDC3 | rs1505851 | intronic | 5 | 90893954 | T/C | stage 1: discovery | 0.741 | 0.041 | 0.007 | 2.287 × 10−8 | 0 | 52,365 | no | |
stage 2: validation | 0.709 | 0.005 | 0.017 | 7.62 × 10−1 | 33.3 | 9,336 | ||||||||
stage 1 + 2 | 0.735 | 0.035 | 0.007 | 1.16 × 10−7 | 14.1 | 61,701 | ||||||||
AAAGC | 0.307 | 0.027 | 0.008 | 7.00 × 10−4 | 46.6 | 42,752 | ||||||||
GIANT | 0.680 | 0.001 | 0.004 | 7.90 × 10−1 | – | 233,999 | ||||||||
WHRadjBMI (women only) | DOCK2a | rs6879439 | intronic | 5 | 169314869 | C/T | stage 1: discovery | 0.520 | 0.060 | 0.010 | 1.02 × 10−8 | 0 | 18,591 | no |
stage 2: validation | 0.526 | 0.013 | 0.028 | 6.54 × 10−1 | 28.7 | 2,747 | ||||||||
stage 1 + 2 | 0.515 | 0.049 | 0.0093 | 1.57 × 10−7 | 1.9 | 23,382 | ||||||||
AAAGC | 0.440 | 0.012 | 0.012 | 3.09 × 10−1 | 0 | 15,600 | ||||||||
GIANT | 0.610 | 0.003 | 0.005 | 6.30 × 10−1 | – | 86,317 | ||||||||
WHRadjBMI (sex combined) |
TAOK3 |
rs115981023 |
intronic |
12 |
118751105 |
A/G |
stage 1: discovery | 0.009 | 0.328 | 0.057 | 1.08 × 10−8 | 44.8 | 19,640 | no |
stage 2: validation | 0.004 | −0.339 | 0.687 | 6.22 × 10−1 | 0 | 1,340 | ||||||||
stage 1 + 2 | 0.009 | 0.308 | 0.057 | 5.18 × 10−8 | 52.0 | 20,980 | ||||||||
AAAGC | 0.050 | 0.027 | 0.027 | 3.07 × 10−1 | 0 | 15,601 | ||||||||
GIANT | 0.002 | no proxy | ||||||||||||
New signals in known loci | ||||||||||||||
BMI | ADCY5a | rs17361324 | intronic | 3 | 123131254 | T/C | stage 1: discovery | 0.280 | 0.042 | 0.008 | 2.60 × 10−8 | 0 | 43,333 | yes |
stage 2: validation | 0.269 | 0.035 | 0.018 | 4.70 × 10−2 | 0 | 9,035 | ||||||||
stage 1 + 2 | 0.278 | 0.041 | 0.007 | 2.84 × 10−9 | 0 | 52,368 | ||||||||
AAAGC | 0.119 | 0.023 | 0.011 | 3.85 × 10−2 | 0 | 42,682 | ||||||||
GIANT | 0.253 | 0.013 | 0.004 | 9.90 × 10−4 | – | 320,704 | ||||||||
ILRUN | rs148899910 | intergenic | 6 | 34232259 | C/G | stage 1: discovery | 0.275 | 0.040 | 0.007 | 9.03 × 10−9 | 0 | 54,105 | yes | |
stage 2: validation | 0.282 | 0.049 | 0.017 | 4.43 × 10−3 | 0 | 9,035 | ||||||||
stage 1 + 2 | 0.276 | 0.041 | 0.006 | 1.24 × 1010 | 0 | 63,140 | ||||||||
AAAGC | 0.316 | −0.016 | 0.008 | 5.02 × 10−2 | 30.0 | 42,750 | ||||||||
GIANTd | 0.017 | 0.036 | 0.012 | 3.99 × 10−3 | – | 216,522 | ||||||||
Height | B4GALNT3 | rs215226 | intronic | 12 | 591300 | A/G | stage 1: discovery | 0.550 | −0.032 | 0.005 | 5.53 × 10−9 | 22.1 | 52,156 | yes |
stage 2: validation | 0.565 | −0.020 | 0.017 | 2.37 × 10−1 | 19.3 | 6,906 | ||||||||
stage 1 + 2 | 0.551 | −0.031 | 0.005 | 1.98 × 10−9 | 21.8 | 59,062 | ||||||||
AAAGC | 0.772 | −0.031 | 0.009 | 8.99 × 10−4 | 24.0 | 41,327 | ||||||||
GIANT | 0.633 | 0.006 | 0.004 | 1.10 × 10−1 | – | 220,370 |
Chr, chromosome; EAF, effect allele frequency; HetIsq, heterogeneity I2; N, sample size; WHRadjBMI, waist-to-hip ratio adjusted for BMI; AAAGC, African American Anthropometry Genetics Consortium; GIANT, Genetic Investigation of Anthropometric Traits Consortium.
All studies were meta-analyzed using METAL,32 with each study entered individually into the stage 1 + 2 meta-analysis.
These BMI and WHRadjBMI analyses did not include Brazilian and/or Native American samples.
Human Genome Organisation-approved gene names.
New loci or signals are those that were replicated by HISLA stage 2 results that are directionally consistent with stage 1 and remained genome-wide significant after meta-analysis with stage 1.
Proxy GIANT, rs1573905 (r2 = 0.96 AMR).
Another BMI SNP (rs1505851-T) near ARRDC3 on chromosome 5 associated at genome-wide significance in HISLA stage 1 (Table S7; Figure S6), but did not replicate in stage 2 (MAF = 71%) or generalize to AAAGC or GIANT (effect allele frequency = 31% or 68%; Table 1). However, the association was directionally consistent and showed some signal in AAAGC (p = 7 × 10−4). LD patterns at ARRDC3 appear to be similar across the three ancestries (albeit with a smaller LD block for 1000 Genomes AFR), meaning that lack of generalization may be related to frequency differences, haplotype effects, or a false positive (Figure S5A).
We identified two WHRadjBMI loci at DOCK2 and TAOK3 at genome-wide significance in HISLA stage 1 after excluding the Brazilian and Native American samples (Table S7; Figures S7 and S8), yet neither met the p value threshold for replication in HISLA stage 2. The DOCK2 association for WHRadjBMI observed among women only in stage 1 (Figure S7) was directionally consistent in the female stage 2 sample. There were more SNPs in high LD with rs6879439 in 1000 Genomes AMR (0.8 ≤ r2 ≤ 1) than for AFR and EUR references (Figure S5B), which may explain why this SNP association did not generalize to AAAGC or GIANT (Table 1).
The genome-wide significant stage 1 TAOK3 association was led by a low frequency variant (rs115981023-A, MAF = 0.9%), but the associations at this variant were not directionally consistent across stages (Table 1; Figure S8). In fact, patterns of very high LD were seen with rs115981023 in 1000 Genomes AMR and AFR (Figure S5C), even though this variant is seen in African ancestry at a higher frequency (e.g., MAF = 0.9% in HISLA versus 5% in AAAGC). Similarly, rs115981023 exhibited moderate heterogeneity across stage 1 samples after excluding Brazilian and Native American samples (I2 = 45%); evidence of moderate heterogeneity remained (I2 = 52%) in the combined meta-analysis of HISLA stage 1 and 2 samples (Table 1). Finally, this variant was the least frequent in European ancestry (MAF = 0.2% in GIANT), which explained the lack of a proxy for generalization in GIANT (Table 1).
No potentially novel loci were identified for height in HISLA stage 1, and the exclusion of the Brazilian and Native American samples did not reveal additional height or WHRadjBMI loci.
Discovery of three signals in established loci for BMI and height in Hispanic/Latino adults
At two established loci for BMI, we identified additional signals at ADCY5 and near ILRUN (Table S7). These signals were both independent of any previously published anthropometric findings (Table S8; Figures S9 and S10). We replicated these signals in stage 2 with directional consistency and in the combined stage 1 + 2 meta-analysis at GWAS significance (Table 1). We also identified one additional signal for height in an established height locus, B4GALNT3, which was independent of the previously reported SNPs for height (Tables S7 and S8; Figure S11). We replicated this signal in stage 2 with directional consistency and a stage 1 + 2 meta-analysis that was GWAS significant (Table 1). In additional gene expression and bioinformatics analyses (Tables S18–S20), we found that each of the three additional signals in established anthropometric loci is supported by an eQTL in whole blood in Hispanic/Latino populations (Table S18), and an eQTL in other relevant tissues, e.g., thyroid, esophagus, artery, using publicly available (non-Hispanic/Latino) datasets (Tables S19 and S20).
Fine-mapping of Hispanic/Latino anthropometric findings
We fine-mapped the PAX3 locus for BMI and the three additional signals in known loci (BMI, ADCY5 and ILRUN; height, B4GALNT3; Table S9). For the three BMI loci, FINEMAP revealed one potential causal set for each locus at PAX3, ADCY5, and ILRUN loci. For the PAX3 locus, this 95th percentile credible set contained only nine plausibly causal SNPs, with the lead SNP rs994108 having a very high posterior probability of being causal (0.89, Table S21). However, functional annotation of this SNP was unremarkable (Tables S22 and S23). In contrast, for ADCY5 and ILRUN, FINEMAP revealed one causal configuration for each locus but with much greater uncertainty of the likely functional variant given the size of the credible sets, which contained 14 and 22 SNPs in the credible region for ADCY5 and ILRUN, respectively. The posterior probability of the best lead SNP at these loci was relatively low with the best posterior probabilities of 0.23 for rs17361324 (ADCY5), and 0.11 for rs73420913 (ILRUN), respectively. Interestingly, however, the best candidate for causality at PAX3 and ADCY5 loci were the lead SNPs from the HISLA meta-analysis; for ILRUN, the FINEMAP and HISLA SNPs were in high LD (rs73420913 had an r2 = 0.96 with the lead HISLA SNP rs148899910), providing greater support for the prioritization of these SNPs for functional interrogation. For the B4GALNT3 height locus, FINEMAP revealed six causal configurations. Four of the variants (rs11063185, rs215230, rs7303572, and rs11063184) with each configuration had a posterior probability >0.99 and contained only the variant itself in the 95th percentile credible set. One variant (rs215223) had a posterior probability of 0.93 and thus included two variants in the 95th percentile credible set. The sixth 95th percentile credible set had a lead variant with a posterior probability of 45% but contained a total of 1,621 additional variants, all of which had very small posterior probabilities (i.e., ≤0.05).
Transferability of adult loci/signals from Hispanic/Latinos to consortia of other ancestral backgrounds
To assess how well the effect estimates are transferable to other populations, we looked up the BMI and height findings from Hispanic/Latinos in the AAAGC and GIANT meta-analysis results (Table 1). The BMI signal at the ADCY5 locus (rs17361324) transferred to both AAAGC and GIANT with directional consistency (beta = 0.13–0.23) and at nominal significance (p < 5 × 10−2). The lead SNP (rs148899910) representing the BMI signal near ILRUN was not available in GIANT; the signal only appeared to be transferable to GIANT (at proxy SNP rs1573905, r2 = 0.96–1 in 1000 Genomes AMR and EUR; Table 1). The signal for height in B4GALNT3 (rs215226) was directionally consistent and nominally significant in AAAGC only. In all cases, the effect sizes observed in GIANT and AAAGC were attenuated compared with the effect sizes from HISLA stage 1.
Relevance of adult Hispanic/Latino anthropometric findings to childhood/adolescence
We looked up our novel HISLA findings in Hispanic/Latino children/adolescents using BMI-for-age and height-for-age Z scores, as well as a case-control study of childhood obesity. Two of the three novel BMI signals were directionally consistent with the anticipated effect on the odds of obesity during childhood/adolescence, one of which was nominally significant (rs17361324 at ADCY5; p = 2.2 × 10−2). None of the HISLA findings generalized at nominal significance with the BMI/height-for-age Z scores, but were directionally consistent with the corresponding effect in adulthood (Table S10). This may have been due to the small available sample size of Hispanic/Latino children/adolescents.
Transferability of established anthropometric loci to Hispanic/Latino adults
We assessed how many established anthropometric loci, described previously in predominantly non-Hispanic/Latino European samples, could be transferred to Hispanic/Latino adults, in light of the available Hispanic/Latino sample size from stage 1. As shown in Table S11, the index SNPs at 336 of 1,247 (26.9%) previously reported BMI loci were suggestively transferable at nominal significance to Hispanic/Latinos. Of these BMI loci, 36 SNPs in the HISLA stage 1 displayed directional consistency with the literature and Bonferroni significance (Table S11). Furthermore, one BMI locus was genome-wide significant at the same published variant and another 12 BMI loci were genome-wide significant at another SNP within 1 Mb and in moderate to high LD (r2 ≥ 0.52 in AMR) with the reported index SNP (Table S7).
Table S12 shows that a slightly higher percentage of known height loci (1,177 of 3,806, or 30.9%) were transferable to Hispanic/Latinos. Of these loci, 124 SNPs were directionally consistent and Bonferroni significant (Table S12). Ten height loci were genome-wide significant at the same lead variant, and another 39 height loci were associated at genome-wide significance at another SNP within 1 Mb (0.05 ≤ r2 ≤ 0.98 in AMR; Table S7).
Finally, Tables S13–S15 show that 143 of 694 (20.6%) known WHRadjBMI in both sexes combined, 133 of 567 (23.5%) in women-only, and 28 of 173 (16.2%) in men-only loci were transferable to Hispanic/Latinos at nominal significance. Of these, a total of 15 loci were associated with WHRadjBMI at Bonferroni significance in the combined, women- or men-only analyses (Tables S13–S15). None of the index SNPs from the previous literature for WHRadjBMI reached genome-wide significance; however, we did observe genome-wide significant evidence for association of an SNP with WHRadjBMI in strong LD with the index variant (r2 = 0.92 in AMR) for the HOXC13 signal (Table S7).
Replication of five novel loci and 33 new signals in established loci for adult anthropometric traits from a trans-ancestral meta-analysis
Our secondary goal was to assemble a trans-ancestral meta-analysis of HISLA stage 1, AAAGC and GIANT consortia results to identify additional novel loci and fine-map established loci by leveraging differences in allele frequencies across populations (Figure 1). As anticipated, this trans-ancestral meta-analysis of HISLA, AAAGC, and GIANT revealed new insights, including 8 novel loci and 35 new signals in established loci that were associated at genome-wide significance (Table S16; Figures S12–S53) and independent of established SNPs within a 10 Mb region (Table 2). Of this set, 5 loci (3 BMI, 1 height, and 1 WHRadjBMI) and 33 signals in established loci (3 BMI, 28 height, and 2 WHRadjBMI) were generalized using the adult British subsample of the UKBB. In some cases, the significance in the trans-ancestral results were driven more by the AAAGC and/or HISLA consortia, which could explain the lack of association in the UKBB British subsample (Table S16; Figure S54).
Table 2.
Trait | SNP rsid | Chr | Position (hg19) | Locus namec | Effect/other alleles | EAF |
N | Unconditioned meta-analysis results |
Conditioned on all known SNPs within 10 Mb region |
UKBB (validation results) |
|||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
HISLA | AAAGC | GIANT | Beta | SE | p value | HetISq | Beta | SE | p value | EAF | N | Beta | SE | p value | |||||||
Novel locia | |||||||||||||||||||||
BMI | rs4675117 | 2 | 227769794 | RHBDD1 | T/C | 0.421 | 0.104 | 0.383 | 343,628 | 0.017 | 0.003 | 8.56 × 10−8 | 0 | 0.019 | 0.003 | 2.23 × 10−9 | 0.383 | 336,107 | 0.006 | 0.002 | 1.82 × 10−2 |
BMI | rs9860730 | 3 | 64701146 | ADAMTS9-AS2 | A/G | 0.354 | 0.222 | 0.767 | 428,763 | −0.016 | 0.003 | 1.67 × 10−8 | 0 | −0.015 | 0.003 | 3.80 × 10−8 | 0.712 | 336,107 | −0.008 | 0.003 | 4.54 × 10−3 |
BMI | rs150992 | 5 | 98275197 | CHD1-DT | A/G | 0.778 | 0.645 | 0.700 | 439,077 | 0.018 | 0.003 | 5.40 × 10−10 | 0 | 0.017 | 0.003 | 1.02 × 10−8 | 0.693 | 336,107 | 0.005 | 0.003 | 3.74 × 10−2 |
Height | rs17375290 | 1 | 61334177 | NFIA | A/G | 0.830 | 0.645 | 0.793 | 364,636 | 0.017 | 0.003 | 3.47 × 10−8 | 0 | 0.017 | 0.003 | 3.00 × 10−8 | 0.794 | 336,474 | 0.002 | 0.002 | 4.58 × 10−1 |
Height | rs10737541 | 1 | 168214098 | ANKRD36BP1 | T/G | 0.399 | 0.645 | 0.196 | 319,809 | −0.018 | 0.003 | 1.60 × 10−9 | 0 | −0.018 | 0.003 | 3.16 × 10−10 | 0.226 | 336,474 | −0.004 | 0.002 | 4.37 × 10−2 |
Height | rs4618485 | 6 | 73555917 | KCNQ5 | A/G | 0.750 | 0.645 | 0.592 | 348,626 | 0.014 | 0.003 | 4.72 × 10−8 | 39.2 | 0.018 | 0.003 | 4.34 × 10−12 | 0.604 | 336,474 | 0.003 | 0.002 | 7.02 × 10−2 |
Height | rs17493997 | 8 | 82044302 | PAG1 | C/G | 0.493 | 0.645 | 0.340 | 325,906 | −0.015 | 0.003 | 3.42 × 10−8 | 0 | −0.017 | 0.003 | 2.73 × 10−10 | 0.299 | 336,474 | −0.0003 | 0.002 | 8.56 × 10−1 |
WHRadjBMI (sex combined) | rs16873543 | 6 | 45577134 | RUNX2 | T/C | 0.716 | 0.645 | 0.767 | 209,552 | −0.018 | 0.004 | 3.20 × 10−6 | 0 | −0.022 | 0.004 | 9.65 × 10−9 | 0.724 | 484,563 | −0.008 | 0.002 | 5.50 × 10−4 |
New signals in established locib | |||||||||||||||||||||
BMI | rs10540 | 11 | 494662 | RNH1 | A/G | 0.200 | 0.031 | 0.092 | 470,714 | −0.021 | 0.004 | 1.01 × 10−7 | 0 | −0.023 | 0.004 | 5.75 × 10−9 | 0.135 | 336,107 | −0.007 | 0.004 | 4.16 × 10−2 |
BMI | rs4807179 | 19 | 1956035 | CSNK1G2 | A/G | 0.494 | 0.218 | 0.525 | 309,507 | 0.020 | 0.003 | 2.75 × 10−10 | 0 | 0.018 | 0.003 | 1.52 × 10−8 | 0.632 | 336,107 | 0.014 | 0.002 | 1.06 × 10−8 |
BMI | rs4813428 | 20 | 21451848 | NKX2-2 | T/C | 0.181 | 0.093 | 0.108 | 321,797 | 0.029 | 0.005 | 2.89 × 10−10 | 0 | 0.029 | 0.005 | 1.46 × 10−10 | 0.093 | 336,107 | 0.013 | 0.004 | 2.47 × 10−3 |
Height | rs4912122 | 1 | 19876438 | NKX2-2 | A/G | 0.383 | 0.837 | 0.383 | 334,951 | −0.015 | 0.003 | 6.33 × 10−9 | 0 | −0.019 | 0.003 | 1.45 × 10−13 | 0.350 | 336,474 | −0.012 | 0.002 | 1.58 × 10−11 |
Height | rs4425978 | 1 | 42243878 | HIVEP3 | T/C | 0.389 | 0.189 | 0.580 | 351,587 | 0.014 | 0.003 | 2.14 × 10−8 | 0 | 0.016 | 0.003 | 3.63 × 10−10 | 0.533 | 336,474 | 0.008 | 0.002 | 5.41 × 10−6 |
Height | rs618555 | 1 | 86481084 | COL24A1 | T/C | 0.241 | 0.064 | 0.353 | 320,239 | 0.019 | 0.003 | 5.47 × 10−12 | 0 | 0.016 | 0.003 | 2.90 × 10−8 | 0.311 | 336,474 | 0.008 | 0.002 | 1.67 × 10−5 |
Height | rs6545538 | 2 | 56217900 | MIR216A | A/G | 0.258 | 0.628 | 0.258 | 305,704 | 0.022 | 0.003 | 1.23 × 10−13 | 0 | 0.019 | 0.003 | 2.59 × 10−11 | 0.266 | 336,474 | 0.011 | 0.002 | 1.25 × 10−8 |
Height | rs2741311 | 2 | 233239743 | ALPP | T/C | 0.076 | 0.018 | 0.050 | 463,609 | 0.046 | 0.005 | 1.41 × 10−21 | 49.6 | 0.030 | 0.005 | 4.78 × 10−10 | 0.080 | 336,474 | 0.033 | 0.003 | 2.39 × 10−24 |
Height | rs6935954 | 6 | 26255451 | HIST1H2BH | A/G | 0.306 | 0.103 | 0.408 | 345,378 | 0.042 | 0.003 | 3.06 × 10−59 | 46.5 | 0.018 | 0.003 | 3.29 × 10−12 | 0.425 | 336,474 | −0.027 | 0.002 | 1.10 × 10−54 |
Height | rs6900530 | 6 | 35280971 | DEF6 | T/C | 0.094 | 0.385 | 0.042 | 123,137 | −0.057 | 0.005 | 3.09 × 10−28 | 79.0 | −0.036 | 0.005 | 6.75 × 10−12 | 0.027 | 336,474 | −0.073 | 0.005 | 1.03 × 10−42 |
Height | rs9472006 | 6 | 43067487 | PTK7 | A/G | 0.149 | 0.249 | 0.059 | 212,931 | −0.027 | 0.005 | 4.37 × 10−9 | 0 | −0.034 | 0.005 | 7.04 × 10−14 | 0.041 | 336,474 | −0.013 | 0.004 | 2.86 × 10−3 |
Height | rs3822957 | 6 | 76607280 | MY O 6 | A/G | 0.173 | 0.538 | 0.158 | 279,818 | −0.023 | 0.003 | 4.88 × 10−12 | 84.4 | −0.023 | 0.003 | 3.44 × 10−12 | 0.142 | 336,474 | −0.015 | 0.002 | 1.98 × 10−9 |
Height | rs1342330 | 6 | 144065685 | PHACTR2 | A/T | 0.601 | 0.738 | 0.508 | 353,259 | 0.014 | 0.003 | 1.70 × 10−8 | 0 | 0.017 | 0.003 | 6.95 × 10−12 | 0.520 | 336,474 | 0.006 | 0.002 | 8.84 × 10−4 |
Height | rs6936615 | 6 | 154355100 | OPRM1 | A/G | 0.887 | 0.901 | 0.850 | 415,248 | −0.018 | 0.003 | 2.47 × 10−8 | 0 | −0.020 | 0.003 | 2.33 × 10−9 | 0.830 | 336,474 | −0.003 | 0.002 | 1.73 × 10−1 |
Height | rs991946 | 6 | 166329862 | RP11-252P19.3c | T/C | 0.481 | 0.486 | 0.500 | 379,912 | −0.019 | 0.002 | 1.28 × 10−14 | 36.7 | −0.019 | 0.002 | 9.96 × 10−15 | 0.479 | 336,474 | −0.013 | 0.002 | 3.92 × 10−13 |
Height | rs7816300 | 8 | 109787856 | TMEM74 | T/C | 0.262 | 0.116 | 0.260 | 397,735 | −0.015 | 0.003 | 2.24 × 10−8 | 0 | −0.016 | 0.003 | 5.47 × 10−9 | 0.299 | 336,474 | −0.002 | 0.002 | 3.94 × 10−1 |
Height | rs4520250 | 9 | 88924057 | TUT7 | A/C | 0.235 | 0.069 | 0.433 | 295,945 | 0.015 | 0.003 | 3.80 × 10−8 | 55.9 | 0.016 | 0.003 | 1.48 × 10−8 | 0.339 | 336,474 | 0.010 | 0.002 | 4.36 × 10−8 |
Height | rs7029157 | 9 | 97000863 | snoU13c | T/C | 0.084 | 0.325 | 0.067 | 262,808 | 0.028 | 0.004 | 5.85 × 10−11 | 16.8 | 0.026 | 0.004 | 1.00 × 10−9 | 0.088 | 336,474 | 0.030 | 0.003 | 4.55 × 10−22 |
Height | rs12347744 | 9 | 97575273 | AOPEP | T/C | 0.102 | 0.015 | 0.033 | 454,111 | −0.032 | 0.005 | 6.55 × 10−11 | 65.8 | −0.030 | 0.005 | 6.06 × 10−10 | 0.061 | 336,474 | −0.031 | 0.004 | 1.04 × 10−17 |
Height | rs7024254 | 9 | 109498129 | ZNF462 | A/G | 0.255 | 0.721 | 0.207 | 324,351 | 0.017 | 0.003 | 1.04 × 10−8 | 23.7 | 0.036 | 0.003 | 6.10 × 10−35 | 0.204 | 336,474 | 0.010 | 0.002 | 6.50 × 10−6 |
Height | rs10119624 | 9 | 118305438 | DELEC1 | A/G | 0.612 | 0.437 | 0.683 | 353,374 | 0.021 | 0.003 | 2.56 × 10−16 | 31.1 | 0.022 | 0.003 | 8.02 × 10−17 | 0.671 | 336,474 | 0.012 | 0.002 | 3.20 × 10−10 |
Height | rs2902635 | 10 | 105476045 | SH3PXD2A | T/G | 0.710 | 0.382 | 0.821 | 308,782 | −0.021 | 0.003 | 1.59 × 10−12 | 0 | −0.017 | 0.003 | 1.62 × 10−8 | 0.805 | 336,474 | −0.015 | 0.002 | 7.65 × 10−12 |
Height | rs17659078 | 11 | 2284590 | ASCL2 | A/C | 0.211 | 0.149 | 0.258 | 354,931 | 0.019 | 0.003 | 1.78 × 10−10 | 9.0 | 0.016 | 0.003 | 3.45 × 10−8 | 0.273 | 336,474 | 0.004 | 0.002 | 2.24 × 10−2 |
Height | rs11605693 | 11 | 122837037 | JHY | T/C | 0.591 | 0.384 | 0.448 | 380,447 | −0.017 | 0.002 | 3.44 × 10−12 | 0 | −0.018 | 0.002 | 3.07 × 10−14 | 0.447 | 336,474 | −0.013 | 0.002 | 4.45 × 10−13 |
Height | rs621794 | 11 | 125849462 | CDON | A/G | 0.500 | 0.430 | 0.475 | 380,049 | −0.014 | 0.002 | 1.46 × 10−8 | 0 | −0.014 | 0.002 | 7.29 × 10−9 | 0.429 | 336,474 | −0.009 | 0.002 | 1.76 × 10−7 |
Height | rs11221442 | 11 | 128577624 | FLI1 | C/G | 0.171 | 0.109 | 0.217 | 352,360 | −0.022 | 0.003 | 2.03 × 10−12 | 78.8 | −0.023 | 0.003 | 2.03 × 10−13 | 0.252 | 336,474 | −0.008 | 0.002 | 2.59 × 10−5 |
Height | rs12300112 | 12 | 103147575 | LINC00485 | C/G | 0.075 | 0.242 | 0.008 | 154,014 | 0.041 | 0.006 | 4.31 × 10−13 | 45.2 | 0.038 | 0.006 | 3.11 × 10−11 | 0.027 | 336,474 | 0.038 | 0.005 | 4.61 × 10−12 |
Height | rs11616067 | 12 | 116393174 | MED13L | A/G | 0.841 | 0.904 | 0.750 | 327,941 | 0.021 | 0.003 | 6.38 × 10−12 | 21.0 | 0.018 | 0.003 | 2.80 × 10−9 | 0.768 | 336,474 | 0.012 | 0.002 | 6.01 × 10−9 |
Height | rs17197170 | 14 | 21977962 | METTL3 | A/G | 0.931 | 0.960 | 0.842 | 310,344 | −0.026 | 0.004 | 5.40 × 10−12 | 64.9 | −0.025 | 0.004 | 3.74 × 10−11 | 0.828 | 336,474 | −0.018 | 0.002 | 3.07 × 10−14 |
Height | rs11076551 | 16 | 51109492 | RP11-883G14.4c | A/G | 0.341 | 0.343 | 0.267 | 434,688 | 0.014 | 0.003 | 2.14 × 10−8 | 21.6 | 0.015 | 0.003 | 5.87 × 10−9 | 0.376 | 336,474 | 0.009 | 0.002 | 2.51 × 10−7 |
Height | rs12918773 | 16 | 89741403 | SPATA33 | A/G | 0.121 | 0.036 | 0.149 | 279,104 | −0.024 | 0.004 | 7.45 × 10−9 | 46.1 | −0.024 | 0.004 | 3.50 × 10−9 | 0.112 | 336,474 | −0.024 | 0.003 | 4.71 × 10−19 |
Height | rs1346490 | 19 | 7244233 | INSR | A/C | 0.546 | 0.611 | 0.500 | 333,896 | 0.015 | 0.003 | 3.99 × 10−9 | 12.3 | 0.014 | 0.003 | 4.86 × 10−8 | 0.620 | 336,474 | 0.009 | 0.002 | 1.67 × 10−7 |
Height | rs17457472 | 19 | 17493610 | PLVAP | A/C | 0.037 | 0.062 | 0.028 | 356,712 | −0.051 | 0.007 | 5.70 × 10−14 | 0 | −0.042 | 0.007 | 9.89 × 10−10 | 0.040 | 336,474 | −0.019 | 0.004 | 1.12 × 10−5 |
WHRadjBMI (sex combined) | rs17099388 | 5 | 142095250 | FGF1 | A/G | 0.166 | 0.256 | 0.008 | 105,460 | 0.039 | 0.007 | 3.14 × 10−9 | 0 | 0.037 | 0.007 | 2.47 × 10−8 | 0.041 | 484,563 | 0.025 | 0.005 | 2.00 × 10−7 |
WHRadjBMI (sex combined) | rs7975017 | 12 | 26428793 | SSPN | T/C | 0.303 | 0.717 | 0.167 | 267,044 | −0.021 | 0.004 | 7.57 × 10−9 | 5.5 | −0.021 | 0.004 | 1.05 × 10−8 | 0.239 | 484,563 | −0.014 | 0.002 | 2.00 × 10−9 |
Chr, chromosome; EAF, effect allele frequency; HetIsq, heterogeneity I2; N, sample size; WHRadjBMI, waist-to-hip ratio adjusted for BMI; AAAGC, African American Anthropometry Genetics Consortium; GIANT, Genetic Investigation of Anthropometric Traits Consortium.
Each novel locus was defined by the absence of known (previously published) SNPs within 1 Mb (±500 kb) of the lead SNP.
Each known locus was defined by a 1 Mb region around previously identified SNP(s) for the indicated trait; the known SNP(s), p < 5 × 10−8, at each established locus can be found in Table S16.
Human Genome Organisation approved gene names, unless otherwise indicated.
We looked up the findings from our trans-ancestral meta-analyses in the sample of Hispanic/Latino children/adolescents (Table S17). We found that 2 of the 7 BMI and height trans-ancestral loci, and 17 of the 33 trans-ancestral BMI/height signals in established loci, were directionally consistent between their adult directions of association and the BMI/height-for-age Z scores in children/adolescents. However, this amount of directional consistency was not more than what would have been expected by chance alone (pbinomial > 0.10). Four trans-ancestral SNPs were associated at nominal significance in the child/adolescent sample, each having been already replicated in UKBB (Table S16). Three of these four loci were directionally consistent in the childhood/adolescence results with the trans-ancestral adult findings (Table S17).
Fine-mapping of trans-ancestral anthropometric findings
We also fine-mapped our trans-ancestral findings (Table S21) using FINEMAP to pinpoint individual variants and genes within each locus region that have a direct effect on the trait. FINEMAP uses a shotgun stochastic search algorithm84 that iterates through causal configurations of SNPs by concentrating efforts on the configurations with non-negligible probability. Within a 1 Mb region, we report (1) the causal configuration of SNPs for a given trait that had the highest posterior probability and (2) the posterior probability of being causal for each of the SNPs.
For four of the five trans-ancestral loci (three BMI loci and one WHRadjBMI locus), there was one SNP within the configuration with the highest posterior probability. For the height locus near ANKRD36BP1, there were two SNPs in the configuration with the highest posterior probability. In all five loci, the SNP with the highest posterior probability from each of these credible sets was either the exact SNP with the strongest GWAS evidence or in high LD (r2 between 0.70 and 0.99 in each ancestry) with the lead GWAS SNP. Two of these five regions had strong prioritization given high posterior probabilities (≥0.8) and small 95th percentile credible sets: (1) for BMI, the CHD1-DT region had a posterior probability of 0.88 for rs150992 with three SNPs in the credible set, and (2) for height, the ANKRD36BP1 region had a posterior probability of 0.93 for rs10737541 with five SNPs in the credible set. From the functional annotations (Tables S22 and S23), we find that all three of the BMI loci, the height loci, and WHRadjBMI loci have enhancer marks and eQTLs, most of which are in highly relevant tissues, e.g., adipose, brain, muscle, thyroid.
For the other trans-ancestral loci, the posterior probabilities were lower, between 0.09 and 0.42, yet four loci (rs9860730, rs17375290, rs4324883, and rs9463108) still had relatively few SNPs (<10) in the 95th percentile credible sets, suggesting a narrow window (combination of variants) around the causal variant. For example, functional annotations of rs17375290, the lead GWAS SNP in the NFIA locus associated with height, show it to have promoter markers in muscle, CADD score of 13.29 (CADD >10 ranks variants among the top 10% potentially deleterious), and an eQTL with FGGY in osteoclast tissue (Tables S22 and S23). Three of the other SNPs in the credible set (rs599989, rs1762881, and rs17121184) have nominally significant (p = 0.01–0.005) eQTLs with FGGY in osteoclast tissue, but are not in high LD with rs17375290 (r2 = 0.03–0.1). Diseases associated with FGGY include autosomal recessive lateral sclerosis and spastic paraplegia type 7, which are known to affect height.
Within the 33 trans-ancestral signals in known loci, 31 had configurations with more than 1 putative causal SNP (e.g., more than 1 credible set). This made sense given these are loci with multiple independent signals, as described by our earlier conditional analyses. Among the putative causal SNPs within each locus, there were a number of SNPs that represented known signals (either the exact SNP or something in high LD among all ancestries). We found that, for many of these, the credible sets contained <10 SNPs. Among the 33 signals in known loci, 26 included a putative causal SNP that is the lead GWAS SNP reported here or an SNP in high LD (r2 > 0.75) with the lead GWAS SNP, suggesting causality for this signal in general, although perhaps maybe not initially described at the most-putatively causal SNP(s). For these putatively causal SNPs, the posterior probabilities ranged from 0.09 to 1. Twenty-two of these SNPs had 95th percentile credible sets that contained <10 SNPs and 15 also had posterior probability ≥0.8.
Many have functional annotations that support the fine-mapping results (Tables S22 and S23). For example, we find eQTLs for the three BMI signals and enhancer marks for rs4807179 in relevant tissues, including adipose, brain, muscle, and/or thyroid. The lead SNPs of these credible sets had posterior probabilities >0.75 and the credible sets included <10 SNPs. Of the 28 identified height signals, we find 13 putatively causal SNPs that are the lead GWAS SNP, or are in high LD (r2 > 0.75) with it, have <10 SNPs in the credible set and have eQTLs in relevant tissues, including muscle, thyroid, adipose, lung, and osteoclasts. Some also have promoter or enhancer marks in some of the same tissues. For the two WHRadjBMI signals, both have three SNPs in the most probable causal configurations. One of these causal SNPs for each region is either the lead GWAS SNP (rs7975017) or an SNP in high LD (rs17099388 and rs6895040 LD: AFR r2 = 1.0; AMR r2 = 1.0; EUR r2 = 1.0), has a posterior probability ≥0.95, and is the only SNP in the credible set. Furthermore, for rs7975017, we find eQTLs in thyroid for multiple genes (BHLHE41, SSPN, and AC022509.3 from GTEx) and enhancer marks in multiple tissues including those related to the WHRadjBMI trait, e.g., thyroid, muscle, fat, bone, and adrenal gland. Overall, across many of the loci and secondary signals, FINEMAP revealed SNPs with somewhat strong prioritization (posterior probability ≥0.8) and, at some loci, putatively causal SNPs in small 95th percentile credible sets, thus demonstrating the utility of trans-ancestral approaches to fine-mapping GWAS loci.
Trans-ancestral findings to account for population structure in previous GWAS
Previous height GWAS utilizing only European ancestry samples are known to exhibit signatures of residual stratification, which manifest in effect size estimates of height-associated SNPs being correlated with geographical structure in Europe.85, 86, 87, 88 In theory, this bias should be lessened with addition of non-European samples in a trans-ancestry GWAS, since geographical structure across different continental ancestries are not expected to be correlated with each other. We demonstrate this hypothesis empirically using the HISLA data. The first two PCs in the PCA of European populations (Figure S55) reflect geographical or population structure in Europe, corresponding to the north-south and southeast-southwest axes of variation, respectively. We found that the bias in effect size estimates due to stratification is most obvious for height as this phenotype is known to differ across Europe.85,89,90 Effect sizes on height estimated from the GIANT and our trans-ancestral meta-analysis were both highly correlated with the loadings of the first PCA (rho = 0.125, p = 3.2 × 10−94 in GIANT; rho = 0.105, p = 3.4 × 10−70 in meta-analysis). The correlation was much lower in AAAGC and HISLA (rho = 0.012, p = 2.17 × 10−4 in AAAGC; rho = 0.007, p = 9.2 × 10−2 in HISLA; Figure 3A). Importantly, the magnitude of correlation was lessened in meta-analysis compared GIANT alone (p = 6.6 × 10−9), consistent with our hypothesis. Other traits were not a priori known to be as differentiated across Europe as height, and thus the degree of correlation between effect sizes and PC loadings are much lower in GIANT (e.g., rho = −0.025 for BMI; Figures 3B–3E).
Discussion
Hispanic/Latinos are a unique population with continental admixture from the Americas, Africa, and Europe,10, 11, 12, 13, 14 and yet are underrepresented in GWAS. Herein, we present results from a large-scale meta-analysis of anthropometric traits on an ancestrally diverse sample of Hispanic/Latino adults (Figures S1–S3). We have assembled a landmark consortium of Hispanics/Latinos to discover and map a total of 6 novel loci and 36 novel signals using both Hispanic/Latino population-specific and trans-ancestral discovery efforts (Figure 1). Numerous previously-reported anthropometric-SNP associations were suggestively (at nominal significance) or strongly (at Bonferroni significance) transferable to Hispanic/Latino adults. For example, between 16% and 31% of anthropometric variants transferred to Hispanic/Latino adults, depending on the given trait or sex-specific analyses conducted (Tables S11–S13). In total, 67 previously reported loci reached genome-wide significance in our Hispanic/Latino adult sample at the same index or another lead SNP, the majority of which were in high LD in 1000 Genomes EUR or AMR (Table S7). Moreover, we observed that four of seven of our HISLA findings were transferable to other ancestral populations at nominal significance.
We note that, even though these findings provide additional evidence for transferability of common loci for anthropometrics,91 still a number of previously reported anthropometric loci may not be transferable to this population in part due to variability in allele frequencies, effect sizes across ancestral populations, or our relatively smaller sample compared with European consortia.55 Thus, absence of generalization does not equate to a lack of relevance to Hispanic/Latino adults or children, especially given that Hispanic/Latinos are under-studied population in genetic research and larger/comparable sample sizes are currently unavailable.
Our conditional and fine-mapping analyses revealed 36 signals in established anthropometric loci, which independently replicated in HISLA stage 2 or the UKBB British subsample. In addition, our lead SNPs for the BMI signals discovered at ADCY5 (from the HISLA meta-analysis) and ADAMTS9-AS2 (from the trans-ancestral meta-analysis) are both nominally associated with childhood obesity status aged between 2 and 18 years. Three of our trans-ancestral signals in established height loci also displayed association with height-for-age Z scores in children/adolescents aged between 5 and 18 years. These observations support the premise that diverse and trans-ancestral studies represent a valuable tool for leveraging ancestral differences and similarities both within and across populations to identify multiple signals in established association regions, identify putative variants that may account for some of the missing heritability of complex diseases, or reveal promising genes and SNPs for functional follow-up.
In light of the notable ancestral, geographical or environmental diversity of the samples analyzed in our meta-analyses, we observed evidence of allele frequency differences for many of our Hispanic/Latino (Figure 4) and trans-ancestry findings (Figure S53). Similar to reports from other diverse genome-wide analyses,55 this allele frequency heterogeneity may explain heterogeneity in effects seen across consortia in our trans-ancestral HISLA, AAAGC, and GIANT meta-analysis (e.g., IGF2BP2 I2 = 78.7; MYO6 with I2 = 84.4, Tables 2 and S16). Our use of fixed-effect meta-analyses may have failed to identify loci with effect heterogeneity unrelated to allele frequency or LD differences across populations; future studies should address this limitation by considering trans-ancestral random-effects meta-analysis, local ancestry and haplotype analyses as these studies explore sources of heterogeneity in large, diverse datasets. These observations reinforce how studies of one predominant ancestry group, such as Europeans, may fail to identify additional loci or, more likely, new signals in known loci that have allele frequency differences across ancestral populations.
Residual uncorrected stratification in GWAS could result in biased estimates of effect sizes.34 For example, effect sizes on height from GIANT were reported to be significantly correlated with north-south axis of variation in Europe, suggesting residual uncorrected stratification,85, 86, 87 which we also observe here. Note that the residual stratification is subtle, and while the effect sizes may be biased, this does not imply that the identified associations are spurious. For example, compared with effect sizes on height from UKBB, which is based on a single homogeneous population and results in better control of population stratification, the genetic correlation between GIANT and UKBB was 0.94.85
Of the three traits studied here, height is the most stratified in Europe. The correlation coefficient between effect sizes on height and PC loadings reached 0.125 in the GIANT only for PC1, while it was much smaller for other traits (e.g., the maximum |rho| = 0.042 in GIANT on WHR using only males on PC1). The decrease in bias in the trans-ancestral meta-analysis was also obvious in height. The correlation with PC1 was non-significant in HISLA (rho = 0.007) and statistically significant but weak in AAAGC (rho = 0.012), consistent with a decreased impact of European population stratification on the estimate of effect size in AAAGC and HISLA. This decreased correlation could be due to large non-European ancestries in these populations (African and Native American, respectively) that make these populations affected by population stratification in Europe; it could also be that, by using European ancestry-based loadings, we are less likely to detect non-European based population stratification patterns or that smaller sample sizes in these cohorts result in greater noise in effect size estimates. Regardless of the reason, compared with GIANT alone, trans-ancestral meta-analysis of the three cohorts showed less impact of uncorrected stratification on effect size estimates, even though the sample sizes in AAAGC and HISLA are comparably small. For other traits, the conclusions are qualitatively similar: that trans-ancestral meta-analysis lessened the bias due to stratification, even though the bias in GIANT was not as strong in the first place.
Gene expression and bioinformatic analyses of our population-specific (Tables S18–S20) and trans-ancestral findings in newly discovered loci (Tables S22 and S23) revealed important insights into the underlying biology of obesity, bone development, and growth. For example, the previously reported BMI locus ILRUN has also been associated with adult height92,93 and height change during puberty.94 The previously described BMI signal was lead at rs205262, an eQTL for another gene within the region (SNRPC) in European ancestry samples.33 A second signal (rs75398113) has also been reported at SNRPC for extremes of the BMI distribution.95 Yet, our signal led by rs148899910 is more than 300 kb away and in low LD with these two index SNPs (r2 = 0.01–0.05 in 1000 Genomes AMR). More recently, rs148899910 has been associated with height in Korean women.96 Furthermore, variants in high LD with rs148899910 in 1000 Genomes AMR are associated with type 2 diabetes in individuals of East Asian ancestry97 (rs4711389 has r2 = 0.9 in 1000 Genomes AMR with rs148899910), and with BMI-adjusted waist circumference in individuals of European ancestry (rs202228093 and rs2780226 each have an r2 > 0.7 with rs148899910 in 1000 Genomes AMR).44,98 Using whole blood gene expression data from 606 participants of the Cameron County Hispanic Cohort, we find evidence that our BMI signal at rs148899910 is an eQTL for increased gene expression of C6orf1 (p = 3 × 10−7) and not any other genes in the region (Table S18). Taken together, this signal shows associations across a wide array of anthropometric phenotypes.
In general, the lead SNPs from our HISLA-only meta-analyses appear relatively benign (not pathogenic) based on CADD and FATHMM-XF scores (Table S20). Yet, all SNPs potentially change motifs. Both rs17361324 (ADCY5) and rs215226 (B4GALNT3) have enhancer and promoter histone marks and eQTLs in the respective genes in relevant tissues. For BMI, there is an eQTL for rs17361324-ADCY5 in thyroid, and ADCY5 has been previously associated with type 2 diabetes,99 BMI,100 central obesity traits,39 height,47 birth outcomes,101, 102, 103 and a number of other phenotypes. In addition, rs17361324 is proximate to an ADCY5 intronic variant (rs1093467, r2 = 0.3 in 1000 Genomes AMR) that is highly conserved across species (Haploreg v.4.1). For height, there is an eQTL for rs215226-B4GALNT3 in aortic (coronary) and tibial nerves. The lead SNP for the height signal in B4GALNT3, rs215226, has enhancer histone marks in bone and muscle, and promoter marks in muscle tissue. In addition, the variant rs215226 (B4GALNT3) has a posterior probability of 1 in FINEMAP analyses (see Table S9). Other interesting information about these regions is provided in Table S19.
The lead SNPs in our trans-ancestral loci were mainly located in intronic and intergenic regions (Table S22) and were benign. One exception was the locus C11orf63 associated with height led by rs11605693, which showed pathogenic scores for CADD and FATHMM-XF (CADD score = 17.1 and FATHMM-XF score = 0.87). This lead SNP has an eQTL in C11orf63 for adipose, tibial nerve, and testis. C11orf63, junctional cadherin complex regulator, is responsible for ependymal cells that line the brain and spinal cord.
Among the trans-ancestral findings, a BMI signal in the established locus RNH1 was led by rs10540 (posterior probability of 0.82), and is an eQTL for a wide range of tissues and genes (see Tables S21 and S23). Another signal in a known locus for height, led by rs12918773, has a posterior probability of 0.98 and is one of four casual variants suggested from fine-mapping in the locus (Table S21), has an eQTL (in lung, thyroid, tibial nerve and artery, breast, testis) with CDK10, a gene also associated with growth retardation.104 In addition, rs1342330 led the newly discovered signal in a known height locus, and has a low regulomeDB score at 2b, and several enhancer and promoter histone marks in relevant tissues (Tables S22). As an intronic variant, it is an eQTL in the pancreas with PHACTR2 (Table S23), a gene associated with body dysmorphic disorder.105 While many of our discovered loci/signals appeared to be benign based on CADD and FATHMM-XF scores, they still show enhancer and promoter histone marks in trait-relevant tissues, such as adipose tissue, bone, muscle, thymus, brain, and adrenal gland.
As described above, in this study we were able to (1) discover six additional loci with a notably smaller analytic size than other anthropometric consortia, such as GIANT. We also (2) discovered 36 signals in established loci in HISLA or our trans-ancestral meta-analysis, and (3) generated trans-ancestral effect estimates with better control for population structure. Taken together, these findings indicate the added value of building large, more diverse GWAS in the near future.
Large-scale analyses of diverse populations hold great potential for advancing the field of genetic epidemiology.55 This study illustrates how studying admixed populations, such as Hispanics/Latinos, and highlighting them in trans-ancestral epidemiologic investigations, can yield additional insights into the genetic architecture of anthropometric traits. Future discovery efforts in Hispanic/Latino populations and with other ancestrally diverse populations will help address the concerning research gap between who is studied and who is affected by conditions, such as obesity, to the benefit of both public health and precision medicine.
Acknowledgments
A.G.L. was supported by NIH (T32 HD091058, P2C HD050924, and P30 AG066615). A.G.T. was supported by NIH (T32HL007055). A.R.L. has been supported by the Leverhulme Trust (F/07 134/DF), the Excellence Initiative of Aix-Marseille University - A∗MIDEX (a French “Investissements d’Avenir” programme), the National Natural Science Foundation of China (#31771393), the Scientific and Technology Committee of Shanghai Municipality (18490750300), Ministry of Science and Technology of China (2020YFE0201600), Shanghai Municipal Science and Technology Major Project (2017SHZDZX01), the 111 Project (B13016), and BBSRC (BB/I021213/1). L.F. was supported by NIH (R01CA204797). L.F.-R. was supported by an American Heart Association predoctoral grant (13PRE16100015). M.G., K.L.Y., and K.E.N. were supported by AHA (13GRNT16490017, 15GRNT25880008), R01DK089256, and R01DK101855. L.F.-R., C.A.H., and R.F.J.L. were supported by NIH (R01DK101855). X.R.G. was supported by NIH (R01EY022651). K.E.N. was supported by NIH (R01HD057194, R01DK122503, R01HG010297, R01HL142302, R01HL143885, and R01HG009974). L.G. was supported by NIH (T32 HL129982). Q.Q. was supported by NIH (R01HL060712, R01HL140976, and R01DK119268). S.F.A.G. was supported by the Daniel B. Burke Endowed Chair for Diabetes Research and NIH (R01 HD056465). X.G., M.A.A., Y.-D.I.C., J.Y., and J.I.R. were supported by NIH (EY14684, HL-0767711, HL-0697974, HL-088457, NEI EY11753, UL1-TR-001881). X.L. and K.R. were supported by NIH (R01 HL0767711 and DK-079888). Z.A., T.A.B., J.T., and A.H.X. were supported by HTN-IR funding (HL-0697974) and P.M.G., Y.H., E.I., and K.D.T. were supported by NIH (EY-14684). M.O.G., W.H., K.L., and K.S. were supported by NIH (HL-088457). A.E.J. was supported by NIH (K99/R00 HL130580). E.M.J. was supported by NIH (R01 CA063446, R01 CA077305, DOD RP9590546, and CBCRP 7PB-0068). J.M. was supported by the American Diabetes Association 1-19-ICTS-068 and by U01HG011723. C.W.K.C. was supported by R35GM142783. Study-specific acknowledgments are available as supplemental data.
Declaration of interests
S.M.G. and A.M.S. receive funding from Seven Bridges Genomics to develop tools for the NHLBI BioData Catalyst consortium. All others authors declare no competing interests.
Footnotes
Supplemental information can be found online at https://doi.org/10.1016/j.xhgg.2022.100099.
Contributor Information
Lindsay Fernández-Rhodes, Email: fernandez-rhodes@psu.edu.
Mariaelisa Graff, Email: migraff@email.unc.edu.
Web resources
LocusZoom, https://locuszoom.org.
GWAS catalog, https://www.ebi.ac.uk/gwas/.
Eigensoft version 7.2.1, https://github.com/DReichLab/EIG/archive/v7.2.1.tar.gz.
NHGRI-EBI catalog, https://www.ebi.ac.uk/gwas/.
Supplemental information
Data and code availability
The HISLA meta-analysis results (GWAS catalog access IDs: height, GCST90095033; BMI, GCST90095034; WHRadjBMI [sex combined], GCST90095035; WHRadjBMI [men], GCST90095036; WHRadjBMI [women], GCST90095037), and the trans-ancestral HISLA, AAAGC, and GIANT meta-analysis results (GWAS catalog access ID: height, GCST90095038; BMI, GCST90095039; WHRadjBMI [sex combined], GCST90095040; WHRadjBMI [men], GCST90095041; WHRadjBMI [women], GCST90095042) are available through the NHGRI-EBI catalog.
References
- 1.World Health Organization Obesity and overweight. 2020. https://www.who.int/en/news-room/fact-sheets/detail/obesity-and-overweight
- 2.Organización de las Naciones Unidas para la Alimentación y la Agricultura América Latina y el Caribe: panorama de la seguridad alimentaria y nutricional. Sistemas alimentarios sostenibles para poner fin al hambre y la malnutrición, 2016. 2017. http://iris.paho.org/xmlui/handle/123456789/33680
- 3.Mueller M., Purnell T.S., Mensah G.A., Cooper L.A. Reducing racial and ethnic disparities in hypertension prevention and control: what will it take to translate research into practice and policy? Am. J. Hypertens. 2015;28:699–716. doi: 10.1093/ajh/hpu233. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Lanas F., Bazzano L., Rubinstein A., Calandrelli M., Chen C.S., Elorriaga N., Gutierrez L., Manfredi J.A., Seron P., Mores N., et al. Prevalence, distributions and determinants of obesity and central obesity in the southern cone of America. PLoS One. 2016;11:e0163727. doi: 10.1371/journal.pone.0163727. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Gravel S., Zakharia F., Moreno-Estrada A., Byrnes J.K., Muzzio M., Rodriguez-Flores J.L., Kenny E.E., Gignoux C.R., Maples B.K., Guiblet W., et al. Reconstructing native American migrations from whole-genome and whole-exome data. PLoS Genet. 2013;9:e1004023. doi: 10.1371/journal.pgen.1004023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Moreno-Estrada A., Gravel S., Zakharia F., McCauley J.L., Byrnes J.K., Gignoux C.R., Ortiz-Tello P.A., Martinez R.J., Hedges D.J., Morris R.W., et al. Reconstructing the population genetic history of the Caribbean. PLoS Genet. 2013;9:e1003925. doi: 10.1371/journal.pgen.1003925. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Gonzalez Burchard E., Borrell L.N., Choudhry S., Naqvi M., Tsai H.J., Rodriguez-Santana J.R., Chapela R., Rogers S.D., Mei R., Rodriguez-Cintron W., et al. Latino populations: a unique opportunity for the study of race, genetics, and social environment in epidemiological research. Am. J. Public Health. 2005;95:2161–2168. doi: 10.2105/AJPH.2005.068668. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Moreno-Estrada A., Gignoux C.R., Fernandez-Lopez J.C., Zakharia F., Sikora M., Contreras A.V., Acuna-Alonzo V., Sandoval K., Eng C., Romero-Hidalgo S., et al. Human genetics. The genetics of Mexico recapitulates native American substructure and affects biomedical traits. Science. 2014;344:1280–1285. doi: 10.1126/science.1251688. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Conomos M.P., Laurie C.A., Stilp A.M., Gogarten S.M., McHugh C.P., Nelson S.C., Sofer T., Fernandez-Rhodes L., Justice A.E., Graff M., et al. Genetic diversity and association studies in US Hispanic/Latino populations: applications in the Hispanic community health study/study of Latinos. Am. J. Hum. Genet. 2016;98:165–184. doi: 10.1016/j.ajhg.2015.12.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Bryc K., Velez C., Karafet T., Moreno-Estrada A., Reynolds A., Auton A., Hammer M., Bustamante C.D., Ostrer H. Colloquium paper: genome-wide patterns of population structure and admixture among Hispanic/Latino populations. Proc. Natl. Acad. Sci. U S A. 2010;107:8954–8961. doi: 10.1073/pnas.0914618107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Halder I., Yang B.Z., Kranzler H.R., Stein M.B., Shriver M.D., Gelernter J. Measurement of admixture proportions and description of admixture structure in different U.S. populations. Hum. Mutat. 2009;30:1299–1309. doi: 10.1002/humu.21045. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Shtir C.J., Marjoram P., Azen S., Conti D.V., Le Marchand L., Haiman C.A., Varma R. Variation in genetic admixture and population structure among Latinos: the Los Angeles Latino eye study (LALES) BMC Genet. 2009;10:71. doi: 10.1186/1471-2156-10-71. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Wang S., Ray N., Rojas W., Parra M.V., Bedoya G., Gallo C., Poletti G., Mazzotti G., Hill K., Hurtado A.M., et al. Geographic patterns of genome admixture in Latin American Mestizos. PLoS Genet. 2008;4:e1000037. doi: 10.1371/journal.pgen.1000037. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Manichaikul A., Palmas W., Rodriguez C.J., Peralta C.A., Divers J., Guo X., Chen W.M., Wong Q., Williams K., Kerr K.F., et al. Population structure of Hispanics in the United States: the multi-ethnic study of atherosclerosis. PLoS Genet. 2012;8:e1002640. doi: 10.1371/journal.pgen.1002640. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Chiang C.W., Gajdos Z.K., Korn J.M., Kuruvilla F.G., Butler J.L., Hackett R., Guiducci C., Nguyen T.T., Wilks R., Forrester T., et al. Rapid assessment of genetic ancestry in populations of unknown origin by genome-wide genotyping of pooled samples. PLoS Genet. 2010;6:e1000866. doi: 10.1371/journal.pgen.1000866. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Spear M.L., Diaz-Papkovich A., Ziv E., Yracheta J.M., Gravel S., Torgerson D.G., Hernandez R.D. Recent shifts in the genomic ancestry of Mexican Americans may alter the genetic architecture of biomedical traits. Elife. 2020;9:e56029. doi: 10.7554/eLife.56029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Ruiz-Linares A., Adhikari K., Acuna-Alonzo V., Quinto-Sanchez M., Jaramillo C., Arias W., Fuentes M., Pizarro M., Everardo P., de Avila F., et al. Admixture in Latin America: geographic structure, phenotypic diversity and self-perception of ancestry based on 7,342 individuals. PLoS Genet. 2014;10:e1004572. doi: 10.1371/journal.pgen.1004572. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Asgari S., Luo Y., Akbari A., Belbin G.M., Li X., Harris D.N., Selig M., Bartell E., Calderon R., Slowikowski K., et al. A positively selected FBN1 missense variant reduces height in Peruvian individuals. Nature. 2020;582:234–239. doi: 10.1038/s41586-020-2302-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Pew Research Center Modern immigration wave brings 59 million to U.S., driving population growth and change through 2065. 2015. http://www.pewhispanic.org/2015/09/28/modern-immigration-wave-brings-59-million-to-u-s-driving-population-growth-and-change-through-2065/
- 20.Ogden C.L., Carroll M.D., Kit B.K., Flegal K.M. Prevalence of obesity and trends in body mass index among US children and adolescents, 1999-2010. JAMA. 2012;307:483–490. doi: 10.1001/jama.2012.40. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Flegal K.M., Carroll M.D., Kit B.K., Ogden C.L. Prevalence of obesity and trends in the distribution of body mass index among US adults, 1999-2010. JAMA. 2012;307:491–497. doi: 10.1001/jama.2012.39. [DOI] [PubMed] [Google Scholar]
- 22.Ogden C.L., Carroll M.D., Kit B.K., Flegal K.M. Prevalence of childhood and adult obesity in the United States, 2011-2012. JAMA. 2014;311:806–814. doi: 10.1001/jama.2014.732. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Isasi C.R., Ayala G.X., Sotres-Alvarez D., Madanat H., Penedo F., Loria C.M., Elder J.P., Daviglus M.L., Barnhart J., Siega-Riz A.M., et al. Is acculturation related to obesity in Hispanic/Latino adults? Results from the Hispanic community health study/study of Latinos. J. Obes. 2015;2015:186276. doi: 10.1155/2015/186276. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.NCD Risk Factor Collaboration Height and body-mass index trajectories of school-aged children and adolescents from 1985 to 2019 in 200 countries and territories: a pooled analysis of 2181 population-based studies with 65 million participants. Lancet. 2020;396:1511–1524. doi: 10.1016/S0140-6736(20)31859-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Knowler W.C., Coresh J., Elston R.C., Freedman B.I., Iyengar S.K., Kimmel P.L., Olson J.M., Plaetke R., Sedor J.R., Seldin M.F., et al. The family investigation of nephropathy and diabetes (FIND): design and methods. J. Diabetes Complications. 2005;19:1–9. doi: 10.1016/j.jdiacomp.2003.12.007. [DOI] [PubMed] [Google Scholar]
- 26.Centers for Disease Control and Prevention . Centers for Disease Control and Prevention; 2016. CDC Growth Charts. [Google Scholar]
- 27.Bradfield J.P., Vogelezang S., Felix J.F., Chesi A., Helgeland O., Horikoshi M., Karhunen V., Lowry E., Cousminer D.L., Ahluwalia T.S., et al. A trans-ancestral meta-analysis of genome-wide association studies reveals loci associated with childhood obesity. Hum. Mol. Genet. 2019;28:3327–3338. doi: 10.1093/hmg/ddz161. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.World Health Organization Growth reference data for 5-19 years. https://www.cdc.gov/growthcharts/cdc_charts.htm
- 29.Butte N.F., Cai G., Cole S.A., Comuzzie A.G. Viva la Familia study: genetic and environmental contributions to childhood obesity and its comorbidities in the Hispanic population. Am. J. Clin. Nutr. 2006;84:646–654. doi: 10.1093/ajcn/84.3.646. quiz 673-644. [DOI] [PubMed] [Google Scholar]
- 30.Lin D.Y., Tao R., Kalsbeek W.D., Zeng D.L., Gonzalez F., Fernandez-Rhodes L., Graff M., Koch G.G., North K.E., Heiss G. Genetic association analysis under complex survey sampling: the Hispanic community health study/study of Latinos. Am. J. Hum. Genet. 2014;95:675–688. doi: 10.1016/j.ajhg.2014.11.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Chen M.H., Yang Q. GWAF: an R package for genome-wide association analyses with family data. Bioinformatics. 2010;26:580–581. doi: 10.1093/bioinformatics/btp710. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Willer C.J., Li Y., Abecasis G.R. METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics. 2010;26:2190–2191. doi: 10.1093/bioinformatics/btq340. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Locke A.E., Kahali B., Berndt S.I., Justice A.E., Pers T.H., Day F.R., Powell C., Vedantam S., Buchkovich M.L., Yang J., et al. Genetic studies of body mass index yield new insights for obesity biology. Nature. 2015;518:197–206. doi: 10.1038/nature14177. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Wood A.R., Esko T., Yang J., Vedantam S., Pers T.H., Gustafsson S., Chu A.Y., Estrada K., Luan J., Kutalik Z., et al. Defining the role of common variation in the genomic and biological architecture of adult human height. Nat. Genet. 2014;46:1173–1186. doi: 10.1038/ng.3097. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Shungin D., Winkler T.W., Croteau-Chonka D.C., Ferreira T., Locke A.E., Magi R., Strawbridge R.J., Pers T.H., Fischer K., Justice A.E., et al. New genetic loci link adipose and insulin biology to body fat distribution. Nature. 2015;518:187–196. doi: 10.1038/nature14132. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Ng M.C.Y., Graff M., Lu Y., Justice A.E., Mudgal P., Liu C.T., Young K., Yanek L.R., Feitosa M.F., Wojczynski M.K., et al. Discovery and fine-mapping of adiposity loci using high density imputation of genome-wide association studies in individuals of African ancestry: African Ancestry Anthropometry Genetics Consortium. PLoS Genet. 2017;13:e1006719. doi: 10.1371/journal.pgen.1006719. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Graff M., Justice A.E., Young K.L., Marouli E., Zhang X., Fine R.S., Lim E., Buchanan V., Rand K., Feitosa M.F., et al. Discovery and fine-mapping of height loci via high-density imputation of GWASs in individuals of African ancestry. Am. J. Hum. Genet. 2021;108:564–582. doi: 10.1016/j.ajhg.2021.02.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Yengo L., Sidorenko J., Kemper K.E., Zheng Z., Wood A.R., Weedon M.N., Frayling T.M., Hirschhorn J., Yang J., Visscher P.M., et al. Meta-analysis of genome-wide association studies for height and body mass index in approximately 700000 individuals of European ancestry. Hum. Mol. Genet. 2018;27:3641–3649. doi: 10.1093/hmg/ddy271. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Pulit S.L., Stoneman C., Morris A.P., Wood A.R., Glastonbury C.A., Tyrrell J., Yengo L., Ferreira T., Marouli E., Ji Y., et al. Meta-analysis of genome-wide association studies for body fat distribution in 694 649 individuals of European ancestry. Hum. Mol. Genet. 2019;28:166–174. doi: 10.1093/hmg/ddy327. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Akiyama M., Okada Y., Kanai M., Takahashi A., Momozawa Y., Ikeda M., Iwata N., Ikegawa S., Hirata M., Matsuda K., et al. Genome-wide association study identifies 112 new loci for body mass index in the Japanese population. Nat. Genet. 2017;49:1458–1467. doi: 10.1038/ng.3951. [DOI] [PubMed] [Google Scholar]
- 41.Berndt S.I., Gustafsson S., Magi R., Ganna A., Wheeler E., Feitosa M.F., Justice A.E., Monda K.L., Croteau-Chonka D.C., Day F.R., et al. Genome-wide meta-analysis identifies 11 new loci for anthropometric traits and provides insights into genetic architecture. Nat. Genet. 2013;45:501–512. doi: 10.1038/ng.2606. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Gong J., Nishimura K.K., Fernandez-Rhodes L., Haessler J., Bien S., Graff M., Lim U., Lu Y., Gross M., Fornage M., et al. Trans-ethnic analysis of metabochip data identifies two new loci associated with BMI. Int. J. Obes. (Lond) 2018;42:384–390. doi: 10.1038/ijo.2017.304. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Gong J., Schumacher F., Lim U., Hindorff L.A., Haessler J., Buyske S., Carlson C.S., Rosse S., Buzkova P., Fornage M., et al. Fine mapping and identification of BMI loci in African Americans. Am. J. Hum. Genet. 2013;93:661–671. doi: 10.1016/j.ajhg.2013.08.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Graff M., Scott R.A., Justice A.E., Young K.L., Feitosa M.F., Barata L., Winkler T.W., Chu A.Y., Mahajan A., Hadley D., et al. Genome-wide physical activity interactions in adiposity - a meta-analysis of 200,452 adults. PLoS Genet. 2017;13:e1006528. doi: 10.1371/journal.pgen.1006528. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Hoffmann T.J., Choquet H., Yin J., Banda Y., Kvale M.N., Glymour M., Schaefer C., Risch N., Jorgenson E. A large multiethnic genome-wide association study of adult body mass index identifies novel loci. Genetics. 2018;210:499–515. doi: 10.1534/genetics.118.301479. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Justice A.E., Winkler T.W., Feitosa M.F., Graff M., Fisher V.A., Young K., Barata L., Deng X., Czajkowski J., Hadley D., et al. Genome-wide meta-analysis of 241,258 adults accounting for smoking behaviour identifies novel loci for obesity traits. Nat. Commun. 2017;8:14977. doi: 10.1038/ncomms14977. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Kichaev G., Bhatia G., Loh P.R., Gazal S., Burch K., Freund M.K., Schoech A., Pasaniuc B., Price A.L. Leveraging polygenic functional enrichment to improve GWAS power. Am. J. Hum. Genet. 2019;104:65–75. doi: 10.1016/j.ajhg.2018.11.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Monda K.L., Chen G.K., Taylor K.C., Palmer C., Edwards T.L., Lange L.A., Ng M.C., Adeyemo A.A., Allison M.A., Bielak L.F., et al. A meta-analysis identifies new loci associated with body mass index in individuals of African ancestry. Nat. Genet. 2013;45:690–696. doi: 10.1038/ng.2608. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Okada Y., Kubo M., Ohmiya H., Takahashi A., Kumasaka N., Hosono N., Maeda S., Wen W., Dorajoo R., Go M.J., et al. Common variants at CDKAL1 and KLF9 are associated with body mass index in East Asian populations. Nat. Genet. 2012;44:302–306. doi: 10.1038/ng.1086. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Tachmazidou I., Suveges D., Min J.L., Ritchie G.R.S., Steinberg J., Walter K., Iotchkova V., Schwartzentruber J., Huang J., Memari Y., et al. Whole-genome sequencing coupled to imputation discovers genetic signals for anthropometric traits. Am. J. Hum. Genet. 2017;100:865–884. doi: 10.1016/j.ajhg.2017.04.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Turcot V., Lu Y., Highland H.M., Schurmann C., Justice A.E., Fine R.S., Bradfield J.P., Esko T., Giri A., Graff M., et al. Protein-altering variants associated with body mass index implicate pathways that control energy intake and expenditure in obesity. Nat. Genet. 2018;50:26–41. doi: 10.1038/s41588-017-0011-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Wen W., Cho Y.S., Zheng W., Dorajoo R., Kato N., Qi L., Chen C.H., Delahanty R.J., Okada Y., Tabara Y., et al. Meta-analysis identifies common variants associated with body mass index in East Asians. Nat. Genet. 2012;44:307–311. doi: 10.1038/ng.1087. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Wen W., Zheng W., Okada Y., Takeuchi F., Tabara Y., Hwang J.Y., Dorajoo R., Li H., Tsai F.J., Yang X., et al. Meta-analysis of genome-wide association studies in East Asian-ancestry populations identifies four new loci for body mass index. Hum. Mol. Genet. 2014;23:5492–5504. doi: 10.1093/hmg/ddu248. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Winkler T.W., Justice A.E., Graff M., Barata L., Feitosa M.F., Chu S., Czajkowski J., Esko T., Fall T., Kilpelainen T.O., et al. The influence of age and sex on genetic associations with adult body size and shape: a large-scale genome-wide interaction study. PLoS Genet. 2015;11:e1005378. doi: 10.1371/journal.pgen.1005378. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Wojcik G.L., Graff M., Nishimura K.K., Tao R., Haessler J., Gignoux C.R., Highland H.M., Patel Y.M., Sorokin E.P., Avery C.L., et al. Genetic analyses of diverse populations improves discovery for complex traits. Nature. 2019;570:514–518. doi: 10.1038/s41586-019-1310-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Marouli E., Graff M., Medina-Gomez C., Lo K.S., Wood A.R., Kjaer T.R., Fine R.S., Lu Y., Schurmann C., Highland H.M., et al. Rare and low-frequency coding variants alter human adult height. Nature. 2017;542:186–190. doi: 10.1038/nature21039. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Cho Y.S., Go M.J., Kim Y.J., Heo J.Y., Oh J.H., Ban H.J., Yoon D., Lee M.H., Kim D.J., Park M., et al. A large-scale genome-wide association study of Asian populations uncovers genetic factors influencing eight quantitative traits. Nat. Genet. 2009;41:527–534. doi: 10.1038/ng.357. [DOI] [PubMed] [Google Scholar]
- 58.Heid I.M., Jackson A.U., Randall J.C., Winkler T.W., Qi L., Steinthorsdottir V., Thorleifsson G., Zillikens M.C., Speliotes E.K., Magi R., et al. Meta-analysis identifies 13 new loci associated with waist-hip ratio and reveals sexual dimorphism in the genetic basis of fat distribution. Nat. Genet. 2010;42:949–960. doi: 10.1038/ng.685. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Justice A.E., Karaderi T., Highland H.M., Young K.L., Graff M., Lu Y., Turcot V., Auer P.L., Fine R.S., Guo X., et al. Protein-coding variants implicate novel genes related to lipid homeostasis contributing to body-fat distribution. Nat. Genet. 2019;51:452–469. doi: 10.1038/s41588-018-0334-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Scuteri A., Sanna S., Chen W.M., Uda M., Albai G., Strait J., Najjar S., Nagaraja R., Orru M., Usala G., et al. Genome-wide association scan shows genetic variants in the FTO gene are associated with obesity-related traits. PLoS Genet. 2007;3:e115. doi: 10.1371/journal.pgen.0030115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Wen W., Kato N., Hwang J.Y., Guo X., Tabara Y., Li H., Dorajoo R., Yang X., Tsai F.J., Li S., et al. Genome-wide association studies in East Asians identify new loci for waist-hip ratio and waist circumference. Sci. Rep. 2016;6:17958. doi: 10.1038/srep17958. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Benner C., Spencer C.C., Havulinna A.S., Salomaa V., Ripatti S., Pirinen M. FINEMAP: efficient variable selection using summary data from genome-wide association studies. Bioinformatics. 2016;32:1493–1501. doi: 10.1093/bioinformatics/btw018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Fisher-Hoch S.P., Rentfro A.R., Salinas J.J., Perez A., Brown H.S., Reininger B.M., Restrepo B.I., Wilson J.G., Hossain M.M., Rahbar M.H., et al. Socioeconomic status and prevalence of obesity and diabetes in a Mexican American community, Cameron County, Texas, 2004-2007. Prev. Chronic Dis. 2010;7:A53. [PMC free article] [PubMed] [Google Scholar]
- 64.Babraham Bioinformatics FastQC. A quality control tool for high throughput sequence data. 2010. https://www.bioinformatics.babraham.ac.uk/projects/fastqc/
- 65.Dobin A., Davis C.A., Schlesinger F., Drenkow J., Zaleski C., Jha S., Batut P., Chaisson M., Gingeras T.R. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29:15–21. doi: 10.1093/bioinformatics/bts635. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Liao Y., Smyth G.K., Shi W. featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics. 2014;30:923–930. doi: 10.1093/bioinformatics/btt656. [DOI] [PubMed] [Google Scholar]
- 67.Love M.I., Huber W., Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15:550. doi: 10.1186/s13059-014-0550-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Staley J.R., Blackshaw J., Kamat M.A., Ellis S., Surendran P., Sun B.B., Paul D.S., Freitag D., Burgess S., Danesh J., et al. PhenoScanner: a database of human genotype-phenotype associations. Bioinformatics. 2016;32:3207–3209. doi: 10.1093/bioinformatics/btw373. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Boyle A.P., Hong E.L., Hariharan M., Cheng Y., Schaub M.A., Kasowski M., Karczewski K.J., Park J., Hitz B.C., Weng S., et al. Annotation of functional variation in personal genomes using RegulomeDB. Genome Res. 2012;22:1790–1797. doi: 10.1101/gr.137323.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Ward L.D., Kellis M. HaploReg v4: systematic mining of putative causal variants, cell types, regulators and target genes for human complex traits and disease. Nucleic Acids Res. 2016;44:D877–D881. doi: 10.1093/nar/gkv1340. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Kent W.J., Sugnet C.W., Furey T.S., Roskin K.M., Pringle T.H., Zahler A.M., Haussler D. The human genome browser at UCSC. Genome Res. 2002;12:996–1006. doi: 10.1101/gr.229102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Lonsdale J., Thomas J., Salvatore M., Phillips R., Lo E., Shad S., Hasz R., Walters G., Garcia F., Young N., et al. The genotype-tissue expression (GTEx) project. Nat. Genet. 2013;45:580–585. doi: 10.1038/ng.2653. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Chang C.C., Chow C.C., Tellier L.C., Vattikuti S., Purcell S.M., Lee J.J. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience. 2015;4:7. doi: 10.1186/s13742-015-0047-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Price A.L., Weale M.E., Patterson N., Myers S.R., Need A.C., Shianna K.V., Ge D., Rotter J.I., Torres E., Taylor K.D., et al. Long-range LD can confound genome scans in admixed populations. Am. J. Hum. Genet. 2008;83:132–135. doi: 10.1016/j.ajhg.2008.06.005. author reply 135–139. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Paternoster L., Zhurov A.I., Toma A.M., Kemp J.P., St Pourcain B., Timpson N.J., McMahon G., McArdle W., Ring S.M., Smith G.D., et al. Genome-wide association study of three-dimensional facial morphology identifies a variant in PAX3 associated with nasion position. Am. J. Hum. Genet. 2012;90:478–485. doi: 10.1016/j.ajhg.2011.12.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Adhikari K., Fuentes-Guajardo M., Quinto-Sanchez M., Mendoza-Revilla J., Camilo Chacon-Duque J., Acuna-Alonzo V., Jaramillo C., Arias W., Lozano R.B., Perez G.M., et al. A genome-wide association scan implicates DCHS2, RUNX2, GLI3, PAX1 and EDAR in human facial variation. Nat. Commun. 2016;7:11616. doi: 10.1038/ncomms11616. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Xiong Z., Dankova G., Howe L.J., Lee M.K., Hysi P.G., de Jong M.A., Zhu G., Adhikari K., Li D., Li Y., et al. Novel genetic loci affecting facial shape variation in humans. Elife. 2019;8:e49898. doi: 10.7554/eLife.49898. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Yap C.X., Sidorenko J., Wu Y., Kemper K.E., Yang J., Wray N.R., Robinson M.R., Visscher P.M. Dissection of genetic variation and evidence for pleiotropy in male pattern baldness. Nat. Commun. 2018;9:5407. doi: 10.1038/s41467-018-07862-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Hagenaars S.P., Hill W.D., Harris S.E., Ritchie S.J., Davies G., Liewald D.C., Gale C.R., Porteous D.J., Deary I.J., Marioni R.E. Genetic prediction of male pattern baldness. PLoS Genet. 2017;13:e1006594. doi: 10.1371/journal.pgen.1006594. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Nelms B.L., Labosky P.A. Transcriptional Control of Neural Crest Development. Morgan & Claypool Life Sciences; 2010. Pax genes. [PubMed] [Google Scholar]
- 81.Shakhova O., Sommer L. StemBook. Harvard Stem Cell Institute; 2010. Neural crest-derived stem cells.https://www.ncbi.nlm.nih.gov/books/NBK44752/ [PubMed] [Google Scholar]
- 82.Lee G., Kim H., Elkabetz Y., Al Shamy G., Panagiotakos G., Barberi T., Tabar V., Studer L. Isolation and directed differentiation of neural crest stem cells derived from human embryonic stem cells. Nat. Biotechnol. 2007;25:1468–1475. doi: 10.1038/nbt1365. [DOI] [PubMed] [Google Scholar]
- 83.Billon N., Iannarelli P., Monteiro M.C., Glavieux-Pardanaud C., Richardson W.D., Kessaris N., Dani C., Dupin E. The generation of adipocytes by the neural crest. Development. 2007;134:2283–2292. doi: 10.1242/dev.002642. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Hans C., Dobra A., West M. Shotgun stochastic search for “large p” regression. J. Am. Stat. Assoc. 2007;102:507–516. [Google Scholar]
- 85.Sohail M., Maier R.M., Ganna A., Bloemendal A., Martin A.R., Turchin M.C., Chiang C.W., Hirschhorn J., Daly M.J., Patterson N., et al. Polygenic adaptation on height is overestimated due to uncorrected stratification in genome-wide association studies. Elife. 2019;8:e39702. doi: 10.7554/eLife.39702. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Berg J.J., Harpak A., Sinnott-Armstrong N., Joergensen A.M., Mostafavi H., Field Y., Boyle E.A., Zhang X., Racimo F., Pritchard J.K., et al. Reduced signal for polygenic adaptation of height in UK Biobank. Elife. 2019;8:e39725. doi: 10.7554/eLife.39725. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Chen M., Sidore C., Akiyama M., Ishigaki K., Kamatani Y., Schlessinger D., Cucca F., Okada Y., Chiang C.W.K. Evidence of polygenic adaptation in Sardinia at height-associated loci ascertained from the Biobank Japan. Am. J. Hum. Genet. 2020;107:60–71. doi: 10.1016/j.ajhg.2020.05.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88.Chen M., Chiang C.W.K. Allele frequency differentiation at height-associated SNPs among continental human populations. Eur. J. Hum. Genet. 2021;29:1542–1548. doi: 10.1038/s41431-021-00938-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Campbell C.D., Ogburn E.L., Lunetta K.L., Lyon H.N., Freedman M.L., Groop L.C., Altshuler D., Ardlie K.G., Hirschhorn J.N. Demonstrating stratification in a European American population. Nat. Genet. 2005;37:868–872. doi: 10.1038/ng1607. [DOI] [PubMed] [Google Scholar]
- 90.Robinson M.R., Hemani G., Medina-Gomez C., Mezzavilla M., Esko T., Shakhbazov K., Powell J.E., Vinkhuyzen A., Berndt S.I., Gustafsson S., et al. Population genetic differentiation of height and body mass index across Europe. Nat. Genet. 2015;47:1357–1362. doi: 10.1038/ng.3401. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91.Carlson C.S., Matise T.C., North K.E., Haiman C.A., Fesinmeyer M.D., Buyske S., Schumacher F.R., Peters U., Franceschini N., Ritchie M.D., et al. Generalization and dilution of association results from European GWAS in populations of non-European ancestry: the PAGE study. PLoS Biol. 2013;11:e1001661. doi: 10.1371/journal.pbio.1001661. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 92.Soranzo N., Rivadeneira F., Chinappen-Horsley U., Malkina I., Richards J.B., Hammond N., Stolk L., Nica A., Inouye M., Hofman A., et al. Meta-analysis of genome-wide scans for human adult stature identifies novel loci and associations with measures of skeletal frame size. PLoS Genet. 2009;5:e1000445. doi: 10.1371/journal.pgen.1000445. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93.Weedon M.N., Lango H., Lindgren C.M., Wallace C., Evans D.M., Mangino M., Freathy R.M., Perry J.R., Stevens S., Hall A.S., et al. Genome-wide association analysis identifies 20 loci that influence adult height. Nat. Genet. 2008;40:575–583. doi: 10.1038/ng.121. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 94.Cousminer D.L., Berry D.J., Timpson N.J., Ang W., Thiering E., Byrne E.M., Taal H.R., Huikari V., Bradfield J.P., Kerkhof M., et al. Genome-wide association and longitudinal analyses reveal genetic loci linking pubertal height growth, pubertal timing and childhood adiposity. Hum. Mol. Genet. 2013;22:2735–2747. doi: 10.1093/hmg/ddt104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 95.Riveros-McKay F., Mistry V., Bounds R., Hendricks A., Keogh J.M., Thomas H., Henning E., Corbin L.J., Understanding Society Scientific Group. O'Rahilly S., et al. Genetic architecture of human thinness compared to severe obesity. PLoS Genet. 2019;15:e1007603. doi: 10.1371/journal.pgen.1007603. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 96.Cho H.W., Jin H.S., Eom Y.B. A genome-wide association study of novel genetic variants associated with anthropometric traits in Koreans. Front. Genet. 2021;12:669215. doi: 10.3389/fgene.2021.669215. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 97.Spracklen C.N., Horikoshi M., Kim Y.J., Lin K., Bragg F., Moon S., Suzuki K., Tam C.H.T., Tabara Y., Kwak S.H., et al. Identification of type 2 diabetes loci in 433,540 East Asian individuals. Nature. 2020;582:240–245. doi: 10.1038/s41586-020-2263-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 98.Christakoudi S., Evangelou E., Riboli E., Tsilidis K.K. GWAS of allometric body-shape indices in UK Biobank identifies loci suggesting associations with morphogenesis, organogenesis, adrenal cell renewal and cancer. Sci. Rep. 2021;11:10688. doi: 10.1038/s41598-021-89176-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 99.Vujkovic M., Keaton J.M., Lynch J.A., Miller D.R., Zhou J., Tcheandjieu C., Huffman J.E., Assimes T.L., Lorenz K., Zhu X., et al. Discovery of 318 new risk loci for type 2 diabetes and related vascular outcomes among 1.4 million participants in a multi-ancestry meta-analysis. Nat. Genet. 2020;52:680–691. doi: 10.1038/s41588-020-0637-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 100.Zhu Z., Guo Y., Shi H., Liu C.L., Panganiban R.A., Chung W., O'Connor L.J., Himes B.E., Gazal S., Hasegawa K., et al. Shared genetic and experimental links between obesity-related traits and asthma subtypes in UK Biobank. J. Allergy Clin. Immunol. 2020;145:537–549. doi: 10.1016/j.jaci.2019.09.035. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 101.Freathy R.M., Mook-Kanamori D.O., Sovio U., Prokopenko I., Timpson N.J., Berry D.J., Warrington N.M., Widen E., Hottenga J.J., Kaakinen M., et al. Variants in ADCY5 and near CCNL1 are associated with fetal growth and birth weight. Nat. Genet. 2010;42:430–435. doi: 10.1038/ng.567. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 102.Zhang G., Feenstra B., Bacelis J., Liu X., Muglia L.M., Juodakis J., Miller D.E., Litterman N., Jiang P.P., Russell L., et al. Genetic associations with gestational duration and spontaneous preterm birth. N. Engl. J. Med. 2017;377:1156–1167. doi: 10.1056/NEJMoa1612665. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 103.Yang X.L., Zhang S.Y., Zhang H., Wei X.T., Feng G.J., Pei Y.F., Zhang L. Three novel loci for infant head circumference identified by a joint association analysis. Front. Genet. 2019;10:947. doi: 10.3389/fgene.2019.00947. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 104.Windpassinger C., Piard J., Bonnard C., Alfadhel M., Lim S., Bisteau X., Blouin S., Ali N.B., Ng A.Y.J., Lu H., et al. CDK10 mutations in humans and mice cause severe growth retardation, spine malformations, and developmental delays. Am. J. Hum. Genet. 2017;101:391–403. doi: 10.1016/j.ajhg.2017.08.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 105.Daelemans C., Ritchie M.E., Smits G., Abu-Amero S., Sudbery I.M., Forrest M.S., Campino S., Clark T.G., Stanier P., Kwiatkowski D., et al. High-throughput analysis of candidate imprinted genes and allele-specific gene expression in the human term placenta. BMC Genet. 2010;11:25. doi: 10.1186/1471-2156-11-25. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The HISLA meta-analysis results (GWAS catalog access IDs: height, GCST90095033; BMI, GCST90095034; WHRadjBMI [sex combined], GCST90095035; WHRadjBMI [men], GCST90095036; WHRadjBMI [women], GCST90095037), and the trans-ancestral HISLA, AAAGC, and GIANT meta-analysis results (GWAS catalog access ID: height, GCST90095038; BMI, GCST90095039; WHRadjBMI [sex combined], GCST90095040; WHRadjBMI [men], GCST90095041; WHRadjBMI [women], GCST90095042) are available through the NHGRI-EBI catalog.