Skip to main content
Human Molecular Genetics logoLink to Human Molecular Genetics
. 2014 Oct 3;24(4):1155–1168. doi: 10.1093/hmg/ddu510

A novel common variant in DCST2 is associated with length in early life and height in adulthood

Ralf JP van der Valk 1,2,3,, Eskil Kreiner-Møller 5,, Marjolein N Kooijman 1,2,3,, Mònica Guxens 6,7,8,, Evangelia Stergiakouli 9,, Annika Sääf 12, Jonathan P Bradfield 13, Frank Geller 15, M Geoffrey Hayes 16,17, Diana L Cousminer 18, Antje Körner 20, Elisabeth Thiering 21,22, John A Curtin 25, Ronny Myhre 26, Ville Huikari 28, Raimo Joro 31, Marjan Kerkhof 33,34, Nicole M Warrington 37,38, Niina Pitkänen 39, Ioanna Ntalla 41,42, Momoko Horikoshi 43,44, Riitta Veijola 45, Rachel M Freathy 47, Yik-Ying Teo 48,49,50, Sheila J Barton 51, David M Evans 9,38, John P Kemp 9,38, Beate St Pourcain 9,10,11, Susan M Ring 9,10, George Davey Smith 9, Anna Bergström 12, Inger Kull 53,54, Hakon Hakonarson 13,55,14, Frank D Mentch 13, Hans Bisgaard 5, Bo Chawes 5, Jakob Stokholm 5, Johannes Waage 5, Patrick Eriksen 5, Astrid Sevelsted 5, Mads Melbye 15,56; Early Genetics and Lifecourse Epidemiology (EAGLE) Consortium73,84, Cornelia M van Duijn 1, Carolina Medina-Gomez 1,3,4, Albert Hofman 1,3, Johan C de Jongste 2,3, H Rob Taal 1,2, André G Uitterlinden 1,3,4; Genetic Investigation of ANthropometric Traits (GIANT) Consortium73,84, Loren L Armstrong 16,17, Johan Eriksson 18, Aarno Palotie 18,57,59,58, Mariona Bustamante 6,7,8,61, Xavier Estivill 7,8,61,62, Juan R Gonzalez 6,7,8, Sabrina Llop 7,63, Wieland Kiess 20, Anubha Mahajan 43, Claudia Flexeder 22, Carla MT Tiesler 21,22, Clare S Murray 25, Angela Simpson 25, Per Magnus 27, Verena Sengpiel 64, Anna-Liisa Hartikainen 29, Sirkka Keinanen-Kiukaanniemi 28, Alexandra Lewin 65, Alexessander Da Silva Couto Alves 65, Alexandra I Blakemore 66, Jessica L Buxton 66, Marika Kaakinen 28,65,30, Alina Rodriguez 65,67, Sylvain Sebert 28, Marja Vaarasmaki 46, Timo Lakka 31,68,69, Virpi Lindi 31, Ulrike Gehring 70, Dirkje S Postma 34,35, Wei Ang 37, John P Newnham 37, Leo-Pekka Lyytikäinen 71,72, Katja Pahkala 39,38, Olli T Raitakari 39,74, Kalliope Panoutsopoulou 76, Eleftheria Zeggini 76, Dorret I Boomsma 77,78,79, Maria Groen-Blokhuis 77,78,79, Jorma Ilonen 40,32, Lude Franke 80, Joel N Hirschhorn 81,60,82, Tune H Pers 81,60,83, Liming Liang 89, Jinyan Huang 89,85, Berthold Hocher 86,87,88, Mikael Knip 19,89,90, Seang-Mei Saw 48,91,92, John W Holloway 52, Erik Melén 12,54, Struan FA Grant 13,55,14, Bjarke Feenstra 15, William L Lowe 16,17, Elisabeth Widén 18, Elena Sergeyev 20, Harald Grallert 23,24, Adnan Custovic 25, Bo Jacobsson 26,64, Marjo-Riitta Jarvelin 28,65,30,93,94, Mustafa Atalay 31, Gerard H Koppelman 34,36, Craig E Pennell 37, Harri Niinikoski 39,75, George V Dedoussis 42, Mark I Mccarthy 43,44,95, Timothy M Frayling 47, Jordi Sunyer 6,7,8,62,, Nicholas J Timpson 9,, Fernando Rivadeneira 1,3,4,, Klaus Bønnelykke 5,, Vincent WV Jaddoe 1,2,3,‡,*, for the Early Growth Genetics (EGG) Consortium
PMCID: PMC4447786  PMID: 25281659

Abstract

Common genetic variants have been identified for adult height, but not much is known about the genetics of skeletal growth in early life. To identify common genetic variants that influence fetal skeletal growth, we meta-analyzed 22 genome-wide association studies (Stage 1; N = 28 459). We identified seven independent top single nucleotide polymorphisms (SNPs) (P < 1 × 10−6) for birth length, of which three were novel and four were in or near loci known to be associated with adult height (LCORL, PTCH1, GPR126 and HMGA2). The three novel SNPs were followed-up in nine replication studies (Stage 2; N = 11 995), with rs905938 in DC-STAMP domain containing 2 (DCST2) genome-wide significantly associated with birth length in a joint analysis (Stages 1 + 2; β = 0.046, SE = 0.008, P = 2.46 × 10−8, explained variance = 0.05%). Rs905938 was also associated with infant length (N = 28 228; P = 5.54 × 10−4) and adult height (N = 127 513; P = 1.45 × 10−5). DCST2 is a DC-STAMP-like protein family member and DC-STAMP is an osteoclast cell-fusion regulator. Polygenic scores based on 180 SNPs previously associated with human adult stature explained 0.13% of variance in birth length. The same SNPs explained 2.95% of the variance of infant length. Of the 180 known adult height loci, 11 were genome-wide significantly associated with infant length (SF3B4, LCORL, SPAG17, C6orf173, PTCH1, GDF5, ZNFX1, HHIP, ACAN, HLA locus and HMGA2). This study highlights that common variation in DCST2 influences variation in early growth and adult height.

INTRODUCTION

Fetal and infancy length growth are important measures of development in early life. Early length growth seems to be associated with height in adulthood (1). It has been shown that fetal and infant growth are independently associated with higher risks of cardiovascular disease, type 2 diabetes and many other complex diseases. Previous findings suggested genetic links between fetal growth and metabolism (2,3). However, these studies mainly focused on birth weight as early growth measure. Skeletal growth is a different measure of development in early life. Skeletal growth during fetal life and infancy is a complex trait with heritability estimates of 26–72% (4). Although correlated with each other, fetal, infant and adult skeletal growth may be influenced by different genetic factors. Many common genetic variants have been identified for adult height (5), but not much is known about the genetics of skeletal growth in early life. Although, several rare genetic defects with large effects on length at birth and during infancy have been found (6,7), common genetic variants that influence normal variation in birth and infant length have not yet been identified. Therefore, we aimed to identify common genetic variants influencing early length growth, also in perspective of their effect on adult stature.

RESULTS

To identify common genetic variants associated with birth length, we examined 2 201 971 million directly genotyped and imputed SNPs with birth length in 22 independent discovery studies with genome-wide association (GWA) or Metabochip data (Stage 1; N = 28 459; Fig. 1). Birth length was measured using standardized procedures (Supplementary Material, Tables S1 and S2). Studies with self-reported measurements were excluded a priori. Birth length was standardized using growth analyzer (http://www.growthanalyser.org), transforming birth length into sex- and age-adjusted standard deviation scores (SDS). We used the North-European 1991 reference panel to compare results between studies. We applied linear regression between number of alleles or dosages obtained from imputations and standardized birth length (full details in Materials and Methods).

Figure 1.

Figure 1

Study design.

Gene identification

In the discovery phase (Stage 1), we found seven independent top SNPs with suggestive evidence of association (P < 1 × 10−6) with birth length (Supplementary Material, Figs. S1 and S2, QQ- and Manhattan plot). Four SNPs mapped to loci already known to be associated with adult height (Supplementary Material, Table S3, LCORL, PTCH1, GPR126 and HMGA2) (5). The 3 SNPs reflecting potentially novel associations were taken forward in nine independent replication studies (Stage 2; N = 11 995; Fig. 1). Only one of the three SNPs displayed significant evidence for replication in Stage 2 and reached genome-wide significance in the joint analysis (Stages 1 + 2; P < 5 × 10−8; Table 1). This novel association arose from SNP rs905938, mapping to chromosome 1q22 in DC-STAMP domain containing 2 (DCST2) (Fig. 2, regional association plot). Each C allele [minor allele frequency (MAF) = 0.24] of rs905938 was associated with an increase (standardized) of 0.046 SDS in birth length (standard error = 0.008, P = 2.46 × 10−8; explained variance = 0.05%). The genome-wide significantly associated SNP showed low degree of heterogeneity between the discovery studies (P = 0.93, I2 = 0%). Figure 3 shows the forest plot of the associations between rs905938[C] and birth length across all studies. Other suggestive loci in the discovery analysis are shown in Supplementary Material, Table S3 (P < 1 × 10−5). Summary statistics of all SNPs are available at http://egg-consortium.org.

Table 1.

Summary statistics of the three novel SNPs at P < 1 × 10−6 in the discovery analysis and the replication follow-up results

Marker MAF β SE P n I2 HetP
Discovery (Stage 1)
 rs905938[C] at 1q22 (DCST2) 0.24 0.050 0.010 2.59 × 10−7 28 327 0.0 0.930
 rs12545524[G] at 8q22.1 (near GDF6) 0.14 0.078 0.014 1.54 × 10−8 22 170 6.6 0.376
 rs11037473[A] at 11p11.2 (nearest genes TTC17-HSD17B12) 0.06 −0.109 0.021 2.17 × 10−7 22 259 0.0 0.735
Replication (Stage 2)
 rs905938[C] at 1q22 (DCST2) 0.23 0.035 0.015 1.99 × 10−2 11 908
 rs12545524[G] at 8q22.1 (near GDF6) 0.11 −0.012 0.017 4.67 × 10−1 17 614
 rs11037473[A] at 11p11.2 (nearest genes TTC17-HSD17B12) 0.08 −0.035 0.020 8.06 × 10−2 17 606
Discovery + replication (Stages 1 + 2)
 rs905938[C] at 1q22 (DCST2) 0.24 0.046 0.008 2.46 × 10−8 40 235
 rs12545524[G] at 8q22.1 (near GDF6) 0.13 0.042 0.011 9.08 × 10−5 39 784
 rs11037473[A] at 11p11.2 (nearest genes TTC17-HSD17B12) 0.07 −0.069 0.014 1.49 × 10−6 39 865

SNPs markers are identified according to their standard rs numbers (NCBI build 36). Independent novel SNPs with a strong suggestive effect in the discovery analysis on birth length are shown (P < 1 × 10−6). SNPs in loci that are known to be associated with adult height were excluded for replication efforts (adult height loci: LCORL, PTCH1, GPR126 and HMGA2). MAF, minor allele frequency; SE, standard error. β reflects differences in standardized birth length per minor allele. P values are obtained from linear regression of each SNP against standardized birth length adjusted for sex and gestational age. We included both GWA and metabochip cohorts in our discovery analysis, rs905938 is on the metabochip, and rs12545524 and rs11037473 are not, this explains the differences in numbers (n). Derived inconsistency statistic I2 and HetP values reflect heterogeneity across discovery studies with the use of Cochran's Q tests.

Figure 2.

Figure 2

Regional association plot of 1q22 in the 22 birth length discovery studies (N = 28 459). SNPs are plotted with their P values (as −log10 values; left y-axis) as a function of genomic position (x-axis). Estimated recombination rates (right y-axis) taken from HapMap are plotted to reflect the local LD-structure around the top associated SNP (‘white open diamond’) and the correlated proxies (‘circles’ according to a black-to-gray scale from r2 = 0 to 1). The joint analysis P value of discovery and replication studies is reported with the ‘white square’ (N = 40 235).

Figure 3.

Figure 3

Forest plot of the associations between rs905938[C] and birth length. *Replication studies. The ‘black diamond’ indicates the overall effect size and the confidence interval of the 31 studies.

Functional analyses

We assessed common variants with deleterious functional implications in linkage disequilibrium (LD, r2 > 0.80) with rs905938 using HaploReg (8). There were no non-synonymous variants in LD with rs905938. We found three putative functional intronic variants in high LD with rs905938. Details are depicted in Supplementary Material, Table S4. Subsequently, we assessed whether variants in the identified locus were involved in the regulation of messenger RNA expression (eQTLs) in genome-wide expression datasets of lymphoblastoid cell lines (LCLs, N = 1830) (9,10). We found cis eQTLs [false discovery rate (FDR) < 1% account for all SNP-probe pairs that were within 1 Mb of each other) for transcripts of PBXIP1, GBA and ADAM15. Yet, rs905938 and the cis eQTL SNPs were not in perfect LD (r2 < 0.80, Supplementary Material, Table S5). Therefore, we cannot exclude that multiple independent effects arise from the same region of association.

DCST2 and growth phenotypes

We tested the associations of rs905938[C] with ‘fetal growth’ measures in the 1st, 2nd and 3rd trimester of pregnancy in the Generation R Study (N = 5756) (11), infant length at 1 year of age (range 6–18 months; N = 28 228) in the Early Growth Genetics (EGG) consortium (12), and adult height in the Genetic Investigation of Anthropometric Traits (GIANT) consortium (N = 127 513) (5). Rs905938[C] was not associated with ‘fetal growth’ measures, but was associated with infant length and adult height (P < 0.05; Table 2).

Table 2.

Associations of rs905938[C] in DCST2 related to birth length with ‘fetal growth’ measures, infant length and adult height

β SE P
Generation R: fetal growth (N = 5756)
First trimester
 Crown-rump length (n = 1126) 0.003 0.045 0.952
Second trimester
 Femur length (n = 5361) −0.035 0.023 0.129
Third trimester
 Femur length (n = 5532) −0.015 0.022 0.490
EGG: infant length
 Infant length at 1 year of age (N = 28 228) 0.035 0.010 5.54 × 10−4
GIANT: adult height
 Adult height (N = 127 513) 0.024 0.006 1.45 × 10−5

rs905938 C-allele with a genome-wide significant effect on birth length is shown (P < 5 × 10−8) in relation to ‘fetal growth’ measures, infant length and adult height. SE, standard error. β reflects difference in standard deviation scores per minor allele.

Known adult height loci in relation to birth and infant length

We also explored whether common genetic variants known to be associated with adult height (5) influenced birth length variation. We found that 17 out of 180 known adult height loci were associated with birth length (FDR < 5%, Supplementary Material, Table S6; Fig. 4, QQ-plot of 180 SNPs and birth length). We then calculated a height-increasing-alleles score of the 180 known height loci (5) to predict birth length in the Generation R Study (N = 2085; Fig. 5). The score composed of variants associated with adult height explained 0.13% of the variance in birth length (P = 0.1), in contrast to the ∼10% of the phenotypic variation in adult height reported in the original manuscript (5).

Figure 4.

Figure 4

QQ-plots of the 180 known adult height SNPs with birth and infant length. QQ-plot of the 180 known adult height SNPs in association with birth length (upper panel) in 22 studies (N = 28 459) and with infant length (lower panel) in 19 studies (N = 28 238). The black dots represent observed P values and the diagonal lines represent the expected P values under the null distribution.

Figure 5.

Figure 5

Height-increasing-alleles score of known adult height SNPs predicting birth and infant length. Genetic risk-allele scores (sum of height-increasing alleles weighted by known effect on adult height (5) transformed to standard deviation Z-scores) in the Generation R study plotted against length adjusted for sex and age. The distribution of the genetic risk-allele score is depicted as bars. (A) Mean birth length plotted against the genetic score (N = 2085). (B) Mean infant length plotted against the genetic score (N = 2385).

To evaluate whether different common genetic variants influenced both birth and infant length, we tested 2 193 675 million SNPs for association with infant length in almost the same set of samples used for the analysis of birth length (19 studies, N = 28 238; Supplementary Material, Table S7). We identified genome-wide significant associations at 11 genetic loci (Supplementary Material, Figs S3 and S4, QQ- and Manhattan plot), which all are known to be associated with adult height (Table 3, SNPs in or near SF3B4, LCORL, SPAG17, C6orf173, PTCH1, GDF5, ZNFX1, HHIP, ACAN, HLA locus and HMGA2) (5,13). In addition, we found that variants in 58 of the adult height loci were associated with infant length at an FDR of 5% (Supplementary Material, Table S8; Fig. 4, QQ-plot of 180 SNPs and infant length). Next, we tested in the Generation R Study (N = 2385) how much of the phenotypic variance in infant length was explained by the score composed of height-increasing-alleles. Variants from the 180 known adult height loci together explained 2.95% of the variance in infant length (P = 3.10 × 10−17, Fig. 5).

Table 3.

Summary statistics of the eleven known adult height SNPs in association with infant length at P < 5 × 10−8

Marker MAF β SE P n I2 HetP
rs7536458[G] at 1p12 (SPAG17) 0.25 −0.064 0.010 9.61 × 10−11 28234 0.0 0.403
rs11205303[C] at 1q21.2 (SF3B4) 0.34 0.087 0.011 1.79 × 10−16 26559 0.0 0.864
rs1380294[T] at 4p15.31 (LCORL) 0.15 −0.108 0.014 2.54 × 10−14 23079 13.7 0.184
rs1812175[A] at 4q28-q32(HHIP) 0.18 −0.068 0.011 2.33 × 10−9 28227 0.0 0.398
rs592229[G] at (HLA locus) 0.43 0.048 0.009 2.22 × 10−8 28223 0.6 0.326
rs9385399[T] at 6q22.32 (C6orf173) 0.46 0.055 0.009 1.68 × 10−10 28224 0.0 0.943
rs1984119[C] at 9q22.3 (PTCH1) 0.26 −0.063 0.010 1.77 × 10−10 28197 0.0 0.490
rs7970350[T] at 12q15 (HMGA2) 0.49 −0.047 0.009 2.90 × 10−8 28226 0.0 0.426
rs2280470[A] at 15q26.1 (ACAN) 0.36 0.053 0.009 6.43 × 10−9 27443 0.0 0.436
rs143384[G] at 20q11.2 (GDF5) 0.44 0.058 0.009 2.87 × 10−10 28232 0.0 0.996
rs1567865[T] at 20q13.13 (ZNFX1) 0.21 0.063 0.010 1.10 × 10−9 28229 22.5 0.104

SNPs markers are identified according to their standard rs numbers (NCBI build 36). The total sample includes data of 19 independent datasets (N = 28 238). MAF, minor allele frequency; SE, standard error. β reflects differences in standardized infant length per minor allele. P values are obtained from linear regression of each SNP against standardized infant length adjusted for sex and age. We included both GWA and metabochip cohorts in our discovery analysis, this explains the differences in numbers (n). Derived inconsistency statistic I2 and HetP values reflect heterogeneity across discovery studies with the use of Cochran's Q tests.

DEPICT analysis of birth and infant length

Finally, we used a pathway analysis tool called DEPICT (Pers et al., unpublished data) to prioritize genes at associated regions, search for reconstituted gene sets that were enriched in genes near associated variants, and identify tissue and cell types in which genes from loci associated with birth and infant length were highly expressed (full details in Materials and Methods). For both traits, we used independent SNPs (r2 < 0.05) associated at P < 1 × 10−5, from 21 birth length and 44 infant length loci. There were no pathways significantly overrepresented in the birth length results. In contrast, for infant length DEPICT significantly prioritized nine genes which were overrepresented (FDR < 5%, Supplementary Material, Table S9), including three known Mendelian human stature genes (ACAN, GDF5 and PTCH1) as well as several relevant reconstituted gene sets (e.g. abnormal sternum ossification, regulation of osteoblast proliferation and WNT signaling, Supplementary Material, Table S10). There was no significant enrichment for particular tissue or cell types for any of the two traits.

DISCUSSION

In the present study we identified one previously unknown locus (rs905938 in DCST2 at 1q22) to be associated with birth length at a genome-wide significant level. This common genetic variant was also associated with infant length and adult height.

It was not possible to identify eQTLs for transcripts of DCST2 in the MRCA and MRCE databases, as there were no probes available (9). Also, there was no significant eQTL of DCST2 in immortalized LCLs (10). However, DCST2 is a DC-STAMP-like protein family member and DC-STAMP is an important regulator of osteoclast cell-fusion in bone homeostasis (1416). The transcripts of PBXIP1, GBA and ADAM15 were in weak LD with our lead SNP rs905938. The PBXIP1 protein is known to regulate estrogen receptor functions (17). Mutations in the GBA gene cause Gaucher disease, and strong associations with Parkinson's disease and dementia with Lewy bodies have been described (1821). ADAM15 is prominently expressed in osteoblasts and to a lesser extent in osteoclasts (22). A study in mice showed that ADAM15 is required for normal skeletal homeostasis and that its absence causes increased nuclear translocation of β-catenin in osteoblasts leading to increased osteoblast proliferation and function, which results in higher trabecular and cortical bone mass (23). The 1q22 locus is a complex region harboring multiple interesting genes that could affect birth length. We emphasize that we could not specifically pinpoint the causal gene(s) as our lead SNP (rs905938) was not in perfect LD with our cis eQTL SNPs.

Although, there is some overlap between adult height loci and birth length, which is illustrated by 17 shared loci, the genetic architecture of adult height seems more similar to the genetic architecture of infant length than birth length [58 shared loci for infant length, based on conservative statistical method (FDR)]. One point of consideration for the interpretation of our findings is the potential of measurement error for birth length (24). This may lead to less power to detect novel genetic variants as standard errors of SNPs could be increased. The estimate of the risk-allele score slope of Figure 5 is not influenced by measurement error and the differences in the slopes suggest that birth and infant length are influenced by distinct genetic variants. We found that the SNP effects for birth length of 137 of the 180 established height loci were in the same direction as reported in the GIANT paper (5) (Supplementary Material, Table S6; probability of success = 0.761, P = 6.25 × 10−13). One hundred sixty-two of the 180 loci were in the same direction for infant length (Supplementary Material, Table S8; probability of success = 0.900, P = 2.20 × 10−16).

Four SNPs associated with birth length (P < 1 × 10−5) are in or near loci known to be associated with birth weight (LCORL, HMGA2, ADCY5 and ADRB1). LCORL is associated with birth weight, birth length, infant length and adult height, but we could not find an obvious link between the gene and adult-onset diseases. HMGA2 is associated with aortic root size (25), type 2 diabetes (26), and many other traits like tooth development, head circumference and brain structure (12,27). ADCY5 is also associated with type 2 diabetes and ADRB1 with adult blood pressure (2,3). These findings highlight genetic links between fetal growth and metabolism (2,3,26). As we found overlap between genetic variants of birth weight and birth length, we looked-up the effect of rs905938 in DCST2 on birth weight in a previous EGG study (3). Rs905938 was associated with birth weight, but weaker as compared with birth length (β = 0.035 SDS, SE = 0.010, P = 2.35 × 10−4, N = 26 558).

In conclusion, in the present study we identified one novel locus (rs905938 in DCST2 at 1q22) associated with birth length at a genome-wide significant level. This common genetic variant was also associated with infant length and adult height, with decreasing magnitude of the associations in later life (0.046 SDS for birth length, 0.035 SDS for infant length and 0.024 SDS for adult height). To our knowledge, no phenotype has been previously associated with the DCST2 gene and while the gene is expressed in osteoclasts, its function should be further studied.

MATERIALS AND METHODS

Stage 1: discovery genome-wide association analyses of birth length

We combined 21 population-based studies with GWA or Metabochip data and birth length available (total N = 28 459 individuals). One of our discovery cohorts had two independent sub-samples within their study leading to a total of 22 independent GWA/Metabochip sub-samples for our analysis: one sub-sample from the Avon Longitudinal Study of Parents and Children (ALSPAC, GWA, n = 4816); Children, Allergy, Milieu, Stockholm, Epidemiology [Swedish] (BAMSE, GWA, n = 423); Children's Hospital Of Philadelphia (CHOP, GWA, n = 432); Copenhagen Study on Asthma in Childhood 2000 (COPSAC-2000, GWA, n = 348); Copenhagen Study on Asthma in Childhood Registry (COPSAC-Registry, GWA, n = 1111); Danish National Birth Cohort (DNBC, GWA, n = 932); Generation R Study (Generation R, GWA, n = 2085); Hyperglycemia and Adverse Pregnancy Outcomes study (HAPO, GWA, n = 1325); Helsinki Birth Cohort Study (HBCS, GWA, n = 1572); Infancia y Medio Ambiente (INMA, GWA, n = 848); Leipzig Childhood Obesity cohort (LEIPZIG, Metbochip, n = 607); Lifestyle Immune System Allergy study (LISA, GWA, n = 552); Manchester Asthma and Allergy Study (MAAS, GWA, n = 402); Norwegian Mother and Child Cohort study (MOBA, GWA, n = 832); Northern Finland Birth Cohorts 1966 (NFBC66, GWA, n = 4642); Northern Finland Birth Cohorts 1986 (NFBC86, Metabochip, n = 4652); Physical Activity and Nutrition in Children study (PANIC, Metabochip, n = 319); two sub-samples from the Prevention and Incidence of Asthma and Mite Allergy birth cohort study (PIAMA1, GWA, n = 283; PIAMA2, GWA, n = 195); The Western Australian Pregnancy Cohort Study (RAINE, GWA, n = 1272); Special Turku Coronary Risk Factor Intervention Project (STRIP, Metabochip, n = 614); and TEENs of Attica: Genes and Environment (TEENAGE, GWA, n = 197). While no systematic phenotypic differences were observed between the sub-samples of the PIAMA birth cohort study, they were analyzed separately due to genotyping on different platforms and at different time periods. Genotypes within each study were obtained using high-density SNP arrays and then imputed for ∼2.5 M HapMap SNPs (Phase II, release 22; http://hapmap.ncbi.nlm.nih.gov/). The basic characteristics, exclusions applied (for example, individuals of non-European ancestry, family related individuals), genotyping, quality control and imputation methods for each discovery study are presented in Supplementary Material, Table S1.

Statistical analysis within discovery studies

In all studies, birth length was measured using standardized procedures. Studies with self-reported measurements were excluded a priori. Birth length was standardized using growth analyzer (http://www.growthanalyser.org), transforming birth length into sex- and age-adjusted SDS. We used the North-European 1991 reference panel to compare results between studies. Multiple births and twins were excluded from all analyses. We applied linear regression between number of alleles or dosages obtained from imputations and standardized birth length. The GWA analysis per study was performed using MaCH2qtl (28), SNPTEST (29), PLINK (30) or PropABEL (31). The secured data exchange and storage were facilitated by the Erasmus Medical Center, Department of Internal Medicine (32).

Meta-analysis of discovery studies

Prior to meta-analysis, SNPs with a MAF <0.01 and poorly imputed SNPs [r2hat <0.3 (MaCH); proper_info <0.4 (IMPUTE2); R2_BEALE <0.4 (BEAGLE)] were filtered. Genomic control (GC) (33) was applied to adjust the statistics generated within each cohort (see Supplementary Material, Table S1 for individual study λ values). Four out of the twenty-two sub-samples were genotyped on Metabochips. These SNP-arrays were enriched with ‘adult height SNPs'. Normal variation in early length growth seems to be associated with height in adulthood (1). Therefore, we assumed more true-positive hits in these studies and did not apply GC in these studies (GIANT et al., unpublished data). Details of any additional corrections for study specific population structure are given in the Supplementary Material, Table S1. Inverse variance fixed-effects meta-analyses were analyzed using METAL (released 2010-08-01) (34) by two meta-analysts in parallel and blinded to obtain identical results. After the METAL meta-analysis, we filtered SNPs with a MAF <0.05 and SNPs that were not available in at least 12 sub-samples to avoid false-positive findings. We used Cochran's Q test and the derived inconsistency statistic I2 to assess evidence of between-study heterogeneity of the effect sizes. The meta-analysis results were obtained for a total of 2 201 971 SNPs. SNPs that crossed the threshold of P ≤ 1 × 10−6 were considered to represent strong suggestive evidence of association with birth length. SNPs that were already known to be associated with adult height were excluded for the replication analysis (5). The explained variance of the top SNPs were calculated in one of the largest cohorts, the Generation R Study (n = 2085).

Stage 2: replication analysis of top birth length SNPs

In the discovery phase, we found seven independent SNPs with strong suggestive evidence of association (P < 1 × 10−6) with birth length. Four SNPs were already known to be associated with adult height (5). These SNPs were excluded for follow-up analyses. The three remaining novel SNPs were followed-up in replication studies. We included both GWA and Metabochip studies in our discovery analysis. Rs905938 was on our Metabochips, and rs12545524 and rs11037473 were not. This results in differences in numbers for our top SNPs in the discovery and replication analyses. rs905938 was taken forward in 9 independent replication studies (N = 11 995), rs12545524 and rs11037473 in 13 independent replication studies including the four discovery Metabochip studies (N = 17 679). Details of the replication studies are presented in Supplementary Material, Table S2. Within the replication studies, we analyzed the association between number of alleles and standardized birth length. Combined effect estimates and heterogeneity between cohorts was calculated using fixed effects meta-analyses in R Version 2.8.1 (The R foundation for Statistical Computing, library rmeta). Top SNPs that crossed the significant threshold of P-replication ≤0.05 and the widely accepted genome-wide significance threshold of P ≤ 5 × 10−8 for all studies combined were considered to represent robust evidence of association with birth length. The institutional review boards for human studies approved the protocols and written consent was obtained from the participating subjects or their caregivers if required by the institutional review board.

DEPICT analysis

We used the novel Data-driven Expression-Prioritized Integration for Complex Traits (DEPICT) method (Pers et al., unpublished data). DEPICT is designed to systematically identify the most likely causal gene at a given locus, gene sets that are enriched in genetic associations, and tissues and cell types in which genes from associated loci are highly expressed. First, DEPICT assigns genes to associated SNPs using LD r2 > 0.5 distance to define locus boundaries, merges overlapping loci and discards loci mapping within the extended major histocompatibility complex region (chromosome 6, base pairs 25 000–35 000). Next, the DEPICT method prioritizes genes within a given associated locus based on the genes' functional similarity to genes from other associated loci. Genes that are highly similar to genes from other loci obtain low prioritization P values, and simulated GWAS results are used to adjust for gene length bias as well as other potential confounders. There can be several prioritized genes in a given locus. Next, DEPICT conducts gene set enrichment analysis by testing whether genes in associated loci enrich for reconstituted versions of known pathways, gene sets as well as protein complexes. Leveraging the guilt by association hypothesis that genes co-expressing with genes from a given gene set are likely to be part of that gene set (see Cvejic et al. (35), for details), the gene set reconstitution is accomplished by identifying genes that were co-expressed with genes in a given gene set based on a panel of 77 840 gene expression microarrays. Gene sets from the following repositories were reconstituted: 5984 protein complexes that were derived from 169 810 high-confidence experimentally derived protein–protein interactions (36); 2473 phenotypic gene sets derived from 211 882 gene–phenotype pairs from the Mouse Genetics Initiative (37); 737 Reactome database pathways (38); 184 KEGG database pathways (39); and 5083 Gene Ontology database terms (40). Finally, DEPICT conducts tissue and cell type enrichment analysis, by testing whether genes in associated loci are highly expressed in any of 209 Medical Subject Heading annotations of 37 427 microarrays from the Affymetrix U133 Plus 2.0 Array platform (see Wood et al. (41) and Geller et al. (42) for previous applications of DEPICT). In this work, 21 autosomal SNPs for birth length and 44 autosomal SNPs for infant length were used as input to DEPICT resulting in 21 and 41 non-overlapping loci, respectively, that covered a total of 34 genes and 83 genes, respectively. The gene prioritization, gene set enrichment and tissue/cell type enrichment analyses were run using the default settings in DEPICT.

SUPPLEMENTARY MATERIAL

Supplementary Material is available at HMG online.

FUNDING

R.M.F. is supported by a Sir Henry Wellcome Postdoctoral Fellowship (Wellcome Trust grant 085541/Z/08/Z). T.H.P. is supported by The Danish Council for Independent Research Medical Sciences (FSS) The Alfred Benzon Foundation. B.F. is supported by an Oak Foundation fellowship. M.M. is a Wellcome Trust Senior Investigator (Wellcome Trust grant 090532) and a NIHR Senior Investigator. T.M.F. is supported by the European Research Council grant: SZ-245 50371-GLUCOSEGENES-FP7-IDEAS-ERC. F.R. (VIDI 016.136.367) and V.W.V.J. (VIDI 016.136.361) received grants from the Netherlands Organization for Health Research and Development. The other authors did not receive funding for this manuscript.

Supplementary Material

Supplementary Data

Acknowledgements

Avon Longitudinal Study of Parents And Children (ALSPAC): We are extremely grateful to all the families who took part in this study, the midwives for their help in recruiting them, and the whole ALSPAC team, which includes interviewers, computer and laboratory technicians, clerical workers, research scientists, volunteers, managers, receptionists and nurses. GWAS data were generated by Sample Logistics and Genotyping Facilities at the Wellcome Trust Sanger Institute and LabCorp (Laboratory Corporation of America) supported by 23andMe. The Wellcome Trust and Swiss National Science Foundation funded the expression data. Ethical approval was obtained from the ALSPAC Law and Ethics Committee and the Local Ethics Committees. Please note that the study website contains details of all the data that is available through a fully searchable data dictionary (http://www.bris.ac.uk/alspac/researchers/data-access/data-dictionary/).

BAMSE: Supported by the Swedish Research Council, the Swedish Heart Lung Foundation, the Centre for Allergy Research (CfA), Stockholm County Council (ALF) and SFO Program in Epidemiology, KI.

Children's Hospital Of Philadelphia (CHOP): The authors thank the network of primary care clinicians and the patients and families for their contribution to this project and to clinical research facilitated by the Pediatric Research Consortium (PeRC) at The Children's Hospital of Philadelphia. R. Chiavacci, E. Dabaghyan, A. (Hope) Thomas, K. Harden, A. Hill, C. Johnson-Honesty, C. Drummond, S. Harrison, F. Salley, C. Gibbons, K. Lilliston, C. Kim, E. Frackelton, F. Mentch, G. Otieno, K. Thomas, C. Hou, K. Thomas and M.L. Garris provided expert assistance with genotyping and/or data collection and management. The authors would also like to thank S. Kristinsson, L.A. Hermannsson and A. Krisbjörnsson of Raförninn ehf for extensive software design and contributions. This research was financially supported by an Institute Development Award from the Children's Hospital of Philadelphia, a Research Development Award from the Cotswold Foundation and NIH grant R01 HD056465.

COPSAC-2000/Registry: We gratefully express our gratitude to the children and families of the COPSAC2000 cohort study for all their support and commitment. We acknowledge and appreciate the unique efforts of the COPSAC research team.

Danish National Birth Cohort (DNBC): Support for the Danish National Birth Cohort was obtained from the Danish National Research Foundation, the Danish Pharmacists' Fund, the Egmont Foundation, the March of Dimes Birth Defects Foundation, the Augustinus Foundation and the Health Fund of the Danish Health Insurance Societies. The generation of GWAS genotype data for the DNBC samples was carried out within the GENEVA consortium with funding provided through the NIH Genes, Environment and Health Initiative (GEI) (U01HG004423). Assistance with phenotype harmonization and genotype cleaning, as well as with general study coordination, was provided by the GENEVA Coordinating Center (U01HG004446). Genotyping was performed at Johns Hopkins University Center for Inherited Disease Research, with support from the NIH GEI (U01HG004438).

The Generation R Study (Generation R): The Generation R Study is conducted by the Erasmus Medical Center in close collaboration with the School of Law and Faculty of Social Sciences of the Erasmus University Rotterdam, the Municipal Health Service Rotterdam area, Rotterdam, the Rotterdam Homecare Foundation, Rotterdam and the Stichting Trombosedienst & Artsenlaboratorium Rijnmond (STAR-MDC), Rotterdam. We gratefully acknowledge the contribution of children and parents, general practitioners, hospitals, midwives and pharmacies in Rotterdam. The study protocol was approved by the Medical Ethical Committee of the Erasmus Medical Centre, Rotterdam. Written informed consent was obtained from all participants. The generation and management of GWAS genotype data for the Generation R Study were done at the Genetic Laboratory of the Department of Internal Medicine, Erasmus MC, the Netherlands. We would like to thank Karol Estrada, Dr Tobias A. Knoch, Anis Abuseiris, Luc V. de Zeeuw and Rob de Graaf, for their help in creating GRIMP, BigGRID, MediGRID and Services@MediGRID/D-Grid, (funded by the German Bundesministerium fuer Forschung und Technology; grants 01 AK 803 A-H, 01 IG 07015 G) for access to their grid computing resources. We thank Mila Jhamai, Manoushka Ganesh, Pascal Arp, Marijn Verkerk, Lizbeth Herrera and Marjolein Peters for their help in creating, managing and QC of the GWAS database. Also, we thank Karol Estrada for their support in creation and analysis of imputed data. The Generation R Study is made possible by financial support from the Erasmus Medical Center, Rotterdam, the Erasmus University Rotterdam and the Netherlands Organization for Health Research and Development.

Hyperglycemia and Adverse Pregnancy Outcome (HAPO) Study: We are indebted to the participants, investigators and research staff of the HAPO study at each of the following centers: Newcastle and Brisbane, Australia; Bridgetown, Barbados; Toronto, Canada; Hong Kong, Hong Kong; Bangkok, Thailand; Belfast and Manchester, UK; Bellflower, CA, Chicago, IL, Cleveland, OH and Providence, RI, USA. This work was supported by US National Institutes of Health (NIH) grants (HD34242, HD34243, HG004415 and CA141688) and by the American Diabetes Association. Genotype cleaning and general study coordination were provided by the GENEVA Coordinating Center (U01HG004446). Genotyping was performed at the Broad Institute of MIT and Harvard, with funding support from the NIH GEI (U01HG04424), and Johns Hopkins University Center for Inherited Disease Research, with support from the NIH GEI (U01HG004438) and the NIH contract ‘High throughput genotyping for studying the genetic contributions to human disease’ (HHSN268200782096C).

Helsinki Birth Cohort Study (HBCS): The Helsinki Birth Cohort Study (HBCS/HBCS 1934-44) thanks Professor David Barker and Tom Forsen. Major financial support was received from the Academy of Finland (project grants 209072, 129255 grant) and British Heart Foundation. The DNA extraction, sample quality control, biobank up-keep and aliquoting were performed at the National Institute for Health and Welfare, Helsinki, Finland.

The INMA Project: This study was funded by grants from Instituto de Salud Carlos III (Red INMA G03/176 and CB06/02/0041), FIS-FEDER 03/1615, 04/1509, 04/1112, 04/1931, 05/1079, 05/1052, 06/1213, 07/0314, 09/02647, 11/01007, 11/02591, 13/02032, 13/1944, PI041436, PI081151, CP11/00178, 97/0588, 00/0021-2, PI061756 and PS0901958, Spanish Ministry of Science and Innovation (SAF2008-00357), European Commission (ENGAGE project and grant agreement HEALTH-F4-2007-201413), Fundació La Marató de TV3, Generalitat de Catalunya-CIRIT 1999SGR 00241 and Conselleria de Sanitat Generalitat Valenciana. Part of the DNA extractions and genotyping was performed at the Spanish National Genotyping Centre (CEGEN-Barcelona). The authors are grateful to Silvia Fochs, Anna Sànchez, Maribel López, Nuria Pey, Muriel Ferrer, Amparo Quiles, Sandra Pérez, Gemma León, Elena Romero, Maria Andreu, Nati Galiana, Maria Dolores Climent and Amparo Cases for their assistance in contacting the families and administering the questionnaires. The authors would particularly like to thank all the participants for their generous collaboration. A full roster of the INMA Project Investigators can be found at http://www.proyectoinma.org/presentacion-inma/listado-investigadores/en_listado-investigadores.html.

Leipzig Obesity Childhood Cohort: The Leipzig Childhood Obesity cohort is supported by grants from Integrated Research and Treatment Centre (IFB) Adiposity Diseases FKZ: 01EO1001, from the German Research Foundation for the Clinical Research Center ‘Obesity Mechanisms' CRC1052/1 C05. We are grateful to all the patients and families for contributing to the study. We highly appreciate the support of the Obesity Team and Auxo Team of the Leipzig University Children's Hospital for management of the patients and to the Pediatric Research Center Lab Team for support with DNA banking.

Lifestyle—Immune System—Allergy (LISA) Study Munich: Generation of GWA data in the LISAplus study in Munich were covered by Helmholtz Zentrum Munich, Helmholtz Centre for Environmental Research. In addition, this work was supported by the Kompetenznetz Adipositas (Competence Network Obesity) funded by the Federal Ministry of Education and Research (FKZ: 01GI1121A).The authors thank all families for participation in the study; the obstetric units for allowing recruitment and the LISA study teams for excellent work.

Manchester Asthma and Allergy Study (MAAS): We would like to thank the children and their parents for their continued support and enthusiasm. We greatly appreciate the commitment they have given to the project. We would also like to acknowledge the hard work and dedication of the study team (post-doctoral scientists, research fellows, nurses, physiologists, technicians and clerical staff). MAAS was supported by the Asthma UK Grants No 301 (1995–1998), No 362 (1998–2001), No 01/012 (2001–2004), No 04/014 (2004–2007) and The Moulton Charitable Foundation (2004-current); age 11 years clinical follow-up is funded by the Medical Research Council (MRC) Grant G0601361.

Norwegian Mother Child Cohort (MoBa): his work was supported by grants from the Norwegian Research Council (FUGE 183220/S10, FRIMEDKLI-05 ES236011), Swedish Medical Society (SLS 2008-21198), Jane and Dan Olsson Foundations and Swedish government grants to researchers in the public health service (ALFGBG-2863, ALFGBG-11522), and the European Community's Seventh Framework Programme (FP7/2007–2013), ENGAGE Consortium, grant agreement HEALTH-F4-2007–201413. The Norwegian Mother and Child Cohort Study was also supported by the Norwegian Ministry of Health and the Ministry of Education and Research, NIH/NIEHS (contract no. N01-ES-75558), NIH/NINDS (grant no.1 UO1 NS 047537-01 and grant no.2 UO1 NS 047537-06A1), and the Norwegian Research Council/FUGE (grant no. 151918/S10). We are grateful to all the participating families in Norway who take part in this ongoing cohort study. Researchers interested in using MoBa data must obtain approval from the Scientific Management Committee of MoBa and from the Regional Committee for Medical and Health Research Ethics for access to data and biological material.

Northern Finland Birth Cohort 1966 (NFBC1966) and 1985–1986 (NFBC1986): We acknowledge late Professor Paula Rantakallio (launch of NFBC1966 and initial data collection), Ms Sarianna Vaara (data collection), Ms Tuula Ylitalo (administration), Mr Markku Koiranen (data management), Ms Outi Tornwall and Ms Minttu Jussila (DNA biobanking).

The PANIC Study: We thank the voluntary children and their families who participated in The PANIC Study. The study protocol was approved by the Research Ethics Committee of the Hospital District of Northern Savo. All children and their parents gave their informed written consent. The PANIC Study has been financially supported by grants from the Ministry of Social Affairs and Health of Finland, the Ministry of Education and Culture of Finland, the University of Eastern Finland, the Finnish Innovation Fund Sitra, the Social Insurance Institution of Finland, the Finnish Cultural Foundation, the Juho Vainio Foundation, the Foundation for Paediatric Research, the Paulo Foundation, the Paavo Nurmi Foundation, the Diabetes Research Foundation, Kuopio University Hospital (EVO-funding number 5031343) and the Research Committee of the Kuopio University Hospital Catchment Area for the State Research Funding.

The prevention and incidence of asthma and mite allergy birth cohort study (PIAMA1 and PIAMA2): The PIAMA birth cohort study is a collaboration of the Institute for Risk Assessment Sciences, University Utrecht (B. Brunekreef), Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht (H.A. Smit), Centre for Prevention and Health Services Research, National Institute for Public Health and the Environment, Bilthoven (A.H. Wijga), Department of Pediatrics, Division of Respiratory Medicine, Erasmus MC-Sophia, Rotterdam (J.C.d.J.), the Departments of Epidemiology (M.K.), Pulmonology (D.S.P.) and Pediatric Pulmonology and Pediatric Allergology (G.H.K.) of the University Medical Center Groningen and the Department of Immunopathology, Sanquin Research, Amsterdam (R.C. Aalberse), the Netherlands. The study team gratefully acknowledges the participants in the PIAMA birth cohort study, and all coworkers who helped conducting the medical examinations, field work and data management. The PIAMA study was funded by grants from the Dutch Asthma Foundation (grant 3.4.01.26, 3.2.06.022, 3.4.09.081 and 3.2.10.085CO), the ZON-MW Netherlands Organization for Health Research and Development (grant 912-03-031), the Stichting Astmabestrijding and the Ministry of the Environment. Genome-wide genotyping was funded by the European Commission as part of GABRIEL (a multidisciplinary study to identify the genetic and environmental causes of asthma in the European Community) contract number 018996 under the Integrated Program LSH-2004-1.2.5-1 Post genomic approaches to understand the molecular basis of asthma aiming at a preventive or therapeutic control.

The Western Australian Pregnancy (RAINE) Cohort: The authors are grateful to the Raine Study participants and their families, and to the Raine Study research staff for cohort coordination and data collection. The authors gratefully acknowledge the NH&MRC for their long-term contribution to funding the study over the last 20 years and also the following Institutions for providing funding for Core Management of the Raine Study: The University of Western Australia (UWA), Raine Medical Research Foundation, UWA Faculty of Medicine, Dentistry and Health Sciences, The Telethon Institute for Child Health Research, Curtin University and Women and Infants Research Foundation. The authors gratefully acknowledge the assistance of the Western Australian DNA Bank (National Health and Medical Research Council of Australia National Enabling Facility). The authors also acknowledge the support of the National Health and Medical Research Council of Australia (Grant ID 403981 and ID 003209) and the Canadian Institutes of Health Research (Grant ID MOP-82893). The study was conducted with appropriate institutional ethics approval, and written informed consent was obtained from mothers at all follow-ups and participants at the year 17 follow-up.

Special Turku Coronary Risk Factor Intervention Project (Strip): The study was approved by the Joint Commission on Ethics of the Turku University and the Turku University Central Hospital. Informed consent was obtained from all parents at the beginning of the trial and from the children at 15 years of age. The STRIP study was financially supported by Academy of Finland (grants 206374 and 251360); Juho Vainio Foundation; Finnish Cardiac Research Foundation; Finnish Cultural Foundation; Finnish Ministry of Education and Culture; Sigrid Juselius Foundation; Yrjö Jahnsson Foundation; C.G. Sundell Foundation; Special Governmental Grants for Health Sciences Research, Turku University Hospital; Foundation for Pediatric Research; and Turku University Foundation.

TEENAGE: TEENAGE study has been co-financed by the European Union (European Social Fund—ESF) and Greek national funds through the Operational Program ‘Education and Lifelong Learning’ of the National Strategic Reference Framework (NSRF)—Research Funding Program: Heracleitus II Investing in knowledge society through the European Social Fund. This work was funded by the Wellcome Trust (098051). We would like to thank all study participants and their families as well as all volunteers for their contribution in this study. We thank the following staff from the Sample Management and Genotyping Facilities at the Wellcome Trust Sanger Institute for sample preparation, quality control and genotyping: Dave Jones, Doug Simpkin, Emma Gray, Hannah Blackburn, Sarah Edkins.

Berlin Birth Cohort (BBC): The Berlin Birth Cohort study was funded by the Deutsche Forschungsgemeinschaft (DFG), Else Kröner-Fresenius Foundation, Jackstädt-Foundation and a research grant of the University of Potsdam, Germany. Details of the study are provided in refs (4345). We deeply acknowledge the contribution of the participating families. Replication genotyping was supported by ENGAGE Framework VII HEALTH-F4-2007-201413 and Wellcome Trust 098381.

Diabetes Prediction and Prevention (DIPP): The DIPP study was supported by the following grants: International: Juvenile Diabetes Research Foundation International (grants 4-1998-274, 4-1999-731, 4-2001-435); European Union (grant BMH4-CT98-3314); Novo Nordisk Foundation; the Academy of Finland (Decision No. 250124 and Centre of Excellence in Molecular Systems Immunology and Physiology Research 2012–2017, Decision No. 250114), the Sigrid Jusélius Foundation, and the Special Research Funds for University Hospitals in Finland. We thank the participating DIPP families and the DIPP staff.

Exeter Family Study Of Childhood Health (EFSOCH): The EFSOCH study was supported by South West NHS Research and Development, Exeter NHS Research and Development, the Darlington Trust, and the Peninsula NIHR Clinical Research Facility at the University of Exeter. We are extremely grateful to the EFSOCH study participants and the EFSOCH study team. The opinions given in this paper do not necessarily represent those of NIHR, the NHS or the Department of Health. Genotyping of EFSOCH DNA samples was supported by Diabetes UK grant RD08/0003692. The local research ethics committees approved the study, and all adult participants gave informed written consent.

Singapore Cohort study Of the Risk factors for Myopia study (SCORM): This study is supported by the National Medical Research Council Grant NMRC/0695/2002, Singapore.

Southampton Women’'s Survey (SWS): The SWS was funded by the Medical Research Council Of Great Britain, the British Heart Foundation, the European Union Framework 7 Programme, the NIHR Nutrition Biomedical Research Centre, University of Southampton, and Arthritis Research UK. We thank the mothers of the Southampton Women's Survey who gave us their time and the team of dedicated research nurses and ancillary staff for their assistance.

Netherlands Twin Register (NTR): This project was supported by ARRA RC2 2MH08995; the European Research Council (Genetics of Mental Illness, ERC-230374); Spinozapremie (NWO/SPI 56-464-14192) and Twin-family database for behavior genetics and genomics studies (NWO 480-04-004).

Conflict of Interest statement. None declared.

REFERENCES

  • 1.Paternoster L., Howe L.D., Tilling K., Weedon M.N., Freathy R.M., Frayling T.M., Kemp J.P., Smith G.D., Timpson N.J., Ring S.M., et al. Adult height variants affect birth length and growth rate in children. Hum. Mol. Genet. 2011;20:4069–4075. doi: 10.1093/hmg/ddr309. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Freathy R.M., Mook-Kanamori D.O., Sovio U., Prokopenko I., Timpson N.J., Berry D.J., Warrington N.M., Widen E., Hottenga J.J., Kaakinen M., et al. Variants in ADCY5 and near CCNL1 are associated with fetal growth and birth weight. Nat. Genet. 2010;42:430–435. doi: 10.1038/ng.567. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Horikoshi M., Yaghootkar H., Mook-Kanamori D.O., Sovio U., Taal H.R., Hennig B.J., Bradfield J.P., St Pourcain B., Evans D.M., Charoen P., et al. New loci associated with birth weight identify genetic links between intrauterine growth and adult height and metabolism. Nat. Genet. 2013;45:76–82. doi: 10.1038/ng.2477. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Mook-Kanamori D.O., van Beijsterveldt C.E., Steegers E.A., Aulchenko Y.S., Raat H., Hofman A., Eilers P.H., Boomsma D.I., Jaddoe V.W. Heritability estimates of body size in fetal life and early childhood. PLoS One. 2012;7:e39901. doi: 10.1371/journal.pone.0039901. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Lango Allen H., Estrada K., Lettre G., Berndt S.I., Weedon M.N., Rivadeneira F., Willer C.J., Jackson A.U., Vedantam S., Raychaudhuri S., et al. Hundreds of variants clustered in genomic loci and biological pathways affect human height. Nature. 2010;467:832–838. doi: 10.1038/nature09410. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Woods K.A., Camacho-Hubner C., Savage M.O., Clark A.J. Intrauterine growth retardation and postnatal growth failure associated with deletion of the insulin-like growth factor I gene. N. Engl. J. Med. 1996;335:1363–1367. doi: 10.1056/NEJM199610313351805. [DOI] [PubMed] [Google Scholar]
  • 7.Abuzzahab M.J., Schneider A., Goddard A., Grigorescu F., Lautier C., Keller E., Kiess W., Klammt J., Kratzsch J., Osgood D., et al. IGF-I receptor mutations resulting in intrauterine and postnatal growth retardation. N. Engl. J. Med. 2003;349:2211–2222. doi: 10.1056/NEJMoa010107. [DOI] [PubMed] [Google Scholar]
  • 8.Ward L.D., Kellis M. HaploReg: a resource for exploring chromatin states, conservation, and regulatory motif alterations within sets of genetically linked variants. Nucleic Acids Res. 2012;40:D930–e39934. doi: 10.1093/nar/gkr917. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Liang L., Morar N., Dixon A.L., Lathrop G.M., Abecasis G.R., Moffatt M.F., Cookson W.O. A cross-platform analysis of 14 177 expression quantitative trait loci derived from lymphoblastoid cell lines. Genome Res. 2013;23:716–726. doi: 10.1101/gr.142521.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Granell R., Henderson A.J., Timpson N., St Pourcain B., Kemp J.P., Ring S.M., Ho K., Montgomery S.B., Dermitzakis E.T., Evans D.M., et al. Examination of the relationship between variation at 17q21 and childhood wheeze phenotypes. J. Allergy Clin. Immunol. 2013;131:685–694. doi: 10.1016/j.jaci.2012.09.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Jaddoe V.W., van Duijn C.M., Franco O.H., van der Heijden A.J., van Iizendoorn M.H., de Jongste J.C., van der Lugt A., Mackenbach J.P., Moll H.A., Raat H., et al. The Generation R Study: design and cohort update 2012. Eur. J. Epidemiol. 2012;27:739–756. doi: 10.1007/s10654-012-9735-1. [DOI] [PubMed] [Google Scholar]
  • 12.Taal H.R., St Pourcain B., Thiering E., Das S., Mook-Kanamori D.O., Warrington N.M., Kaakinen M., Kreiner-Moller E., Bradfield J.P., Freathy R.M., et al. Common variants at 12q15 and 12q24 are associated with infant head circumference. Nat. Genet. 2012;44:532–538. doi: 10.1038/ng.2238. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Weedon M.N., Lango H., Lindgren C.M., Wallace C., Evans D.M., Mangino M., Freathy R.M., Perry J.R., Stevens S., Hall A.S., et al. Genome-wide association analysis identifies 20 loci that influence adult height. Nat. Genet. 2008;40:575–583. doi: 10.1038/ng.121. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Kukita T., Wada N., Kukita A., Kakimoto T., Sandra F., Toh K., Nagata K., Iijima T., Horiuchi M., Matsusaki H., et al. RANKL-induced DC-STAMP is essential for osteoclastogenesis. J. Exp. Med. 2004;200:941–946. doi: 10.1084/jem.20040518. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Yagi M., Miyamoto T., Sawatani Y., Iwamoto K., Hosogane N., Fujita N., Morita K., Ninomiya K., Suzuki T., Miyamoto K., et al. DC-STAMP is essential for cell–cell fusion in osteoclasts and foreign body giant cells. J Exp Med. 2005;202:345–351. doi: 10.1084/jem.20050645. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Jansen B.J., Eleveld-Trancikova D., Sanecka A., van Hout-Kuijer M., Hendriks I.A., Looman M.G., Leusen J.H., Adema G.J. OS9 interacts with DC-STAMP and modulates its intracellular localization in response to TLR ligation. Mol. Immunol. 2009;46:505–515. doi: 10.1016/j.molimm.2008.06.032. [DOI] [PubMed] [Google Scholar]
  • 17.Manavathi B., Lo D., Bugide S., Dey O., Imren S., Weiss M.J., Humphries R.K. Functional regulation of pre-B-cell leukemia homeobox interacting protein 1 (PBXIP1/HPIP) in erythroid differentiation. J. Biol. Chem. 2012;287:5600–5614. doi: 10.1074/jbc.M111.289843. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Stone D.L., Tayebi N., Orvisky E., Stubblefield B., Madike V., Sidransky E. Glucocerebrosidase gene mutations in patients with type 2 Gaucher disease. Hum. Mutat. 2000;15:181–188. doi: 10.1002/(SICI)1098-1004(200002)15:2<181::AID-HUMU7>3.0.CO;2-S. [DOI] [PubMed] [Google Scholar]
  • 19.Sidransky E., Nalls M.A., Aasly J.O., Aharon-Peretz J., Annesi G., Barbosa E.R., Bar-Shira A., Berg D., Bras J., Brice A., et al. Multicenter analysis of glucocerebrosidase mutations in Parkinson’s disease. N. Engl. J. Med. 2009;361:1651–1661. doi: 10.1056/NEJMoa0901281. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Chahine L.M., Qiang J., Ashbridge E., Minger J., Yearout D., Horn S., Colcher A., Hurtig H.I., Lee V.M., Van Deerlin V.M., et al. Clinical and biochemical differences in patients having Parkinson disease with vs without GBA mutations. JAMA Neurol. 2013;70:852–858. doi: 10.1001/jamaneurol.2013.1274. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Nalls M.A., Duran R., Lopez G., Kurzawa-Akanbi M., McKeith I.G., Chinnery P.F., Morris C.M., Theuns J., Crosiers D., Cras P., et al. A multicenter study of glucocerebrosidase mutations in dementia with Lewy bodies. JAMA Neurol. 2013;70:727–735. doi: 10.1001/jamaneurol.2013.1925. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Inoue D., Reid M., Lum L., Kratzschmar J., Weskamp G., Myung Y.M., Baron R., Blobel C.P. Cloning and initial characterization of mouse meltrin beta and analysis of the expression of four metalloprotease-disintegrins in bone cells. J. Biol. Chem. 1998;273:4180–4187. doi: 10.1074/jbc.273.7.4180. [DOI] [PubMed] [Google Scholar]
  • 23.Marzia M., Guaiquil V., Horne W.C., Blobel C.P., Baron R., Chiusaroli R. Lack of ADAM15 in mice is associated with increased osteoblast function and bone mass. Biol. Chem. 2011;392:877–885. doi: 10.1515/BC.2011.080. [DOI] [PubMed] [Google Scholar]
  • 24.Johnson T.S., Engstrom J.L., Warda J.A., Kabat M., Peters B. Reliability of length measurements in full-term neonates. J. Obstet. Gynecol. Neonatal. Nurs. 1998;27:270–276. doi: 10.1111/j.1552-6909.1998.tb02649.x. [DOI] [PubMed] [Google Scholar]
  • 25.Vasan R.S., Glazer N.L., Felix J.F., Lieb W., Wild P.S., Felix S.B., Watzinger N., Larson M.G., Smith N.L., Dehghan A., et al. Genetic variants associated with cardiac structure and function: a meta-analysis and replication of genome-wide association data. JAMA. 2009;302:168–178. doi: 10.1001/jama.2009.978-a. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Voight B.F., Scott L.J., Steinthorsdottir V., Morris A.P., Dina C., Welch R.P., Zeggini E., Huth C., Aulchenko Y.S., Thorleifsson G., et al. Twelve type 2 diabetes susceptibility loci identified through large-scale association analysis. Nat. Genet. 2010;42:579–589. doi: 10.1038/ng.609. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Ikram M.A., Fornage M., Smith A.V., Seshadri S., Schmidt R., Debette S., Vrooman H.A., Sigurdsson S., Ropele S., Taal H.R., et al. Common variants at 6q22 and 17q21 are associated with intracranial volume. Nat. Genet. 2012;44:539–544. doi: 10.1038/ng.2245. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Li Y., Willer C.J., Ding J., Scheet P., Abecasis G.R. MaCH: using sequence and genotype data to estimate haplotypes and unobserved genotypes. Genet. Epidemiol. 2010;34:816–834. doi: 10.1002/gepi.20533. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Marchini J., Howie B., Myers S., McVean G., Donnelly P. A new multipoint method for genome-wide association studies by imputation of genotypes. Nat. Genet. 2007;39:906–913. doi: 10.1038/ng2088. [DOI] [PubMed] [Google Scholar]
  • 30.Purcell S., Neale B., Todd-Brown K., Thomas L., Ferreira M.A., Bender D., Maller J., Sklar P., de Bakker P.I., Daly M.J., et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 2007;81:559–575. doi: 10.1086/519795. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Aulchenko Y.S., Struchalin M.V., van Duijn C.M. ProbABEL package for genome-wide association analysis of imputed data. BMC Bioinformatics. 2010;11:134. doi: 10.1186/1471-2105-11-134. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Estrada K., Abuseiris A., Grosveld F.G., Uitterlinden A.G., Knoch T.A., Rivadeneira F. GRIMP: a web- and grid-based tool for high-speed analysis of large-scale genome-wide association using imputed data. Bioinformatics. 2009;25:2750–2752. doi: 10.1093/bioinformatics/btp497. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Devlin B., Roeder K. Genomic control for association studies. Biometrics. 1999;55:997–1004. doi: 10.1111/j.0006-341x.1999.00997.x. [DOI] [PubMed] [Google Scholar]
  • 34.Willer C.J., Li Y., Abecasis G.R. METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics. 2010;26:2190–2191. doi: 10.1093/bioinformatics/btq340. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Cvejic A., Haer-Wigman L., Stephens J.C., Kostadima M., Smethurst P.A., Frontini M., van den Akker E., Bertone P., Bielczyk-Maczynska E., Farrow S., et al. SMIM1 underlies the Vel blood group and influences red blood cell traits. Nat. Genet. 2013;45:542–545. doi: 10.1038/ng.2603. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Lage K., Karlberg E.O., Storling Z.M., Olason P.I., Pedersen A.G., Rigina O., Hinsby A.M., Tumer Z., Pociot F., Tommerup N., et al. A human phenome–interactome network of protein complexes implicated in genetic disorders. Nat. Biotechnol. 2007;25:309–316. doi: 10.1038/nbt1295. [DOI] [PubMed] [Google Scholar]
  • 37.Bult C.J., Richardson R.J., Blake J.A., Kadin J.A., Ringwald M., Eppig J.T. Mouse genome informatics in a new age of biological inquiry. Proceedings of the IEEE International Symposium on Bio-Informatics and Biomedical Engineering; 29–32; 2000. The Mouse Genome Database Group. [Google Scholar]
  • 38.Croft D., O'Kelly G., Wu G., Haw R., Gillespie M., Matthews L., Caudy M., Garapati P., Gopinath G., Jassal B., et al. Reactome: a database of reactions, pathways and biological processes. Nucleic Acids Res. 2011;39:D691–D697. doi: 10.1093/nar/gkq1018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Kanehisa M., Goto S., Sato Y., Furumichi M., Tanabe M. KEGG for integration and interpretation of large-scale molecular data sets. Nucleic Acids Res. 2012;40:D109–D114. doi: 10.1093/nar/gkr988. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Ashburner M., Ball C.A., Blake J.A., Botstein D., Butler H., Cherry J.M., Davis A.P., Dolinski K., Dwight S.S., Eppig J.T., et al. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat. Genet. 2000;25:25–29. doi: 10.1038/75556. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Wood A.R., Esko T., Yang J., Vedantam S., Pers T.H., Gustafsson S., Chu A.Y., Estrada K., Luan J., Kutalik Z., et al. Defining the role of common variation in the genomic and biological architecture of adult human height. Nat. Genet. 2014 doi: 10.1038/ng.3097. doi:10.1038/ng.3097. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Geller F., Feenstra B., Carstensen L., Pers T.H., van Rooij I.A., Korberg I.B., Choudhry S., Karjalainen J.M., Schnack T.H., Hollegaard M.V., et al. Genome-wide association analyses identify variants in developmental genes associated with hypospadias. Nat. Genet. 2014 doi: 10.1038/ng.3063. [DOI] [PubMed] [Google Scholar]
  • 43.Hocher B., Chen Y.P., Schlemm L., Burdack A., Li J., Halle H., Pfab T., Kalk P., Lang F., Godes M. Fetal sex determines the impact of maternal PROGINS progesterone receptor polymorphism on maternal physiology during pregnancy. Pharmacogenet Genomics. 2009;19:710–718. doi: 10.1097/FPC.0b013e328330bc7a. [DOI] [PubMed] [Google Scholar]
  • 44.Pfab T., Slowinski T., Godes M., Halle H., Priem F., Hocher B. Low birth weight, a risk factor for cardiovascular diseases in later life, is already associated with elevated fetal glycosylated hemoglobin at birth. Circulation. 2006;114:1687–1692. doi: 10.1161/CIRCULATIONAHA.106.625848. [DOI] [PubMed] [Google Scholar]
  • 45.Hocher B., Slowinski T., Stolze T., Pleschka A., Neumayer HH., Halle H. Association of maternal G protein beta3 subunit 825T allele with low birthweight. Lancet. 2000;355:1241–1242. doi: 10.1016/S0140-6736(00)02094-8. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Data

Articles from Human Molecular Genetics are provided here courtesy of Oxford University Press

RESOURCES