Skip to main content
Genetics logoLink to Genetics
. 2016 Feb 10;202(4):1313–1328. doi: 10.1534/genetics.115.183905

Estimating Modifying Effect of Age on Genetic and Environmental Variance Components in Twin Models

Liang He *,†,1, Mikko J Sillanpää ‡,§, Karri Silventoinen *, Jaakko Kaprio *,**,††, Janne Pitkäniemi *,‡‡
PMCID: PMC4827728  PMID: 26868768

Abstract

Twin studies have been adopted for decades to disentangle the relative genetic and environmental contributions for a wide range of traits. However, heritability estimation based on the classical twin models does not take into account dynamic behavior of the variance components over age. Varying variance of the genetic component over age can imply the existence of gene–environment (G × E) interactions that general genome-wide association studies (GWAS) fail to capture, which may lead to the inconsistency of heritability estimates between twin design and GWAS. Existing parametric G × E interaction models for twin studies are limited by assuming a linear or quadratic form of the variance curves with respect to a moderator that can, however, be overly restricted in reality. Here we propose spline-based approaches to explore the variance curves of the genetic and environmental components. We choose the additive genetic, common, and unique environmental variance components (ACE) model as the starting point. We treat the component variances as variance functions with respect to age modeled by B-splines or P-splines. We develop an empirical Bayes method to estimate the variance curves together with their confidence bands and provide an R package for public use. Our simulations demonstrate that the proposed methods accurately capture dynamic behavior of the component variances in terms of mean square errors with a data set of >10,000 twin pairs. Using the proposed methods as an alternative and major extension to the classical twin models, our analyses with a large-scale Finnish twin data set (19,510 MZ twins and 27,312 DZ same-sex twins) discover that the variances of the A, C, and E components for body mass index (BMI) change substantially across life span in different patterns and the heritability of BMI drops to ∼50% after middle age. The results further indicate that the decline of heritability is due to increasing unique environmental variance, which provides more insights into age-specific heritability of BMI and evidence of G × E interactions. These findings highlight the fundamental importance and implication of the proposed models in facilitating twin studies to investigate the heritability specific to age and other modifying factors.

Keywords: twin models, penalized B-splines, age-specific heritability, empirical Bayes predictor, body mass index


TWIN studies have been broadly used as a tool to investigate heritability for decades. A recent comprehensive meta-analysis reported 17,804 heritability estimates for various traits based on twin studies (Polderman et al. 2015). Estimation methods and software such as structural equation modeling (SEM) based on the classical twin models consisting of additive genetic, common, and unshared environmental effects have been well established and extensively adopted (Rijsdijk and Sham 2002). Extensions for longitudinal data or time-to-event phenotypes based on the classical twin models have been proposed (Boomsma et al. 1989; Pitkäniemi et al. 2007). Nevertheless, one of the major issues posing potential challenge is that many twin data sets contain individuals across a wide range of age and the variances of genetic and environmental components for many complex traits such as body mass index (BMI) and height change over age (Lajunen et al. 2009; Elks et al. 2012) or birth cohorts (Silventoinen et al. 2000). For example, the variances of genetic and environmental components and thus the heritability of BMI have been found to diverge with age in a Finnish study of adolescent twins (Lajunen et al. 2009) and another of adolescents and young adults (Ortega-Alonso et al. 2012), implying that potential gene–environment (G × E) interactions play a notable role in these complex traits. During childhood when a twin pair is raised in a similar environment, the common environmental effects may be present in early childhood when children are dependent on their parents and then drop after midchildhood when they start to get more independence (Silventoinen et al. 2009). Unfortunately, relying on the classical twin models in these cases may lead to substantially biased results (Purcell 2002; Zyphur et al. 2013). In many studies, information on age at measurements is available for each twin pair, and ignoring how age potentially modifies the genetic and environmental variances would result in problematic estimates especially when age is distributed widely and considerably different between monozygotic (MZ) and dizygotic (DZ) twin samples.

The gap between the heritability estimated from twin studies and genome-wide association studies (GWAS), known as missing heritability, is observed for many complex traits, and so far little is known about its cause (Manolio et al. 2009; Eichler et al. 2010). As an example, compared with a heritability of from nearly 50% up to 80% estimated from different twin studies for BMI (Elks et al. 2012), thus far, a meta-analysis of multiple GWAS data sets has reported that 97 genome-wide significant loci account for <3% of the variance, while all common single-nucleotide polymorphisms (SNPs) account for ∼20% of the variance in which age has been adjusted for linearly (Locke et al. 2015) and ∼17 million variants account for around 27% of the variance (Yang et al. 2015). While literature has suggested that the fundamental assumption of additive genetics in GWAS might be problematic (Nelson et al. 2013), estimates of heritability from the perspective of twin design should also be refined or revised by taking into account moderators such as age. One of the causes for missing heritability probably lies in the G × E interactions. Increasing variance of the genetic component over age implies the existence of G × E interactions that general GWAS are not able to capture. Hence, it would be of vital concern to explore how the genetic and environmental components evolve and to estimate the variance curves on which there is no prior information available.

Exploration of dynamics of variance components over a period of growth by partitioning the phenotypic variance and covariance with quantitative genetic models can date back to W. Atchley in the early 1980s (Atchley and Rutledge 1980; Cheverud et al. 1983; Pletcher and Geyer 1999; Charmantier et al. 2006). Recently, approaches such as functional GWAS (fGWAS) have been proposed to explore time-varying genetic effects of SNPs by utilizing Legendre orthogonal polynomials and longitudinal data (Wu and Lin 2006; Das et al. 2011; Li and Sillanpää 2015). Different from fGWAS and its recent developments (Das et al. 2011; Li et al. 2015), which are based on a varying-coefficient model, in the case of twin study, we aim at modeling variance functions directly. One of the solutions is to divide samples into multiple strata and to estimate the variances within each stratum (Neale and Cardon 1992). The drawbacks of this strategy include that selection of cut points for strata is often unequivocal and difficult in some scenarios and twins measured at different ages are not always in the same strata. It may result in larger uncertainty of estimates due to reduced sample sizes in each stratum. Another option is to adopt an existing parametric G × E interaction model proposed by Purcell in which age can be modeled by adding a moderator in a linear or quadratic form (Purcell 2002). Unfortunately, the variance curves to be estimated in many cases are not necessarily linear or quadratic. Their patterns can often be fluctuant and dramatically change across life span as shown later in this study. A concrete parametric model is often difficult to determine beforehand. Hence, a nonparametric approach to explore the variance curves of these components can give more realistic understanding on the pattern of the moderating effect provided that the number of estimated parameters is not too large. Most recently, Briley et al. (2015) introduced local structural equation modeling (LOSEM), which estimates G × E interactions using a nonparametric approach based on LOWESS and SEM.

The past two decades have seen rapid advances of employing B-splines, particularly penalized B-splines (P-splines) (O’Sullivan 1986; Eilers and Marx 1996), to estimate an unknown function in the context of epidemiology. With the same advantages of the flexibility as LOSEM, P-splines can further optimally choose the best model by penalizing the smoothness and perform various hypothesis tests (Ruppert et al. 2003). In this work, we propose novel twin models that choose as the starting point the standard twin model based on decomposing total variance into the variance components of additive genetics (A), common environment (C), and unique environment (E) (a.k.a. the ACE model). We generalize the component variances as variance functions with respect to age by using B-splines or P-splines in which the ACE model becomes the special case when all spline coefficients within the components share the same value. We construct expectation-maximization (EM) and Markov chain Monte Carlo (MCMC) algorithms to estimate the variance functions together with their pointwise confidence bands and construct an R package based on these models. We then show through simulation studies and a real data analysis that the proposed models along with the estimation are well behaved and are capable of depicting a refined picture of dynamic behavior of the component variances, which provides more insights into the age-specific heritability. Our results from the analysis of a Finnish twin data set indicate that there exist remarkable age-dependent effects on all A, C, and E components for BMI and age should be taken into account when estimating the heritability of BMI.

Methods

Suppose that we have observed zero-mean normally distributed quantitative phenotypic data YM (an nM×1 vector) and YD (an nD×1 vector) for fM(=nM/2) MZ and fD(=nD/2) DZ twin pairs, respectively. The phenotypes of each twin pair are measured at the same age TM (an fM×1 vector) and TD (an fD×1 vector). An example of the observed data format is shown in Table 1.

Table 1. An example of the data format used for model specification.

MZ twins DZ twins
Twin ID Trait Age Twin ID Trait Age
1 YM(1) TM(1) 1 YD(1) TD(1)
1 YM(2) 1 YD(2)
2 YM(3) TM(2) 2 YD(3) TD(2)
2 YM(4) 2 YD(4)
fM YM(nM1) TM(fM) fD YD(nD1) TD(fD)
fM YM(nM) fD YD(nD)

According to the ACE model, we have a random effect model YM(iM,j)=aM(iM)+cM(iM)+e(iM,j) for the MZ twin pair iM and a model YD(iD,j)=a1D(iD)+a2D(iD,j)+cD(iD)+e(iD,j) for the DZ twin pair iD, where iM{1,,fM}, iD{1,,fD}, and j=1,2. Here we assume that aMN(0, σa2), a1D,a2DN(0, 0.5σa2), cM,cDN(0, σc2), and eN(0, σe2), where σa2, σc2, and σe2(0,+) are the additive genetic, common, and unique environmental components, respectively. Thus, this model implies that the phenotypes follow the following normal distribution,

(YMYD)N((00), (ΣM00ΣD)), (1)

where ΣM=σa21BM+σc21BM+σe2I (an nM×nM matrix), ΣD=σa2A+σc21BD+σe2I (an nD×nD matrix), 1BM=IfM12, 1BD=IfD12, and A=IfDA2, where 12=(1111), A2=(10.50.51), and ⊗ represents the Kronecker product.

Estimation of variance curves using B-splines

We first propose a nonparametric approach, referred to as an ACE(t) model, to estimate the variance curves of the additive genetic and common environmental components using B-splines (de Boor 1978) with a set of evenly distributed knots. We assume that the A and C variances are represented at each time point with their own parameters as follows:

σa2(t), σc2(t),  t=1,,T. (2)

Note that at this stage we assume σe2(t)=σe2 to be constant over time although including an age-dependent unique environmental component can be readily extended. Instead of trying to estimate σa2(t) [or σc2(t)] separately at each time point, we assume that values at adjacent time points are more alike; that is, they can be described as a smooth curve over time. First, values in (2) are expressed using linear combinations of B-spline basis functions as

σa2(t)=ek=1KβkABkA(t)

and

σc2(t)=el=1LβlCBlC(t),

where βA(,+)K and βC(,+)L are the vectors of coefficients. We select K and L B-spline basis functions for the A and C components, respectively. We choose the B-spline basis functions BA(t) and BC(t) with a degree p=2 (or an order of 3) for the following simulation and real data analyses. The degree p=2 is probably appropriate for most traits as the variance curves of the A and C components are generally smooth and do not change sharply within a small age interval. Thus, the numbers of knots for construction of the B-splines for the A and C components are K+p+1 and L+p+1. Replacing the variances σa2 and σc2 in the model (1) with their time-dependent counterparts σa2(t) and σc2(t) leads to the covariance matrices

ΣM=diag(eBTMAβA)12+diag(eBTMCβC)12+σe2I,
ΣD=diag(eBTDAβA)A2+diag(eBTDCβC)12+σe2I,

where BTMA, BTMC, BTDA, and BTDC are fM×K, fM×L, fD×K, and fD×L design matrices of the B-spline basis functions evaluated at TM and TD for the MZ and DZ twins and diag(.) converts a vector to a diagonal matrix with the vector entries as its diagonal elements. The model becomes the classical ACE model when the coefficients share the same value within βA and βC, respectively. The estimates (βA^,βC^) can be obtained using maximum-likelihood estimation (MLE), and the variance curves σa2(t) and σc2(t) are estimated by σa2^(t)=ek=1KβkA^BkA(t) and σc2^(t)=el=1Lβlc^BlC(t). The detailed derivation of the MLE for the ACE(t) model is described in the Appendix.

The pointwise (not simultaneous) confidence bands for the ACE(t) model can be constructed from a Delta method as previously described (Rosenberg 1995; Mao and Zhao 2003). In this case, according to the Delta method the estimates σa2^(t) and σc2^(t) at the time point t follow normal distributions with the variances approximated by Var^(ek=1KβkA^BkA(t))=e2·k=1KβkA^BkA(t)·(BA(t)HβA˜1BA(t)) and Var^(el=1LβlC^BlC(t))=e2·l=1Lβlc^BlC(t)·(BC(t)HβC˜1BC(t)), where H˜1 is the inverse of the estimated Fisher information from the MLE (i.e., the inverse of the Hessian of the negative log-likelihood). The confidence bands can also be estimated using a bootstrap method that is more computationally intensive than the Delta method. A parametric bootstrapping can be performed by first generating the bootstrap phenotypes (YM, YD) based on model (1) with the parameters replaced by their estimates (βA^,βC^,σe2^) and fitting the model to obtain the bootstrap replicates (βA^,βC^,σe2^) from which the confidence bands at the time point t can be derived (Efron and Tibshirani 1994).

Stable estimation of variance curves, using penalized splines

While the estimation and implementation of the ACE(t) model are straightforward and fast, the performance of using B-splines in practice is sensitive to the selection of the number of knots. Fewer knots suffer from large uncertainty while excessive knots can lead to overfitting. A handful of procedures for automatic knot selection have been developed; however, they are generally quite complicated and computationally intensive as pointed out by Ruppert et al. (2003), especially in the context of variance curve estimation based on large-scale twin data. One preferable way is to retain abundant knots and penalize their roughness, which is described as follows.

To reduce the impact of the number of knots, we therefore propose a more stable model using P-splines (O’Sullivan 1986), which we refer to as an ACE(t)-p model. As previously shown, the number of knots in P-splines has small influence on the results and it has been suggested that at most 35–40 knots are generally enough for all sample sizes and for all smooth functions without too many oscillations (Ruppert 2002). We first parameterize the ACE model with three random effects to decompose the covariance matrices and write the subsequent derivations in a matrix form. Specifically, we let Z2M=Z3M=IfM12×1, Z1D=0.5InD, Z2D=0.5IfD12×1, and Z3D=IfD12×1 so that Z2MZ2M=Z3MZ3M=1BM, Z3DZ3D=1BD, and Z1DZ1D+Z2DZ2D=A, where 12×1=(11). According to the first type of parameterization proposed by Rabe-Hesketh et al. (2008) and following the notation of the spline coefficients and design matrices in the ACE(t) model, the covariance matrices are expressed as

ΣM=Z2Mdiag(eBTMAβA)Z2M+Z3Mdiag(eBTMCβC)Z3M+σe2I,
ΣD=Z1Ddiag(eBTDAβA12×1)Z1D+Z2Ddiag(eBTDAβA)Z2D+Z3Ddiag(eBTDCβC)Z3D+σe2I.

The approach using P-splines imposes two penalties on βA and βC, respectively. We add the quadratic penalties λAβADAβA and λCβCDCβC to the log-likelihood,

(2)lpsp(σe2,βA,βC)=log|ΣM|+YM ΣM1YM+log|ΣD|+YDΣD1YD+λAβADAβA+λCβCDCβC,

where DA and DC are the penalty matrices constructed from differences between neighboring coefficients according to Eilers and Marx (1996) (refer to the Appendix for the contents of DA and DC). The penalizing coefficients λA and λC substantially affect the estimation results, and many strategies of selecting their values have been proposed, including generalized cross-validation (Craven and Wahba 1978) and Akaike’s information criterion (Akaike 1969). P-splines in Bayesian models may be implemented via variational Bayes estimation (Li and Sillanpää 2013) and MCMC methods (Crainiceanu et al. 2005). On the other hand, under a framework of a random effects model, quadratic penalty terms can be regarded as assuming the following zero-mean normal distributions to the spline coefficients

βAN(0,σβA2DA),βCN(0,σβC2DC), (3)

where σβA2=1/λA, σβC2=1/λC, and DA, DC are the generalized inverses of DA and DC. Expression (3) provides a connection to estimation of variance components by restricted maximum likelihood (REML) in a linear mixed-model context (Ruppert et al. 2003). To estimate β=(βA,βC), we follow a strategy adopted by Kauermann and Wegener (2011) that first estimates (σβA2, σβC2) by using a marginal likelihood with the spline coefficients β integrated out analytically. Based on this strategy, Krivobokova proposed a unified EM-type iterative procedure (Krivobokova et al. 2008) to estimate the spline coefficients and the penalizing coefficients (now transformed to the variances of random effects in the linear mixed model) simultaneously by estimating the spline coefficients through an iterated weighted least-squares (IWLS) algorithm and employing an approximation algorithm (Breslow and Clayton 1993) to iterate between the estimation of the spline coefficients and minimization of the marginal log-likelihood. More specifically, following the same spirit, the marginal likelihood in our case is

Lpsp(σe2,σβA2,σβC2)=1σβAK˜σβCL˜1|ΣM|1/2|ΣD|1/2e(1/2)[ YMΣM1YM+YMΣD1YD+βADAβA/σβA2+βCDCβC/σβC2]dβAdβC, (4)

where K˜ and L˜ are the numbers of positive eigenvalues of DA and DC. By applying a Laplace approximation to the integrand of (4) with the condition of K, LnM+nD, we obtain the −2 log-likelihood

(2)lpsp(σe2,σβA2,σβC2)=K˜logσβA2+L˜logσβC2+log|ΣM{β^}|+YM ΣM1{β^}YM+log|ΣD{β^}|+YDΣD1{β^}YD+βA^DAβA^σβA2+βC^DCβC^σβC2+log|g(β^)|,

where

g(β)=(1/2)(log|ΣM|+YM ΣM1YM+log|ΣD|+YDΣD1YD+βADAβA/σβA2+βCDCβC/σβC2),

{β^} represents the replacement of β at β^ in the covariance matrices, and β^ is evaluated to minimize g(β). Instead of IWLS in Krivobokova et al. (2008), we use the limited-memory Broyden–Fletcher–Goldfarb–Shanno (L-BFGS) algorithm (Byrd et al. 1995), which does not require evaluating a Hessian matrix explicitly. The following algorithm computes β^ together with (σβA2^, σβC2^) in an iterative way:

  1. Give initial values to (σe2^(1),σβA2^(1),σβC2^(1)).

  2. Given the values of (σe2^(i1),σβA2^(i1),σβC2^(i1)) from the previous iteration, we find (βA^(i),βC^(i)), minimizing g(β) by using the L-BFGS algorithm.

  3. We find (σe2^(i),σβA2^(i),σβC2^(i)), minimizing (2)lpsp(σe2,σβA2,σβC2) by using the L-BFGS algorithm with (βA^(i),βC^(i)) obtained from step 2.

  4. Iterate between steps 2 and 3 until convergence.

The details of the derivation for the algorithm are given in the Appendix.

While the proposed algorithm provides a type of estimate for β, the construction of confidence intervals (C.I.’s) is far from straightforward. It has been pointed out that general bootstrap methods for the C.I.’s are overly narrow because they include only variance but not bias, which is substantial in the penalized estimation (Goeman 2010). The C.I.’s constructed under the mixed-model framework or the posterior distribution overcome this issue (Wood 2006; Kauermann et al. 2009). Recall that a best prediction (BP) for random effects in the linear mixed model is defined as the mean of the posterior distribution of random effects given data and the variances (Searle et al. 2009). This does not naturally apply to β^ as the estimated spline coefficient β^ from the above algorithm is the mode of the joint distribution of Y=(YM, YD) and β rather than a mean βˇ, which minimizes the mean square error (MSE) of prediction (Searle et al. 2009). Unlike the best linear unbiased predictions (BLUPs) in the linear mixed model (Henderson 1975; Robinson 1991), the posterior mean βˇ in our case is not a linear function of Y because β is in the variance functions. Thus, we propose an MCMC method to numerically approximate the posterior distribution to obtain βˇ, which is the estimated (or empirical) BP of β with (σe2^,σβA2^,σβC2^) plugged in. From the Bayesian perspective, βˇ is an empirical Bayes predictor of β (Carlin and Louis 1997). Then the C.I.’s can be derived either from the conditional MSE of prediction obtained from the posterior distribution of β approximated by the MCMC method or from the marginal prediction error covariance matrix Var(βˇβ) (Booth and Hobert 1998; Skrondal and Rabe-Hesketh 2009), which can be estimated by a parametric bootstrap method. The detailed description of how to construct C.I.’s is given in the Appendix.

Implementation and extension

We developed an R package “ACEt” to implement the proposed models for public use (https://r-forge.r-project.org/R/?group_id=2167). One of the attractive points is that both models can be readily implemented using existing R functions. We utilized the “splines” R package for computing the B-spline design matrices and the “optim” R function to perform the minimizations in which the L-BFGS algorithm with box bounds is used. Although the L-BFGS algorithm does not evaluate the Hessian matrices explicitly, the matrix operations would be time consuming and even impractical when thousands of samples are involved. The situation is particularly exacerbated as the twin models often require a larger sample size to obtain a reliable estimate of the variances. Fortunately, as most of the matrices are sparse and block diagonal, we dramatically reduce the running time by partitioning the matrix operations twin-wisely and implementing the optimization functions in C++.

Our proposed models can be easily modified and extended to handle other twin models such as the AE and CE models. We can also estimate the variance curve of the unique environmental component together with the other components. For example, both variances in the AE model can vary over age, which we refer to as the AE(t) and AE(t)-p models. These models have been implemented and included in the ACEt R package. Their application is demonstrated in the following Finnish BMI twin study.

Data availability

The simulation data is included as an example dataset in the R package.

Results

Simulation study

We performed two simulation studies with different sample sizes to provide a rough picture of how many twin pairs are needed for reliable estimates. In these simulations, we also compared the performance among the ACE(t) models with 5 and 8 interior knots and the ACE(t)-p models with 8 and 12 interior knots. We generated phenotypic data for 5000 MZ and 5000 DZ twin pairs in the first simulation and twofold sample sizes in the second one, using R. We first sampled a vector of age T for the twin pairs that was uniformly distributed from 1 to 50. We fixed the variance of the E component at 1 and chose the following quadratic and power variance functions for the A and C components,

σa2(T)=310(T2550)2,
σc2(T)=T0.2,

of which the shapes are depicted in Figure 1. We then simulated the phenotypic data by sampling from bivariate normal distributions based on Equation 1 with σa2(T) and σc2(T) plugged into the covariance matrices. The average mean square error (AMSE), which is defined as AMSEa,c=1mi=1m(σa,c2(ti)σa,c2(ti)^)2 and takes into account both bias and variance, was used to evaluate the performance of estimation where we chose the number of evaluated time points m=500. In each scenario, we simulated 200 data sets and fitted them with the proposed ACE(t) and ACE(t)-p models to calculate the AMSEs. For the ACE(t) model, we ran an additional analysis to compare the confidence bands obtained from the Delta method and the bootstrap method with 200 replicates.

Figure 1.

Figure 1

Variance functions of the A, C, and E components investigated in the simulation study.

The results presented in Table 2 show that the AMSEs of the ACE(t) model with 8 knots are larger than those with 5 knots for both components. This suggested that 8 knots seemed excessive and could worsen the performance in terms of AMSE for the relatively smooth functions that we chose to investigate here. In contrast, the ACE(t)-p model had smaller AMSEs than the ACE(t) model in all scenarios, confirming its superiority of tuning the penalty term for optimal estimation as long as a sufficient number of knots is given (Ruppert et al. 2003). The ASMEs from the ACE(t)-p model with a different number of knots were more similar to each other, implying that the estimates from P-splines are much less affected by the choice of knots than those from B-splines. The smaller number of knots in the penalized spline estimator performs better in terms of AMSE, which is in accordance with a previous theoretical finding (Claeskens et al. 2009). The AMSEs decreased almost linearly with increasing sample size in all situations; however, accurate estimates still entailed thousands of twin pairs.

Table 2. The empirical AMSE from the ACE(t) and ACE(t)-p models.

Component ACE(t) (5knots) ACE(t) (8knots) ACE(t)-p (8knots) ACE(t)-p (12 knots)
n=20,000
Additive genetic 0.04020788 0.05683567 0.0290397 0.03235355
Common environmental 0.03416853 0.05224053 0.01837206 0.02278777
n=40,000
Additive genetic 0.01770024 0.02705611 0.01339244 0.01574035
Common environmental 0.01588659 0.02550069 0.009042152 0.01346567

The confidence bands from the Delta method were comparable to those from the bootstrap method for both components as plotted in Figure 2. The confidence bands estimated from the bootstrap method were less smooth due to the limited number of resampling (i.e., 200 replicates), which should be adequate for the bootstrap method (Efron and Tibshirani 1994). The confidence bands from these two methods almost coincided, suggesting that the L-BFGS algorithm worked properly as a faster alternative to Newton’s method in this case.

Figure 2.

Figure 2

Confidence bands constructed from the Delta and bootstrap methods for the A (shown in A) and C (shown in B) components.

A Finnish twin study of BMI

We applied both models to a twin data set retrieved from the Finnish Twin Cohort study to investigate age-dependent genetic and environmental components of BMI. The details on collection of the data were described in previous publications for older (Kaprio and Koskenvuo 2002) and younger cohorts (Kaprio et al. 2002). In this analysis, we included 19,510 MZ and 27,312 DZ same-sex twin individuals together with the information on age at the measurement contributed to the CODATwins project (Silventoinen et al. 2015). The distributions of age at the measurement among the MZ and DZ twins were widespread from 11 years to 60 years as shown in Figure 3. We fitted the data set with a linear regression on sex and age before using the residuals from the regression as the phenotypes for the analysis of variance curves. The variance curves of the genetic and common environmental components estimated from the ACE(t) (with 5 interior knots for either component) and the ACE(t)-p models (with 8 interior knots for either component) are shown in Figure 4 and Figure 5. For the ACE(t)-p model, we included more knots to ensure they were sufficient and excessive knots were affected little in terms of AMSE. The 95% pointwise confidence bands were obtained by using the bootstrap method for the ACE(t) model and the MCMC method described in the Appendix for the ACE(t)-p model. The comparable results from both models suggested that the variance of the additive genetic component started from ∼2 at age 11 years and increased rapidly until age 35 years. After that, it leveled off for ∼10 years before growing again to nearly 15, except that the curve from the ACE(t)-p model had more fluctuations. In contrast, the variance of the common environmental component starting at ∼8 at age 11 years dropped markedly before age 20 years and became almost undetectable after that. The estimated variance of the unique environmental component was 2.25. In contrast, the ACE model using the OpenMX package (Boker et al. 2011) yielded estimates of 7.384, 0.000, and 2.265 for the A, C, and E components, respectively, while the AE model gave nearly the same estimates for the A and E components.

Figure 3.

Figure 3

Distributions of age among MZ and DZ twins.

Figure 4.

Figure 4

The variance curves of the A and C components with the 95% confidence bands estimated from the ACE(t) model.

Figure 5.

Figure 5

The variance curves of the A and C components with the 95% confidence bands estimated from the ACE(t)-p model.

The results revealed that the common environmental factor almost vanished after age 20 years, but the growth of the additive genetic factor might be attributed to an increasing unique environmental variance that was assumed to be constant in the ACE(t) and ACE(t)-p models. Therefore, to verify whether a wrong model was used to fit the data after age 20 years, we employed the AE(t) and AE(t)-p models described in Methods to estimate the variance curves after age 20 years, where we allowed both A and E variance components to change. Consequently, as shown in Figure 6 and Figure 7, the variance of the A component leveled off and moderately fluctuated at ∼7.5 and climbed after age 50 years. In contrast, the variance of the E component initially standing at 2.25 grew almost linearly across the entire age interval. The AE(t) (with five interior knots for either component) and AE(t)-p (with eight interior knots for either component) models generated comparable results for the E component while AE(t)-p produced a more wavy and detailed curve for the A component probably because the number of knots in the AE(t) model was too small to catch the irregular and oscillating curve.

Figure 6.

Figure 6

The variance curves after age 20 years of the A and E components with the 95% confidence bands estimated from the AE(t) model.

Figure 7.

Figure 7

The variance curves after age 20 years of the A and E components with the 95% confidence bands estimated from the AE(t)-p model.

To further verify the periodic pattern of the A component after age 20 years, we performed stratified analyses with 5-year and 3-year intervals based on the AE model. The result based on the 5-year interval (Figure 8A) was very close to that from the AE(t) model (i.e., the B-spline model) and the result based on the 3-year interval (Figure 8B) was more similar to that from the AE(t)-p model (i.e., the P-spline model), which showed more periodic fluctuations. Smaller age intervals offered a refined picture but resulted in larger estimation uncertainty.

Figure 8.

Figure 8

The variance curves after age 20 years of the A and E components with the 95% confidence bands estimated from the stratified analyses. (A) Divided by 5-year intervals. (B) Divided by 3-year intervals.

Discussion

In this work, we have proposed two novel twin models to explore dynamic behavior of the variance components over age or time. The estimated variance functions can be utilized to calculate the age-specific heritability. The results of our BMI study demonstrate that the proposed models provide a more detailed dynamic profile of the A, C, and E variance components than the classical twin models. These models are more flexible than the parametric models (Purcell 2002) as they do not require a prespecified assumption of the function form. Compared to LOSEM, which is an exploratory method and the results of which are heavily dependent on the selection of the width of a weighting distribution (Briley et al. 2015), the proposed models based on P-splines are more adaptive to data and straightforward for model comparison using likelihood-ratio tests.

The ACE(t)-p model is more stable than the ACE(t) model in the sense that it is less sensitive to the number of knots. This amounts to the problem in stratification analyses, that is, how to choose age intervals. The ACE(t)-p model solves this issue by penalizing the smoothness to select the best model. Moreover, the simulation results suggest that the ACE(t)-p model has the advantage of providing better estimates in terms of AMSE; however, the benefit comes from the price of more computational intensity with the same number of knots. It is evident from the simulation study that adding excessive knots in the ACE(t) model worsens the performance for smooth functions. The ACE(t)-p model also performs better with fewer knots provided the number of knots is sufficient but its performance is less affected than that of the ACE(t) model. Thus, assuming that the variance components rarely change drastically within a short period of time, a preferable strategy is to use the ACE(t)-p model and to choose a moderate number of knots [a number of 10 for each component seems enough based on our simulation, although 35-40 is almost adequate for all smooth functions (Ruppert 2002)] to ensure that the model can sufficiently capture the dynamic behavior and in the meantime does not substantially lose performance or increase computational intensity.

The simulation study also indicates that given a sample size of 20,000 and a balanced and widespread distribution of age, both models are able to produce slightly reliable estimates of the functions, which implies that the accurate estimation of variance functions requires a relatively large sample size (at least tens of thousands). Given the current implementation using C++ in the ACEt R package, for a study consisting of 10,000 twin pairs, an Intel i5-3210M laptop takes a few seconds for the ACE(t) model, depending on the number of knots, while it takes around two times longer for the ACE(t)-p model under the same setting. The iterative algorithm for the ACE(t)-p model usually converges within 10 steps.

The trend of the A and C components estimated from the Finnish twin data set for BMI before age 20 years is consistent with that in previous reports (Lajunen et al. 2009; Silventoinen et al. 2009) except that the estimated variance of the C component before age 15 years is higher than that from the stratified study (Lajunen et al. 2009). This is probably caused by the lack of adequate observations at the ages of 13 years and 15 years (Figure 2). It is also possible that the steep decline at the beginning is too rapid for the splines with the order of 3 to capture. Given the absence of the C component, our variance curves of the A and E components indicate that the heritability [estimated as σa2(t)/(σa2(t)+σe2(t))] decreases gradually to <50% until age 50 years. This is in accordance with a previous report in which samples were stratified by 10-year age groups (Korkeila et al. 1991) and a meta-analysis of twin studies showing that the heritability of BMI is higher among adolescents than among adults (Elks et al. 2012). Our estimated variance curves suggest that the decreasing heritability of BMI during middle age is due to the remarkable growth of the unique environmental variance, highlighting the importance of the environmental role in controlling BMI among adults. The variance of the genetic component stabilizes after age 20 years, possibly caused by the end of the physiologically developmental phase that is influenced substantially by genetic factors. It may also relate to the effect size of the SNP rs9939609 in FTO on BMI that rises until around age 20 years and then gradually attenuates into adulthood (Hardy et al. 2010). In contrast, the heritability of 76% from the ACE and AE models is overestimated for age >20 years as our variance curves suggest that the heritability falls below 50% after age 40 years.

Comparing the variance curves from the different models in the Finnish BMI study shows that fitting data using a wrong model may lead to severely misleading results and interpretation. For instance, ignoring the age-dependent effect on the E component inflated the estimation of the variance of the genetic component. As the heterogeneous genetic or environmental effects of age seem ubiquitous for many complex traits, previous twin studies based on the classical twin models (Distel et al. 2011; Elks et al. 2012; Sas et al. 2012) might produce an inaccurate estimate of heritability. Explicitly modeling the variance functions is a necessary action if we want to know more about their strength. Our proposed models provide unprecedentedly refined details for the evolution of both genetic and environmental factors, which pave the way to explore the age-specific heritability function. Identification of the dynamic behavior of the genetic factor likely implies the interaction with age of some key genes related to the phenotype such as FTO for BMI. Our results provide support for the need to further investigate time-dependent effects of significantly associated SNPs identified in GWAS. In fact, multiple significant SNPs have been found to affect BMI with different temporal patterns during middle age (Das et al. 2011).

Future work of the proposed models can be focused on the extension to other types of traits such as binary and categorical traits. A full Bayesian method may be utilized to obtain a more accurate estimate and C.I.’s. Moreover, the proposed models can also be utilized to investigate other timescales and other moderators shared by a twin pair such as birth year. Similar to LOSEM, a major limitation is that the age has to be measured at the twin level (Briley et al. 2015). Further generalization to the variance–covariance matrices using covariance functions (Pletcher and Geyer 1999) is necessary to extend the models to investigate other continuous environmental moderators that may differ within a twin pair, for example education, incomes, or some physical environmental factors.

In conclusion, we have proposed two novel twin models that estimate dynamic behavior of the variance components in the classical twin models with respect to age. As an alternative and generalization to the classical twin models, we have demonstrated the performance of these models through the simulation and the Finnish BMI studies. Our results have emphasized the importance and implications of these models in facilitating twin studies to investigate the age-specific heritability. Given the evidence of the important influence of age on both genetic and environmental factors, previous studies on heritability estimated from the classical twin models may need further review. It should be noted that in addition to age the proposed models can be used to study other environmental moderators as well, which will remarkably broaden their application.

Acknowledgments

We are grateful to the editors and two anonymous referees for their constructive comments that helped us to substantially improve the presentation of this article. L.H. was financially supported by the Department of Public Health, University of Helsinki. K.S. was supported by the Academy of Finland (grant 266592). J.K. was supported by the Academy of Finland (grants 265240 and 263278). The research reported in this article was supported, in part, by grants P01 AG043352 and R01AG047310 from the National Institute on Aging. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

Appendix

Detailed Derivation of the Estimation in the ACE(t) Model

According to model (1), the estimation of σe2, βA, and βC is straightforward by minimizing the −2 log-likelihood,

(2)l=log|ΣM|+YM ΣM1YM+log|ΣD|+YDΣD1YD.

The score functions are obtained by calculating the partial derivatives with respect to the parameters,

(2)lσe2=tr(ΣM1)YMΣM1ΣM1YM+tr(ΣD1)YDΣD1ΣD1YD,
(2)lβA=(BTMA12×1)diag(eBTMAβA12×1)d(ΣD11BM1BMΣD1YDYDΣD1)1nM×1+(BTDA12×1)diag(eBTDAβA12×1)d(ΣD1AAΣD1YDYDΣD1)1nD×1,
(2)lβC=(BTMC12×1)diag(eBTMCβC12×1)d(ΣD11BM1BMΣD1YDYDΣD1)1nM×1+(BTDC12×1)diag(eBTDCβC12×1)d(ΣD11BD1BDΣD1YDYDΣD1)1nD×1,

where d(·) is the matrix with only its diagonal elements. The Hessian matrix H(βA,βC,σe2) can be obtained by further calculating the partial derivatives of the score functions with respect to the parameters. Thus, the MLEs σe2^, βkA^, and βlc^ can be obtained by using numerical methods such as the Newton–Raphson method. Practically, it would be time-consuming because the estimation of variance generally requires a relatively large sample size so that the large-scale computations for the Hessian matrix would be inevitably involved. Alternatively, we minimize the log-likelihood function without directly calculating the Hessian matrix by using the limited-memory L-BFGS algorithm (Byrd et al. 1995) for bound constrained optimization, which is in the family of quasi-Newton methods. In this algorithm, the approximated Hessian matrix H˜ is computed in a way that is much faster than the Newton–Raphson method.

Detailed Derivation of the Estimation in the ACE(t)-p Model

We use the difference matrix of the second order (Eilers and Marx 1996) to construct DA=ΔATΔA and DC=ΔCTΔC, where

ΔA=(12100121)(K2)×K

and

ΔC=(12100121)(L2)×L.

Given the target function

g(βA,βC)=(1/2)(log|ΣM|+YMΣM1YM+log|ΣD|+YDΣD1YD+βADAβA/σβA2+βCDCβC/σβC2),

the L-BFGS algorithm in step 1 can be performed using the following score functions:

g(βA,βC)βA=12(BTMAdiag(eBTMAβA)d(Z2MΣM1Z2MZ2MΣM1YMYMΣM1Z2M)1fM×1+0.5(BTDA12×1)diag(eBTDAβA12×1)d(ΣD1ΣD1YDYDΣD1)1nD×1+BTDAdiag(eBTDAβA)d(Z2DΣD1Z2DZ2DΣD1YDYDΣD1Z2D)1fD×1+(DA+DA)βAσβA2),
g(βA,βC)βC=12(BTMCdiag(eBTMCβC)d(Z3MΣM1Z3MZ3MΣM1YMYMΣM1Z3M)1fM×1+BTDCdiag(eBTMCβC)d(Z3DΣD1Z3DZ3DΣD1YDYDΣD1Z3D)1fD×1+(DC+DC)βCσβC2).

Through straightforward matrix operations, it follows that the second derivative after applying expectation over (YM,YD) by recalling that E(YPY)=tr(PΣ) for a nonstochastic matrix P is

g(βA,βC)=12(G11G12G12G22),
G11=σβA2d(DA+DA)+BTMAdiag(e2BTMAβA)d(Z2MΣM11BMΣM1Z2M)BTMA+0.25(BTDA12×1)diag(e2BTDAβA12×1)d( ΣD1ΣD1)(BTDA12×1)+BTDAdiag(e2BTDAβA)(d(Z2DΣD1ΣD1Z2D)+0.5d(Z2DΣD11BDΣD1Z2D))BTDA
G12=BTMCdiag(eBTMCβC+BTMAβA)d(Z3MΣM11BMΣM1Z3M)BTMA+0.5BTDCdiag(eBTDCβC+BTDAβA)d(Z3DΣD1ΣD1Z3D+Z3DΣD11BDΣD1Z3D)BTDA,
G22=σβC-2d(DC+DC)+BTMCdiag(e2BTMCβC)d(Z3MΣM-11BMΣM-1Z3M)BTMC+BTDCdiag(e2BTDCβC)d(Z3DΣD-11BDΣD-1Z3D)BTDC.

To perform the L-BFGS algorithm in step 2, we calculate the score functions given (βA^,βC^),

(2)lpsp(σe2,σβA2,σβC2)σβA2=(σβA4)βA^DAβA^+K˜σβA2+tr(g(βA^,βC^)1g(βA^,βC^)σβA2),
(2)lpsp(σe2,σβA2,σβC2)σβC2=(σβC4)βC^DCβC^+L˜σβC2+tr(g(βA^,βC^)1g(βA^,βC^)σβC2),
(2)lpsp(σe2,σβA2,σβC2)σe2=tr(ΣM1)YMΣM1ΣM1YM+tr(ΣD1)YDΣD1ΣD1YD+tr(g(βA^,βC^)1g(βA^,βC^)σe2),

where

g(βA^,βC^)σβA2=0.5σβA4(d(DA+DA)000),
g(βA^,βC^)σβC2=0.5σβC4(000d(DC+DC)),
g(βA^,βC^)σe2=12(G11σe2G12σe2G12σe2G22σe2),

where

G11σe2=BTMAdiag(e2BTMAβA)d(Z2MΣM1ΣM11BMΣM1Z2M+Z2MΣM11BMΣM1ΣM1Z2M)BTMA0.5(BTDA12×1)diag(e2BTDAβA12×1)d(ΣD1ΣD1ΣD1)(BTDA12×1)BTDAdiag(e2BTDAβA)d(2Z2DΣD1ΣD1ΣD1Z2D+Z2DΣD1ΣD11BDΣD1Z2D+Z2DΣD11BDΣD1ΣD1Z2D)BTDA,
G12σe2=BTMCdiag(eBTMCβC+BTMAβA)d(Z3M ΣM1ΣM11BMΣM1Z3M+Z3MΣM11BMΣM1ΣM1Z3M)BTMABTDCdiag(eBTDCβC+BTDAβA)d(Z3DΣD1ΣD1ΣD1Z3D+Z3DΣD1ΣD11BDΣD1Z3D+Z3DΣD11BDΣD1ΣD1Z3D)BTDA,
G22σe2=BTMCdiag(e2BTMCβC)d(Z3MΣM1ΣM11BMΣM1Z3M+Z3MΣM11BMΣM1ΣM1Z3M)BTMCBTDCdiag(e2BTDCβC)d(Z3DΣD1ΣD11BDΣD1Z3D+Z3DΣD11BDΣD1ΣD1Z3D)BTDC.

Construction of the Confidence Intervals in the ACE(t)-p Model

Let us denote Y=(YM,YD) and β=(βA,βC). It has been proposed that treating the P-splines under a framework of a random effects model has the advantages of giving a better estimate of the variance of β^ (Ruppert et al. 2003). In our context, given σβA2 and σβC2, according to the model assumption, g(β) is the logarithm of the joint distribution f(β, Y) of Y and β. However, when we treat β as random effects, a BP βˇ for the random effects defined as the mean of the posterior distribution given data from the perspective of the Bayesian framework, i.e., E(β|y), has the following desirable properties (Searle et al. 2009),

E(βˇ)=Ey(E(β|y))=E(β),
Var(βˇβ)=Ey(Var(β|y)).

It is, however, not as straightforward to obtain E(β|y) and Ey(Var(β|y)) in this case as the BLUP (Henderson 1975) in the linear mixed model because the posterior distribution of β is not available in a closed form. Note that eg(β) is proportional to the posterior distribution of β. Therefore, various numerical methods can be used to approximate the posterior distribution based on eg(β), from which the estimated BP βˇ, the prediction error Var(βˇβ), and thus the C.I.’s can be derived.

Here we give a simple Metropolis–Hastings (MH) algorithm (Hastings 1970) to numerically obtain the posterior distribution of β given σβA2^ and σβC2^, which is proportional to the joint distribution of Y and β,

eg(β)=exp(12(log|ΣM|+YMΣM1YM+log|ΣD|+YDΣD1YD+βADAβAσβA2+βCDCβCσβC2)),

described as follows:

  1. Set the initial value of β(1) at 0.

  2. Propose the next β(i+1) by sampling from N(β(i), σp2I).

  3. Draw u from Uni[0, 1].

  4. If u<e(g(β(i+1))g(β(i))), accept β(i+1). Then go back to step 2.

The variance σp2 of the proposal distribution affects the performance of the MH algorithm in terms of convergence. In our simulation and real data study, we set σp2=0.1 and the trace plots showed the chains mixed quickly. The MCMC samples can give us accurate estimates of βˇ and Var(β|y) under the guarantee of adequate numerical precision. We found in the simulation study that the posterior mean βˇ is often quite close to the mode β^, suggesting that the posterior distribution is highly symmetric. The conditional MSE of prediction Var(β|y) is proposed to be a better estimate of prediction error than Var(βˇβ) (Skrondal and Rabe-Hesketh 2009), which thus is adopted to compute the C.I.’s in our analyses. Alternatively, a parametric bootstrap method can be used to estimate Ey(Var(β|y)) by generating a set of bootstrap replicates y from the empirical distribution f^(y) according to model (1) with the parameters replaced by their estimates from the EM algorithm. Thus, the variances Var^(BA(t)βˇA) and Var^(BC(t)βˇC) at the time t are expressed as BA(t)Var^(βˇAβA)BA(t) and BC(t)Var^(βˇCβC)BC(t), where Var^(βˇAβA) and Var^(βˇCβC) are estimated from the MCMC outputs. It should be pointed out that the estimated variances do not take into account the variability of σβA2^ and σβC2^, which is the price of our Laplace approximation compared to a full Bayesian approach as previously noted (Krivobokova et al. 2008; Skrondal and Rabe-Hesketh 2009). Finally, the C.I.’s can be constructed by assuming the approximate normality of Var(β|y) based on the Bayesian central limit theorem (Carlin and Louis 1997).

Footnotes

Communicating editor: G. A. Churchill

Literature Cited

  1. Akaike H., 1969.  Fitting autoregressive models for prediction. Ann. Inst. Stat. Math. 21: 243–247. [Google Scholar]
  2. Atchley W. R., Rutledge J. J., 1980.  Genetic components of size and shape. I. Dynamics of components of phenotypic variability and covariability during ontogeny in the laboratory rat. Evolution 34: 1161–1173. [DOI] [PubMed] [Google Scholar]
  3. Boker S., Neale M., Maes H., Wilde M., Spiegel M. et al, 2011.  OpenMx: an open source extended structural equation modeling framework. Psychometrika 76: 306–317. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Boomsma D. I., Martin N. G., Molenaar P. C., 1989.  Factor and simplex models for repeated measures: application to two psychomotor measures of alcohol sensitivity in twins. Behav. Genet. 19: 79–96. [DOI] [PubMed] [Google Scholar]
  5. Booth J. G., Hobert J. P., 1998.  Standard errors of prediction in generalized linear mixed models. J. Am. Stat. Assoc. 93: 262–272. [Google Scholar]
  6. Breslow N. E., Clayton D. G., 1993.  Approximate inference in generalized linear mixed models. J. Am. Stat. Assoc. 88: 9–25. [Google Scholar]
  7. Briley D. A., Harden K. P., Bates T. C., Tucker-Drob E. M., 2015.  Nonparametric estimates of gene × environment interaction using local structural equation modeling. Behav. Genet. 45: 581–596. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Byrd R., Lu P., Nocedal J., Zhu C., 1995.  A limited memory algorithm for bound constrained optimization. SIAM J. Sci. Comput. 16: 1190–1208. [Google Scholar]
  9. Carlin B. P., Louis T. A., 1997.  Bayes and empirical Bayes methods for data analysis. Stat. Comput. 7: 153–154. [Google Scholar]
  10. Charmantier A., Perrins C., McCleery R. H., Sheldon B. C., 2006.  Age-dependent genetic variance in a life-history trait in the mute swan. Proc. Biol. Sci. 273: 225–232. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Cheverud J. M., Rutledge J. J., Atchley W. R., 1983.  Quantitative genetics of development: genetic correlations among age-specific trait values and the evolution of ontogeny. Evolution 37: 895–905. [DOI] [PubMed] [Google Scholar]
  12. Claeskens G., Krivobokova T., Opsomer J. D., 2009.  Asymptotic properties of penalized spline estimators. Biometrika 96: 529–544. [Google Scholar]
  13. Crainiceanu C., Ruppert D., Wand M. P., 2005.  Bayesian analysis for penalized spline regression using WinBUGS. J. Stat. Softw. 14: 1–24. [Google Scholar]
  14. Craven P., Wahba G., 1978.  Smoothing noisy data with spline functions. Numer. Math. 31: 377–403. [Google Scholar]
  15. Das K., Li J., Wang Z., Tong C., Fu G., et al. , 2011.  A dynamic model for genome-wide association studies. Hum. Genet. 129: 629–639. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. de Boor, C., 1978 A Practical Guide to Splines (Applied Mathematical Sciences, Vol. 27). Springer-Verlag, New York.
  17. Distel M. A., Vink J. M., Bartels M., van Beijsterveldt C. E. M., Neale M. C., et al. , 2011.  Age moderates non-genetic influences on the initiation of cannabis use: a twin-sibling study in Dutch adolescents and young adults. Addiction 106: 1658–1666. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Efron, B., and R. J. Tibshirani, 1994 An Introduction to the Bootstrap. CRC Press, Cleveland, OH/Boca Raton, FL. [Google Scholar]
  19. Eichler E. E., Flint J., Gibson G., Kong A., Leal S. M., et al. , 2010.  Missing heritability and strategies for finding the underlying causes of complex disease. Nat. Rev. Genet. 11: 446–450. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Eilers P. H. C., Marx B. D., 1996.  Flexible smoothing with B-splines and penalties. Stat. Sci. 11: 89–121. [Google Scholar]
  21. Elks C. E., Den Hoed M., Zhao J. H., Sharp S. J., Wareham N. J., et al. , 2012.  Variability in the heritability of body mass index: a systematic review and meta-regression. Front. Endocrinol. 3: 29. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Goeman J. J., 2010.  L1 penalized estimation in the Cox proportional hazards model. Biom. J. 52: 70–84. [DOI] [PubMed] [Google Scholar]
  23. Hardy R., Wills A. K., Wong A., Elks C. E., Wareham N. J., et al. , 2010.  Life course variations in the associations between FTO and MC4R gene variants and body size. Hum. Mol. Genet. 19: 545–552. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Hastings W. K., 1970.  Monte Carlo sampling methods using Markov chains and their applications. Biometrika 57: 97–109. [Google Scholar]
  25. Henderson C. R., 1975.  Best linear unbiased estimation and prediction under a selection model. Biometrics 31: 423–447. [PubMed] [Google Scholar]
  26. Kaprio J., Koskenvuo M., 2002.  Genetic and environmental factors in complex diseases: the older Finnish Twin Cohort. Twin Res. Off. J. Int. Soc. Twin Stud. 5: 358–365. [DOI] [PubMed] [Google Scholar]
  27. Kaprio J., Pulkkinen L., Rose R. J., 2002.  Genetic and environmental factors in health-related behaviors: studies on Finnish twins and twin families. Twin Res. Off. J. Int. Soc. Twin Stud. 5: 366–371. [DOI] [PubMed] [Google Scholar]
  28. Kauermann G., Wegener M., 2011.  Functional variance estimation using penalized splines with principal component analysis. Stat. Comput. 21: 159–171. [Google Scholar]
  29. Kauermann G., Claeskens G., Opsomer J. D., 2009.  Bootstrapping for penalized spline regression. J. Comput. Graph. Stat. 18: 126–146. [Google Scholar]
  30. Korkeila M., Kaprio J., Rissanen A., Koskenvuo M., 1991.  Effects of gender and age on the heritability of body mass index. Int. J. Obes. 15: 647–654. [PubMed] [Google Scholar]
  31. Krivobokova T., Crainiceanu C. M., Kauermann G., 2008.  Fast adaptive penalized splines. J. Comput. Graph. Stat. 17: 1–20. [Google Scholar]
  32. Lajunen H.-R., Kaprio J., Keski-Rahkonen A., Rose R. J., Pulkkinen L., et al. , 2009.  Genetic and environmental effects on body mass index during adolescence: a prospective study among Finnish twins. Int. J. Obes. 33: 559–567. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Li Z., Sillanpää M. J., 2013.  A Bayesian nonparametric approach for mapping dynamic quantitative traits. Genetics 194: 997–1016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Li Z., Sillanpää M. J., 2015.  Dynamic quantitative trait locus analysis of plant phenomic data. Trends Plant Sci. 20: 822–833. [DOI] [PubMed] [Google Scholar]
  35. Li J., Wang Z., Li R., Wu R., 2015.  Bayesian group Lasso for nonparametric varying-coefficient models with application to functional genome-wide association studies. Ann. Appl. Stat. 9: 640–664. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Locke A. E., Kahali B., Berndt S. I., Justice A. E., Pers T. H., et al. , 2015.  Genetic studies of body mass index yield new insights for obesity biology. Nature 518(7538): 197–206. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Manolio T. A., Collins F. S., Cox N. J., Goldstein D. B., Hindorff L. A., et al. , 2009.  Finding the missing heritability of complex diseases. Nature 461: 747–753. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Mao W., Zhao L. H., 2003.  Free-knot polynomial splines with confidence intervals. J. R. Stat. Soc. Ser. B Stat. Methodol. 65: 901–919. [Google Scholar]
  39. Neale M., Cardon L., 1992.  Methodology for Genetic Studies of Twins and Families. Springer Science & Business Media; Dordrecht, The Netherlands. [Google Scholar]
  40. Nelson R. M., Pettersson M. E., Carlborg Ö., 2013.  A century after Fisher: time for a new paradigm in quantitative genetics. Trends Genet. 29: 669–676. [DOI] [PubMed] [Google Scholar]
  41. Ortega-Alonso A., Pietiläinen K. H., Silventoinen K., Saarni S. E., Kaprio J., 2012.  Genetic and environmental factors influencing BMI development from adolescence to young adulthood. Behav. Genet. 42: 73–85. [DOI] [PubMed] [Google Scholar]
  42. O’Sullivan F., 1986.  A statistical perspective on ill-posed inverse problems. Stat. Sci. 1: 502–518. [Google Scholar]
  43. Pitkäniemi J., Moltchanova E., Haapala L., Harjutsalo V., Tuomilehto J., et al. , 2007.  Genetic random effects model for family data with long-term survivors: analysis of diabetic nephropathy in type 1 diabetes. Genet. Epidemiol. 31: 697–708. [DOI] [PubMed] [Google Scholar]
  44. Pletcher S. D., Geyer C. J., 1999.  The genetic analysis of age-dependent traits: modeling the character process. Genetics 153: 825–835. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Polderman T. J. C., Benyamin B., de Leeuw C. A., Sullivan P. F., van Bochoven A., et al. , 2015.  Meta-analysis of the heritability of human traits based on fifty years of twin studies. Nat. Genet. 47: 702–709. [DOI] [PubMed] [Google Scholar]
  46. Purcell S., 2002.  Variance components models for gene-environment interaction in twin analysis. Twin Res. Off. J. Int. Soc. Twin Stud. 5: 554–571. [DOI] [PubMed] [Google Scholar]
  47. Rabe-Hesketh S., Skrondal A., Gjessing H. K., 2008.  Biometrical modeling of twin and family data using standard mixed model software. Biometrics 64: 280–288. [DOI] [PubMed] [Google Scholar]
  48. Rijsdijk F. V., Sham P. C., 2002.  Analytic approaches to twin data using structural equation models. Brief. Bioinform. 3: 119–133. [DOI] [PubMed] [Google Scholar]
  49. Robinson G. K., 1991.  That BLUP is a good thing: the estimation of random effects. Stat. Sci. 6: 15–32. [Google Scholar]
  50. Rosenberg P. S., 1995.  Hazard function estimation using B-splines. Biometrics 51: 874–887. [PubMed] [Google Scholar]
  51. Ruppert D., 2002.  Selecting the number of knots for penalized splines. J. Comput. Graph. Stat. 11: 735–757. [Google Scholar]
  52. Ruppert, D., M. P. Wand, and R. J. Carroll, 2003 Semiparametric Regression. Cambridge University Press, Cambridge/London/New York. [Google Scholar]
  53. Sas A. A., Jamshidi Y., Zheng D., Wu T., Korf J., et al. , 2012.  The age-dependency of genetic and environmental influences on serum cytokine levels: a twin study. Cytokine 60: 108–113. [DOI] [PubMed] [Google Scholar]
  54. Searle, S. R., G. Casella, and C. E. McCulloch, 2009 Variance Components. John Wiley & Sons, New York/Hoboken, NJ. [Google Scholar]
  55. Silventoinen K., Kaprio J., Lahelma E., Koskenvuo M., 2000.  Relative effect of genetic and environmental factors on body height: differences across birth cohorts among Finnish men and women. Am. J. Public Health 90: 627. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Silventoinen K., Rokholm B., J. Kaprio, and T. I. Sørensen, 2009.  The genetic and environmental influences on childhood obesity: a systematic review of twin and adoption studies. Int. J. Obes. 34: 29–40. [DOI] [PubMed] [Google Scholar]
  57. Silventoinen K., Jelenkovic A., Sund R., Honda C., Aaltonen S., et al. , 2015.  The CODATwins Project: the cohort description of collaborative project of development of anthropometrical measures in twins to study macro-environmental variation in genetic and environmental effects on anthropometric traits. Twin Res. Hum. Genet. Off. J. Int. Soc. Twin Stud. 18: 348–360. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Skrondal A., Rabe-Hesketh S., 2009.  Prediction in multilevel generalized linear models. J. R. Stat. Soc. Ser. A Stat. Soc. 172: 659–687. [Google Scholar]
  59. Wood S. N., 2006.  On confidence intervals for generalized additive models based on penalized regression splines. Aust. N. Z. J. Stat. 48: 445–464. [Google Scholar]
  60. Wu R., Lin M., 2006.  Functional mapping - how to map and study the genetic architecture of dynamic complex traits. Nat. Rev. Genet. 7: 229–237. [DOI] [PubMed] [Google Scholar]
  61. Yang J., Bakshi A., Zhu Z., Hemani G., Vinkhuyzen A. A. E. et al, 2015.  Genetic variance estimation with imputed variants finds negligible missing heritability for human height and body mass index. Nat. Genet. 47: 1114–1120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Zyphur M. J., Zhang Z., Barsky A. P., Li W.-D., 2013.  An ACE in the hole: twin family models for applied behavioral genetics research. Leadersh. Q. 24: 572–594. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The simulation data is included as an example dataset in the R package.


Articles from Genetics are provided here courtesy of Oxford University Press

RESOURCES