Ge et al. (1) consider the extension of Fisher’s classic model for heritability to the case where there are repeated measurements on subjects. One approach to analyzing repeated measurements is to average observations. The authors show empirically and via simulations that estimates of heritability derived from averaging repeated measurements lead to underestimates of heritability. Some may find the bias revealed by Ge et al. (1) to be surprising because averaging is commonly justified in other settings such as repeated measures ANOVA. In this letter we detail the bias that arises from conflating measurement error and unique environmental variance. This elucidates the authors’ empirical findings, which represent a case with large measurement error exacerbated by only two measurements per subject.
Consider the model for additive genetic, common environmental, and unique environmental components. We use the mixed-model formulation as in ref. 2 but include measurement error. For conciseness, we assume no nuisance covariates. Let be the kth measurement for k = 1, . . . , n (for simplicity, we assume the same n for all subjects) for the jth individual in the ith family. Let denote the additive genetic component, and for dizygotics (DZs) and siblings we add as in ref. 2. Let denote the common environmental component, the unique environmental, and the measurement error (as the authors note, can also include biological transients). For monozygotic twins (MZs),
and for DZs and full siblings,
and, consequently, for both cases. Ge at al. (1) observe and are identifiable if data are not averaged and define measurement-error corrected heritability: .
Now consider (the averaged data). For MZs,
and then , and similarly for DZs and full siblings. Now and are no longer identifiable, so they are conflated as a single term; define . Consequently, ; equivalently, .
Thus, we see that the bias in the averaged data decreases as the number of repeated measurements increases, as well as when the measurement error decreases. Here we have only examined the model-based heritabilities based on identifiability issues, whereas in practice maximum likelihood or other estimators may introduce some additional biases, but the simple analysis here suffices to reveal the crux of the issue. The authors highlight a case with high variability and only two measurements (n = 2), and hence the differences between approaches are large. We commend the authors for pointing out the empirical differences that can arise but also note the role of the number of repeated measurements.
Footnotes
The authors declare no conflict of interest.
References
- 1.Ge T, Holmes AJ, Buckner RL, Smoller JW, Sabuncu MR. Heritability analysis with repeat measurements and its application to resting-state functional connectivity. Proc Natl Acad Sci USA. 2017;114:5521–5526. doi: 10.1073/pnas.1700765114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Rabe-Hasketh S, Skrondal A, Gjessing H. Biometrical modeling of twin and family data using standard mixed model software. Biometrics. 2008;64:280–288. doi: 10.1111/j.1541-0420.2007.00803.x. [DOI] [PubMed] [Google Scholar]
