Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 2017 Jul 26;114(32):8602–8607. doi: 10.1073/pnas.1621096114

Detection and quantification of inbreeding depression for complex traits from SNP data

Loic Yengo a,1, Zhihong Zhu a, Naomi R Wray a,b, Bruce S Weir c, Jian Yang a,b, Matthew R Robinson a,d, Peter M Visscher a,b,1
PMCID: PMC5558994  PMID: 28747529

Significance

Inbreeding depression (ID) is the reduction of fitness in offspring of related parents. This phenomenon can be quantified from SNP data through a number of measures of inbreeding. Our study addresses two key questions. How accurate are the different methods to estimate ID? And how and why should investigators choose among the multiple inbreeding measures to detect and quantify ID? Here, we compare the behaviors of ID estimates from three commonly used SNP-based measures of inbreeding and provide both theoretical and empirical arguments to answer these questions. Our work illustrates how to analyze SNP data efficiently to detect and quantify ID, across species and traits.

Keywords: inbreeding depression, directional dominance, quantitative genetics, single-nucleotide polymorphism, homozygosity

Abstract

Quantifying the effects of inbreeding is critical to characterizing the genetic architecture of complex traits. This study highlights through theory and simulations the strengths and shortcomings of three SNP-based inbreeding measures commonly used to estimate inbreeding depression (ID). We demonstrate that heterogeneity in linkage disequilibrium (LD) between causal variants and SNPs biases ID estimates, and we develop an approach to correct this bias using LD and minor allele frequency stratified inference (LDMS). We quantified ID in 25 traits measured in 140,000 participants of the UK Biobank, using LDMS, and confirmed previously published ID for 4 traits. We find unique evidence of ID for handgrip strength, waist/hip ratio, and visual and auditory acuity (ID between −2.3 and −5.2 phenotypic SDs for complete inbreeding; P<0.001). Our results illustrate that a careful choice of the measure of inbreeding combined with LDMS stratification improves both detection and quantification of ID using SNP data.


Mating between close relatives has detrimental consequences on the survival and fertility of resulting offspring (1). This overall reduction of fitness, referred to as inbreeding depression (ID), is observable in a wide range of organisms, including plants (2), animals (3, 4), and humans (5). In humans, major abnormalities are more frequent in children from consanguineous marriages (6) and genes causing rare diseases can be mapped by ascertaining children from such matings (7). To date, although the genetic basis of ID is not completely elucidated, two main hypotheses are proposed to explain this phenomenon: homozygosity for partially recessive deleterious mutations and heterozygous advantage (overdominance) (1, 8). More generally, ID can be estimated for any complex trait, even if the trait is not an obvious component of fitness. For polygenic traits, ID can be detected if there is directional dominance (DD) across loci, which means that the phenotype of individuals who are heterozygous deviates from the average phenotypes of homozygous individuals in a consistent direction. For fitness components, DD is negative; i.e., on average homozygosity reduces fitness.

In practice, ID can be estimated from pedigree studies when the relationships between parents are known (6, 9). However, given the limited number and the small sizes of such studies in humans, contemporary efforts (5, 10) to quantify ID have instead used SNP genotyping platforms to directly estimate inbreeding coefficients (F). SNP data may allow a more accurate evaluation of inbreeding (11), in particular for distant and cryptic inbreeding, and allow inference to be drawn from large population data (10). Conceptually, once a measure of F is derived from SNP data, ID can subsequently be estimated by correlating phenotype with the estimated F.

Genome-wide estimators of F fall in two categories: average homozygosity measures across loci (irrespective of position) and measures of continuous runs of homozygosity (ROH). Using ROH, ID has been reported for diseases (12, 13), height (5), and cognition (10). ROH-based estimates of F (FROH) have been previously shown to better correlate with the unobserved pedigree-inbreeding coefficient compared with other measures of inbreeding (14, 15), which has made them a gold standard. However, the sampling variance of these estimates is large, and consequently large sample sizes (10) are required to detect ID with FROH measures. In addition, FROH estimation depends on arbitrary (although optimized) choices of multiple parameters like the minimum number of SNPs covered by a ROH, the distance between two consecutive ROHs, and the number of heterozygous genotypes allowed in each ROH. Setting ROH length cutoffs ignores the contribution to ID of smaller identity by descent segments due to distant ancestors.

Therefore, quantifying the theoretical properties (bias and variance) of ID estimates derived from FROH is challenging. These two critical limitations led us to consider two other commonly used measures of inbreeding (3, 15), namely the excess of homozygosity inbreeding coefficient (hereafter denoted FHOM) as estimated in PLINK (16) and the correlation between uniting gametes (hereafter denoted FUNI) previously introduced as F^III in Yang et al. (17), as potential efficient measures for detecting and quantifying ID. We present the theory underlying unbiased estimation of ID and compare through simulations the performances of these three measures of inbreeding. We then quantify ID in 25 quantitative traits measured in a large dataset of 140,000 individuals from the UK Biobank, using an approach that is robust to different assumptions on the distribution of effect sizes, to possible directional effects of minor alleles and to population stratification.

Results

Theoretical Determinants of Unbiased Estimation of ID.

We assume that the phenotype of interest is a quantitative trait y with genetic component that is underlain by random additive and dominance effects of m independent causal variants. We denote b=j=1m2pj(1pj)δj as the expected depression in y resulting from complete inbreeding, where pj is the minor allele frequency (MAF) of the jth causal variant and δj the expectation of its dominance effect. In the absence of epistasis, fitness-related phenotypes linearly decrease with increasing inbreeding (Eq. S2). This well-established linear relationship naturally implies the use of linear regression methods to estimate ID.

Least-squares estimates of ID obtained with FUNI converge with increasing sample size toward bUNI=cov[𝒚,FUNI]/var[FUNI]. When explicitly calculating cov[y,FUNI] and var[FUNI] with respect to the genotypes and the effect size distributions, we found under classical assumptions (Eq. S4 and Table S1) that bUNI is unbiased when the average linkage disequilibrium (LD) among observed SNPs equals the weighted (by effect sizes) average LD between causal variants and observed SNPs. Although influenced by the effect size distribution, the consistency of bUNI toward b is mainly driven by differences in LD between causal variants and observed SNPs. Therefore, a simple condition under which bUNI is unbiased is if the causal variants are a random subset of the observed SNPs. However, if the causal variants are enriched in high-LD regions of the genome, bUNI will overestimate the actual inbreeding depression. In contrast, if the causal variants are enriched in a low-LD region like DNAse-I hypersensitive sites or enhancers (18), or if they are enriched among low-frequency variants, bUNI is expected to underestimate the true effect. This is further illustrated in our first simulation (Fig. 1).

Table S1.

Joint distribution of allele frequencies as a function of the LD

Alleles aj Aj Allele frequency
ak pjpk + Djk (1 − pj) pkDjk pk
Ak (1 − pk) pjDjk (1 − pj) (1 − pk) + Djk 1 − pk
pj 1 − pj 1

aj and ak are the minor alleles of SNP j and SNP k, and Aj and Ak are their major alleles, respectively The MAFs of SNP j and SNP k are pj and pk, respectively. Djk is the LD parameter.

Fig. 1.

Fig. 1.

Averaged estimates of inbreeding depression (ID) from 1,000 simulated datasets. Datasets were simulated assuming a true ID parameter b = −3 (horizontal gray line) phenotypic SD for complete inbreeding. In scenario 1 the m=1,000 causal variants were randomly sampled from all observed SNPs, whereas in scenarios 2 and 3 they were respectively sampled from low- and high-LD regions of the genome. In A the expectation of the dominance effects (δj for the jth causal variant) is constant (neutral model) whereas in panel B δj is inversely proportional to the variance of the minor allele count at each causal variant. FHOM, excess homozygosity inbreeding measure; FROH, runs of homozygosity-based inbreeding measures; FUNI, measure of inbreeding based on correlation between uniting gametes; LDMS, LD and minor allele frequency stratified inference; SEM, SE of the mean.

LD heterogeneity between causal variants and SNPs used for inference has been previously shown to determine the consistency of heritability estimates (1921). We leveraged this estimation problem to propose a strategy to correct the differential LD bias when estimating ID. Following a previous approach (19), we explored how stratifying SNPs according to their LD score (22) and their MAFs before analyses (details given in Supporting Information) could correct or at least reduce these biases. We illustrate in our first simulation that LD score and MAF (LDMS) stratification performs well in correcting these biases (Fig. 1).

Similar to that shown with FUNI, we prove that the consistency of ID estimates obtained with FHOM (hereafter denoted bHOM) is also determined by LD differences between SNPs and causal variants (Eq. S5). However, the bias of bHOM cannot simply be predicted by the ratio of the mean LD score in causal variants over the mean LD score in SNPs (Supporting Information). Nevertheless, our derivations predict that bHOM behaves similarly to bUNI with respect to LD differences between causal variants and SNPs. Importantly, we also prove that possible directional effects of minor alleles (DEMA) could confound bHOM because of the correlation between minor allele counts and FHOM (Supporting Information). Such directional effects could arise as a consequence of directional selection (when the minor allele is also the derived allele) as previously reported in human height (23) or simply because of population stratification (PS).

Simulation Study.

The complete description of the simulation study is given in Supporting Information.

Influence of differential MAF and LD between causal variants and SNPs.

We first considered three scenarios to illustrate the influence of LD and MAF heterogeneity between causal variants and SNPs. In all these scenarios, we assumed no DEMA, i.e., parameter s=0 in Eq. S3, and that b = −3 phenotypic SDs. Moreover, we assumed the expectation of the dominance effects to be either constant, i.e., δj=b/j=1m2pj(1pj), or inversely proportional to the variance of the minor allele count, i.e., δj=b/2mpj(1pj). The first assumption corresponds to neutral traits, whereas the second one assigns a larger effect to SNPs with lower MAF and therefore corresponds more to traits under directional selection. Unbiasedness is defined below as when the average estimate of ID over multiple simulation replicates does not significantly differ from the value of b used for simulation.

Scenario 1.

In this scenario the causal variants were randomly sampled from the 3,857,369 autosomal SNPs that passed the genotypes quality control (Supporting Information). As predicted by our derivations, we observed that FUNI-based estimates of b were unbiased when dominance effects are assumed inversely proportional to the variances of allele counts, whereas an overestimation of 14% of b was observed when dominance effects are assumed constant (Fig. 1A). This overestimation is explained by the fact that assuming a constant dominance effect, regardless of allele frequencies, creates an apparent MAF and LD heterogeneity between SNPs and causal variants by relatively up-weighting common SNPs compared with rarer SNPs (Eq. S3). We observed that LDMS stratification, which accounts for that heterogeneity, completely corrected this upward bias as presented in Fig. 1A. In addition, we found that FHOM produced unbiased estimates of b when dominance effects are assumed constant as for a neutral trait (Fig. 1A), but was biased downward (7% of b) when dominance effects are inversely proportional to the variances of allele counts (Fig. 1B). This downward bias can be explained using the same reasoning presented above because in that case assuming dominance effects inversely proportional to the variances of allele counts relatively up-weights rarer SNPs compared with common ones. This downward bias could similarly be corrected using LDMS stratification. We also found that estimates of b obtained with ROH-based measures of inbreeding were strongly biased: +162% of b using the definition of ROH from Joshi et al. (10) [hereafter denoted FROH(1)] and +91% of b using an alternative definition from Gazal et al. (15) or Howrigan et al. (24) [hereafter denoted as FROH(2)]. The main difference between those two definitions of ROH is that FROH(2) requires LD pruning of the SNPs before calling the ROHs, whereas FROH(1) explicitly imposes a constraint on the ROH lengths (here >1.5 Mb). This result highlights that LD pruning improves ID estimation using ROH-based inbreeding measures but still remains insufficient to produce unbiased estimates. Indeed, using more stringent LD pruning thresholds did not change our conclusion (Fig. S1). Overall, we found that LDMS stratified estimates for FUNI and FHOM were unbiased in all cases (Fig. 1 A and B), which emphasizes that this strategy can be safely used even when causal variants are perfectly tagged by SNPs.

Fig. S1.

Fig. S1.

Bias of the ROH-based inbreeding measures (FROH) expressed as the linear regression of the inbreeding coefficient at causal variants (FQTL) over FROH. Unbiasedness is equivalent to a regression coefficient of 1 (red horizontal line). All 3,857,369 SNPs were considered as causal variants. Assumption A: The expectations of dominance effects are equal for all SNPs. Assumption B: The expectation of the dominance effect of each SNP is inversely proportional to the variance of the minor allele count for that SNP.

On average over 1,000 simulation replicates we found that FHOM-associated estimates had smaller standard errors (SE) compared with FUNI or FROH (FROH(1) and FROH(2)) (Fig. S2 A and B). FHOM consequently yielded the largest statistical power whereas FUNI was second best with a power on average 10% below that of FHOM. On the other hand, because of their large SEs, FROH(1) and FROH(2) yielded the smallest statistical power to detect ID. Finally, we found that LDMS stratified estimates had 13% larger SEs compared with nonstratified estimates. This increase in SE corresponds on average over all inbreeding measures to an 8% loss of statistical power and is explained by the larger underlying effective dimensionality (4 LD score strata × 6 MAF strata = 24 parameters actually estimated; Supporting Information) of the LDMS approach compared with the nonstratified inference.

Fig. S2.

Fig. S2.

Averaged SEs of ID estimates from 1,000 simulated datasets. Datasets were simulated assuming a true ID parameter b=3 (horizontal gray line) phenotypic SD for complete inbreeding. In scenario 1 the m = 1,000 causal variants were randomly sampled from all observed SNPs, whereas in scenarios 2 and 3 they were respectively sampled from low- and high-LD regions of the genome. In A the expectation of the dominance effects (δj for the jth causal variant) is constant (neutral model) whereas in B δj is inversely proportional to the variance of the minor allele count at each causal variant. FHOM, excess homozygosity inbreeding measure; FROH(1) and FROH(2), runs of homozygosity-based inbreeding measures; FUNI, measure of inbreeding based on correlation between uniting gametes; LDMS, LD and MAF stratified inference.

Scenarios 2 and 3.

For the two other scenarios we used 1,358,699 SNPs within exons, introns, 3’-UTRs, 5’-UTRs, and promoter regions ±500 bp (SI Materials and Methods, URLs). SNPs within these five genomic (sets of) regions have distinct MAF and LD distributions as shown in Figs. S3 and S4. In scenario 2, we sampled the causal variants among 542,379 intronic SNPs with MAF <5% whereas in scenario 3 causal variants were sampled among 28,341 SNPs within exons, 3’-UTRs, and 5’-UTRs. Our theoretical derivations predict that FUNI and FHOM would underestimate the true ID in scenario 2 because causal variants in that scenario had on average lower LD scores (Fig. S3). Accordingly, we found over 1,000 simulation replicates an underestimation of 19% of the true ID for FUNI and FHOM (Fig. 1). These downward biases could be reduced below 1% of b using LDMS stratification (Fig. 1 A and B) and were not significantly different from 0 (P>0.5). In scenario 3 causal variants had on average larger LD scores and MAF (Figs. S3 and S4). We therefore expected an overestimation of ID estimates in that scenario according to our theoretical derivations. This predicted upward bias was indeed more noticeable in our simulations (40% on average over all inbreeding measures) compared with scenario 2. Still, using LDMS stratification we were able to reduce these biases down to <15% of b on average over all inbreeding measures (Fig. 1) and more specifically <10% of b for FUNI. Overall, LDMS stratification using FUNI yielded the smallest biases compared with all other strategies.

Fig. S3.

Fig. S3.

Mean LD score for SNPs within exons (coding), introns, 3’- and 5’-UTRs, and promoter regions. LD scores were calculated using 3.8 million genotyped and imputed SNPs that passed our quality control.

Fig. S4.

Fig. S4.

MAF distribution for 3.8 million SNPs within exons (coding), introns, 3’- and 5’-UTRs, and promoter regions. The height of each rectangle is proportional to the fraction of SNPs within the functional category and the width is proportional to the fraction of SNPs within the MAF interval.

Influence of DEMA.

Let αj denote the expectation of the additive effect of the minor allele at the jth causal variant. We define s=j=1m2pjαj as an overall measure of DEMA. Under this assumption we prove that estimates of ID obtained with FHOM are confounded because of the correlation between FHOM and minor allele counts (Supporting Information). We illustrate and quantify here that confounding bias. The parameters of this simulation are similar to scenario 1 of the first simulation, except that data are now simulated assuming no ID, i.e., b=0 and using s=10. With s=10, DEMA (which contributes to the additive genetic variance) accounts in our simulations for 0.3% of the total phenotypic variance (Supporting Information). We considered two alternatives for the expectation of the additive effects αj: (i) αj=s/j=1m2pj is constant and (ii) αj=s/2mpj is inversely proportional to the MAF pj. Under both alternatives, we found that estimates of ID obtained with FHOM were severely biased (between −2 and −3 phenotypic SDs whereas the true value is 0; Fig. 2A) unlike those derived from other inbreeding measures. Whether this bias can be corrected is a difficult question in practice. Indeed, under the simplistic scenario considered here where causal variants are well tagged, adjusting for a genetic score summing all minor alleles should completely correct this bias. However, in more realistic situations where causal variants are only partially tagged, this would remain insufficient. In contrast, theory shows that FUNI is orthogonal to minor allele counts and consequently would not be influenced by such directional effects if these exist. This result has motivated our decision to restrict real data analyses to FUNI only even if FHOM had better statistical power in our previous simulations.

Fig. 2.

Fig. 2.

Averaged estimates of ID from 1,000 simulated datasets. In A datasets were simulated assuming no ID, i.e., b=0 and nonnegative expectation for the additive effects (i.e., αj>0, for the jth causal variant). In B datasets were simulated assuming no ID (b=0) and no directional effect of minor alleles (s=0) but including the contribution of the first 10 genotypic PCs to model the effect of population stratification. FHOM, excess homozygosity inbreeding measure; FROH, runs of homozygosity-based inbreeding measures; FUNI, measure of inbreeding based on correlation between uniting gametes; SEM, SE of the mean.

DEMA could also arise as a consequence of PS. To illustrate that second aspect, we performed another simulation with settings similar to scenario 1 of the first simulation (with b=0 and s=0) but now include the effect of 10 genotypic principal components (PC) as a proxy for PS (Supporting information). When varying the contribution of PS to the phenotypic variance from 0 to 5%, we observed that PS had a larger influence on estimates derived from FHOM and FROH. These observations are consistent with our theoretical results (at least for FHOM) as they directly derive from the correlation between FHOM and minor allele counts, the weighted sum of which constitutes the PC. The biases of FHOM- and FROH-associated ID estimates were on average 1 phenotypic SD (Fig. 2B) when PS explained 5% of the phenotypic variance but were significant only for FHOM (Fig. S5B). On the contrary, the biases observed when using FUNI never exceeded 0.5 phenotypic SD (Fig. 2B) and were not statistically different from 0 (Fig. S5B). In conclusion, orthogonality between FUNI and minor allele counts (25) (Supporting Information) guarantees that confounding by DEMA or PS is negligible for ID estimates obtained with this measure.

Fig. S5.

Fig. S5.

Type I error rate associated with the detection of ID. Type I error rate is measured as the fraction of P values <5% from 1,000 simulated datasets. In A datasets were simulated assuming no ID; i.e., b=0 and nonnegative expectation for the additive effects (i.e., αj>0, for the jth causal variant). In B datasets were simulated assuming no ID (b=0) and no directional effect of minor alleles (s=0), but including the contribution of the first 10 genotypic PCs to model the effect of PS. Abbreviations are as defined in Fig. S2.

To summarize this section, we show in our simulations that biases in ID estimates induced by MAF and LD heterogeneity between SNPs and causal variants can be corrected using LDMS stratification. Moreover, we show on average that LDMS correction performs better when applied to FUNI and that FUNI-based ID estimates are robust to DEMA and PS. Overall, FUNI offers the best trade-off between statistical efficiency and unbiasedness in the situations covered in this simulation study. We therefore recommend its use and focus hereafter our analyses on real data to FUNI.

Analysis of UK Biobank Data.

We quantified ID in 25 quantitative traits measured in 140,000 (Table 1) participants from the UK Biobank (Supporting Information). These traits can be grouped into three categories including physical measures (standing height, weight, body mass index, waist and hip circumferences, waist/hip ratio, bone mineral density, body fat percentage, hand grip strength, systolic blood pressure, diastolic blood pressure, heart pulse rate, peak expiratory flow, visual acuity measured on log minimum angle of resolution (MAR) scale, and auditory acuity measured as the speech reception threshold), cognitive traits and educational attainment (fluid intelligence score, mean time to identify matches, maximum number of digits remembered, and age when completed full education), and sex-specific reproductive traits (number of children fathered, number of live births, age at menarche, and age at menopause) (Table S2). Some of these traits like standing height, peak expiratory flow (strongly correlated with forced expiratory volume in 1 s: r=0.79, P<1010), educational attainment, and cognitive ability were previously reported to be associated with inbreeding (10). Beyond quantifying the effect of inbreeding on these traits, we also aimed to evaluate whether differential LD and MAF distribution in causal variants influenced classical least-squares estimates and if so to correct these biases using our LDMS inference. The analyses were performed using linear regression adjusted for age, sex (for traits not specific to males or females), recruitment center, Townsend deprivation index (26) as a proxy for socio-economical status, and the first 10 genotypic PCs. The last three adjustments were considered to account for geographical and socio-economic structures in the UK population, which we found to correlate with levels of inbreeding (Table S3).

Table 1.

Statistically significant estimates of inbreeding depression for eight quantitative traits measured in the UK Biobank

FUNI FUNI(LDMS)
Traits N Estimate SE P value Estimate SE P value PHET
PEF 117,575 (103,781) −4.1 (−4.1) 0.62 (0.74) 4.57 ×1011 (3.48 ×108) −4.12 (−4.19) 0.69 (0.84) 2.25 ×109 (6.35 ×107) 0.941 (0.81)
AA, speech reception threshold 43,175 (38,449) 5.23 (4.44) 1.04 (1.26) 5.55 ×107 (4.45 ×104) 5.34 (4.39) 1.16 (1.45) 4.6 ×106 (2.51 ×103) 0.828 (0.94)
FIS 45,043 (40,089) −3.9 (−3.14) 0.97 (1.23) 5.36 ×105 (1.08 ×102) −4.72 (−4.32) 1.06 (1.42) 8.52 ×106 (2.25 ×103) 0.061 (0.07)
HGS, average of left and right hands 139,623 (122,950) −2.36 (−2.72) 0.54 (0.68) 1.38 ×105 (5.79 ×105) −2.43 (−3.31) 0.6 (0.76) 4.64 ×105 (1.51 ×105) 0.771 (0.1)
NCF 51,494 (45,483) −3.09 (−3.69) 0.96 (1.16) 1.27 ×103 (1.44 ×103) −4.01 (−4.58) 1.07 (1.32) 1.79 ×104 (5.17 ×104) 0.039 (0.14)
MTCIM 138,902 (122,334) 1.94 (1.7) 0.54 (0.68) 3.66 ×104 (1.21 ×102) 2.05 (2.1) 0.6 (0.77) 6.14 ×104 (6.24 ×103) 0.647 (0.26)
WHR 140,295 (123,540) 2.35 (3.03) 0.53 (0.67) 1.12 ×105 (6.18 ×106) 1.97 (2.89) 0.6 (0.76) 7.85 ×104 (1.37 ×104) 0.121 (0.69)
VA, log MAR scale 29,616 (26,596) 4.04 (5.77) 1.20 (1.51) 7.42 ×104 (1.37 ×104) 4.39 (6.68) 1.32 (1.73) 9.04 ×104 (1.14 ×104) 0.518 (0.26)

Effect sizes and standard errors are expressed in phenotypic SD of the trait. Results presented in parentheses were obtained after removing 16,781 related individuals and 19 extreme cases of inbreeding (FUNI >0.15). N: sample size in the analysis. PHET is the P value from the LDMS heterogeneity test comparing nonstratified and LDMS-stratified estimates.

Table S2.

Estimates of inbreeding depression for 25 quantitative traits measured in the UK Biobank

FUNI FHOM FROH(1) FROH(2)
Traits N Estimate (SE) P value Estimate (SE) P value Estimate (SE) P value Estimate (SE) P value
PEF 117,575 −4.1 (0.6) 4.6 × 10−11 −2.7 (0.5) 2.1 × 10−8 −7.6 (1.4) 1.3 × 10−7 −8.5 (1.5) 3.9 × 10−8
AA 43,175 5.2 (1.0) 5.5 × 10−7 3.2 (0.8) 7.5 × 10−5 9.4 (2.4) 1.1 × 10−4 11 (2.6) 1.3 × 10−5
WHR 140,295 2.3 (0.5) 1.1 × 10−5 1.8 (0.4) 3.2 × 10−5 2.7 (1.2) 0.02 4.5 (1.3) 5.5 × 10−4
HGS 139,623 −2.4 (0.5) 1.4 × 10−5 −1.6 (0.4) 1.5 × 10−4 −3.8 (1.2) 1.8 × 10−3 −4.3 (1.3) 1.1 × 10−3
FIS 45,043 −3.9 (1.0) 5.4 × 10−5 −2 (0.8) 8.2 × 10−3 −7.9 (2.2) 3.9 × 10−4 −7.9 (2.4) 9 × 10−4
MTCIM 138,902 1.9 (0.5) 3.7 × 10−4 1.3 (0.4) 2.4 × 10−3 3.7 (1.2) 2.5 × 10−3 3.6 (1.3) 6.2 × 10−3
Height 140,302 −1.9 (0.5) 4.8 × 10−4 −1.1 (0.4) 0.01 −2.7 (1.2) 0.03 −3.5 (1.3) 7.3 × 10−3
VA 29,616 4 (1.2) 7.4 × 10−4 2 (0.9) 0.04 7.7 (2.7) 4.9 × 10−3 7.9 (2.9) 7.3 × 10−3
NCF 51,494 −3.1 (1.0) 1.3 × 10−3 −2.4 (0.7) 1.4 × 10−3 −5.4 (2.2) 0.01 −7.7 (2.4) 1.3 × 10−3
WC 140,233 1.4 (0.5) 7.2 × 10−3 0.91 (0.4) 0.03 1.9 (1.2) 0.12 2.7 (1.3) 0.04
BFat 138,055 1.1 (0.4) 0.05 0.71 (0.4) 0.09 1.6 (1.2) 0.19 1.9 (1.3) 0.14
HR 130,694 1.1 (0.6) 0.05 1.2 (0.4) 5.7 × 10−3 2.6 (1.3) 0.04 3.2 (1.4) 0.02
MNDR 145,75 −3.4 (1.8) 0.06 −2.7 (1.4) 0.06 −6.7 (4.3) 0.19 −8.3 (4.5) 0.07
A-Mena 72,196 1.3 (0.8) 0.08 0.29 (0.6) 0.63 1.4 (1.7) 0.42 1.9 (1.9) 0.3
BMI 139,686 0.94 (0.5) 0.08 0.54 (0.4) 0.21 0.86 (1.2) 0.48 1.3 (1.3) 0.3
BMR 138,047 −0.83 (0.5) 0.12 −0.61 (0.4) 0.16 −2.3 (1.2) 0.05 −2.5 (1.3) 0.05
A-Edu 95,540 −0.6 (0.6) 0.34 −0.36 (0.5) 0.48 0.19 (1.4) 0.89 −0.07 (1.5) 0.96
NLB 59,943 0.8 (0.9) 0.36 0.26 (0.7) 0.71 1.1 (2) 0.57 0.77 (2.1) 0.72
NIM 138,543 −0.45 (0.5) 0.41 −0.79 (0.4) 0.07 −2.9 (1.2) 0.02 −3.1 (1.3) 0.02
A-Meno 42,782 −0.82 (1.0) 0.43 −0.2 (0.9) 0.82 −1.8 (2.5) 0.46 −2 (2.7) 0.44
SBP 130,787 0.32 (0.6) 0.57 0.01 (0.4) 0.99 0.24 (1.3) 0.85 0.79 (1.4) 0.56
BMD 80,698 −0.31 (0.7) 0.65 −0.26 (0.5) 0.64 0.69 (1.5) 0.65 0.08 (1.6) 0.96
DBP 130,831 0.17 (0.56) 0.75 −0.21 (0.4) 0.63 0.8 (1.3) 0.52 1.1 (1.4) 0.39
HC 139,694 0.11 (0.54) 0.83 −0.1 (0.4) 0.81 0.5 (1.2) 0.68 0.17 (1.3) 0.89
Weight 139,916 −0.07 (0.5) 0.89 −0.096 (0.4) 0.82 −0.71 (1.2) 0.56 −0.68 (1.3) 0.59

AA, auditory acuity measured as the speech reception threshold; A-Edu, age when completed full education; A-mena, age at menarche; A-Meno, age at menopause; BFat, body fat percentage; BMD, bone mineral density; BMI, body mass index; BMR, basal metabolic rate; DBP, diastolic blood pressure; FIS, fluid intelligence score; HC, hip circumference and weight; HGS, hand grip strength; HR, heart pulse rate; MNDR, maximum number of digits remembered; MTCIM, mean time to correctly identify matches during cognitive test; NCF, number of children fathered; NIM, number of incorrect matches during cognitive test; NLB, number of live births; PEF, peak expiratory flow; SBP, systolic blood pressure; VA, visual acuity in log MAR scale; WC, waist circumference; WHR, ratio of waist circumference over hip circumference. Effect sizes and SEs are expressed in phenotypic SD. N: sample size in the analysis. Other abbreviations are defined in Fig. S2. P values <0.05/25 = 0.002 are highlighted in boldface type.

Table S3.

Statistical association between the inbreeding measures FUNI (correlation between uniting gametes) and different measures of geographical and socio-economic structure in the UK Biobank

Measure of population structure F statistic Degrees of freedom P value
Age at recruitment 6.84 1, N 8.9 × 10−3
Sex 0.32 1, N 0.57
Recruitment center 2.68 21, N 4.5 × 10−5
Townsend deprivation index Genotypic PCs 47.6 1, N 5.4 × 10−12
PC1 94.6 1, N 2.4 × 10−22
PC2 202 1, N 7.9 × 10−46
PC3 19.4 1, N 1.1 × 10−5
PC4 6.53 1, N 0.01
PC5 1.66 1, N 0.19
PC6 3.15 1, N 0.07
PC7 0.71 1, N 0.40
PC8 >3.35 1, N 0.07
PC9 0.03 1, N 0.86
PC10 1.63 1, N 0.20

Statistical association was measured analysis of variances. N = 140,720.

After Bonferroni correction (P<0.05/25 traits = 2×103), we detected significant ID in eight traits (Table 1), using LDMS stratified inference based on FUNI. Ranked by decreasing magnitude of depression, these traits are auditory acuity (AA), fluid intelligence score (FIS), visual acuity (VA), peak expiratory flow (PEF), number of children fathered (NCF), hand grip strength (HGS), mean time to correctly identify matches (MTCIM), and waist/hip ratio (WHR). This analysis included 16,781 related individuals (estimated to be first, second, or third degree) and 19 participants with extreme inbreeding (FUNI>0.15). As a sensitivity analysis we reran all analyses without related and inbreeding outliers, which reduced the number of traits passing the significance threshold to five. The three traits dropped in this secondary analysis were AA (P=2.51×103 in secondary analysis), FIS (P=2.25×103 in secondary analysis), and MTCIM (P=6.24×103 in secondary analysis). To test whether the differences in ID estimates between full and reduced analyses are significant, we used a jackknife procedure to compare the observed differences with differences generated when randomly excluding 16,800 participants. Over 1,000 resampling events, we found that the observed differences in ID estimates for the eight traits highlighted above were not significantly different from those obtained when excluding random subsets (empirical P>0.14; Fig. S6). We consequently believe that the drop of significance between those two analyses is mainly explained by the reduced statistical power and not by confounding. In addition, we explored how much of ID could be captured at genome-wide significant (GWS) SNPs. We therefore selected trait-specific GWS SNPs from the genome-wide association studies (GWAS) catalog (SI Materials and Methods, URLs) and assessed inbreeding depression for the same traits using FUNI at those loci. We could not, however, detect any significant association with the traits analyzed in our study. Even for height, for which 700 common GWSs are now reported (27), the estimate of inbreeding depression at GWS was only −0.08 SD for complete inbreeding (P=0.072).

Fig. S6.

Fig. S6.

Distribution of the log ratio of absolute differences in ID estimates observed when excluding from our main analysis 16,800 (19 inbreeding outliers and 16,781 first-degree related) participants over differences generated by randomly excluding the same number of participants. These log ratios were >0 (ratio >1) in more than 14% of the 1,000 sampling replicates, suggesting that excluding the 16,800 participants from our main analysis has the same effect as excluding a random subset. AA, auditory acuity measured as the speech reception threshold; FIS, fluid intelligence score; HGS, hand grip strength; MTCIM, mean time to correctly identify matches during cognitive test; NCF, number of children fathered; PEF, peak expiratory flow; VA, visual acuity in log MAR scale; WHR, ratio of waist circumference over hip circumference.

We observed for all traits that ID estimates derived from FROH were systematically larger than those obtained with FUNI (Table S3). As expected, their SEs were also larger. In particular, only four and six traits (of eight detected with FUNI) passed the Bonferroni threshold when using FROH(1) and FROH(2), respectively. On the other hand, ID estimates obtained with FHOM were systematically smaller than those obtained using FUNI (Table S3), with an average over the eight traits significantly associated with FUNI,bHOM0.64×bUNI. The latter observation would be expected if the traits analyzed are under directional selection as observed in our simulations when rarer variants were assumed to have larger effects.

We observed for most traits that LDMS stratified and nonstratified FUNI estimates were similar (Table 1), suggesting weak differential LD and MAF distributions in SNPs tagging causal variants. Nonetheless, a marginally significant (LDMS heterogeneity test P<0.05; Table 1 and Fig. S7) difference could be observed in NCF for which the LDMS ID estimate was 1 SE larger than the nonstratified one (Table 1). This also translated into an improvement of the association P value from 1.27×103 to 1.79×104 (Table 1). We subsequently assessed which component(s) in the LDMS stratification contributed the most to NCF (Fig. S8). We therefore fitted a first multivariate regression model adjusted for four inbreeding coefficients specific to each LD score strata component and then another multivariate regression model adjusted for six inbreeding coefficients specific to each MAF stratum. We chose to fit two different models (for MAF and LD separately) instead of one including 24 covariates to minimize the effects of colinearity between inbreeding measures in each LDMS stratum. We found a nominally significant contribution of SNPs with minor alleles <5% (bFUNI = −4.01 phenotypic SD; P=0.01) but no significant enrichment in LD strata despite a large contribution of the second-lowest LD score stratum (bFUNI = −1.62 phenotypic SD; P=0.12). According to our derivations, this enrichment of ID in lower-frequency SNPs and more generally in low-LD regions explains why nonstratified analyses produced smaller estimates compared with the LDMS approach. These results imply a disproportionate contribution of low-frequency SNPs to ID in NCF.

Fig. S7.

Fig. S7.

P-values distribution of the LD and MAF heterogeneity test. P values were calculated on data generated in simulation 1. A and B correspond to the null hypothesis (H0) of no LD or MAF heterogeneity between SNPs and causal variants. C and D correspond to the alternative hypothesis (H1) of LD and MAF heterogeneity between causal variants and SNPs. A and C compare via a quantile–quantile plot the P-value distribution to the uniform distribution between 0 and 1. B and D represent, for different values of the true inbreeding depression used to simulate the data (b=0, −3, −6, and −9), the proportion of those P values smaller than 0.05.

Fig. S8.

Fig. S8.

Estimates of ID in the NCF from the MAF and LD score stratified analysis (LDMS), using the FUNI (correlation between uniting gametes) inbreeding measure. A corresponds to stratification into six MAF strata, whereas B corresponds to stratification into LD score segments. First LD score quartile corresponds to low LD.

SI Materials and Methods

Statistical Models and Notations.

We consider the following model:

yi=μ+j=1majxij+j=1mdjHij+εi. [S1]

For individual i, yi is the observed value of the fitness component of interest, and xij is the minor allele count at the jth causal SNP (xij{0,1,2}). pj is the MAF of the jth causal SNP, Hij=xij(2xij) is the indicator of heterozygosity, and εi is a residual term capturing nongenetic effects on the observed phenotype. The additive and dominance effect sizes of the minor allele at the jth causal SNP are denoted aj and dj, respectively. For the sake of simplicity, we assume that SNP interactions are negligible and consequently do not appear in Eq. S1. We also assume independence between the m causal variants, between the genotypes and the effect sizes, and between the genetic and the nongenetic effects. Finally we assume the effect sizes to be random and such that 𝔼[aj]=αj and 𝔼[dj]=δj. We denote fi individual is inbreeding coefficient. From Eq. S1 the expectation of yi with respect to the genotypes and the effect sizes can be written as

𝔼[yi|fi]=μ+j=1m2pjαj+2pj(1pj)δjConstant for all individuals+(j=1m2pj(1pj)δj)b:reduction for complete inbreedingfi, [S2]

which translates a linear and often negative relationship between the average fitness component and the inbreeding coefficient fi. The slope b=j=1m2pj(1pj)δj in this linear relationship is the expected depression in the fitness component due to complete inbreeding (fi=1). Eq. S1 can also be written as

yi=μ+sXiQTL+bFiQTL+giA+giD+εi, [S3]

where giA=j=1m(aj𝔼[aj])xij and giD=j=1m(dj𝔼[dj])Hij are the nondirectional additive and dominance breeding values of individual i, respectively. XiQTL and FiQTL are defined as

XiQTL=j=1mαjxijj=1m2pjαjandFiQTL=1j=1mδjHijj=1m2pj(1pj)δj,

and s and b are defined as

s=j=1m2pjαjandb=j=1m2pj(1pj)δj.

Measures of Inbreeding.

We studied three measures of inbreeding. All these measures of inbreeding require individual SNP genotypes and can be used in the absence of any pedigree information. The first inbreeding measure is the excess of homozygosity measure defined here as

FHOM=1k=1pxk(2xk)k=1p2pk(1pk),

where xk is the minor allele count of SNP k, pk is the MAF, and p is the number of genotyped or imputed SNPs available. FHOM is implemented in PLINK2 (command: –het).

The second measure (FUNI) is based on the correlation between uniting gametes. This measure was defined in Yang et al. (17) as

FUNI=1pk=1pxk2(1+2pk)xk+2pk22pk(1pk).

FUNI is implemented in PLINK2 and GCTA software (command: –ibc). The last measure is defined as the proportion of the genome within ROH. More specifically, the inbreeding measure FROH was calculated as the cumulated length (in base pairs) of one individual’s genome within ROHs divided by 3×109 (the approximate length of the autosomal genome in base pairs). We used two definitions of ROHs corresponding to those proposed in Joshi et al. (10) (definition 1) and Gazal et al. (15) (definition 2). Inbreeding measures calculated using definition 1 and definition 2 are respectively denoted as FROH(1) and FROH(2). Both definitions used SNPs with MAF >5% as previously reported in Joshi et al. (10) and Gazal et al. (15).

Definition 1 requires ROHs to be at least 1.5 Mb long, to cover at least 50 SNPs, and to be located at least 1 Mb away from one another. This corresponds to the following PLINK2 command: –homozyg –homozyg-density 50 –homozyg-gap 1000 –homozyg-kb 1500 –homozyg-snp 50 –homozyg-window-het 1 –homozyg-window-missing 5 –homo- zyg-window-snp 50. Definition 2 requires only LD pruning the SNPs (PLINK2 command –indep 50 5 2), but does not impose additional explicit constraint on ROHs besides not allowing any heterozygote in the ROHs. The following PLINK1 command was used to call ROHs under definition 2: –homozyg –homozyg-window-het 0 –homozyg-snp 50 –homozyg-kb 0 –homozyg-density 5000 –homozyg-gap 5000. As explained in Gazal et al. (15), the options –homozyg-density 5000 and –homozyg-gap 5000 are set for PLINK1 to ignore those constraints.

Theoretical Determinants of Unbiased Estimation.

The linear relationship in Eq. S2 naturally implies the use of linear regression to estimate ID. Least-squares estimates of ID derived from the inbreeding measure F asymptotically converge toward b^F=cov[F,y]/𝕍[F].

For F = FUNI, b^F can be written as (proof in SI Appendix)

bUNI=b×(j=1pwjLjC1/pk=1pLkSNP),wherewj=2pj(1pj)δjj=1m2pj(1pj)δj. [S4]

In Eq. S4, LjC is the average LD between the jth causal variants and the p observed SNPs, whereas LkSNP is the average LD between the kth observed SNPs and the p observed SNPs. To better understand Eq. S4, we note that when δj is inversely proportional to 2pj(1pj), wj simplifies into 1/m and therefore bUNI/b is simply the ratio of mean LD score in causal variants over the mean LD score in SNPs. More generally, if LjC1/pk=1pLkSNP (which would hold when causal variants are a subset of SNPs), then Eq. S4 simplifies as bUNI=b(j=1mwj)=b.

For F = FHOM we can derive a somewhat similar expression (proof in SI Appendix),

bHOM=s×(j=1pwj(a)RjC1/Spk=1pSk)+b×(j=1pwj(d)SjC1/Spk=1pSk). [S5]

In Eq. S5, RjC and SjC measure the LD between causal variant and SNPs, whereas Sk measures the LD among SNPs. Detailed derivations of these quantities are given in SI Appendix as well as the proof of unbiasedness when causal variants are randomly sampled from SNPs.

LD Score and MAF (LDMS) Stratified Inference.

We defined the LD score (22) of SNP j as the sum of pairwise r2 between SNP j and all SNPs located within 10 Mb around SNP j’s physical position. To correct potential biases induced by uneven LD (and MAF) distribution between causal variants and SNPs, we used a LD and MAF stratified inference as previously described (19). This approach is summarized below in five steps.

(i) The first step consists of identifying segments of the genome with uniform LD patterns. For this we used a sliding-window approach where each segment contains 2ms SNPs with overlap of ms SNPs with adjacent segments. The number ms was chosen as in Yang et al. (19) to be the average number of SNPs per 100 kb of a chromosome: ms=100×nSNP/LSNP, with nSNP and LSNP being respectively the number of SNPs and the length (in kilobases) of the chromosome. We then calculated the mean LD score of the variants in each segment and took the average of the mean LD scores of two adjacent segments for overlapped regions. This process partitioned the genome into a large number of segments. (ii) The second step consists of stratifying the SNPs into K groups (G1GK) according to the mean LD score of the segments (mLDS) and the MAF interval they belong to. We used here four classes of LD [mLDS<first quartile (low LD), first quartile<mLDS<median, median < mLDS < 3rd quartile, and mLDS>third quartile) and six MAF intervals [(1–5%), (5–10%), (10–20%), (20–30%), (30–40%), and (40–50%)], i.e., 4×6=24 strata in total. (iii) For each group Gk, the third step consists of calculating a measure of inbreeding Fk (could be any of the three measures presented in this study) specific to that group. (iv) In the fourth step we fitted a multivariate linear regression model using all of the Fks as covariates to obtain 𝐛𝐊=(b^1,,b^K), a vector of K regression coefficients corresponding to each group. (v) The fifth and last step comprises collapsing these K estimates into a single genome-wide estimate b^(LDMS): b^(LDMS)=b^1++b^K, where 𝟏K is a K-dimensional vector with all entries equal to 1. The SE of b^(LDMS)) can be obtained from the asymptotic variance–covariance matrix of 𝐛𝐊, using the following relationship:

𝕍[b^(LDMS)]=𝟏K𝕍[𝐛𝐊]𝟏K.

We developed and applied a Wald test to assess the statistical significance of differences between nonstratified and LDMS stratified ID estimates. The test statistic is defined as the squared difference between the two estimates divided by the variance of that difference. The details of that derivation and some additional simulations showing that this test is not inflated in the absence of LD or MAF heterogeneity are presented in SI Appendix and Fig. S7. The LDMS procedure is implemented in GCTA as part of the GCTA-GREML-LDMS analysis (SI Materials and Methods, URLs).

UK Biobank Data.

Inclusion criteria.

We used baseline data from 152,729 participants from the UK Biobank (34). To ensure ancestry homogeneity, we selected individuals who reported to be British, Irish, white, or of any other white background and whose coordinates on the first genetic PC were below 0 (Fig. S9). In total, we included 140,720 participants in this analysis. All participants in the UK Biobank study provided written informed consent.

Fig. S9.

Fig. S9.

First two PCs used to infer ethnicity in the UK Biobank participants. Principal components were calculated using 150,000 independent SNPs.

Phenotyping.

We analyzed 25 traits: standing height, weight, body mass index, waist and hip circumferences, ratio between waist circumference and hip circumference (referred to as WHR), bone mineral density, body fat percentage, HGS, systolic blood pressure, diastolic blood pressure, heart pulse rate, PEF, VA (measured on log MAR scale) and AA (measured as the speech reception threshold), cognitive traits and EA (FIS, MTCIM, maximum number of digits remembered, and age when completed full education), and sex-specific reproductive traits (NCF, number of live births, age at menarche, and age at menopause). For each trait, we excluded values >4 SD away from the mean. The sample sizes specific to each trait are given in Table S2. Finally, all traits were adjusted for age, sex (for traits not specific to males or females), recruitment center, 10 genotypic PCs, and Townsend deprivation index (26) as a proxy for the socio-economic status. After adjustment, we used inverse normal transformed residuals as the traits to be correlated with inbreeding measures. Although focused on FUNI, we also provide ID estimates from other inbreeding measures, in particular from FROH for comparison with previous and future studies using those measures (Table S2).

Genotyping.

We used data from 152,729 participants (including 140,720 analyzed in this study; SI Materials and Methods, Inclusion criteria) genotyped in the first phase of genotyping of the UK Biobank. The first steps of the quality control have been previously described (SI Materials and Methods, URLs). Phasing and imputation were performed using SHAPEIT and IMPUTE2 (SI Materials and Methods, URLs), respectively, as previously described (35). After imputation, we selected 9,493,148 autosomal SNPs with imputation quality r2>0.3, MAF>1%, and Hardy–Weinberg equilibrium test P value >106. Imputed SNPs were then called to the genotypes having the largest posterior probability. Finally, we removed redundancy by LD pruning SNPs with a squared genotype correlation r2>0.9. In total we used 3,857,369 SNPs in this analysis.

Simulation Study.

In all simulations we used 10,000 unrelated (pairwise genetic relationship <0.05) individuals randomly sampled from the 140,720 individuals selected for analysis.

Simulation 1: Influence of MAF and LD distribution in causal variants.

To illustrate the importance of LD and MAF distribution in causal variants, we considered three scenarios. In the first scenario (scenario 1) we sampled the causal variants at random among the 3,857,369 observed SNPs. For the two other scenarios we used 1,358,699 SNPs within exons, introns, 3-UTR, 5-UTR, and promoter regions ±500 bp (SI Materials and Methods, URLs). The LD score and MAF distributions of SNPs within these functional annotations are presented in Figs. S3 and S4. In scenario 2, we sampled the causal variants among 542,379 intronic SNPs with MAF <5% (low-LD scenario) whereas in scenario 3 causal variants were sampled among 28,341 SNPs within exons and 3- and 5-UTRs (high-LD scenario). In all simulations, we sampled 1,000 sets of m=1,000 causal variants each and simulated a phenotype (1 replicate per set of causal variants), using the following modified version of Eq. S3:

yi=μ+sXiQTL+bFiQTL+giA+giD+εi. [S6]

In Eq. S6, giA and giD are respectively defined as in ref. 25 as the scaled additive and orthogonal dominance values:

giA=j=1m(ajαj)[xij2pj2pj(1pj)]andgiD=j=1m(djδj)[xij(1+2pjxij)2pj22pj(1pj)].

Under Eq. S1 and assuming that ajN(αj,σa2) and djN(δj,σd2), the phenotypic variance can be expressed as

𝕍[y]=𝕍[sXQTL+bFQTL]+m(σa2+σd2)+σe2m(σa2+σd2)+σe2, [S7]

where σe2=𝕍[εi] is the variance of the nongenetic effects [derivation of Eq. S7 in SI Appendix]. Zhu et al. (25) found on average across multiple traits that additive variance was about fivefold larger than dominance variance. Following their findings, we fixed in our simulations σa2=5σd2=5H2/6m, σe2=1H2, and H2=0.8, where H2 denotes the heritability of the trait. We observed in our simulations that 𝕍[sXQTL+bFQTL] was negligible and consequently 𝕍[y]1. We used two values of b, b=0 and b=3 SD of the trait, to study statistical power to detect ID. In all simulations, the statistical significance threshold for P values is 5%. The ID parameter b was estimated for each measure of inbreeding by fitting a linear regression model with the phenotype as the response variable and the corresponding measure as the primary covariate.

Simulation 2: Influence of directional effects of minor alleles.

In this simulation we illustrated how directional effects of minor alleles may confound estimates of ID. The settings of this simulation are similar to scenario 1 in the first simulation. We simulated data assuming no inbreeding depression, i.e., b=0 and fixed s=10. We considered two alternatives for the expectation of the additive effects αj: (i) αj=s/j=1m2pj is constant and (ii) αj=s/(2mpj) is inversely proportional to the MAF pj.

We further quantified the effect of PS on ID estimates by running another simulation (also similar to scenario 1 in the first simulation) with b=0, s=0, and now including the effect of the first 10 genotypic PCs (PC1,PC2,,PC10) as a proxy for PS:

yi=μ+θ(k=110PCk)+giA+giD+εi. [S8]

In Eq. S8 θ was chosen so that PS accounts for 1%, 2%, 3%, 4%, and 5% of the total phenotypic variance. PCs were calculated using genotypes from the 10,000 unrelated participants selected for the simulations. Type I error in this simulation was calculated as the fraction of P values (testing for the significance of ID) below 0.05.

Selection of Genome-Wide Significant SNPs from the GWAS Catalog.

To assess how much ID could be captured at GWS SNPs, we assessed inbreeding using FUNI at GWS SNPs from the GWAS catalog (SI Materials and Methods, URLs). We downloaded version 1.0 of the GWAS catalog and selected SNPs so that their associated “DISEASE/TRAIT” field exactly matched “height,” “weight,” “body mass index,” “waist circumference,” “waist-hip ratio,” “bone mineral density,” “body fat percentage,” “systolic blood pressure,” “diastolic blood pressure,” “intelligence” [or “intelligence (childhood)”], “cognitive test performance,” “cognitive function,” “cognitive performance,” “general cognitive ability,” “educational attainment,” “educational attainment (years of education),” “educational attainment (college completion),” “age at first birth,” “menarche (age at onset),” “menopause (age at onset),” “forced expiratory volume in 1 s,” and “lung function (forced expiratory flow between 25% and 75% of forced vital capacity).” In total we selected 2,069 unique SNPs.

URLs.

SI Appendix

Derivation of Equation S2.

Starting with the following equation (Eq. 1 in the main text),

yi=μ+j=1majxij+j=1mdjHij+εi,

and assuming independence between the effect sizes and the genotypes and 𝔼[aj]=αj and 𝔼[dj]=δj, then we can write that

𝔼[yi|fi]=μ+j=1mαj𝔼[xij|fi]+j=1mδj𝔼[Hij|fi],

where fi is individual i’s inbreeding coefficient. We know that 𝔼[xij|fi]=2pj and 𝔼[Hij|fi]=2pj(1pj)(1fi). Hence follows Eq. S2:

𝔼[yi|fi]=μ+j=1m2pjαj+j=1m2pj(1pj)(1fi)δj=μ+j=1m2pjαj+2pj(1pj)δj+(j=1m2pj(1pj)δj)fi.

Derivation of Eqs. S4 and S5.

Additional notations and useful formulas.

We introduce now new notations. Let xj(f) and xj(m) be the number of minor alleles (0 or 1) inherited from the father and the mother, respectively. Then the minor allele count xj is defined as xj=xj(f)+xj(m). We introduce Djk as the LD parameter between minor alleles of SNP j and SNP k. We assume that the LD between paternal and maternal alleles is the same, i.e., that (xj(f),xk(f)) and (xj(m),xk(m)) have the same probability distribution. The latter distribution is presented in Table S1.

Let Ii be the indicator of complete inbreeding; i.e., Ii=1 with probability fi if individual i is completely inbred and Ii=0 if its parents are completely independent. We note f=𝔼[fi]. We have now that

cov[xij(f),xij(m)]=𝔼[𝔼[xij(f)xij(m)]]𝔼[xij(f)]𝔼[xij(m)]=𝔼[[xij(f)xij(m)=1]]pj2=𝔼[[xij(f)xij(m)=1|Ii=1][Ii=1]]+𝔼[[xij(f)xij(m)=1|Ii=0](1[Ii=1])pj2]=𝔼[fipj+(1fi)pj2pj2]=pj(1pj)𝔼[fi]cov[xij(f),xij(m)]=pj(1pj)f.

Moreover, because we assume that LD between the father’s alleles is the same as between the mother’s alleles, then we have that cov[xj(f),xk(f)]=cov[xj(m),xk(m)]=Djk.

Another important formula is

cov[xij(f),xik(m)]=𝔼[𝔼[xij(f)xik(m)]]𝔼[xij(f)]𝔼[xik(m)]=𝔼[[xij(f)xik(m)=1]]pjpk=𝔼[[xij(f)xik(m)=1|Ii=1][Ii=1]]+𝔼[[xij(f)xik(m)=1|Ii=0]×(1[Ii,=1])pjpk]=𝔼[[xij(f)xik(f)=1|Ii=1]fi+pjpk(1fi)pjpk]=𝔼[fi(pjpk+Djk)+pjpk(1fi)pjpk]cov[xij(f),xik(m)]=Djkf.

Following the same reasoning, we can prove that

cov[FUNI(j),FUNI(k)]=rjk2+(1rjk2+Djk(12pj)(12pk)pjpk(1pj)(1pk))ff2,cov[Hj,FUNI(k)]=2pj(1pj)[rjk2+f(1rjk2)],cov[xj,FUNI(k)]=4Djk(12pk)f2pk(1pk),cov[Hj,Hk]=4pjpk(1pj)(1pk)f(1f)+(1f)[4Djk2+2Djk(12pk)(12pj)]cov[Hk,xj]=2Djk(12pk)(1f), [S9]

where FUNI(j) is defined as

FUNI(j)=xj2(1+2pj)xj2pj22pj(1pj).

Getting back to Eqs. S4 and S5.

Eq. S4.

Using the results above we have that

cov[FUNI,y]𝕍[FUNI]=A(I)f+A(R)(1f)B(I)f+B(R)(1f)f2, [S10]

where

A(I)=j=1mk=1p{4Djk(12pk)2pk(1pk)αj2pj(1pj)δj}A(R)=j=1mk=1p{4Djk22pk(1pk)δj}=j=1m2pj(1pj)δj(k=1prjk2)LjC
B(I)=1pl=1pk=1p(1+Dlk(12pl)(12pk)plpk(1pl)(1pk))B(R)=1pl=1p(k=1prlk2)LkSNP.

Given that f1, we may assume that f0 and therefore simplify Eq. S10 into Eq. S4.

Eq. S5.

For FHOM we have that

cov[FHOM,y]𝕍[FHOM]=Sp×C(I)f(1f)+C(R)(1f)D(I)f(1f)+D(R)(1f), [S11]

where

Sp=k=pp2pk(1pk)C(I)=j=1mk=1p{4pjpk(1pj)(1pk)×δj}C(R)=j=1mk=1p{2αjDjk(12pk)+[4Djk2+2Djk(12pk)×(12pj)]×δj}D(I)=l=1pk=1p{4plpk(1pl)(1pk)}=Sp2D(R)=l=1pk=1p{4Dlk2+2Dlk(12pk)(12pl)}.

Therefore, with f=0, we have that

cov[FHOM,y]𝕍[FHOM]=s(j=1mωj(a)Rj(C)1Spk=1pSk)+b(j=1mωj(d)Sj(C)1Spk=1pSk), [S12]

where

ωj(a)=αjj=1m2αjpj,ωj(d)=δjj=1m2pj(1pj)δj,Rj(C)=k=1p2Djk(12pk)Sk=l=1p{4Dlk2+2Dlk(12pk)(12pl)},

and

Sj(C)=k=1p{4Djk2+2Djk(12pj)(12pk)}.

Rj(C) and Sj(C) both measure LD between the jth causal variant and the p SNPs used for inference, and Sk measures LD between SNP k and all observed SNPs. To better understand the relationship described in Eq. S12, let us consider a special case. When s=0 and δj is constant, Eq. S12 can be written

cov[FHOM,y]𝕍[FHOM]=b(1Sm(C)j=1mSj(C)1Spk=1pSk)=b×1pSp1mSm(C)×1mj=1mSj(C)1pk=1pSk, [S13]

where Sm(C)=j=pm2pj(1pj). Therefore, when causal variants are randomly sampled from observed SNPs, 1mSm(C)1pSp and 1mj=1mSj(C)1pk=1pSk, which guarantees unbiasedness.

Derivation of Eq. S7.

To derive Eq. S7, we assumed the following modified version of Eq. S3:

yi=μ+sXiQTL+bFiQTL+giA+giD+εi. [S14]

In Eq. S14 giA and giD are respectively defined as the scaled additive and orthogonal dominance values (25):

giA=j=1m(ajαj)×[xij2pj2pj(1pj)]andgiD=j=1m(djδj)×[xij(1+2pjxij)2pj22pj(1pj)].

Because of the independence between effect sizes and genotypes, we have that

cov[XiQTL,giA]=𝔼[XiQTLgiA]𝔼[XiQTL]𝔼[giA]=0=j,lm𝔼[ajαj]𝔼[alαl]×𝔼(XilQTL[xij2pj2pj(1pj)])=0.

Similarly, we have that cov[XiQTL,giD]=cov[FiQTL,giA]=cov[FiQTL,giD]=0. Therefore,

𝕍[yi]=𝕍[sXiQTL+bFiQTL]+𝕍[giA+giD+εi].

Let us now derive the expression of 𝕍[sXiQTL+bFiQTL]. We note that hj=2pj(1pj):

𝕍[sXiQTL+bFiQTL]=𝕍[j=1mαjxij+bj=1mδjHij]=j=1mαj2𝕍[xij]+δj2𝕍[Hij]2αjδjcov[xij,Hij]=j=1mhj[αj2+δj2(1hj)2αjδj(12pj)].

We consider two cases corresponding to the different configurations presented in this paper:

i) αj=0.

𝕍[sXiQTL+bFiQTL]=𝕍[bFiQTL]=j=1mhj[δj2(1hj)].

When δj=b/2mpj(1pj), we have that

𝕍[bFiQTL]=b2m2j=1m(1hj)hj=b2m(1mj=1m1hj1).

If we assume for example the causal variants to be a random subset of HapMap 3 (HM3) SNPs (1.1 million SNPs), we can therefore use their allele frequencies to approximate 1mj=1m1hj. Using allele frequencies of HM3 SNPs in the UK Biobank, we found that 1mj=1m1hj5 and therefore that

𝕍[bFiQTL]4b2m.

A different set of SNPs could have been used instead of HM3 SNPs. However, we argue here that in any case, 1mj=1m1hj is constant with respect to the number of causal variants m and therefore 𝕍[bFiQTL] would be negligible for a trait with a large number of causal variants. When δj=b/j=1mhj, we have that

𝕍[bFiQTL]=b2(j=1mhj(1hj)(j=1mhj)2)=b2m(1mj=1mhj(1hj)(1mj=1mhj)2).

Using HM3 SNPs, we can similarly approximate 1mj=1mhj(1hj)0.198 and 1mj=1mhj=0.325, which gives

𝕍[bFiQTL]1.88×b2m.

In conclusion, we show here that 𝕍[bFiQTL] is of order 1/m and therefore contributes little to the phenotypic variance of traits with large numbers of causal variants.

𝑖𝑖) δj=0.

𝕍[sXiQTL+bFiQTL]=𝕍[sXiQTL]=j=1mhjαj2.

When αj=s/2mpj, we have (still using HM3 SNPs) that

𝕍[sXiQTL]=s2m2j=1mhj4pj2=s22m2j=1m1pjpj=s22m(1mj=1m1pj1)4.32×s2m.

and similarly, when αj=s/j=1m2pj, we can write that

𝕍[sXiQTL]=s2m(1mj=1mhj(1mj=1m2pj)2)1.41×s2m.

Just like 𝕍[bFiQTL], 𝕍[sXiQTL] is also of order 1/m and hence contributes little to the phenotypic variance.

We now denote aj and dj as

aj=ajαjN(0,σa2)anddj=djδjN(0,σd2).

Let xj(s) and xj(D,s) be defined as

xj(s)=xij2pj2pj(1pj)andxj(D,s)=xijD2pj22pj(1pj)

with xijD,s=xij(1+2pjxij). Hence,

𝕍[ajxj(s)+djxj(D,s)]=𝕍[ajxj(s)]+𝕍[djxj(D,s)]+2cov[ajxj(s),djxj(D,s)]=𝔼[aj2(xj(s))2]𝔼[ajxj(s)]2+𝔼[dj2(xj(D,s))2]𝔼[djxj(D,s)]2+2cov[ajxj(s),djxj(D,s)]=𝔼[aj2]𝔼[(xj(s))2]𝔼[aj]2𝔼[xj(s)]2+𝔼[dj2]𝔼[(xj(D,s))2]𝔼[dj]2𝔼[xj(D,s)]2+2cov[ajxj(s),djxj(D,s)]=σa2+σd2+2cov[ajxj(s),djxj(D,s)]=σa2+σd2+2𝔼[(ajxj(s)).(djxj(D,s))]2𝔼[(ajxj(s))]𝔼[(djxj(D,s))]=σa2+σd2+2𝔼[(ajxj(s)).(djxj(D,s))]=σa2+σd2+2𝔼[ajdj𝔼[xj(s),xj(D,s)]=0,[Zhu et al.(25)]]𝕍[ajxj+djHj]=σa2+σd2.

Now assuming independence between the m causal variants we can derive Eq. S7,

𝕍[y]=𝕍[sXQTL+bFQTL]+m(σa2+σd2)+σe2m(σa2+σd2)+σe2+Km,

where K is constant with respect to the number of causal variants. For large m, 𝕍[y]m(σa2+σd2)+σe2.

Effect of Strong PS.

The contribution of PS reported in this study needs, however, to be put in perspective as our simulations and real data analyses were based upon a relatively homogeneous population within the United Kingdom. For stronger PS (e.g., between European and Asian populations) FUNI-based ID estimates can also be biased. This extreme situation would in general be handled as part of quality control. Still, we quantify below in a simulation study the bias of FUNI-based estimates in such a context.

Description of the simulation study.

Model and notations.

We assume that the data are collected from two populations: population 1 and population 2. For the sake of simplicity, we assume that all m=1,000 causal variants are observed and have the same allele frequency p1 in population 1 and p2 in population 2. We denote Fst as Fst=p1p2 and π as the proportion of samples from population 2 in the data used for inference.

For individual i in population k (k=1 or 2), we simulated a trait yi(k) from the model

yi(k)=j=1m(xij(k)2pj(k)hj(k))aj+j=1m(Hij(k)hj(k)hj(k))djgi(k)+ei(k),

where hj(k)=2pj(k)(1pj(k)) and pj(k) is the MAF of SNP j in population k; i.e., pj(1)=p1 and pj(2)=p2. We defined aj as aj=a×(1)j so that the additive genetic variance equals ma2 without directional effect because the sum of all aj is 0. We also defined dj=b, where b is the depression for complete inbreeding. The residual term ei(k) was simulated from a normal distribution with mean 0 and variance equal to var[gi(k)]/(1H21), where H2 is the heritability of the trait.

Simulation parameters.

In this simulation we fixed H2=0.8, p1=0.3, m=1,000, and the total sample size N=10,000. We varied Fst=p1p2 between 0.1 and +0.1 and the proportion π of samples from population 2 between 0.1 and 0.5.

Simulation results.

Table S4 reports the averaged estimates (over 10 simulation replicates) of inbreeding depression for the different combinations of Fst and π. We observed for large values of Fst that FUNI produced biased estimates of inbreeding depression, but that bias was at most 12% of the true effect. On the contrary, FHOM shows larger biases (on average 50% of b), which is in line with our results presented in the main text.

Table S4.

Averaged estimates (over 10 simulation replicates) of inbreeding depression under different scenarios of strong population stratification

π = 10% π = 25% π = 50%
Fst FUNI b^ (SE) FHOM b^ (SE) FUNI b^ (SE) FHOM b^ (SE) FUNI b^ (SE) FHOM b^ (SE)
−0.10 −0.88 (0.01) −0.23 (0.005) −0.87 (0.009) −0.13 (0.003) −0.93 (0.01) −0.09 (0.004)
−0.05 −0.99 (0.01) −0.59 (0.005) −0.99 (0.006) −0.43 (0.005) −0.99 (0.01) −0.35 (0.005)
0.0 −1.00 (0.01) −1.00 (0.006) −1.00 (0.008) −1.00 (0.007) −1.00 (0.01) −1.00 (0.004)
+0.05 −0.98 (0.01) −0.71 (0.007) −0.99 (0.008) −0.56 (0.006) −0.99 (0.01) −0.47 (0.004)
+0.10 −0.89 (0.01) −0.45 (0.005) −0.89 (0.01) −0.30 (0.004) −0.95 (0.01) −0.24 (0.003)

The true inbreeding depression parameter used to simulate the data is b ⁼ −1. Datasets are simulated as mixtures of two populations with mixture proportions π and 1 − π and allele frequency difference equal to Fst.

LDMS Heterogeneity Test.

Derivation of the test statistic.

We developed and applied a Wald test to assess the statistical significance of a difference between nonstratified ID estimation and LDMS stratified ID estimation.

Let us denote 𝐅 the n×p matrix of inbreeding measure for each individual (1n) and each SNP (1p). The nonstratified measure of inbreeding 𝐅g (subscript g means genome) is defined as the mean inbreeding across all SNPs. Therefore, 𝐅g=1p𝐅1p, where 𝟏p is the p-dimensional vector with all entries equal to 1.

Stratifying the SNPs into K LD and MAF categories defines a linear map of the p individual SNP inbreeding measures into K aggregated inbreeding measures for each of the strata. The matrix of these strata-specific inbreeding measures can be written

𝐅s=𝐅𝐙,

where 𝐙 is a p×K matrix with entries zjk so that zjk equals the proportion of SNPs in the kth category if SNP j belongs to that category or equals 0 otherwise.

Without loss of generality, we assume that the sample means of Fg and the columns of 𝐅s are 0. Using least-squares estimation theory, we can write the closed-form expressions of the the nonstratified (bg^) and the LDMS stratified (bs^) estimators. Assuming 𝐲 is the vector of the phenotypes (with sample mean 0), we have that

bg^=1𝐅g𝐅g𝐅g𝐲andbs^=𝟏K[𝐅s𝐅s]1𝐅s𝐲.

We note that both bg and bs are linear functions of 𝐲. We can therefore use the following matrix relationship:

[bg^bs^]=[1𝐅g𝐅g𝐅g𝟏K[𝐅s𝐅s]1𝐅s]𝐀𝐲𝕍[bg^bs^]=𝐀𝕍[𝐲]𝐀. [S15]

When the individuals in the study are independent and when assuming homoscedasticity of the residuals, we have that 𝕍[𝐲]=σ2In. We therefore deduce that the correlation between bg^ and bs^ is

r^gs=corr(bg^,bs^)=𝟏K[𝐅s𝐅s]1𝐅s𝐅g(𝐅g𝐅g)1𝐅g𝐅g×(𝟏K[𝐅s𝐅s]1𝟏K)=𝟏K[𝐅s𝐅s]1𝐅s𝐅g(𝐅g𝐅g)×(𝟏K[𝐅s𝐅s]1𝟏K). [S16]

Therefore, to test the hypothesis H0:bg=bs vs. H1:bgbs, we can use the test statistic

(bg^bs^)2𝕍(bg^bs^)=(bg^bs^)2σ(bg^)22r^gsσ(bg^)σ(bs^)+σ(bg^)2H0χ12, [S17]

where σ(bg^) and σ(bs^) are respectively the standard errors of bg^ and bs^.

Control of type I error and statistical power.

To check that type I error is well controlled under the null hypothesis, we analyzed the distribution of P values from the comparison of bg^ with bs^ when the causal variants are a random sample of the SNPs. Under this scenario, we expect bg^ and bs^ to coincide as illustrated in our first simulation study (simulation 1, scenario 1). We reused data generated in our first simulation study but simply varied the value of the true ID parameters b between 0, –3, –6, and –9 SDs of the simulated trait. The results presented in Fig. S7 correspond to the comparison of FUNI with FUNILDMS under the assumption that the dominance effect at each SNP is inversely proportional to the variance of allele counts.

As expected, the P-values distribution in this scenario was indistinguishable from the uniform distribution (Kolmogov–Smirnov test P>0.26 for all values of b). In particular, the observed proportion of P values below 5% does not deviate from its expectation of 5% (Fig. S7B). This result confirms that this test can be used safely to detect LD and MAF heterogeneity.

When we simulated LD and MAF heterogeneity by sampling causal variants from high or LD regions of the genome (simulation 1, scenarios 2 and 3), we noticed a deviation from the uniform distribution as illustrated in Fig. S7C. The power calculations presented in Fig. S7D are based on the fixed sample size of n=10,000 used in our simulations.

Dataset S1 (PDF)

Discussion

We comprehensively quantified the behavior of ID estimators based on three commonly used measures of inbreeding. Our study illustrated some of the shortcomings of the most commonly used ROH-based estimates of ID, which not only are biased but also have large SE (approximately three times larger compared with FUNI). This, along with the arbitrary choices underlying the definition of ROHs, leads us to recommend the use of FUNI over FROH. Overall, our results suggest that FUNI-based ID estimates are robust to different assumptions about the distribution of effect sizes, to possible directional effects of minor alleles, and also to population stratification. The contribution of population stratification reported in this study needs, however, to be put in perspective as our simulations and real data analyses were based upon a relatively homogeneous population within the United Kingdom. For stronger population stratification (e.g., between European and Asian populations) FUNI-based ID estimates can also be biased. This somewhat extreme situation, which would in general be handled as part of quality control, is discussed in Supporting Information (Table S4 and Dataset S1).

This study also highlights that differential LD distribution between causal variants and SNPs could bias ID estimates. As previously reported for genomic-relatedness-based restricted maximum-likelihood (GREML) (28) heritability estimates (1921), we demonstrate through simulations that an LDMS approach successfully corrects these biases when estimating ID. More generally, the flexibility of the LDMS approach in terms of numbers and types of MAF/LD strata allows adaptation to any effect size distribution. Indeed, all SNP-based inbreeding measures are defined upon an underlying assumption on the distribution of dominance effects (e.g., when assuming constant dominance effects, the underlying inbreeding measure is FHOM), which, when not verified, creates biases in ID estimates even when causal variants are randomly distributed among observed SNPs. We showed in our simulations using two distributions of dominance effects that such biases are explained by MAF and LD heterogeneity between causal variants and SNPs and therefore can be corrected using the LDMS approach. This result is important as it guarantees an unbiased estimation of ID regardless of the distribution of dominance effects.

Beyond methodological considerations, we confirmed in this study known associations between increased inbreeding and reduced lung function (PEF), cognitive ability (FIS and MTCIM), and fertility (NCF), which were previously reported, however, using different proxy traits (10, 2931). We also replicated the association between inbreeding and decreased height (𝐛UNILDMS: −1.71 phenotypic SD for complete inbreeding; P=0.003) even though it was below the Bonferroni threshold. We did not replicate the association with educational attainment (EA) measured, as the “age when completed full education.” However, when measuring EA as whether or not participants went to college or university as in Joshi et al. (10), we found a strong although nominally significant negative association (odds ratio of 0.04, P=0.02).

We report evidence of ID for HGS, VA, AA and WHR. Although HGS, VA, and AA seem obvious fitness-related traits, these results still require replication. The association with WHR is particularly interesting as it illustrates on real data that one may benefit from using a less variable measure of inbreeding. Joshi et al. (10) reported a positive effect of inbreeding on WHR using FROH; however, even with 200,000 individuals the effect did not reach statistical significance (P=0.09). Although heterogeneity between the cohorts involved in that meta-analysis may explain that apparent lack of statistical power, our theoretical and simulation results predict that if the authors had used a different metric [FUNI instead of FROH(1) to reduce the SE] combined with a LD and MAF stratified inference, then such an effect would have been easily detected. We have not considered in this study the detection and estimation of nonlinear effects of inbreeding because of a lack of statistical power to detect such effects. Nonlinearity is predicted by theory in the presence of epistasis involving dominance (32) and was implied by Szpiech et al. (33), who showed that long runs of homozygosity were enriched for coding variants that are predicted to be deleterious. In conclusion, we have demonstrated that LD and MAF stratified inference based on FUNI as a measure of inbreeding minimizes bias relative to the other ID estimation strategies compared in this study. As illustrated here on real data, we believe that our approach will lead to more discoveries in forthcoming and larger studies.

Materials and Methods

Statistical Models and Notations.

We consider the following model:

yi=μ+j=1majxij+j=1mdjHij+εi, [1]

For individual i, yi is the observed value of the phenotype of interest, and xij is the minor allele count at the jth causal SNP (xij{0,1,2}). We denote pj the minor allele frequency of the jth causal SNP, Hij=xij(2xij) is the indicator of heterozygosity, and εi is a residual term capturing nongenetic effects on the observed phenotype. The additive and dominance effect sizes of the minor allele at the jth causal SNP are respectively denoted aj and dj. We assume independence between the m causal variants, between the genotypes and the effect sizes, and between the genetic and the nongenetic effects. Finally, we assume the effect sizes to be random and such that E[aj]=αj and E[dj]=δj.

Measures of Inbreeding.

We studied three measures of inbreeding. All these measures of inbreeding require individual SNP genotypes and can be used in the absence of any pedigree information. The first inbreeding measure is the excess of homozygosity measure defined here as

FHOM=1k=1pxk(2xk)k=1p2pk(1pk),

where xk is the minor allele count of SNP k, pk is the minor allele frequency, and p is the number of genotyped or imputed SNPs available. FHOM is implemented in PLINK2 (command: –het).

The second measure (FUNI) is based on the correlation between uniting gametes. This measure was defined in Yang et al. (17) as

FUNI=1pk=1pxk2(1+2pk)xk+2pk22pk(1pk).

FUNI is implemented in PLINK2 and GCTA software (command: –ibc).

The last measure is defined as the proportion of the genome within ROH. More specifically, the inbreeding measure FROH was calculated as the cumulated length (in base pairs) of one individual’s genome within ROHs divided by 3×109 (the approximate length of the autosomal genome in base pairs). We used two definitions of ROHs corresponding to those proposed in Joshi et al. (10) (definition 1) and Gazal et al. (15) (definition 2). Inbreeding measures calculated using definition 1 and definition 2 are respectively denoted as FROH(1) and FROH(2). Both definitions used SNPs with MAF>5% as previously reported in Joshi et al. (10) and Gazal et al. (15) (Supporting Information).

UK Biobank Data.

We used baseline data from 152,729 men and women who were genotyped in the first phase of genotyping of the UK Biobank (34). To ensure ancestry homogeneity, we selected individuals who reported to be “British,” “Irish,” “white,” or of “any other white background” and whose coordinates on the first genetic PC were below 0 (Fig. S9). In total, we included 140,720 participants in this analysis. The Northwest Multicentre Research Ethics Committee (MREC) approved the study and all participants in the UK Biobank study provided written informed consent. The first steps of the quality control have been previously described (SI Materials and Methods, URLs). Phasing and imputation were performed using SHAPEIT and IMPUTE2 (SI Materials and Methods, URLs), respectively, as previously described (35). After imputation, we selected 9,493,148 autosomal SNPs with imputation quality r2>0.3, MAF>1%, and Hardy–Weinberg equilibrium test P value >106. Imputed SNPs were then called to the genotypes having the largest posterior probability. Finally, we removed redundancy by LD pruning SNPs with a squared genotype correlation r2>0.9. In total we used 3,857,369 SNPs in this analysis.

Supplementary Material

Supplementary File
pnas.1621096114.sd01.pdf (62.3KB, pdf)

Acknowledgments

This research was supported by the Australian Research Council (DP130102666 and DP160103860), the Australian National Health and Medical Research Council (1078037 and 1107258), the National Institute of Health (NIH Grants R01AG042568 and P01GM099568), and the Sylvia and Charles Viertel Charitable Foundation. This research has been conducted using the UK Biobank Resource under Project 12505.

Footnotes

The authors declare no conflict of interest.

This article is a PNAS Direct Submission.

This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1621096114/-/DCSupplemental.

References

  • 1.Charlesworth D, Willis JH. The genetics of inbreeding depression. Nat Rev Genet. 2009;10:783–796. doi: 10.1038/nrg2664. [DOI] [PubMed] [Google Scholar]
  • 2.Huang X, et al. Genomic analysis of hybrid rice varieties reveals numerous superior alleles that contribute to heterosis. Nat Commun. 2015;6:6258. doi: 10.1038/ncomms7258. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Huisman J, Kruuk LEB, Ellis PA, Clutton-Brock T, Pemberton JM. Inbreeding depression across the lifespan in a wild mammal population. Proc Natl Acad Sci USA. 2016;113:3585–3590. doi: 10.1073/pnas.1518046113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Pemberton JM, Ellis PE, Pilkington JG, Bérénos C. Inbreeding depression by environment interactions in a free-living mammal population. Heredity. 2017;118:64–77. doi: 10.1038/hdy.2016.100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.McQuillan R, et al. Evidence of inbreeding depression on human height. PLoS Genet. 2012;8:e1002655. doi: 10.1371/journal.pgen.1002655. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Bittles AH, Neel JV. The costs of human inbreeding and their implications for variations at the DNA level. Nat Genet. 1994;8:117–121. doi: 10.1038/ng1094-117. [DOI] [PubMed] [Google Scholar]
  • 7.Najmabadi H, et al. Deep sequencing reveals 50 novel genes for recessive cognitive disorders. Nature. 2011;478:57–63. doi: 10.1038/nature10423. [DOI] [PubMed] [Google Scholar]
  • 8.Charlesworth B, Charlesworth D. The genetic basis of inbreeding depression. Genet Res. 1999;74:329–340. doi: 10.1017/s0016672399004152. [DOI] [PubMed] [Google Scholar]
  • 9.Morton NE, Crow JF, Muller HJ. An estimate of the mutational damage in man from data on consanguineous marriages. Proc Natl Acad Sci USA. 1956;42:855–863. doi: 10.1073/pnas.42.11.855. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Joshi PK, et al. Directional dominance on stature and cognition in diverse human populations. Nature. 2015;523:459–462. doi: 10.1038/nature14618. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Kardos M, Luikart G, Allendorf FW. Measuring individual inbreeding in the age of genomics: Marker-based measures are better than pedigrees. Heredity. 2015;115:63–72. doi: 10.1038/hdy.2015.17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Keller MC, et al. Runs of homozygosity implicate autozygosity as a schizophrenia risk factor. PLoS Genet. 2012;8:e1002656. doi: 10.1371/journal.pgen.1002656. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Lencz T, et al. Runs of homozygosity reveal highly penetrant recessive loci in schizophrenia. Proc Natl Acad Sci USA. 2007;104:19942–19947. doi: 10.1073/pnas.0710021104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Keller MC, Visscher PM, Goddard ME. Quantification of inbreeding due to distant ancestors and its detection using dense single nucleotide polymorphism data. Genetics. 2011;189:237–249. doi: 10.1534/genetics.111.130922. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Gazal S, et al. Inbreeding coefficient estimation with dense SNP data: Comparison of strategies and application to HapMap III. Hum Hered. 2014;77:49–62. doi: 10.1159/000358224. [DOI] [PubMed] [Google Scholar]
  • 16.Purcell S, et al. PLINK: A tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81:559–575. doi: 10.1086/519795. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Yang J, Lee SH, Goddard ME, Visscher PM. GCTA: A tool for genome-wide complex trait analysis. Am J Hum Genet. 2011;88:76–82. doi: 10.1016/j.ajhg.2010.11.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Gusev A, et al. Partitioning heritability of regulatory and cell-type-specific variants across 11 common diseases. Am J Hum Genet. 2014;95:535–552. doi: 10.1016/j.ajhg.2014.10.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Yang J, et al. Genetic variance estimation with imputed variants finds negligible missing heritability for human height and body mass index. Nat Genet. 2015;47:1114–1120. doi: 10.1038/ng.3390. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Speed D, Hemani G, Johnson MR, Balding DJ. Improved heritability estimation from genome-wide SNPs. Am J Hum Genet. 2012;91:1011–1021. doi: 10.1016/j.ajhg.2012.10.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Lee SH, et al. Estimation of SNP heritability from dense genotype data. Am J Hum Genet. 2013;93:1151–1155. doi: 10.1016/j.ajhg.2013.10.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Bulik-Sullivan BK, et al. LD score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat Genet. 2015;47:291–295. doi: 10.1038/ng.3211. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Robinson MR, et al. Population genetic differentiation of height and body mass index across Europe. Nat Genet. 2015;47:1357–1362. doi: 10.1038/ng.3401. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Howrigan DP, et al. Genome-wide autozygosity is associated with lower general cognitive ability. Mol Psychiatry. 2016;21:837–843. doi: 10.1038/mp.2015.120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Zhu Z, et al. Dominance genetic variation contributes little to the missing heritability for human complex traits. Am J Hum Genet. 2015;96:377–385. doi: 10.1016/j.ajhg.2015.01.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Mackenbach JP. Health and deprivation. Inequality and the North. Health Policy. 1988;10:207. [Google Scholar]
  • 27.Wood AR, et al. Defining the role of common variation in the genomic and biological architecture of adult human height. Nat Genet. 2014;46:1173–1186. doi: 10.1038/ng.3097. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Yang J, et al. Common SNPs explain a large proportion of the heritability for human height. Nat Genet. 2010;42:565–569. doi: 10.1038/ng.608. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Robert A, Toupance B, Tremblay M, Heyer E. Impact of inbreeding on fertility in a pre-industrial population. Eur J Hum Genet. 2009;17:673–681. doi: 10.1038/ejhg.2008.237. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Bittles AH, Grant JC, Sullivan SG, Hussain R. Does inbreeding lead to decreased human fertility? Ann Hum Biol. 2002;29:111–130. doi: 10.1080/03014460110075657. [DOI] [PubMed] [Google Scholar]
  • 31.Woodley MA. Inbreeding depression and IQ in a study of 72 countries. Intelligence. 2009;37:268–276. [Google Scholar]
  • 32.Lynch M. The genetic interpretation of inbreeding depression and outbreeding depression. Evolution. 1991;45:622–629. doi: 10.1111/j.1558-5646.1991.tb04333.x. [DOI] [PubMed] [Google Scholar]
  • 33.Szpiech Z, et al. Long runs of homozygosity are enriched for deleterious variation. Am J Hum Genet. 2013;93:90–102. doi: 10.1016/j.ajhg.2013.05.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Allen N, et al. UK Biobank: Current status and what it means for epidemiology. Health Policy Technol. 2012;1:123–126. [Google Scholar]
  • 35.O’Connell J, et al. Haplotype estimation for biobank-scale data sets. Nat Genet. 2016;48:817–820. doi: 10.1038/ng.3583. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary File
pnas.1621096114.sd01.pdf (62.3KB, pdf)

Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES