Skip to main content
Genetics, Selection, Evolution : GSE logoLink to Genetics, Selection, Evolution : GSE
. 2023 Oct 25;55:74. doi: 10.1186/s12711-023-00846-7

Approaching autozygosity in a small pedigree of Gochu Asturcelta pigs

Katherine D Arias 1, Juan Pablo Gutiérrez 2, Iván Fernández 1, Isabel Álvarez 1, Félix Goyache 1,
PMCID: PMC10601182  PMID: 37880572

Abstract

Background

In spite of the availability of single nucleotide polymorphism (SNP) array data, differentiation between observed homozygosity and that caused by mating between relatives (autozygosity) introduces major difficulties. Homozygosity estimators show large variation due to different causes, namely, Mendelian sampling, population structure, and differences among chromosomes. Therefore, the ascertainment of how inbreeding is reflected in the genome is still an issue. The aim of this research was to study the usefulness of genomic information for the assessment of genetic diversity in the highly endangered Gochu Asturcelta pig breed. Pedigree depth varied from 0 (founders) to 4 equivalent discrete generations (t). Four homozygosity parameters (runs of homozygosity, FROH; heterozygosity-rich regions, FHRR; Li and Horvitz’s, FLH; and Yang and colleague’s FYAN) were computed for each individual, adjusted for the variability in the base population (BP; six individuals) and further jackknifed over autosomes. Individual increases in homozygosity (depending on t) and increases in pairwise homozygosity (i.e., increase in the parents’ mean) were computed for each individual in the pedigree, and effective population size (Ne) was computed for five subpopulations (cohorts). Genealogical parameters (individual inbreeding, individual increase in inbreeding, and Ne) were used for comparisons.

Results

The mean F was 0.120 ± 0.074 and the mean BP-adjusted homozygosity ranged from 0.099 ± 0.081 (FLH) to 0.152 ± 0.075 (FYAN). After jackknifing, the mean values were slightly lower. The increase in pairwise homozygosity tended to be twofold higher than the corresponding individual increase in homozygosity values. When compared with genealogical estimates, estimates of Ne obtained using FYAN tended to have low root-mean-squared errors. However, Ne estimates based on increases in pairwise homozygosity using both FROH and FHRR estimates of genomic inbreeding had lower root-mean-squared errors.

Conclusions

Parameters characterizing homozygosity may not accurately depict losses of variability in small populations in which breeding policy prohibits matings between close relatives. After BP adjustment, the performance of FROH and FHRR was highly consistent. Assuming that an increase in homozygosity depends only on pedigree depth can lead to underestimating it in populations with shallow pedigrees. An increase in pairwise homozygosity computed from either FROH or FHRR is a promising approach for characterizing autozygosity.

Supplementary Information

The online version contains supplementary material available at 10.1186/s12711-023-00846-7.

Background

In small populations in which preservation of genetic variability is the goal, possible losses of genetic diversity have been traditionally assessed using genealogical information by computing coefficients of inbreeding (F) [1], defined as the probability that two alleles sampled at random at a locus are identical-by-descendent (IBD), or coefficients of coancestry (f) [2], defined as the probability that two alleles sampled at random in two different individuals are IBD. However, how F and f coefficients are reflected in the genome remains unclear. Although genealogical analyses assume that each individual in the base population has unique alleles at each locus, genomic data are subject to finite sampling and, thus, homozygosity at a locus can occur by chance. In other words, two alleles at a locus can be identical-by-state (IBS) rather than IBD. Therefore, estimating homozygosity due to IBD in an individual (autozygosity sensu, Cotterman [3]) has major difficulties because estimates widely vary from expectations as a consequence of Mendelian sampling [4] and the variance in IBD sharing per chromosome [5]. In this scenario, genomic estimates of homozygosity tend to measure the ‘realized inbreeding’ of an individual rather than the effect of autozygosity [6].

Ascertainment of the relationships between genealogical and genomic estimates of F has usually been approached using either simulated [713] or real [1417] data with deep pedigrees. However, inconsistencies between these two sources of information may be greater in the case of small populations with shallow pedigrees and high proportions of full and half-siblings. Such scenarios could lead to weak correlations between pedigree-based and genomic estimates of inbreeding [8]. Moreover, mating policies can also affect the informativeness of genomic estimators. When matings between close relatives are avoided, allele frequencies tend to remain balanced across years and generations [18, 19]. In such scenarios, parameters that characterize homozygosity may not show clear trends [15].

There is consensus that the use of genomic estimators of inbreeding (homozygosity) may be beneficial to conservation programmes by allowing them to maintain the largest possible amount of genetic diversity [20, 21]. However, their efficiency could be affected by the genotyping platform[10, 19, 22], as well as by the characteristics of the population [13, 2325]. Furthermore, genomic coefficients of inbreeding are highly dependent on the assumptions underlying their definition and do not always provide useful estimates of inbreeding due to inconsistent results in terms of gain and loss of genetic variability [15].

The rate of inbreeding computed using genealogical information is a standard criterion considered in conservation programmes, as it translates to the effective population size (Ne) of a population [22, 26]. It can be efficiently estimated using the individual increase in inbreeding coefficients (ΔFi) [27, 28]. The parameter ΔFi is not affected by changes in mating policy, allows a flexible definition of the reference population in a pedigree, and gives useful estimates of Ne even with shallow pedigrees [28, 29]. However, similar approaches have not been attempted using genomic data.

The Gochu Asturcelta breed is an extremely endangered Celtic–Iberian pig population [30, 31] for which a conservation programme was started with six founders, only four of which provided viable offspring [32]. The start of the recovery programme has been well documented [32, 33]: its first stages were affected by incorrect management practices, including full-sib matings [34], causing a sudden increase in inbreeding. Immediately thereafter, a strict mating policy prohibiting matings between close relatives and prolonging the reproductive career of the founders and their direct descendants as much as possible was implemented to keep founder contributions balanced across generations and to preserve their genetic background in the present population [32]. The mating policies generated a complex pedigree useful for testing different hypotheses on the relationship between pedigree and genomic information and the preservation of genetic diversity [32, 33, 35].

The main objective of this research was to obtain empirical evidence on the usefulness of genomic information for the assessment of genetic diversity in small populations with shallow pedigrees using a sample of Gochu Asturcelta pigs. The performance of four estimators of homozygosity and the influence of genomic variability in the base population will be assessed, as well as the partitioning of genomic diversity into autosomes. An increase in homozygosity will be estimated at the individual level following two approaches: (a) computation of individual increases in homozygosity as an extension of the genealogical approach proposed by Gutiérrez et al. [27] and (b) computation of the increase in homozygosity of an individual over the mean homozygosity of its parents. Based on these estimates of increase in homozygosity, genomic estimates of Ne will be computed and further compared with genealogical estimators (ΔFi and realized Ne). Insights for programmes aimed at the preservation of genetic variability in small animal populations will be discussed.

Methods

SERIDA is associated with the Ethical Committee in Research of the University of Oviedo (Spain), which ensures that all research with biological agents follows Good Laboratory Practices and European and Spanish regulations on biosecurity (Regulation of February 13th, 2014; BOPA no.47, February 26th, 2014). Tissue and hair root samples used in this project were collected by veterinary practitioners working for the Gochu Asturcelta Breeders’ Association (ACGA), with the permission and in the presence of the owners of the animals. For this reason, permission from the Ethical Committee in Research of the University of Oviedo was not required. In all instances, ACGA veterinarians followed standard procedures and relevant national guidelines to ensure appropriate animal care.

Samples and genotyping

The analysed pedigree included 534 genotyped individuals, forming 526 parents–offspring trios, corresponding to 76 families (descendants of the same boar–sow pair) and 95 litters registered between 2000 and 2010. Data include the genotypes of 24 boars, 42 sows, and their offspring. Family size (offspring per family) varied from 1 to 34, with 48 families having 5 offspring or more. Based on pedigree, up to 105 genotyped individuals were noninbred (F = 0). Two genotyped founders and four full-sib individuals (born from 2003 to 2005 and direct descendants from two non-genotyped founders with offspring in the dataset [32]) were considered as the base population (BP); their genotypes were used to calculate genomic estimators of inbreeding relative to the BP.

Individuals were genotyped with Axiom-PorcineHDv1 of Affymetrix (658,692 single nucleotide polymorphisms, SNPs). The software Axiom Analysis Suite v4.0.3 (Thermo Fisher Scientific, Waltham, MA, USA) was used to create standard.ped and.map files. The genotyped SNPs were mapped using the Sscrofa genome build 11.1 [36]. Only SNPs on autosomal chromosomes with known positions were considered. To avoid the presence of null and false alleles, genotypes were only filtered on the basis of Mendelian errors [37]. In total, 503,043 SNPs (151,226 of them monomorphic) with a minimum call rate of 0.97 were retained for further analysis.

Pedigree analyses

The pedigree data were analysed using the program ENDOG v4.8 [38]. The pedigree was described by computing the number of fully traced generations (G) and the equivalent discrete generations (t) for each individual. G is defined as the number of generations separating the offspring from the farthest generation where the two ancestors of the individual are known, while t is the sum of (1/2)n, where n is the number of generations separating the individual from each known ancestor [39].

Inbreeding coefficients (Fi) were computed for each individual i in the pedigree following Meuwissen and Luo [40]. The increase in inbreeding (ΔFi) was computed as proposed by Gutiérrez et al. [28] as ΔFi=1-1-Fiti-1. The realized effective population size was computed for each cohort by averaging the individual increase in inbreeding coefficients of the n individuals included in the cohort as NeFi=1/2ΔFi¯ [27, 29]. As a complementary approach, Ne was also computed from the increase in pairwise coancestry [41] as NeCij=1/(2ΔC¯), with ΔC¯ being the mean value of the increase in pairwise coancestry ΔCij=1-(1-Cij)(ti+tj)/2, where Cij is the coancestry coefficient between individuals i and j and ti and tj are their respective equivalent discrete generations.

Reference subpopulations

For descriptive purposes, most analyses were presented for both the whole genotyped population and the subpopulation of individuals with two or more equivalent discrete generations (t) in their pedigree (t-subset; 480 individuals). Furthermore, different cohorts were defined according to (a) the year of birth (C2007, 112 offspring; C2008, 208 offspring; and C2009, 125 offspring, including 20 individuals born in 2010) and (b) the number of complete generations in their pedigree (two complete generations—CG2, 325 offspring; and three complete generations—CG3, 149 individuals).

Estimates of homozygosity

Four parameters that characterize homozygosity (FROH, FHRR, FLH, and FYAN) were computed:

  1. FROH is the proportion of the genome covered by stretches of homozygous sites, usually referred to as runs of homozygosity (ROH) [42]. FROH for individual i was computed as FROHi=LROHiLAUTO, where LROHi is the length of the genome covered by ROH in individual i and LAUTO is the length of the autosomal genome [42]. Identification of ROH for each individual was carried out using the consecutive runs approach implemented in the package detectRuns in R [43], using the following parameters: a maximum number of five heterozygous SNPs and a maximum number of five missing SNPs in a run; a minimum number of 20 SNPs in a run; a minimum length of 1 kb; and a maximum distance of 1 Mb allowed between consecutive SNPs. These parameters are expected to allow almost complete detection of ROH [44] and to avoid bias caused by recombination events [45]. For descriptive purposes only, the ROH segments identified in each individual were summarized into ROH regions using the intersectBed function of the BedTools version 2.28.0 software [46], and the results were plotted as an idiogram using the package RIdeogram of R [47].

  2. FHRR is the proportion of the genome covered by heterozygosity-rich regions (HRR; formerly known as runs of heterozygosity) [48]. FHRR for individual i was computed as FHRRi=1-LHRRiLAUTO, where LHRRi is the length of the genome covered by HRR in individual i. Identification of HRR for each individual was performed using the consecutive runs approach implemented in the package detectRuns in R [43]. Given that HRR segments are expected to be less frequent and shorter (< 1 Mb) than ROH and that the main parameter leading to their identification is the number of homozygous SNPs allowed in a segment [17, 44, 49], the following parameters were used to detect HRR: a maximum number of five homozygous SNPs in a run; a maximum number of five missing SNPs in a run; a minimum number of 10 SNPs in a run; a minimum length of 1 kb; and a maximum distance of 1 Mb allowed between consecutive SNPs. For descriptive purposes, the HRR segments identified in each individual were summarized into HRR regions using the intersectBed function of the BedTools version 2.28.0 software [46], and the results were plotted as an idiogram using the package RIdeogram of R [47].

  3. FLH is the Li and Horvitz [50] estimator of the deviation of the observed frequency of homozygotes from that expected in the BP under Hardy–Weinberg proportions. The diagonals of the Li and Horvitz [50] relationship matrix were computed using custom R code as FLH=SFNEJ-k=1S1-2pk0(1-pk0)S-k=1S1-2pk0(1-pk0), where S is the number of loci, FNEJ is the proportion of loci homozygous in individual i [51] and pk(0) is the frequency of the reference allele of SNP k in the BP. FLH is equivalent to FIS [52, 53] after correction for the homozygosity in the BP and, therefore, gives information in terms of IBD.

  4. FYAN is equal to the diagonals of the genomic relationship matrix constructed using the Yang et al. [54] approach based on the correlation between uniting gametes. Using custom R code, FYAN was computed as FYAN=1Sk=1Sxk2-(1+2pk(0))xki+2pk(0)22pk(0)(1-pk(0)), where xk is the genotype of the individual for SNP k. Yang et al.’s [54] estimator gives more weight to homozygotes for the minor allele than to homozygotes for the major allele.

The four parameters computed are expected to characterize differences: (a) in the length of homozygous genomic stretches (FROH); (b) in the length of non-random heterozygous genomic segments (FHRR); (c) in expected homozygosity (FLH); and (d) on the diagonal of the genomic relationship matrix (FYAN). Unlike FROH and FHRR, both FLH and FYAN account for allele frequencies in a previously defined BP.

For descriptive purposes, possible trends in homozygosity estimates by pedigree depth (t) were assessed by computing regression coefficients using the package STAT of R [55].

Adjustment of estimates of homozygosity

To improve robustness in the estimates of FROH, FHRR, FLH and FYAN for each individual in the dataset, two consecutive adjustment strategies were applied:

  1. Following Powell et al. [56], individual estimates of homozygosity were adjusted for the mean homozygosity in the BP as Fia=Fi-FBP1-FBP, where Fia is the individual homozygosity estimate adjusted for the BP, Fi is the homozygosity estimate (FROH, FHRR, FLH or FYAN) for individual i and FBP is the mean of the BP for each corresponding homozygosity estimate. Note that this BP adjustment is independent of the use of the allele frequencies in the BP for the computation of FLH and FYAN.

  2. Considering the partitioning of genomic diversity into chromosomes, BP-adjusted estimates were also computed for each autosome and further averaged at the individual level by jackknifing over autosomes [57].

Hereafter, references to unadjusted estimates will be referred to as ‘raw’ estimates, estimates adjusted for the mean homozygosity in the BP will be referred to as ‘BP-adjusted’ estimates, and those computed with additional jackknifing over autosomes as ‘jackknifed’ estimates.

Increases in genomic inbreeding and Ne

For the four parameters used, the increase in homozygosity was computed at the individual level in two ways:

  1. Individual increase in homozygosity, computed as ΔtFi=1-1-Fit, where Fi is either the BP-adjusted or the jackknifed estimate of homozygosity and t is the equivalent discrete generations in the pedigree of the individual.

  2. Increase in pairwise homozygosity, computed as ΔpFijk=Fi-0.5Fj+Fk1-0.5Fj+Fk, where Fi, Fj and Fk are either the BP-adjusted or the jackknifed estimates of homozygosity of offspring i, parent j, and parent k, respectively.

Note that both these approaches are applied for each individual in the dataset as the deviations from either an “ideal” population with t = 0 (individual increase in homozygosity) or the mean genomic inbreeding of the parents of the individual, assumed to be a “reference population” from which each offspring is differentiated (increase in pairwise homozygosity).

Effective population sizes were subsequently computed by averaging ΔtFi and ΔpFijk among all individuals in a cohort and computed as N¯eFi=12ΔtFi¯ and N¯eFijk=12ΔpFijk¯, respectively. Uncertainty of the N¯eFi and N¯eFijk estimates obtained for the two yearly and genealogical cohorts defined were measured via root-mean-squared error (RMSE) as RMSE=Ne-N¯eF2n, where Ne is either the estimate of NeFi or of NeCij obtained from genealogical information, N¯eF is the corresponding estimate of effective population size computed from either ΔtFi or ΔpFijk for FROH, FHRR, FLH or FYAN, and n is the sample size (5, in this case).

Results

A full description of the pedigree of the Gochu Asturcelta pig used is given in association with the estimates of genomic parameters computed for each individual as Additional files (see Additional file 1: Tables S1, S2, S3, S4). Table 1 gives mean values for the genealogical parameters describing the pedigree. The mean pedigree-based inbreeding level of the population was 0.120 ± 0.074, with a mean pedigree depth of 2.8 ± 0.7 equivalent discrete generations (t). These values were only slightly higher (mean Fi = 0.134 ± 0.066; t = 2.9 ± 0.5) for the t-subset, which included 90% of the individuals in the dataset.

Table 1.

Mean (± standard deviation) of genealogical parameters (inbreeding, Fi, individual increase in inbreeding, ΔFi, and equivalent discrete generations, t) and estimates of homozygosity (FROH, FHRR, FLH, and FYAN) and their increase computed from genomic information

graphic file with name 12711_2023_846_Tab1_HTML.jpg

Minimum and maximum values are given in brackets. Genomic parameters are presented as raw values as well as values adjusted for the mean homozygosity in the base population (BP) and values obtained via jackknifing over autosomes. Data are given for the whole population as well as for the individuals with two or more equivalent discrete generations in their pedigree (t-subset). ΔtF values correspond to the individual increase in inbreeding computed for each of the four homozygosity parameters tested, while ΔpF values indicate the corresponding increase in pairwise homozygosity values

Raw homozygosity estimates

In total, 1,411,686 and 2,176,412 ROH and HRR segments, respectively, were identified across the 534 genotyped individuals. These segments were summarized into 3017 ROH areas covering 1.04 Gb and 4646 HRR areas covering 290.7 Mb over the 18 autosomes (Fig. 1).

Fig. 1.

Fig. 1

Ideogram illustrating, per autosome, the distribution of the 3017 and 4646 genomic areas in which ROH (in red) and HRR (in blue) segments were identified across the 534 genotyped Gochu Asturcelta individuals

Figure 2a illustrates the dispersion of the raw estimates of homozygosity by autosome in the individuals belonging to the BP. The raw estimates showed wide dispersion and evident differences in scale among homozygosity parameters. Raw estimates of the four parameters used (FROH, FHRR, FLH and FYAN) computed at the individual level are given in Additional file 1: Table S1 and summarised in Table 1. As expected, before BP adjustment of the estimates for each individual, FROH and FHRR always took positive and high mean values (0.522 and above, [see Additional file 1: Table S3]). However, both FLH and FYAN took significantly lower (FYAN = 0.030 ± 0.086) and even negative (FLH = − 0.041 ± 0.093) mean values. Raw estimates for both FLH and FYAN took negative values in a substantial number of individuals (355 individuals, 66%, and 203 individuals, 38%, respectively), including those belonging to the BP.

Fig. 2.

Fig. 2

Dispersion, over the 18 autosomes, of the four estimates of homozygosity tested (FROH, in dark blue; FHRR, in light blue; FLH, in grey; and FYAN in green) in the six individuals of the Gochu Asturcelta pedigree used as the base population. Plot (a) illustrates the raw mean values, whereas plot (b) illustrates the mean values after adjustment for the mean for each autosome in the base population

BP-adjusted homozygosity estimates

Figure 2b illustrates the effect of adjustment for the mean variability in the BP on the estimates of homozygosity, by autosome, in the BP individuals. BP adjustment gave less dispersed estimates across chromosomes. However, some individuals still had estimates of homozygosity departing from expectations on some chromosomes (namely, Sus scrofa (SSC) chromosome SSC2, SSC12 and SSC13).

Genomic parameters computed after adjustment for the mean homozygosity in the BP are given for each individual in the pedigree in Additional file 1: Table S2. Mean values for genomic parameters after adjustment for variability in BP are in Table 1 for both the whole population and the t-subset. After adjustment, both FROH and FHRR took negative values for some individuals belonging to the first generations of the pedigree (57 individuals, 11%, and 96 individuals, 18%, respectively), whereas the frequency of individuals with negative values for FLH and FYAN was lower (73 individuals, 14%, and 11 individuals, 2%, respectively). However, the means for FROH (0.123 ± 0.095), FHRR (0.111 ± 0.104), FLH (0.099 ± 0.081), and FYAN (0.152 ± 0.075) were always positive. The corresponding means for the t-subset were slightly higher, ranging from 0.112 ± 0.072 (FROH) to 0.163 ± 0.069 (FYAN).

Figure 3 (left column) illustrates the variation in the BP-adjusted individual homozygosity values per t. The four homozygosity parameters showed a large amount of variation. When families with five or more offspring were considered, the within-family coefficient of variation ranged from 11 to 519% for FROH, from 17 to 334% for FHRR, from 10 to 1,366% for FLH, and from 55 to 1,688% for FYAN. Although the homozygosity values did not show clear trends, regression of individual homozygosity estimates on t indicated that FROH, FHRR, FLH and FYAN tended to increase significantly (p < 0.001) with pedigree depth. However, this scenario was not confirmed in the t-subset [see Additional file 1: Table S3]: regression of both FLH and FYAN estimates on t became statistically non-significant (p > 0.10), while those corresponding to FROH and FHRR continued to be significant (p < 0.05).

Fig. 3.

Fig. 3

Individual variation in the four different homozygosity estimates used (FROH, FHRR, FLH, and FYAN). Values computed for each individual in the dataset (in circles) are plotted by pedigree depth (equivalent discrete generations) of the animals. Regression of the individual homozygosity values over pedigree depth (t) is also displayed. The corresponding regression coefficients (b) and their statistical significance (p) are also given. The left column illustrates the variation in the base population-adjusted values, whereas the right column illustrates the variation in the jackknifed values

Jackknifed estimates of homozygosity

Genomic parameters computed via jackknifing over autosomes are given for each individual in the pedigree in Additional file 1: Table S4. The corresponding mean values for the whole population and the t-subset are also in Table 1. After jackknifing, mean values were slightly lower than those obtained adjusting for BP only. A higher mean was obtained for FYAN (0.148 ± 0.072), and a lower mean was obtained for FLH (0.092 ± 0.081), whereas the means for FROH (0.116 ± 0.096) and FHRR (0.104 ± 0.104) had intermediate values. The ranges (maximum and minimum values) of the jackknifed values computed for FROH, FHRR, and FLH were higher than those computed after BP adjustment only. However, the range for FYAN was narrower than that observed after correcting for variability in the BP only.

Figure 3 also illustrates variation in the individual homozygosity values per t after jackknifing over autosomes. Compared with the BP-adjusted estimates, no clear differences in variation could be assessed. However, both regression coefficients and goodness-of-fit (R2) of the regressions of homozygosity estimates on t were slightly higher than those obtained for the BP-adjusted values (Fig. 3; Additional file 1: Table S3). The within-family coefficients of variation (for families with five or more offspring) ranged from 11 to 726% for FROH, from 17 to 723% for FHRR, and from 7 to 1865% for FLH. However, the range of within-family coefficients of variation for FYAN was significantly narrower (ranging from 7 to 178%).

Increases in inbreeding and homozygosity

Table 1 also gives mean values for the genealogical individual increase in inbreeding (ΔFi), as well as for the individual increase in homozygosity (ΔtFi) and the increase in pairwise homozygosity (ΔpFijk) values computed from genomic data. Although they were computed via jackknifing over autosomes or adjusting for the variability in the BP only, mean ΔpFijk values were roughly twofold higher than the corresponding ΔtFi values, except for FYAN, for which the mean values were more similar (ΔpFYAN = 0.071 ± 0.088 and ΔtFYAN = 0.060 ± 0.032, and ΔpFYAN = 0.071 ± 0.083 and ΔtFYAN = 0.058 ± 0.031, respectively). In spite of the similar computation methods applied, mean ΔtFi values computed for FROH, FHRR, and FLH were lower than the genealogical increase in inbreeding (0.066 ± 0.048), which, in turn, was closer to the BP-adjusted and jackknifed mean values of ΔtFYAN. These trends were the same for the t-subset. Figure 4 shows the variation in the genealogical increase in inbreeding and both the individual increase in homozygosity and the increase in pairwise homozygosity by pedigree depth. As expected, for t ≥ 2, the ΔtFi values tended to fit the genealogical reference (ΔFi) better than the ΔpFijk values. Furthermore, while ΔtFi values tended to be lower than the mean genealogical increase in inbreeding, particularly for ΔtFLH, the ΔpFijk values could take higher or lower values than the genealogical reference at different levels of t.

Fig. 4.

Fig. 4

Comparison between the variation of different estimates of individual increase in homozygosity and the genealogical individual increase in inbreeding (dotted line) by pedigree depth (equivalent discrete generations; t). In the left column, the individual increase in homozygosity (red line) and the increase in pairwise homozygosity (green line) computed from the base population-adjusted values of individual homozygosity are given for each of the four homozygosity parameters (FROH, FHRR, FLH, and FYAN) tested. In the right column, the individual increase in homozygosity (blue line) and the increase in pairwise homozygosity (purple line) computed from the values of individual homozygosity obtained via jackknifing over autosomes are given for the same parameters

Consistent with the lower mean homozygosity values computed via jackknifing over autosomes, mean jackknifed ΔtFi values were slightly lower than those computed using BP-adjusted values (Table 1). However, the increase in pairwise homozygosity values showed the opposite pattern, with mean jackknifed ΔpFijk values higher than the corresponding BP-adjusted values, except for FYAN. In practical terms, the mean values for both ΔtFYAN and ΔpFYAN were the same.

Effective population size

Table 2 gives the effective population size computed from genealogical and genomic information for the five yearly and genealogical cohorts defined. Regarding genealogical information, Ne computed from individual increases in inbreeding was slightly higher than that computed from increases in pairwise coancestry, except for C2007 (which was approximately twofold higher) and CG3, for which NeCij (7.8) was higher than NeFi (7.0).

Table 2.

Estimates of effective population size computed using genealogical information (individual increase in inbreeding, NeFi i, and coancestry, NeCij) and genomic information (individual increase in homozygosity and increase in pairwise homozygosity)

graphic file with name 12711_2023_846_Tab2_HTML.jpg

Increases in genomic homozygosity were computed for four different genomic parameters: FROH, FHRR, FLH and FYAN. Genomic-based estimates of Ne were computed from raw homozygosity values as well as from unbiased estimates obtained via jackknifing over autosomes. Estimates were computed for the five yearly and genealogical cohorts defined. Mean equivalent discrete generations (t; standard deviation in brackets) are also given

The Ne values estimated using jackknifed individual increases in homozygosity were always higher than those obtained after adjustment for the BP only. However, the opposite pattern was observed for the Ne estimates based on the increase in pairwise homozygosity. Estimates of Ne computed using ΔpFijk tended to be roughly half of those computed using ΔtFi. In any case, independently of the method for the calculation of increases in homozygosity, estimates of Ne¯FYANijk tended to be nearer to their Ne¯FYANi counterparts.

Ne estimates obtained using FYAN had stable root-mean-squared errors, independent of the approach used for computation of the increase in homozygosity (Table 3). However, when an increase in pairwise homozygosity was considered, Ne¯FROHijk and particularly Ne¯FHRRijk resulted in lower RMSE values.

Table 3.

Root-mean-squared error (RMSE) for the genomic-based Ne estimated for the five yearly and genealogical cohorts defined using the corresponding NeFi and NeCij values as references

NeFi NeCij
BP-adjusted values Jackknifed values BP-adjusted values Jackknifed values
Individual increase in homozygosity
 Ne¯FROHi 3.2 3.3 3.0 3.1
 Ne¯FHRRi 3.4 3.5 3.2 3.3
 Ne¯FLHi 3.7 3.8 3.4 3.5
 Ne¯FYANi 3.0 3.0 2.8 2.8
Increase in pairwise homozygosity
 Ne¯FROHijk 2.6 2.5 2.5 2.5
 Ne¯FHRRijk 2.5 2.5 2.4 2.4
 Ne¯FLHijk 3.0 3.0 2.9 2.9
 Ne¯FYANijk 3.0 3.0 2.9 2.9

Discussion

Three groups of inbreeding coefficients were assessed in this study: (a) genealogical Fi, as a standard indicator of the genetic variability in a pedigree; (b) FROH and FHRR, estimating genomic identity from homozygous and heterozygous genomic segments, respectively; and (c) FLH and FYAN, estimating homozygosity while accounting for the allele frequencies in the BP. The pedigree-based inbreeding coefficients reflect IBD probabilities at an infinite number of unlinked loci over descendants that share a pedigree. However, this assumption does not fit with genomic inbreeding estimates. It is unlikely that individuals of a founder population, for instance, in populations under conservation programmes [58], do not show different degrees of genomic relatedness and, possibly, some degree of structuring. This is why the definition of a BP for adjustment of genomic inbreeding estimates is challenging [13].

Usefulness of estimates of homozygosity

Comparison among homozygosity parameters is not straightforward, not only because of differences in computation but also because they are dependent on the variability in the BP [13, 15]. The probabilistic process of Mendelian segregation results in large variance across loci, among individuals, and at the population level. Consequently, homozygosity parameters have higher variation than expected from genealogies, even in full sibs [19, 48]. The four homozygosity parameters used here were adjusted for the mean homozygosity values in the BP, making the different estimates more comparable (Fig. 2).

The first group of homozygosity parameters tested, FROH and FHRR, are not computed using allele frequencies [59]. Their main advantage is that, before BP adjustment, their definition always yields positive values (ranging from 0 to 1), leading to a straightforward interpretation. However, technical issues, such as marker density and the computational methods used (and the parameters assumed) for identification of either ROH or HRR segments, affect the estimates due to a possible less-efficient identification of shorter segments. This may be of importance for both FROH and FHRR. Since we used SNP array data with sufficient density to capture long homozygous or heterozygous segments, we applied the consecutive runs approach to identify both ROH and HRR stretches. This approach has been reported to have better performance for the identification of short (homozygous or heterozygous) genomic stretches than the well-known sliding window approach [17, 60].

Different studies have suggested that FROH can be an accurate estimator of IBD [13, 16, 6163]. However, in small pedigrees, FROH may depend to a large extent on long ROH segments [6]. ROH patterns in a population are frequently conditioned by either recent demographic events or selection in the populations under study, leading to the occurrence of long ROH segments [63, 64]. Homozygosity estimates are greater when using long ROH segments than when using shorter ROH segments [13]. Therefore, fitting well with the Gochu Asturcelta scenario analysed, FROH better characterizes recent inbreeding than old inbreeding.

This is the first study that uses FHRR to estimate genomic inbreeding. Unlike ROH, the nature of HRR is not well understood. Although recombination and other processes can shape the frequency and distribution of ROH over the genome, ROH are significantly more common in regions with high linkage disequilibrium and low recombination, tending to be identically inherited from the parents and therefore provide a good IBD estimator (see Gibson et al. [65] and many others later). Although the occurrence of HRR has been reported to be statistically non-random and to cluster in specific chromosomal regions [48], it seems to be independent from linkage disequilibrium in the regions [66]. Furthermore, HRR have been recently shown to be independent of the expected heterozygosity in the population [67] and informative about population structure and demographic history [49].

Altogether, the published evidence described above suggests that the evolutionary forces that shape ROH and HRR segments can be different. Furthermore, our results (Fig. 1) confirm that the chromosomal region with ROH and HRR segments do not overlap. Although the usefulness of HRR for monitoring genomic diversity in livestock needs further confirmation, we consider that the information provided by HRR through the different evolutionary forces acting on different chromosomal regions cannot be disregarded and is likely to be, at least, complementary to that provided by ROH. In any case, the increasing availability of sequence data will further clarify the evolutionary and demographic drivers of HRR and the role of these regions in livestock adaptation.

The literature suggests that HRR are short or very short and that they could be subject to balancing or countervailing selection [17, 44, 48, 49]. Our results confirm that HRR segments are short (always shorter than 1 Mb, and 89% of them shorter than 100 kb). Therefore, theoretically, the ability of ROH and HRR to characterize overall homozygosity (gain of homozygous stretches or loss of heterozygous stretches) in an individual may differ. However, our results suggest that after BP adjustment, both FROH and FHRR values are highly consistent (Table 1; Fig. 2).

In contrast with FROH and FHRR, which are putatively useful to characterize the overall homozygosity in an individual or population, the second group of homozygosity parameters, FLH and FYAN, have been reported as useful estimators of homozygosity due to IBS [9, 16, 45, 53, 59]. The results of FLH and FYAN depend on the allele frequencies of a defined reference population [45]. Furthermore, FYAN is heavily weighted by the frequency of rare alleles [13, 15, 53, 59]. Values obtained for either FLH or FYAN can range from -1 to 1, and in our data, individual estimates of FLH and FYAN took negative values with some frequency. Caballero et al. [13] showed that both FLH and FYAN can take lower values than expected due to the excess of heterozygotes expected for markers in finite populations. Therefore, the behaviour of FLH and FYAN cannot be straightforwardly compared to that of FROH and FHRR [15, 53] unless BP adjustment is applied. Although FROH,FHRR, and FLH weight each genotyped allele equally, Yang et al.’s [54] method tends to yield higher homozygosity estimates in individuals that are homozygous for rare alleles [13, 15]. Other methods, particularly FROH, were less efficient in capturing homozygosity of rare alleles [52].

Adjustment of homozygosity estimates

The reasons summarized above make different homozygosity parameters difficult to compare. To overcome this issue, we adjusted the four homozygosity measures according to the mean values in the BP, allowing more comparable behaviours among them. However, estimates of homozygosity are strongly affected by Mendelian sampling and the genomic relatedness between individuals in the BP [8]. The adjustment for variability in the BP does not allow complete correction of the estimates (Fig. 2). In this scenario, outliers can still occur, therefore suggesting that additional strategies (here, ‘jackknifing over autosomes’) should be applied to improve robustness of the estimates.

After correction for the mean homozygosity in the BP, the main differences in behaviour were found for FYAN, confirming that this parameter is highly sensitive to this adjustment. There is no consensus on how to best define the BP. Some authors reported that using the actual allele frequencies in the BP can give inconsistent FYAN values across generations with unrealistic patterns of increases or decreases in variability [15]. Other authors reported that fixing the allele frequencies in the base population to 0.5 [8, 68] allows stronger correlations of genealogical inbreeding estimates with genomic estimates computed from the genomic relationship matrix to be obtained. However, defining the current population as the BP yields variable correlations, even negative, with pedigree-based estimates [13].

Furthermore, lack of information about the real founders in a population can bias estimates of inbreeding parameters. This is particularly important when the aim is to compare estimates of homozygosity and inbreeding because the number of loci genotyped is always finite and the base (or founder) population has a different proportion of homozygous loci assumed to be IBS, whereas when using genealogies, founders are assumed to be unrelated and F is always dependent on the pedigree depth [69, 70]. Although small in size, our BP included two out of four of the founders of the population of Gochu Asturcelta and four direct descendants of the non-genotyped founders [32, 33]. Estimates of inbreeding that account for allele frequencies (FLH and FYAN) are expected to capture relationships due to the existence of common ancestors in a population, tracing the genetic variability of the present population back to the BP [16]. However, the literature suggests that founder allele frequencies in small populations have little meaning: in the absence of mutation, all current copies of an allele may be IBD, and its current frequency simply represents the reproductive success of the founder and its descendants [45] and could be related to the census size of breeders [13]. In our data, homozygosity tended to increase significantly with pedigree depth (t), and BP adjustment resulted in acceptably consistent behaviour among homozygosity parameters (Fig. 3; Table 1). However, this approach may not be sufficient and jackknifing over autosomes suggested that additional adjustments would be necessary to obtain robust estimates; thus, BP-adjusted estimates of homozygosity may be biased upwards.

In general, our results suggest that BP adjustment does not remove differences in performance among homozygosity parameters, therefore suggesting that the genetic variability of a population should be assessed with caution. Of course, allele frequencies may be strongly affected by both editing of genotype data and the presence of allelic dropouts and null alleles in arrays [37], which can bias homozygosity estimates. Removal of rare alleles can cause an apparent increase in inbreeding due to IBS rather than IBD [9, 62]. To avoid this undesirable effect, we edited the SNP array data, filtering only loci that have Mendelian errors only, which have been previously shown to be mainly caused by technical issues and cause spurious diversity [37].

Demographic causes of variation in genomic estimates of homozygosity

In our population, homozygosity did not show clear trends of variation in relation to pedigree depth (Fig. 3). In fact, the accumulation of homozygosity with pedigree depth in inbred populations may be lower than expected when minimum coancestry matings are implemented, as in our case [32]. The Gochu Asturcelta conservation programme applied a strict mating policy to avoid matings between close relatives and to prolong the reproductive career of the founders and their direct descendants as much as possible to keep founder contributions balanced across generations and to preserve their genetic background in the population [32]. Theoretical and empirical evidence suggests that mating policies involving minimum coancestry in undivided populations balance allele frequencies at neutral genetic markers across generations [18, 23], therefore minimizing Mendelian segregation variance [22].

Although genomic estimates of inbreeding reflect the percentage of the genome that is homozygous, pedigree-based estimates are only expectations [20]. This suggests that the ability of homozygosity parameters to characterize the genetic variability of a population should be tested in each particular case, making it difficult to provide general guidelines. The results obtained in a particular population may be shaped by its particular breeding scenario. The literature suggests that between-individual variation in inbreeding estimators is strongly increased by presence of population structure [45] as well as by relatedness within the population, the mating policy applied, and the presence of large families increase [13, 58].

Increases in homozygosity and effective population size

Here, we evaluated two approaches to compute the increase in homozygosity. All estimates had large standard deviations (Table 1) as a consequence of the relatively small sample size and the strong effect of Mendelian sampling on the estimates of genomic inbreeding. The individual increase in homozygosity is conceptually similar to the individual increase in inbreeding (ΔFi) proposed by Gutiérrez et al. [27, 28], which has been used as a standard for comparisons. Although ΔFi allows a flexible definition of the reference population and is independent of mating policies [29, 41], it is based on the assumption that losses in diversity depend on pedigree depth only [71]. However, this assumption cannot be straightforwardly applied to individual increases in homozygosity: although founder individuals and their direct descendants in a pedigree are not inbred, all individuals in a pedigree have different degrees of homozygosity that can partially be explained by IBS. Our data suggest that the accumulation of the pedigree tends to increase homozygosity (Fig. 3). However, matings between relatives may be more important. In this regard, the increase in pairwise homozygosity could be intuitively considered a more accurate estimate of the increase in homozygosity due to IBD. In any case, increases in homozygosity are not constant across the genome [72]. IBD estimates in a limited genomic area or at the genome-wide level exhibit more variance than those computed for chromosomes [5]. To avoid this, jackknifing over autosomes was applied to the BP-adjusted genomic inbreeding estimates.

Although the differences were small, the results obtained after jackknifing suggest that BP-adjusted estimates of individual increases in homozygosity can be overestimates, while BP-adjusted values for increases in pairwise homozygosity can be underestimates (Table 1; Fig. 4). Values obtained for ΔtFi displayed consistent variation with ΔFi, which can be explained by their similar computation methods. The two approaches tested for increases in homozygosity showed consistent behaviour for FROH, FHRR, and FLH. However, the increases in homozygosity computed using FYAN departed from this general behaviour. FYAN is expected to have a smaller sampling variance than other estimates of homozygosity [54]. Our results confirmed this, as ΔtFYAN had lower variation (54.1%) than ΔtFHRR (105.9%).

The usefulness of the two approaches tested for increases in homozygosity to compute Ne has been compared with that of the genealogical “realized” Ne computed from ΔFi but also with that derived from the increase in pairwise coancestry proposed by Cervantes et al. [73]. The last approach is expected to give more stable estimates of Ne in cases of shallow pedigrees due to the larger amount of information used in the computations.

Estimates of Ne obtained from genealogical and genomic information are expected to have large differences [16, 22, 68, 74, 75]. Indeed, criteria based on pedigree information refer to an infinite number of loci [71], while criteria based on observed genomic polymorphism mirror phenomena related to temporal changes in allele frequency at a limited number of loci [22, 76]. Furthermore, genealogical estimates of Ne, frequently used as a reference in the literature, may not reflect the actual Ne. Even with suitable genealogical data, genealogical Ne does not consider natural selection against homozygotes. Therefore, the genetic variability at the genomic level could be higher than expected. Although different genomic methods have been proposed to obtain reliable Ne estimates using half- and full-sib allele frequencies [77] and heterozygote excess [78], computation of reliable estimates of Ne is still a challenge. Mendelian sampling causes large differences in allele frequencies among close relatives (e.g., full-sibs) and at the population level. This may be particularly important in small populations with shallow pedigrees and frequent matings between relatives.

Genomic approaches are expected to give higher estimates of Ne than approaches based on genealogical data [14, 33], probably varying with the actual number of breeders in the sample [13]. This expectation fits well with the estimates of Ne obtained using individual increases in homozygosity. However, this is not so clear for increases in pairwise homozygosity. The assumption that an increase in homozygosity only depends on pedigree depth causes ΔtFi values to underestimate the IBD scenario of the subpopulations analysed when compared with ΔpFi. Our results suggest that pedigree depth itself may influence the accumulation of homozygosity (see Additional file 1: Table S3). However, this effect may be weaker than that of matings between relatives. In any case, jackknifing over autosomes can help avoid under- and overestimation of Ne. For individual increases in homozygosity, estimates of Ne¯FYANi tended to be closer to both NeFi and NeCij estimates. However, this was not the case for the increase in pairwise homozygosity, which was more similar to Ne¯FROHijk and, particularly, to Ne¯FHRRijk. The lower variance of the FYAN values may lead to better adjustment of the influence of pedigree depth on increases in homozygosity.

Estimates of Ne based on ΔpFROH and, particularly, on ΔpFHRR had lower RMSE values in the dataset. If ΔpFijk values are useful for characterizing increases in homozygosity due to IBD, our results suggest that increases in homozygosity would be caused by losses of small HRR genomic segments, which could be more sensitive to mating between relatives than to the increase in the length of long stretches of ROH.

In any case, as expected, estimates of genomic Ne were affected by cohort size: the larger the subpopulation assessed is, the higher the estimates of genomic Ne. In this regard, the values for the cohorts C2008 and CG3 had the highest estimates for a given method. The yearly cohort C2008 included several non-inbred individuals (F = 0), which could lead to higher Ne estimates. However, this did not happen in CG3, therefore confirming some dependency on genomic Ne and sampling size. In the Gochu Asturcelta population, cohort sizes have been reported to be crucial for obtaining reliable estimates of Ne [33]. Our results, obtained using completely different information and methods, confirm this point.

Conclusions

Here, we analysed the ability of different genomic estimates of homozygosity to characterize the genetic background of a livestock population under mating policies aiming at conserving genetic variability. Our study showed that the mating policy applied to the Gochu Asturcelta pig population was successful in maintaining balanced allele frequencies. However, in such scenarios, parameters that characterize homozygosity are limited in their ability to depict trends in losses of variability. In spite of the presence of closely-related individuals from different families that vary in size, the four homozygosity parameters used (FROH, FHRR, FLH, and FYAN) characterized the evolution of the pedigree to some extent. However, different adjustments were necessary to improve robustness of the estimates: although the adjustment of the homozygosity values by the mean of the BP appears indispensable, additional adjustment via jackknifing over autosomes helps to account for within-individual genome variation. Unlike genealogical approaches, the assumption that increases in homozygosity depend on pedigree depth only may not apply to small, shallow pedigrees, and an increase in pairwise homozygosity may better characterize autozygosity. Although increases in homozygosity computed using FYAN seem to have good general behaviour, increases in pairwise homozygosity computed using FROH and FHRR may be particularly useful if efficient identification of short ROH and HRR segments can be ensured.

Supplementary Information

12711_2023_846_MOESM1_ESM.xlsx (413.1KB, xlsx)

Additional file 1: Table S1. Pedigree used and raw individual estimates of homozygosity. Description of the pedigree of the Gochu Asturcelta pigs used and raw estimates of homozygosity parameters computed at the individual level. Table S2: BP-adjusted homozygosity estimates. Description of the BP-adjusted homozygosity values and the corresponding increase in homozygosity parameters computed at the individual level. Table S3: Regression of homozygosity against equivalent discrete generations. Regression of four homozygosity estimates and their corresponding individual increase in homozygosity (Δt) and paired increase in homozygosity (Δp) against equivalent discrete generations (t). Table S4: Jackknifed homozygosity values and their corresponding individual increases in homozygosity (Δt) and paired increases in homozygosity (Δp) values. Description of the jackknifed homozygosity values and the corresponding increase in homozygosity parameters computed at the individual level.

Acknowledgements

The authors kindly thank the members of the Gochu Asturcelta Breeders Association (ACGA; https://www.gochuasturcelta.org/) for their support and kind collaboration. The genotyping service was carried out at CEGEN-PRB3-ISCIII; it was supported by grant PT17/0019 of PE I+D+i 2013-2016, funded by ISCIII and ERDF. The authors are indebted to Dr. Gonçalo Carvalho (Thermo Fisher Scientific) for his willingness and kind contribution to the execution of this project.

Author contributions

FG and IA conceived and planned the project; KDA, FG, and JPG wrote the paper; KDA, FG, and IF performed the data analyses; KDA, FG, and JPG interpreted the models; IA and IF performed sampling and discussed and interpreted genetic data in light of the statistical and breeding evidence; and IA performed the laboratory work. All the authors read and approved the final manuscript.

Funding

This research was partially funded by AEI-FEDER, grant no. PID2019-103951RB/AEI/10.13039/501100011033. K.D.A. was funded by AEI-ESF, grant no. PRE2020-092905. For the purpose of open access, the author has applied a CC BY public copyright licence to any author-accepted manuscript version arising from this submission.

Availability of data and materials

All data necessary to replicate the results presented are provided as Additional Tables (.xlsx). SNP array data are currently under analysis within the “AutoGenome” project framework. However, the dataset used is available from the corresponding author upon reasonable request.

Declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Footnotes

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Wright S. Evolution in Mendelian populations. Genetics. 1931;16:97–159. doi: 10.1093/genetics/16.2.97. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Malécot G. Les mathématiques de l’hérédité. Paris: Masson et Cie; 1948. [Google Scholar]
  • 3.Cotterman CW. A calculus for statistico-genetics. PhD thesis, Ohio State University; 1940.
  • 4.Hill WG, Weir BS. Variation in actual relationship as a consequence of Mendelian sampling and linkage. Genet Res (Camb) 2011;93:47–64. doi: 10.1017/S0016672310000480. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Visscher PM, Medland SE, Ferreira MAR, Morley KI, Zhu G, Cornes BK, et al. Assumption-free estimation of heritability from genome-wide identity-by-descent sharing between full siblings. PLoS Genet. 2006;2:e41. doi: 10.1371/journal.pgen.0020041. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Gómez-Raya L, Rodríguez C, Barragán C, Silió L. Genomic inbreeding coefficients based on the distribution of the length of runs of homozygosity in a closed line of Iberian pigs. Genet Sel Evol. 2015;47:81. doi: 10.1186/s12711-015-0153-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Morales-González E, Fernández J, Pong-Wong R, Toro MÁ, Villanueva B. Changes in allele frequencies when different genomic coancestry matrices are used for maintaining genetic diversity. Genes (Basel) 2021;12:673. doi: 10.3390/genes12050673. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Wang J. Pedigrees or markers: which are better in estimating relatedness and inbreeding coefficient? Theor Popul Biol. 2016;107:4–13. doi: 10.1016/j.tpb.2015.08.006. [DOI] [PubMed] [Google Scholar]
  • 9.Wang J. Marker-based estimates of relatedness and inbreeding coefficients: an assessment of current methods. J Evol Biol. 2014;27:518–530. doi: 10.1111/jeb.12315. [DOI] [PubMed] [Google Scholar]
  • 10.Kardos M, Luikart G, Allendorf FW. Measuring individual inbreeding in the age of genomics: marker-based measures are better than pedigrees. Heredity (Edinb) 2015;115:63–72. doi: 10.1038/hdy.2015.17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Maltecca C, Tiezzi F, Cole JB, Baes C. Symposium review: Exploiting homozygosity in the era of genomics: selection, inbreeding, and mating programs. J Dairy Sci. 2020;103:5302–5313. doi: 10.3168/jds.2019-17846. [DOI] [PubMed] [Google Scholar]
  • 12.Caballero A, Villanueva B, Druet T. On the estimation of inbreeding depression using different measures of inbreeding from molecular markers. Evol Appl. 2021;14:416–428. doi: 10.1111/eva.13126. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Caballero A, Fernández A, Villanueva B, Toro MA. A comparison of marker-based estimators of inbreeding and inbreeding depression. Genet Sel Evol. 2022;54:82. doi: 10.1186/s12711-022-00772-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Silió L, Barragán C, Fernández AI, García-Casco J, Rodríguez MC. Assessing effective population size, coancestry and inbreeding effects on litter size using the pedigree and SNP data in closed lines of the Iberian pig breed. J Anim Breed Genet. 2016;133:145–154. doi: 10.1111/jbg.12168. [DOI] [PubMed] [Google Scholar]
  • 15.Villanueva B, Fernández A, Saura M, Caballero A, Fernández J, Morales-González E, et al. The value of genomic relationship matrices to estimate levels of inbreeding. Genet Sel Evol. 2021;53:42. doi: 10.1186/s12711-021-00635-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Rodríguez-Ramilo ST, Elsen JM, Legarra A. Inbreeding and effective population size in French dairy sheep: comparison between genomic and pedigree estimates. J Dairy Sci. 2019;102:4227–4237. doi: 10.3168/jds.2018-15405. [DOI] [PubMed] [Google Scholar]
  • 17.Mulim HA, Brito LF, Pinto LFB, Ferraz JBS, Grigoletto L, Silva MR, et al. Characterization of runs of homozygosity, heterozygosity-enriched regions, and population structure in cattle populations selected for different breeding goals. BMC Genomics. 2022;23:209. doi: 10.1186/s12864-022-08384-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Royo LJ, Álvarez I, Gutiérrez JP, Fernández I, Goyache F. Genetic variability in the endangered Asturcón pony assessed using genealogical and molecular information. Livest Sci. 2007;107:162–169. [Google Scholar]
  • 19.Saura M, Fernández A, Rodríguez MC, Toro MA, Barragán C, Fernández AI, et al. Genome-wide estimates of coancestry and inbreeding in a closed herd of ancient Iberian pigs. PLoS One. 2013;8:e78314. doi: 10.1371/journal.pone.0078314. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Toro MA, Villanueva B, Fernández J. Genomics applied to management strategies in conservation programmes. Livest Sci. 2014;166:48–53. [Google Scholar]
  • 21.Fernández J, Toro MA, Gómez-Romano F, Villanueva B. The use of genomic information can enhance the efficiency of conservation programs. Anim Front. 2016;6:59–64. [Google Scholar]
  • 22.Wang J, Santiago E, Caballero A. Prediction and estimation of effective population size. Heredity (Edinb) 2016;117:193–206. doi: 10.1038/hdy.2016.43. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Fernández J, Villanueva B, Pong-Wong R, Toro MÁ. Efficiency of the use of pedigree and molecular marker information in conservation programs. Genetics. 2005;170:1313–1321. doi: 10.1534/genetics.104.037325. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.de Cara MAR, Fernández J, Toro MA, Villanueva B. Using genome-wide information to minimize the loss of diversity in conservation programmes: using genome-wide information to minimize the loss of diversity. J Anim Breed Genet. 2011;128:456–464. doi: 10.1111/j.1439-0388.2011.00971.x. [DOI] [PubMed] [Google Scholar]
  • 25.Gómez-Romano F, Villanueva B, Rodríguez de Cara MÁ, Fernández J. Maintaining genetic diversity using molecular coancestry: the effect of marker density and effective population size. Genet Sel Evol. 2013;45:38. doi: 10.1186/1297-9686-45-38. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Meuwissen THE, Sonesson AK, Gebregiwergis G, Woolliams JA. Management of genetic diversity in the era of genomics. Front Genet. 2020;11:880. doi: 10.3389/fgene.2020.00880. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Gutiérrez JP, Cervantes I, Molina A, Valera M, Goyache F. Individual increase in inbreeding allows estimating effective sizes from pedigrees. Genet Sel Evol. 2008;40:359–378. doi: 10.1186/1297-9686-40-4-359. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Gutiérrez JP, Cervantes I, Goyache F. Improving the estimation of realized effective population sizes in farm animals. J Anim Breed Genet. 2009;126:327–332. doi: 10.1111/j.1439-0388.2009.00810.x. [DOI] [PubMed] [Google Scholar]
  • 29.Cervantes I, Goyache F, Molina A, Valera M, Gutiérrez JP. Application of individual increase in inbreeding to estimate realized effective sizes from real pedigrees. J Anim Breed Genet. 2008;125:301–310. doi: 10.1111/j.1439-0388.2008.00755.x. [DOI] [PubMed] [Google Scholar]
  • 30.Arias KD, Lee H, Bozzi R, Álvarez I, Gutiérrez JP, Fernández I, et al. Ascertaining the genetic background of the Celtic-Iberian pig strain: a signatures of selection approach. J Anim Breed Genet. 2023; 17829268. 10.1111/jbg.12829. [DOI] [PubMed]
  • 31.Menéndez J, Goyache F, Beja-Pereira A, Fernández I, Menéndez-Arias NA, Godinho R, et al. Genetic characterisation of the endangered Gochu Asturcelta pig breed using microsatellite and mitochondrial markers: insights for the composition of the Iberian native pig stock. Livest Sci. 2016;187:162–167. [Google Scholar]
  • 32.Menéndez J, Alvarez I, Fernandez I, Goyache F. Genealogical analysis of the Gochu Asturcelta pig breed: insights for conservation. Czech J Anim Sci. 2016;61:140–149. [Google Scholar]
  • 33.Menéndez J, Álvarez I, Fernandez I, Menéndez-Arias NA, Goyache F. Assessing performance of single-sample molecular genetic methods to estimate effective population size: empirical evidence from the endangered Gochu Asturcelta pig breed. Ecol Evol. 2016;6:4971–4980. doi: 10.1002/ece3.2240. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Menéndez J, Álvarez I, Fernández I, de la Roza B, Goyache F. Multiple paternity in domestic pigs under equally probable natural matings—a case study in the endangered Gochu Asturcelta pig breed. Arch Anim Breed. 2015;58:217–220. [Google Scholar]
  • 35.Arias KD, Gutiérrez JP, Fernandez I, Menéndez-Arias NA, Álvarez I, Goyache F. Segregation patterns and inheritance rate of copy number variations regions assessed in a Gochu Asturcelta pig pedigree. Gene. 2023;854:147111. doi: 10.1016/j.gene.2022.147111. [DOI] [PubMed] [Google Scholar]
  • 36.Groenen MAM, Archibald AL, Uenishi H, Tuggle CK, Takeuchi Y, Rothschild MF, et al. Analyses of pig genomes provide insight into porcine demography and evolution. Nature. 2012;491:393–398. doi: 10.1038/nature11622. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Arias KD, Álvarez I, Gutiérrez JP, Fernandez I, Menéndez J, Menéndez-Arias NA, et al. Understanding Mendelian errors in SNP arrays data using a Gochu Asturcelta pig pedigree: genomic alterations, family size and calling errors. Sci Rep. 2022;12:19686. doi: 10.1038/s41598-022-24340-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Gutiérrez JP, Goyache F. A note on ENDOG: a computer program for analysing pedigree information. J Anim Breed Genet. 2005;122:172–176. doi: 10.1111/j.1439-0388.2005.00512.x. [DOI] [PubMed] [Google Scholar]
  • 39.Maignel L, Boichard D, Verrier E. Genetic variability of French dairy breeds estimated from pedigree information. Interbull Bull. 1996;14:49–53. [Google Scholar]
  • 40.Meuwissen T, Luo Z. Computing inbreeding coefficients in large populations. Genet Sel Evol. 1992;24:305–313. [Google Scholar]
  • 41.Cervantes I, Pastor JM, Gutiérrez JP, Goyache F, Molina A. Computing effective population size from molecular data: the case of three rare Spanish ruminant populations. Livest Sci. 2011;138:202–206. [Google Scholar]
  • 42.McQuillan R, Leutenegger A-L, Abdel-Rahman R, Franklin CS, Pericic M, Barac-Lauc L, et al. Runs of homozygosity in European populations. Am J Hum Genet. 2008;83:359–372. doi: 10.1016/j.ajhg.2008.08.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Biscarini F, Cozzi P, Gaspa G, Marras G. Package ‘detectRUNS’. https://cran.r-project.org/web/packages/detectRUNS/detectRUNS.pdf/. Accessed 8 Sept 2023.
  • 44.Biscarini F, Mastrangelo S, Catillo G, Senczuk G, Ciampolini R. Insights into genetic diversity, runs of homozygosity and heterozygosity-rich regions in Maremmana semi-feral cattle using pedigree and genomic data. Animals (Basel) 2020;10:2285. doi: 10.3390/ani10122285. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Thompson EA. Identity by descent: variation in meiosis, across genomes, and in populations. Genetics. 2013;194:301–326. doi: 10.1534/genetics.112.148825. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Quinlan AR, Hall IM. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics. 2010;26:841–842. doi: 10.1093/bioinformatics/btq033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Hao Z, Lv D, Ge Y, Shi J, Weijers D, Yu G, et al. RIdeogram: drawing SVG graphics to visualize and map genome-wide data on the idiograms. PeerJ Comput Sci. 2020;6:e251. doi: 10.7717/peerj-cs.251. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Williams JL, Hall SJG, Del Corvo M, Ballingall KT, Colli L, Ajmone Marsan P, et al. Inbreeding and purging at the genomic Level: the Chillingham cattle reveal extensive, non-random SNP heterozygosity. Anim Genet. 2016;47:19–27. doi: 10.1111/age.12376. [DOI] [PubMed] [Google Scholar]
  • 49.Bizarria Dos Santos W, PimentaSchettini G, Gonçalves Fonseca M, Pereira GL, Loyola Chardulo LA, Machado Neto O, et al. Fine-scale estimation of inbreeding rates, runs of homozygosity and genome-wide heterozygosity levels in the Mangalarga Marchador horse breed. J Anim Breed Genet. 2021;138:161–173. doi: 10.1111/jbg.12508. [DOI] [PubMed] [Google Scholar]
  • 50.Li CC, Horvitz DG. Some methods of estimating the inbreeding coefficient. Am J Hum Genet. 1953;5:107–117. [PMC free article] [PubMed] [Google Scholar]
  • 51.Nejati-Javaremi A, Smith C, Gibson JP. Effect of total allelic relationship on accuracy of evaluation and response to selection. J Anim Sci. 1997;75:1738–1745. doi: 10.2527/1997.7571738x. [DOI] [PubMed] [Google Scholar]
  • 52.Wright S. The genetical structure of populations. Ann Eugen. 1949;15:323–354. doi: 10.1111/j.1469-1809.1949.tb02451.x. [DOI] [PubMed] [Google Scholar]
  • 53.Alemu SW, Kadri NK, Harland C, Faux P, Charlier C, Caballero A, et al. An evaluation of inbreeding measures using a whole-genome sequenced cattle pedigree. Heredity (Edinb) 2021;126:410–423. doi: 10.1038/s41437-020-00383-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Yang J, Benyamin B, McEvoy BP, Gordon S, Henders AK, Nyholt DR, et al. Common SNPs explain a large proportion of the heritability for human height. Nat Genet. 2010;42:565–569. doi: 10.1038/ng.608. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Bolar K. Package ‘STAT’. https://cran.r-project.org/web/packages/STAT/STAT.pdf/ Accessed 8 Sept 2023.
  • 56.Powell JE, Visscher PM, Goddard ME. Reconciling the analysis of IBD and IBS in complex trait studies. Nat Rev Genet. 2010;11:800–805. doi: 10.1038/nrg2865. [DOI] [PubMed] [Google Scholar]
  • 57.Efron B. The Jackknife, the bootstrap and other resampling plans. 6. Philadelphia: Society for Industrial and applied mathematics; 1994. [Google Scholar]
  • 58.Goudet J, Kay T, Weir BS. How to estimate kinship. Mol Ecol. 2018;27:4121–4135. doi: 10.1111/mec.14833. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Zhang Q, Calus MP, Guldbrandtsen B, Lund MS, Sahana G. Estimation of inbreeding using pedigree, 50k SNP chip genotypes and full sequence data in three cattle breeds. BMC Genet. 2015;16:88. doi: 10.1186/s12863-015-0227-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Chang CC, Chow CC, Tellier LCAM, Vattikuti S, Purcell SM, Lee JJ. Second-generation PLINK: rising to the challenge of larger and richer datasets. GigaScience. 2015;4:7. doi: 10.1186/s13742-015-0047-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Peripolli E, Munari DP, Silva MVGB, Lima ALF, Irgang R, Baldi F. Runs of homozygosity: current knowledge and applications in livestock. Anim Genet. 2017;48:255–271. doi: 10.1111/age.12526. [DOI] [PubMed] [Google Scholar]
  • 62.Ferenčaković M, Sölkner J, Curik I. Estimating autozygosity from high-throughput information: effects of SNP density and genotyping errors. Genet Sel Evol. 2013;45:42. doi: 10.1186/1297-9686-45-42. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Curik I, Ferenčaković M, Sölkner J. Inbreeding and runs of homozygosity: a possible solution to an old problem. Livest Sci. 2014;166:26–34. [Google Scholar]
  • 64.Purfield DC, McParland S, Wall E, Berry DP. The distribution of runs of homozygosity and selection signatures in six commercial meat sheep breeds. PLoS One. 2017;12:e0176780. doi: 10.1371/journal.pone.0176780. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Gibson J, Morton NE, Collins A. Extended tracts of homozygosity in outbred human populations. Hum Mol Genet. 2006;15:789–795. doi: 10.1093/hmg/ddi493. [DOI] [PubMed] [Google Scholar]
  • 66.Selli A, Ventura RV, Fonseca PAS, Buzanskas ME, Andrietta LT, Balieiro JCC, et al. Detection and visualization of heterozygosity-rich regions and runs of homozygosity in worldwide sheep populations. Animals (Basel) 2021;11:2696. doi: 10.3390/ani11092696. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Kenny D, Carthy TR, Murphy CP, Sleator RD, Evans RD, Berry DP. The association between genomic heterozygosity and carcass merit in cattle. Front Genet. 2022;13:789270. doi: 10.3389/fgene.2022.789270. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Lozada-Soto EA, Maltecca C, Lu D, Miller S, Cole JB, Tiezzi F. Trends in genetic diversity and the effect of inbreeding in American Angus cattle under genomic selection. Genet Sel Evol. 2021;53:50. doi: 10.1186/s12711-021-00644-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Álvarez I, Royo LJ, Gutiérrez JP, Fernández I, Arranz JJ, Goyache F. Relationship between genealogical and microsatellite information characterizing losses of genetic variability: empirical evidence from the rare Xalda sheep breed. Livest Sci. 2008;115:80–88. [Google Scholar]
  • 70.Fernández J, Meuwissen THE, Toro MA, Mäki-Tanila A. Management of genetic diversity in small farm animal populations. Animal. 2011;5:1684–1698. doi: 10.1017/S1751731111000930. [DOI] [PubMed] [Google Scholar]
  • 71.Leroy G, Mary-Huard T, Verrier E, Danvy S, Charvolin E, Danchin-Burge C. Methods to estimate effective population size using pedigree data: examples in dog, sheep, cattle and horse. Genet Sel Evol. 2013;45:1. doi: 10.1186/1297-9686-45-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Gossmann TI, Woolfit M, Eyre-Walker A. Quantifying the variation in the effective population size within a genome. Genetics. 2011;189:1389–1402. doi: 10.1534/genetics.111.132654. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Cervantes I, Gutiérrez JP, Molina A, Goyache F, Valera M. Genealogical analyses in open populations: the case of three Arab-derived Spanish horse breeds. J Anim Breed Genet. 2009;126:335–347. doi: 10.1111/j.1439-0388.2008.00797.x. [DOI] [PubMed] [Google Scholar]
  • 74.Granado-Tajada I, Rodríguez-Ramilo ST, Legarra A, Ugarte E. Inbreeding, effective population size, and coancestry in the Latxa dairy sheep breed. J Dairy Sci. 2020;103:5215–5226. doi: 10.3168/jds.2019-17743. [DOI] [PubMed] [Google Scholar]
  • 75.Charlesworth B. Effective population size and patterns of molecular evolution and variation. Nat Rev Genet. 2009;10:195–205. doi: 10.1038/nrg2526. [DOI] [PubMed] [Google Scholar]
  • 76.Toro MA, Meuwissen THE, Fernández J, Shaat I, Mäki-Tanila A. Assessing the genetic diversity in small farm animal populations. Animal. 2011;5:1669–1683. doi: 10.1017/S1751731111000498. [DOI] [PubMed] [Google Scholar]
  • 77.Wang J. A new method for estimating effective population sizes from a single sample of multilocus genotypes. Mol Ecol. 2009;18:2148–2164. doi: 10.1111/j.1365-294X.2009.04175.x. [DOI] [PubMed] [Google Scholar]
  • 78.Pudovkin AI, Zaykin DV, Hedgecock D. On the potential for estimating the effective number of breeders from heterozygote-excess in progeny. Genetics. 1996;144:383–387. doi: 10.1093/genetics/144.1.383. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

12711_2023_846_MOESM1_ESM.xlsx (413.1KB, xlsx)

Additional file 1: Table S1. Pedigree used and raw individual estimates of homozygosity. Description of the pedigree of the Gochu Asturcelta pigs used and raw estimates of homozygosity parameters computed at the individual level. Table S2: BP-adjusted homozygosity estimates. Description of the BP-adjusted homozygosity values and the corresponding increase in homozygosity parameters computed at the individual level. Table S3: Regression of homozygosity against equivalent discrete generations. Regression of four homozygosity estimates and their corresponding individual increase in homozygosity (Δt) and paired increase in homozygosity (Δp) against equivalent discrete generations (t). Table S4: Jackknifed homozygosity values and their corresponding individual increases in homozygosity (Δt) and paired increases in homozygosity (Δp) values. Description of the jackknifed homozygosity values and the corresponding increase in homozygosity parameters computed at the individual level.

Data Availability Statement

All data necessary to replicate the results presented are provided as Additional Tables (.xlsx). SNP array data are currently under analysis within the “AutoGenome” project framework. However, the dataset used is available from the corresponding author upon reasonable request.


Articles from Genetics, Selection, Evolution : GSE are provided here courtesy of BMC

RESOURCES