Abstract
Biologically informed single nucleotide polymorphisms (SNPs) impact genomic prediction accuracy of the target traits. Our previous genomics, proteomics, and transcriptomics work identified candidate genes related to puberty and fertility in Brahman heifers. We aimed to test this biological information for capturing heritability and predicting heifer fertility traits in another breed i.e., Tropical Composite. The SNP from the identified genes including 10 kilobases (kb) region on either side were selected as biologically informed SNP set. The SNP from the rest of the Bos taurus genes including 10-kb region on either side were selected as biologically uninformed SNP set. Bovine high-density (HD) complete SNP set (628,323 SNP) was used as a control. Two populations—Tropical Composites (N = 1331) and Brahman (N = 2310)—had records for three traits: pregnancy after first mating season (PREG1, binary), first conception score (FCS, score 1 to 3), and rebreeding score (REB, score 1 to 3.5). Using the best linear unbiased prediction method, effectiveness of each SNP set to predict the traits was tested in two scenarios: a 5-fold cross-validation within Tropical Composites using biological information from Brahman studies, and application of prediction equations from one breed to the other. The accuracy of prediction was calculated as the correlation between genomic estimated breeding values and adjusted phenotypes. Results show that biologically informed SNP set estimated heritabilities not significantly better than the control HD complete SNP set in Tropical Composites; however, it captured all the observed genetic variance in PREG1 and FCS when modeled together with the biologically uninformed SNP set. In 5-fold cross-validation within Tropical Composites, the biologically informed SNP set performed marginally better (statistically insignificant) in terms of prediction accuracies (PREG1: 0.20, FCS: 0.13, and REB: 0.12) as compared to HD complete SNP set (PREG1: 0.17, FCS: 0.10, and REB: 0.11), and biologically uninformed SNP set (PREG1: 0.16, FCS: 0.10, and REB: 0.11). Across-breed use of prediction equations still remained a challenge: accuracies by all SNP sets dropped to around zero for all traits. The performance of biologically informed SNP was not significantly better than other sets in Tropical Composites. However, results indicate that biological information obtained from Brahman was successful to predict the fertility traits in Tropical Composite population.
Keywords: cattle, fertility, genomic prediction, multi-omics, multitrait meta-analysis
Multi-omics data can be used to select single nucleotide polymorphism (SNP) that are biologically relevant. These SNP can be used to aid genomic selection strategies.
Introduction
Early puberty in beef cattle underpins higher first conception rates. First conception and related reproductive traits, which are easy to record, can be useful in beef production systems to perform genetic selection. Adoption of genomic selection strategies has contributed with a 3- to 4-fold increase in the genetic gain for traits with low heritability, such as fertility traits in dairy cattle (Weller et al., 2017). Reproductive traits in beef cattle are often low to moderately heritable (Hawken et al., 2012; Zhang et al., 2014; Ali et al., 2019). The complex and polygenic natures of reproductive traits were reported in cattle and pigs (Fortes et al., 2013; Zak et al., 2017). The complexity of fertility traits emerges from the molecular networking of multiple loci plus the impact of environmental factors (Chen et al., 2008).
Theory shows that the accuracy of genomic prediction is improved by higher trait heritability and larger reference population size (Goddard, 2009). The accuracy of genomic prediction also improves when the reference population consists of a combination of different breeds or populations (Brondum et al., 2011; de Haas et al., 2012; Porto-Neto et al., 2015; Song et al., 2019). It can be deduced that genomic prediction is challenging for fertility traits with low heritability but can be improved by increasing the size of the reference population and combining different populations.
Another aspect to increase the accuracy of genomic prediction is the use of customized sets of single nucleotide polymorphism (SNP; Xiang et al., 2021). The commonly used genomic best linear unbiased prediction (GBLUP) method assumes that all loci have small effects on traits under investigation. While assuming this, GBLUP ignores the fact that biological information retained by specific regions of the genome governs the expression of the phenotypes (Abdollahi-Arpanahi et al., 2017). Multiple studies have demonstrated that the use of informative variants, in linkage disequilibrium (LD) with causal mutations, and ignoring non-informative variants may improve the accuracy of genomic predictions (Sarup et al., 2016; van den Berg et al., 2016; Raymond et al., 2018; Xiang et al., 2019). In addition to genomic prediction, the trait-specific informative SNP also impact heritability estimates (Sarup et al., 2016; Tahir et al., 2022). Various SNP selection strategies have been proposed to improve the accuracy of genomic predictions, such as selecting SNP with large effects for specific traits (Weigel et al., 2009), SNP linked to trait-specific gene sets or pathways (Koufariotis et al., 2014; Edwards et al., 2016; Abdollahi-Arpanahi et al., 2017; Rezende et al., 2019), SNP with presumed functional roles (Koufariotis et al., 2014), and selection of SNP based on machine learning approaches (Li et al., 2018; Yin et al., 2020). There is no current consensus on what the best method for customizing SNP sets for fertility traits is.
Our group previously published genome-wide association, transcriptomics, and proteomics studies that identified candidate genes related to puberty and fertility in Brahman heifers (Nguyen, 2018; Tahir et al., 2019, 2021). In the current study, we used the biological information generated by these previous studies to perform genomic predictions for three female reproductive traits: pregnancy status after the first joining (PREG1, binary), first conception score (FCS, score 1 to 3), and rebreeding score (REB, score 1 to 3.5) in Tropical Composite heifer population. The hypothesis was that the use of SNP located in genomic regions which are biologically relevant to heifer puberty and fertility might result in capturing the genetic variation and producing better genomic prediction accuracies for puberty-based fertility traits in across-breed scenario. To test this hypothesis, we compared three SNP sets: a biologically informed SNP set, a biologically uninformed SNP set, and the high-density (HD) 777K SNP set. In doing so, we investigated the usefulness of the biological information obtained from the studies on Brahman breed to capture genetic variation, estimate heritabilities, and predict phenotypes in another breed Tropical Composite. We compared the SNP sets in two scenarios: 1) across breed use of biological information: 5-fold cross-validation within Tropical Composite heifer population using SNP sets that were selected based on Brahman studies; and 2) across breed use of prediction equations: application of prediction equations (SNP effects) from Brahman to Tropical Composites and vice versa to predict GEBV.
Materials and Methods
Prior biological knowledge from previous multi-omics experiments
Previously, our group used genomics, proteomics, and transcriptomics approaches to identify candidate genes related to puberty and fertility in Brahman heifers. The genomics approach applied multitrait meta-analysis on three early reproductive traits PREG1, FCS, and REB to identify candidate genes related to fertility in Brahman female cattle (Tahir et al., 2021). The proteomics approach used mass-spectrometry technique for hypothalamic, pituitary, and ovarian tissues of pre- vs. postpubertal Brahman heifers to identify proteins/genes related to puberty (Nguyen, 2018; Tahir et al., 2019). The transcriptomics approach used hypothalamic, pituitary, and ovarian tissues of pre- vs. postpubertal Brahman heifers to identify genes related to puberty through RNA-sequencing technique (Nguyen, 2018). A comprehensive list of candidate genes (number of genes = 5,306) was identified in these studies. We named this list as multi-omics gene list. In contrast to multi-omics gene list, we created a list of nonmulti-omics gene (number of genes = 21,927) which included all Bos taurus genes except the multi-omics gene list. Based on these gene lists, different SNP sets were preselected to be used in genetic parameter estimation and genomic prediction of the traits selected for this study.
Preselection of SNP sets
Three SNP sets were selected to test their ability to explain genetic variance of the traits used in this study and to predict these traits. The first set comprised of imputed genotypes from bovine HD complete SNP retained after quality control filtering: MAF > 0.05; and call rate > 0.9. This SNP set was termed as “HD complete SNP set”. From the HD complete SNP set, the SNP within the multi-omics genes including 10 kb up and down stream genomic regions were selected as “biologically informed SNP set”. Similarly, from the HD complete SNP set, the SNP within nonmulti-omics genes including 10 kb up and downstream genomic regions were selected as “biologically uninformed SNP set”. The details about these SNP sets are provided in Table 1. The three SNP sets were used to estimate genetic parameters and to perform genomic predictions for three traits PREG1, FCS, and REB.
Table 1.
Description of SNP sets used for genomic prediction of the heifer reproductive traits measured early in life
| SNP Set | SNP Markers | Average MAF1 | Description |
|---|---|---|---|
| HD complete SNP Set | 669,662 | 0.295 | All BovineHD SNP (MAF > 0.05; call rate > 0.9) |
| Biologically informed SNP set | 110,040 | 0.295 | Within 10 kb around multi-omics genes |
| Biologically uninformed SNP set | 228,308 | 0.294 | Within 10 kb around non-multi-omics genes |
1Minor allele frequency.
Genotype and phenotype datasets
Genotype and phenotype records used in this study were obtained from “Female Fertility PhenoBank” project (L.GEN.1710), a research project funded by Meat and Livestock Australia (MLA). This project maintains a database of reproductive phenotypes recorded in female cattle of different tropically adapted beef breeds, typical of northern Australia. We obtained data of two breeds i.e., Brahman and Tropical Composite from Female Fertility PhenoBank. The cattle populations and scoring criteria of the traits used in this study are described in Table 2.
Table 2.
Scoring criteria and number of animals per trait in each population
| Trait | Score | Scoring criteria | Tropical composite number of animals | Brahman number of animals | |||||
|---|---|---|---|---|---|---|---|---|---|
| CRC4 | NT5 | Total | CRC | KAM6 | NT | Total | |||
| PREG11 | 1 | Not pregnant after first joining | 138 | 167 | 305 | 261 | 211 | 128 | 600 |
| 2 | Pregnant after first joining | 737 | 289 | 1026 | 671 | 997 | 51 | 1719 | |
| Total | 875 | 456 | 1331 | 932 | 1208 | 179 | 2319 | ||
| FCS2 | 1 | Never conceived up to 36-m age | 136 | – | 136 | 230 | 199 | – | 429 |
| 2 | Conceived in 29 to 36-m age | 19 | – | 19 | 41 | 395 | – | 436 | |
| 3 | Conceived before 29-mo age | 847 | – | 847 | 658 | 492 | – | 1150 | |
| Total | 1002 | – | 1002 | 929 | 1086 | – | 2015 | ||
| REB3 | 1 | Not pregnant after the first two joining | – | 22 | 22 | 26 | 116 | 11 | 153 |
| 2 | Pregnant in second joining, not in first | 133 | 144 | 277 | 235 | 198 | 117 | 550 | |
| 3 | Pregnant in first joining, not in second | 196 | 109 | 305 | 387 | 103 | 16 | 506 | |
| 4 | Pregnant twice, after first two joining | 541 | 170 | 711 | 284 | 7 | 53 | 326 | |
| Total | 870 | 445 | 1315 | 932 | 424 | 179 | 1535 | ||
1Pregnancy status after first joining.
2First conception score.
3Rebreeding score.
4Beef CRC population.
5Kamilaroi population.
6NT DITT population.
Tropical composite population
Since the biologically informed and uninformed SNP sets were based on biological information obtained from Brahman cattle, we planned to test the effectiveness of these SNP sets in capturing genetic variance, estimating heritabilities, and predicting phenotypes in an independent population of a different breed i.e., Tropical Composite. This led to a selection of genotype and phenotype records of 1,350 Tropical Composite cows from the Cooperative Research Centre for Beef Genetic Technologies (Beef CRC; Hawken et al., 2012) and the Northern Territory Department of Industry, Tourism, and Trade (NT DITT; Schatz and Hearnden, 2008) herds. The phenotype records for early reproductive traits PREG1 and REB were available from both herds in this population, while the records of FCS were available with the Beef CRC herd only (Table 2).
Brahman population
Records of 2,319 Brahman cows belonging to the Beef CRC, NT DITT herds, and a commercial Brahman breeder herd (Kamilaroi) were available from the Female Fertility PhenoBank. The same population was used in our previous study on genomics approach (described above) to identify genes related to traits PREG1, FCS, and REB. The purpose of selecting this population in this study was to perform across breed validation of SNP effects from one breed to the other for the estimation of GEBV.
Genotypes and imputation
A combined reference panel of 1,180 Brahman and Tropical Composite cattle, genotyped with the Bovine-HD (777K) SNP chip, was used to impute medium density SNP genotypes (Bovine 50K SNP chip) of Tropical Composite population to HD level (777K). Similarly, a reference panel of 546 Brahman animals, genotyped with the Bovine-HD (777K) SNP chip, was used to impute medium density genotypes (Bovine 50K SNP chip) of Brahman population to HD level. A combination of Eagle v2.4.1 (Loh et al., 2016) and Minimac3 (Das et al., 2016) software was used for imputation. After quality control filtering (allelic R2 > 0.4, MAF > 0.05), a total of 669,662 SNP for PREG1, 669,793 SNP for REB, and 671,280 SNP for FCS were available in the Tropical Composite heifers. Similarly, in Brahman population, 587,900 SNP for PREG1, 584,510 SNP for FCS, and 584,344 SNP for REB were available after imputation.
Training and validation groups in the tropical composite population
5-fold cross-validation approach was adapted to determine the accuracy of genomic predictions for each trait in the Tropical Composite population. 5-fold cross-validation provides the advantage of large training groups and avoids high variation in accuracies of genomic prediction (Erbe et al., 2013). For this purpose, five training and five validation groups were selected from the Tropical Composite population. The records in the Tropical Composite population were randomized (Silva et al., 2016; Toghiani et al., 2017), and five equal subsets were made. The training and validation groups were made in a way that every training group consisted of four out of five subsets and the fifth subset was declared as a validation group. This way, five training and validation groups (TR1, TR2, TR3, TR4, TR5 and Val1, Val2, Val3, Val4, Val5) were created for each trait. The composition of validation groups showing the representation of scores for all traits is provided in the Supplementary Figure S1. The training and validation groups were used to determine genetic parameters and the accuracy of genomic predictions, respectively.
Fixed effects
The individual datasets that were combined to create Tropical Composite population came from independently raised herds, therefore, each dataset had its own fixed effects. The Beef CRC dataset had fixed effects: farm location (animals raised together on the same farm), cow crop (birth year), and birth month class (August to November = class A; December to April = class B). The NT DITT dataset had the fixed effects: cow crop and birth month class. Similarly, in the Brahman population, dataset from each herd had its own fixed effects. The Beef CRC dataset had farm location, cow crop, and birth month class as fixed effects. The NT DITT dataset had cow crop and birth month class as fixed effects. Kamilaroi had cow crop as fixed effect. In addition to the above-mentioned fixed effects, “datasets” themselves were also used as fixed effects. All fixed effects in each individual dataset were combined to make a combined fixed effect, which was used in the final model.
Another factor to consider was different breeds within the Tropical Composite population. Tropical Composite is a generic term for stable crossbreeds from different breeds in Australia. Animals from the individual herds within the Tropical Composite population had undefined multiple Tropical Composite breeds (Figure 1). To address breed differences, Tropical Composite genotypes were subjected to principal component analysis using software GCTA and the top two principal components were used as covariates in the final model. The variance explained by the first and second principal components was 5% and 2%, respectively.
Figure 1.
Principal component analysis of breeds from different populations used in this study. CRC_BB, Cooperative Research Centre Brahman cows; CRC_TC, Cooperative Research Centre Tropical Composite cows; Kamilaroi_BB, Kamilaroi herd Brahman cows; NT DITT_BB, Northern Territory Department of Industry, Tourism, and Trade Brahman cows; NT DITT_TC, Northern Territory Department of Industry, Tourism, and Trade Tropical Composite cows.
Estimation of genetic parameters and GBLUP analysis
To investigate the first scenario, 5-fold cross-validation in the Tropical Composite population, the training groups were used to estimate additive genomic breeding values and SNP effects using GBLUP methodology implemented in GCTA software (Yang et al., 2011). The model used was as follows (Zhang et al., 2014):
| (1) |
where, X is the incidence matrix of fixed effect, β is the estimate of fixed effects, Z is the incidence matrix of random polygenic effects, g is the estimate of random additive polygenic effects distributed as (g) ∼ N(0, Gσg2), where G is the genomic relationship matrix (GRM) based on set of SNP markers included in the respective SNP sets. Residual is represented as e and distributed as (e) ~ N(0, Iσe2). The same model was used to estimate heritability of the traits using complete records of the Tropical Composite population.
The GBLUP SNP effects estimated from the training groups were used to calculate the GEBV of the validation groups, as follows (Toghiani et al. 2017):
| (2) |
where, Z is the incidence matrix designating the additive allele score of jth SNP for ith individual, and a is the estimated SNP effect corresponding to the jth SNP.
In addition to model (1), another model including multiple GRM (biologically informed and biologically uninformed) was also implemented. The purpose of this model was to estimate the additive genetic variance explained by biologically informed and biologically uninformed SNP sets, when fitted together in the model. Following was the multi-GRM model used:
where, X is the incidence matrix for fixed effect, β is the estimate of fixed effects, Z is the incidence matrix for random polygenic effects, gbi is the estimate of the parameter with distribution (gbi) ∼ N(0, Gbiσgbi2) that captured the additive genetic variance explained by the biologically informed SNP set, and gbu is the estimate of the parameter with distribution (gbu) ∼ N(0, Gbuσgbu2) that captured the additive genetic variance by the biologically uninformed SNP set. Gbi and Gbu are the genomic relationship matrices that were constructed using the biologically informed and uninformed SNP sets, respectively. Residual is represented as e and distributed as (e) ~ N(0, Iσe2).
Across-breed validation of SNP effects
To investigate the second scenario, across breed validation of SNP effect, complete records of both populations were used to estimate SNP effects for all SNP sets and traits using model (1). The SNP effects from one breed were used to predict GEBV of the other breed.
Accuracy of genomic prediction
The accuracy of genomic prediction was calculated as the correlation coefficient between GEBV and phenotypes adjusted for fixed effects. In case of 5-fold cross-validation in Tropical Composite population, the average of accuracies from five validation groups was considered as the accuracy of genomic prediction for the traits.
Comparison with Bovine XT-50 genotype array
A genotyping array “XT-50” was recently reported to have variants pleiotropic for 34 traits of economic importance including fertility in cattle (Xiang et al., 2021). We compared the genomic regions tagged by biologically informed SNP set in our study with the XT-50 genotype array. Since the XT-50 array included sequence variants, we expected limited commonalities between biologically informed SNP set and the XT-50 genotype array. To make a fair comparison, the extended regions of 10 kb around variants of the XT-50 array were used to extract HD SNP from the Bovine-HD SNP set. This approach resulted in a list of HD SNP in vicinity of the XT-50 variants that were comparable to the biologically informed SNP set of our study.
Results
In the Tropical Composite population, using the HD complete SNP set, PREG1 came up with the highest heritability estimate of 0.16 followed by REB with an estimate of 0.11, and FCS with an estimate of 0.08. When comparing the SNP sets within traits, the biologically informed SNP set produced relatively higher estimates of heritability for PREG1 (0.18) and FCS (0.12) than the HD complete SNP set and biologically uninformed SNP set; however, the differences in estimates were statistically insignificant. For REB, the heritability estimate by the HD complete SNP set was the highest as 0.11 (Table 3).
Table 3.
Estimates of heritability and accuracy of genomic prediction for heifer reproductive traits measured early in life in Bos indicus derived cattle in two across breed scenarios
| SNP sets | h 2 1 | SE2 | Accuracy (r)3 | Accuracy (r) | Accuracy (r) |
|---|---|---|---|---|---|
| (TC) | 5-Fold validation TC4 |
SNP effects of BB5 on TC |
SNP effects of TC on BB |
||
| PREG16 | |||||
| Biologically informed SNP set | 0.18 | 0.05 | 0.20 (0.012) | -0.03 | 0.002 |
| HD complete SNP set | 0.16 | 0.05 | 0.17 (0.01) | -0.001 | 0.02 |
| Biologically uninformed Set | 0.13 | 0.05 | 0.16 (0.01) | 0.01 | 0.02 |
| FCS7 | |||||
| Biologically informed SNP set | 0.12 | 0.05 | 0.13 (0.018) | -0.04 | −0.03 |
| HD complete SNP set | 0.08 | 0.05 | 0.10 (0.017) | −0.02 | −0.002 |
| Biologically uninformed Set | 0.08 | 0.05 | 0.10 (0.017) | −0.01 | −0.006 |
| REB8 | |||||
| Biologically informed SNP set | 0.09 | 0.04 | 0.12 (0.018) | −0.04 | −0.03 |
| HD complete SNP set | 0.11 | 0.05 | 0.11 (0.023) | −0.02 | −0.02 |
| Biologically uninformed Set | 0.10 | 0.05 | 0.11 (0.024) | −0.02 | −0.03 |
1Heritability of traits in Tropical Composite population.
2Standard error of heritability.
3Accuracy of genomic prediction: correlation between GEBV and adjusted phenotypes. Parentheses: standard error of accuracy of genomic prediction.
4Tropical Composite population.
5Brahman population.
6Pregnancy after first joining.
7First conception score.
8Rebreeding score.
In multiple GRM-based GBLUP analysis, when GRM based on the biologically informed and uninformed SNP sets were used together in the model, the biologically informed SNP set captured all observed heritability for PREG1 and FCS while biologically uninformed SNP set could not capture any proportion of the heritability for these traits. In the case of REB, the heritability was split between biologically informed and uninformed SNP sets with biologically uninformed SNP set explaining more proportion of heritability than the biologically informed SNP set. The detailed estimates of genetic variance and heritability by all SNP sets in multiple GRM GBLUP models for all traits are provided in Table 4.
Table 4.
Heritability and genetic variance explained by biologically informed and uninformed SNP sets in multiple GRM GBLUP model
| Trait | MGRM1 used in model | Genetic variance (SE2) | Heritability (SE) |
|---|---|---|---|
| PREG13 | Biologically informed SNP set | 0.032 (0.01) | 0.23 (0.1) |
| Biologically uninformed SNP set | 0 (0.01) | 0 (0.1) | |
| FCS4 | Biologically informed SNP set | 0.077 (0.05) | 0.16 (0.1) |
| Biologically uninformed SNP set | 0 (0.05) | 0 (0.1) | |
| REB5 | Biologically informed SNP set | 0.011 (0.04) | 0.03 (0.09) |
| Biologically uninformed SNP set | 0.030 (0.04) | 0.08 (0.09) |
1Multiple genomic relationship matrix.
2Standard error.
3Pregnancy status after first joining.
4First conception score.
5Rebreeding score.
Overall, the accuracy of genomic prediction was low for all traits which is typical for low heritable fertility traits. In the first scenario, 5-fold cross-validation within the Tropical Composite population, biologically informed SNP set allowed estimation of relatively more accurate GEBV for all traits when compared to the HD complete SNP set or the biologically uninformed SNP set; however, these differences were nominal and not statistically significant. The accuracy of genomic prediction by biologically informed SNP set was 0.20 for PREG1, 0.13 for FCS, and 0.12 for REB. The HD complete SNP set produced genomic prediction accuracy of 0.17 for PREG1, 0.10 for FCS, and 0.11 for REB. Biologically uninformed SNP set produced almost similar accuracies of genomic prediction for all traits as did the HD complete SNP set. The details of genomic prediction accuracies by all SNP sets for all traits are provided in Table 3 and Figure 2.
Figure 2.
Accuracies of genomic prediction in Tropical Composite heifer population through 5-fold cross validation. BIN: Biologically informed SNP set included SNP within 10 kb region of the genes identified in puberty and fertility related studies in Brahman cattle through multi-omics techniques. HDC: HD complete SNP set included all SNP from BovineHD SNP chip after quality control filtering. BUN: Biologically uninformed SNP set included SNP within 10 kb of all Bos taurus genes except the genes used in selecting biologically informed SNP set. PREG1, pregnancy after first joining, FCS, first conception score, REB, rebreeding score.
In the second scenario, the application of SNP effects from one breed to the other, the accuracies of genomic prediction for all traits were drastically reduced for all SNP sets when the GEBV of a population were calculated using SNP effects from the other population. All SNP sets resulted in the accuracy of genomic prediction for all traits around zero (Table 3).
In order to compare the genomic regions tagged by biologically informed SNP set and XT-50 genotyping array, we identified 184K HD SNP in extended genomic regions (10 kb) around the variants of XT-50 genotype array. The comparison of these SNP with biologically informed SNP set of the current study (110K) revealed 100K SNP in common.
Discussion
The purpose of this study was to investigate the effectiveness of fertility-related biological information, discovered in the Brahman breed, to estimate genetic parameters of early fertility traits and predict these in another breed i.e., Tropical Composite. We compared a biologically informed SNP set to the HD complete SNP set and to a biologically uninformed SNP set as means to incorporate (or not) biological information from previous studies (Nguyen et al., 2017; Nguyen, 2018; Tahir et al., 2019, 2021) into genomic predictions for heifer fertility traits measured early in life. The idea was to test the effectiveness of the biological information with the challenge of two scenarios: 1) determining the effectiveness of biologically informed SNP set (obtained from Brahman breed) for genetic parameter estimation and genomic prediction of puberty-based reproductive traits in another breed i.e., Tropical Composite through 5-fold cross-validation, and 2) determining the effectiveness of biologically informed SNP set for genomic prediction through application of SNP effects from one breed on other.
Overall, the accuracies of genomic prediction for all traits by all SNP sets were low in this study. Accuracy of genomic prediction is dependent on the heritability of the trait (Hayes and Goddard, 2010; Zhang et al., 2014). The low accuracy of genomic prediction is expected when the heritability of the traits is low which is true in the case of fertility traits (Cammack et al., 2009; Toghiani et al., 2017). The relationship of low heritability and low accuracies of genomic prediction was reflected in the results of this study where estimates of heritability were found to be 0.16, 0.11, and 0.08, and the accuracy of genomic prediction was found to be 0.17, 0.11, and 0.10 for PREG1, REB, and FCS, respectively, using HD complete SNP set.
For all traits, the accuracy of genomic prediction by biologically informed SNP set was slightly higher (statistically insignificant) than the HD complete SNP set and biologically uninformed SNP set in the Tropical Composite population using 5-fold validation approach in this study. A report on genomic prediction of production traits in pigs explained that dilution of SNP panels with less informative SNP (SNP not associated with target trait) can decrease the predictive ability of the SNP panel (Sarup et al., 2016). Similarly, Ling et al. (2021) described the effect of QTL-linked and nonlinked SNP on the genomic prediction of the traits and reported that inclusion of non-linked SNP in the prediction equation decreased the accuracy of genomic prediction. These studies indicate the importance of prioritizing informative genomic regions and SNP for improved genomic prediction. In the current study, the selection of biologically informed SNP set was based on genomic regions related to puberty and fertility in Brahman breed (studies cited above). The results indicate that the biologically informed SNP set was also effective in predicting fertility traits in another breed Tropical Composite. It is possible that the genomic regions tagged by the biologically informed SNP set were enriched for SNP that were informative for fertility traits in both breeds, hence were useful in genomic prediction in across-breed scenario.
The existence of trait-specific biologically informative SNP in specific genomic regions can also be supported by further results of this study which indicate that the biologically informed SNP set having 110K SNP produced slightly higher (statistically insignificant) estimates of heritability for PREG1 and FCS and nominally better prediction accuracies for all traits. The nominal improvements were observed even though the HD complete SNP set and the uninformed SNP set had a higher number of SNP (>228K compared to 110K). Since the biologically informed SNP were also part of the HD complete SNP set, statistically similar estimates of heritability and prediction accuracy were expected. Even though the biologically informed and the uninformed SNP sets were different, their performance in terms of capturing heritability and predicting traits might be similar because of LD between SNP across these sets. We note that LD can occur within nearby genomic regions and inter-chromosomally (Kulminski, 2011). We accounted for any LD possibility by fitting simultaneously the GRM from biologically informed and uninformed SNP sets in a GBLUP model. Through this model, we were able to partition the contribution of biologically informed and uninformed SNP set towards the heritability of the studied traits. The results indicated that the biologically informed SNP set captured all of the observed heritability for PREG1 and FCS when fitted together with the GRM of biologically uninformed SNP. However, this scenario was different for REB, where 73% of heritability was explained by the biologically uninformed SNP set while 27% was explained by the biologically informed SNP set. We believe that REB might be a more complex trait than PREG1 and FCS because it is influenced by postpartum anestrous interval in addition to puberty (Kinder and Werth, 1991; Tahir et al., 2021). We observed small differences in heritability estimates by biologically informed SNP set when fitted alone and together with biologically uninformed SNP set in the models. These differences might be due to interaction between biologically informed and uninformed SNP sets when fitted as covariates with each other.
Another point to observe is that regardless of molecular biology and SNP function, all the SNP sets could estimate the relationships in our study population, generating useful kinship matrices. This might be the reason why SNP sets were not significantly different in performance. It has been documented that as few as 50 to 100 SNP genotyping might be sufficient for parentage tests in cattle (Van Eenennaam et al., 2007). All our SNP sets had a much larger number of markers than the number mentioned in the cited study.
Although the accuracies of genomic prediction by biologically informed SNP set were not significantly higher than the HD complete SNP set and biologically uninformed SNP set in this study, these were comparable to the other similar studies in cattle. A study on US Jersey and Holstein bulls tested SNP with presumed functional role and top SNP significantly associated (P-value =<0.05) with the trait “sire conception rate” for genomic prediction accuracy. Using linear kernel and multikernal Gaussian models, they reported 2% to 5% increase in accuracy of genomic prediction with top SNP, while SNP with presumed functional roles produced comparable results to the SNP set having all markers (Abdollahi-Arpanahi et al., 2017). In the above-cited study, the SNP with presumed biological function were the markers present within all genes that were part of GO terms enriched in their previous studies while the current study selected the biologically informed SNP from the specific genes that were identified in previous puberty and fertility-related studies. Daetwyler et al. (2019) used prioritized biologically informed variants from whole-genome sequence of cattle to use for genomic prediction of milk and fertility traits in Australian Red, Holstein, and Jersey breeds and found an average increase in accuracy of genomic prediction by 2.5% across purebreds and all traits. A study on Danish Jersey cattle used biologically informed variants related to milk and fertility traits from previous studies to perform genomic prediction and reported increase in genomic prediction accuracy for milk traits but not for fertility traits (Liu et al., 2020). Warburton et al. (2020) used three techniques (GWAS, meta-analysis, and conditional and joint analysis) to preselect top variants associated with a fertility trait “age at first corpus luteum” in Bos indicus-influenced cattle. The preselected variants were combined with standard 6K, 50K, and 800K genotype panels to predict another fertility trait “reproductive maturity score”. When compared with control 6K, 50K, and 800K SNP panels, they found increase in the accuracy of genomic prediction by ~0% to 9%, ~0% to 2%, and ~0% to 4% through Bayes-R approach when top variants identified through different techniques were combined with standard 6K, 50K, and 800K SNP panels, respectively. A recent study reported that use of the XT-50 genotyping array resulted in more accurate genomic predictions for multiple milk-related traits in cattle populations across different countries (Xiang et al., 2021). Using a multibreed Australian cow population as a reference to predict New Zealand purebred and cross-bred cows for three milk traits, the average increase in accuracy of genomic prediction by XT-50 panel compared to standard 50K panel was 9%, 11%, and 7.9% for pure Holstein, pure Jersey, and cross-bred cows, respectively. The increase in prediction accuracies achieved with biologically informed/preselected SNP sets in our study and other studies cited above, was not significantly higher when compared to standard SNP sets. However, a recent report (Reverter et al., 2022) highlights the impact of even small increases in accuracies of genomic predictions. According to Reverter et al. (2022), a nominal increase of approximately 0.05 in prediction accuracy can translate into an increase in discrimination power between the top and bottom quartiles of predicted EBV that can be useful to the industry.
The comparison of biologically informed SNP set from the current study with the XT-50 study by Xiang et al. (2021) revealed overlapping genomic regions between the two marker sets. The XT-50 panel used by Xiang et al. (2021) included ~50-K sequence level pleiotropic variants that were prioritized due to their biological relevance with 34 traits including fertility traits (daughter pregnancy rate and productive life) using multiple statistical techniques on data of ~44,000 Australian dairy cattle. Interestingly, out of 110-K SNP in the biologically informed SNP set of our study, 100-K SNP were in common with HD SNP obtained from the extended genomic regions (10 kb) around XT-50 variants. This large proportion of SNP in common with XT-50 based SNP set indicates that the biologically informed SNP in the current study tagged the same genomic regions as did the XT-50 genotyping array. This finding suggests that the traits of economic importance investigated in both studies are controlled by similar genomic regions. Further investigation on use of biologically informed SNP set from the current study can highlight the potential of this SNP set to predict other traits of economic importance in beef cattle.
In the scenario, when the SNP effects from one breed were used to calculate GEBV of the other breed, the accuracy of genomic predictions for all traits was drastically reduced to around zero by all SNP sets. Hayes et al. (2009) reported that the accuracy of genomic predictions was decreased to around zero when the prediction estimates of Holstein cattle were used to predict GEBV of Jersey cattle and vice versa. Additional studies also contextualize the challenge of across-breed predictions (Erbe et al., 2012; Kachman et al., 2013; Calus et al., 2014). A simple explanation can be that an SNP in LD with a QTL in one breed might not be in LD with the same QTL in the other breed. The inconsistent LD between SNP and QTL in across breed scenario results in drastic decrease in accuracy of genomic prediction. Further investigations with focus on understanding LD structures of different populations and prioritization of the informative SNP across breed are needed to overcome the problem of low accuracy of genomic prediction arising from the use of prediction equations from one breed to the other.
Conclusion
The results of this study indicate that the genomic regions that were identified as relevant to puberty and fertility traits in Brahman heifers through multi-omics studies were also informative in terms of capturing heritability and predicting heifer fertility traits in Tropical Composite heifers similar to a standard HD SNP chip. Overall, the results reported in this study suggest that genomic prediction for complex female reproductive traits can potentially be improved by customizing the SNP panel for genomic predictions using multi-omics data. Future strategies for customization of SNP for fertility traits might include several GWAS and other omics studies performed in different breeds, to create a panel that is more likely to work for any breed. Given the past decade of research, we are now in a position to aggregate biological information from multiple studies that have targeted female reproduction traits, and this can inform future genomic selection strategies.
Supplementary Material
Acknowledgments
We thank Meat Livestock Australia (MLA) for funding the project “Female Fertility Phenobank (L.GEN 1710)” and Commonwealth Scientific and Industrial Research Organization (CSIRO) for providing PhD Top-up scholarship and computational resources for data analysis.
Glossary
Abbreviations
- FCS
first conception score
- GBLUP
Genomic best linear unbiased prediction
- GEBV
Genomic estimated breeding value
- GRM
genomic relationship matrix
- GWAS
genome-wide association study
- HD
high-density
- h2
heritability of the trait
- LD
linkage disequilibrium
- MAF
minor allele frequency
- PREG1
pregnancy at first mating opportunity
- QTL
quantitative trait loci
- REB
rebreeding score
- SNP
single nucleotide polymorphism
Contributor Information
Muhammad S Tahir, School of Chemistry and Molecular Biosciences, The University of Queensland, St. Lucia Campus, Brisbane 4072, QLD, Australia.
Laercio R Porto-Neto, Commonwealth Scientific and Industrial Research Organization, St. Lucia, Brisbane 4072, QLD, Australia.
Toni Reverter-Gomez, Commonwealth Scientific and Industrial Research Organization, St. Lucia, Brisbane 4072, QLD, Australia.
Babatunde S Olasege, School of Chemistry and Molecular Biosciences, The University of Queensland, St. Lucia Campus, Brisbane 4072, QLD, Australia.
Mirza R Sajid, Department of Statistics, University of Gujrat, 50700 Punjab, Pakistan.
Kimberley B Wockner, Queensland Department of Agriculture and Fisheries, Brisbane 4072, QLD, Australia.
Andre W L Tan, School of Chemistry and Molecular Biosciences, The University of Queensland, St. Lucia Campus, Brisbane 4072, QLD, Australia.
Marina R S Fortes, School of Chemistry and Molecular Biosciences, The University of Queensland, St. Lucia Campus, Brisbane 4072, QLD, Australia.
Conflict of Interest Statement
The authors declare no real or perceived conflicts of interest.
Literature Cited
- Abdollahi-Arpanahi, R., Morota G., and Penagaricano F.. . 2017. Predicting bull fertility using genomic data and biological information. J. Dairy Sci. 100:9656–9666. doi: 10.3168/jds.2017-13288 [DOI] [PubMed] [Google Scholar]
- Ali, I., Suhail S. M., and Shafiq M.. . 2019. Heritability estimates and genetic correlations of various production and reproductive traits of different grades of dairy cattle reared under subtropical condition. Reprod. Domest. Anim. 54:1026–1033. doi: 10.1111/rda.13458 [DOI] [PubMed] [Google Scholar]
- van den Berg, I., Boichard D., Guldbrandtsen B., and Lund M. S.. . 2016. Using sequence variants in linkage disequilibrium with causative mutations to improve across-breed prediction in dairy cattle: a simulation study. G3: Genes 6:2553–2561. doi: 10.1534/g3.116.027730 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brondum, R. F., Rius-Vilarrasa E., Stranden I., Su G., Guldbrandtsen B., Fikse W. F., and Lund M. S.. . 2011. Reliabilities of genomic prediction using combined reference data of the Nordic Red dairy cattle populations. J. Dairy Sci. 94:4700–4707. doi: 10.3168/jds.2010-3765 [DOI] [PubMed] [Google Scholar]
- Calus, M. P. L., Huang H. Y., Vereijken A., Visscher J., Napel J., and Windig J. J.. . 2014. Genomic prediction based on data from three layer lines: a comparison between linear methods. Genet. Sel. Evol. 46:1–13. doi: 10.1186/s12711-014-0057-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cammack, K. M., Thomas M. G., and Enns R. M.. . 2009. Reproductive traits and their heritabilities in beef cattle. Prof. Anim. Sci. 25:517–528. doi: 10.15232/S1080-7446(15)30753-1 [DOI] [Google Scholar]
- Chen, Y. Q., Zhu J., Lum P. Y., Yang X., Pinto S., MacNeil D. J., Zhang C. S., Lamb J., Edwards S., Sieberts S. K., . et al. 2008. Variations in DNA elucidate molecular networks that cause disease. Nature 452:429–435. doi: 10.1038/nature06757 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Daetwyler, H. D., Xiang R., Yuan Z., Bolormaa S., Vander Jagt C. J., Hayes B. J., van der Werf J. H. J., Pryce J. E., Chamberlain A. J., and Macleod I. M.. . 2019. Integration of functional genomics and phenomics into genomic prediction raises its accuracy in sheep and dairy cattle. Proc. Assoc. Adv. Anim. Breeding 23:11–14. [Google Scholar]
- Das, S., Forer L., Schönherr S., Sidore C., Locke A. E., Kwong A., Vrieze S. I., Chew E. Y., Levy S., McGue M., . et al. 2016. Next-generation genotype imputation service and methods. Nat. Genet. 48:1284–1287. doi: 10.1038/ng.3656 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Edwards, S. M., Sorensen I. F., Sarup P., Mackay T. F. C., and Sorensen P.. . 2016. Genomic prediction for quantitative traits is improved by mapping variants to gene ontology categories in Drosophila melanogaster. Genetics 203:1871–1883. doi: 10.1534/genetics.116.187161 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Erbe, M., Hayes B. J., Matukumalli L. K., Goswami S., Bowman P. J., Reich C. M., Mason B. A., and Goddard M. E.. . 2012. Improving accuracy of genomic predictions within and between dairy cattle breeds with imputed high-density single nucleotide polymorphism panels. J. Dairy Sci. 95:4114–4129. doi: 10.3168/jds.2011-5019 [DOI] [PubMed] [Google Scholar]
- Erbe, M., Pimentel E. C. G., Sharifi A. R., and Simianer H.. . 2013. Assessment of cross-validation strategies for genomic prediction in cattle. Proc. WCGALP. Göttingen: Georg-August-University. No. 0553. [Google Scholar]
- Fortes, M. R., Deatley K. L., Lehnert S. A., Burns B. M., Reverter A., Hawken R. J., Boe-Hansen G., Moore S. S., and Thomas M. G.. . 2013. Genomic regions associated with fertility traits in male and female cattle: advances from microsatellites to high-density chips and beyond. Anim. Reprod. Sci. 141:1–19. doi: 10.1016/j.anireprosci.2013.07.002 [DOI] [PubMed] [Google Scholar]
- Goddard, M. 2009. Genomic selection: prediction of accuracy and maximisation of long term response. Genetica 136:245–257. doi: 10.1007/s10709-008-9308-0 [DOI] [PubMed] [Google Scholar]
- de Haas, Y., Calus M. P., Veerkamp R. F., Wall E., Coffey M. P., Daetwyler H. D., Hayes B. J., and Pryce J. E.. . 2012. Improved accuracy of genomic prediction for dry matter intake of dairy cattle from combined European and Australian data sets. J. Dairy Sci. 95:6103–6112. doi: 10.3168/jds.2011-5280 [DOI] [PubMed] [Google Scholar]
- Hawken, R. J., Zhang Y. D., Fortes M. R., Collis E., Barris W. C., Corbet N. J., Williams P. J., Fordyce G., Holroyd R. G., Walkley J. R., . et al. 2012. Genome-wide association studies of female reproduction in tropically adapted beef cattle. J. Anim. Sci. 90:1398–1410. doi: 10.2527/jas.2011-4410 [DOI] [PubMed] [Google Scholar]
- Hayes, B., and Goddard M.. . 2010. Genome-wide association and genomic selection in animal breeding. Genome 53:876–883. doi: 10.1139/G10-076 [DOI] [PubMed] [Google Scholar]
- Hayes, B. J., Bowman P. J., Chamberlain A. C., Verbyla K., and Goddard M. E.. . 2009. Accuracy of genomic breeding values in multi-breed dairy cattle populations. Genet. Sel. Evol. 41:51. doi: 10.1186/1297-9686-41-51 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kachman, S. D., Spangler M. L., Bennett G. L., Hanford K. J., Kuehn L. A., Snelling W. M., Thallman R. M., Saatchi M., Garrick D. J., Schnabel R. D., . et al. 2013. Comparison of molecular breeding values based on within- and across-breed training in beef cattle. Genet. Sel. Evol. 45:30. doi: 10.1186/1297-9686-45-30 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kinder, J. E., and Werth L. A.. . 1991. Management strategies for improving the re-breeding of the cow. Proc. Range beef cow symposium. Colorado, US:Fort Collins; p. 249. [Google Scholar]
- Koufariotis, L., Chen Y. P., Bolormaa S., and Hayes B. J.. . 2014. Regulatory and coding genome regions are enriched for trait associated variants in dairy and beef cattle. BMC Genomics 15:436. doi: 10.1186/1471-2164-15-436 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kulminski, A. M. 2011. Complex phenotypes and phenomenon of genome-wide inter-chromosomal linkage disequilibrium in the human genome. Exp. Gerontol. 46:979–986. doi: 10.1016/j.exger.2011.08.010 [DOI] [PubMed] [Google Scholar]
- Li, B., Zhang N., Wang Y. -G., George A. W., Reverter A., and Li Y.. . 2018. Genomic prediction of breeding values using a subset of SNPs identified by three machine learning methods. Front. Genet. 9:237. doi: 10.3389/fgene.2018.00237 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ling, A. S., Hay E. H., Aggrey S. E., and Rekaya R.. . 2021. Dissection of the impact of prioritized QTL-linked and -unlinked SNP markers on the accuracy of genomic selection. BMC Genom. Data 22:26. doi: 10.1186/s12863-021-00979-y [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu, A., Lund M. S., Boichard D., Karaman E., Fritz S., Aamand G. P., Nielsen U. S., Wang Y., and Su G.. . 2020. Improvement of genomic prediction by integrating additional single nucleotide polymorphisms selected from imputed whole genome sequencing data. Heredity 124:37–49. doi: 10.1038/s41437-019-0246-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Loh, P. R., Danecek P., Palamara P. F., Fuchsberger C., Reshef Y. A., Finucane H. K., Schoenherr S., Forer L., McCarthy S., Abecasis G. R., . et al. 2016. Reference-based phasing using the Haplotype Reference Consortium panel. Nat. Genet. 48:1443–1448. doi: 10.1038/ng.3679 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nguyen, L. T. 2018. Using transcriptomic and proteomic analysis to characterize the biological basis of puberty in Brahman heifers. Thesis (Ph.D.). Australia: The University of Queensland. [Google Scholar]
- Nguyen, L. T., Reverter A., Canovas A., Venus B., Islas-Trejo A., Porto-Neto L. R., Lehnert S. A., Medrano J. F., Moore S. S., and Fortes M. R.. . 2017. Global differential gene expression in the pituitary gland and the ovaries of pre- and postpubertal Brahman heifers. J. Anim. Sci. 95:599–615. doi: 10.2527/jas.2016.0921 [DOI] [PubMed] [Google Scholar]
- Porto-Neto, L. R., Barendse W., Henshall J. M., McWilliam S. M., Lehnert S. A., and Reverter A.. . 2015. Genomic correlation: harnessing the benefit of combining two unrelated populations for genomic selection. Genet. Sel. Evol. 47:84. doi: 10.1186/s12711-015-0162-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Raymond, B., Bouwman A. C., Schrooten C., Houwing-Duistermaat J., and Veerkamp R. F.. . 2018. Utility of whole-genome sequence data for across-breed genomic prediction. Genet. Sel. Evol. 50:27. doi: 10.1186/s12711-018-0396-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Reverter, A., Alexandre P. A., Li Y., Hine B. C., Duff C. J., Ingham A. B., and Porto-Neto L. R.. . 2022. Genomic prediction accuracy: how low can we go? In: World congress on genetics applied to livestock production, 25-005. Rotterdam Netherlands:Wageningen Academic Publisher. [Google Scholar]
- Rezende, F. M., Nani J. P., and Penagaricano F.. . 2019. Genomic prediction of bull fertility in US Jersey dairy cattle. J. Dairy Sci. 102:3230–3240. doi: 10.3168/jds.2018-15810 [DOI] [PubMed] [Google Scholar]
- Sarup, P., Jensen J., Ostersen T., Henryon M., and Sorensen P.. . 2016. Increased prediction accuracy using a genomic feature model including prior information on quantitative trait locus regions in purebred Danish Duroc pigs. BMC Genet. 17:1–16. doi: 10.1186/s12863-015-0322-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schatz, T. J., and Hearnden M. N.. . 2008. Heifer fertility on commercial cattle properties in the Northern Territory. Aust. J. Exp. Agric. 48:940–944. doi: 10.1071/ea08057 [DOI] [Google Scholar]
- Silva, R. M. O., Fragomeni B. O., Lourenco D. A. L., Magalhães A. F. B., Irano N., Carvalheiro R., Canesin R. C., Mercadante M. E. Z., Boligon A. A., Baldi F. S., . et al. 2016. Accuracies of genomic prediction of feed efficiency traits using different prediction and validation methods in an experimental Nelore cattle population. J. Anim. Sci. 94:3613–3623. doi: 10.2527/jas.2016-0401 [DOI] [PubMed] [Google Scholar]
- Song, H. L., Ye S. P., Jiang Y. F., Zhang Z., Zhang Q., and Ding X. D.. . 2019. Using imputation-based whole-genome sequencing data to improve the accuracy of genomic prediction for combined populations in pigs. Genet. Sel. Evol. 51:1–13. doi: 10.1186/s12711-019-0500-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tahir, M. S., Nguyen L. T., Schulz B. L., Boe-Hansen G. A., Thomas M. G., Moore S. S., Lau L. Y., and Fortes M. R. S.. . 2019. Proteomics recapitulates ovarian proteins relevant to puberty and fertility in brahman heifers (Bos indicus L.). Genes (Basel) 10:923. doi: 10.3390/genes10110923 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tahir, M. S., Olasege B. S., Wockner K., Tan W. L. A., Porto-Neto L. R., and Fortes M. R. S.. . 2022. Effect of informative polymorphism on accuracy of genomic prediction and heritability. 08:004. In: Calus M., editor. World congress on genetics applied to livestock production. Rotterdam, Netherlands: Wageningen Academic Publishers. [Google Scholar]
- Tahir, M. S., Porto-Neto L. R., Gondro C., Shittu O. B., Wockner K., Tan A. W. L., Smith H. R., Gouveia G. C., Kour J., and Fortes M. R. S.. . 2021. Meta-analysis of heifer traits identified reproductive pathways in Bos indicus Cattle. Genes 12:768. doi: 10.3390/genes12050768 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Toghiani, S., Hay E., Sumreddee P., Geary T. W., Rekaya R., and Roberts A. J.. . 2017. Genomic prediction of continuous and binary fertility traits of females in a composite beef cattle breed. J. Anim. Sci. 95:4787–4795. doi: 10.2527/jas2017.1944 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Van Eenennaam, A. L., Weaber R. L., Drake D. J., Penedo M. C. T., Quaas R. L., Garrick D. J., and Pollak E. J.. . 2007. DNA-based paternity analysis and genetic evaluation in a large, commercial cattle ranch setting. J. Anim. Sci. 85:3159–3169. doi: 10.2527/jas.2007-0284 [DOI] [PubMed] [Google Scholar]
- Warburton, C. L., Engle B. N., Ross E. M., Costilla R., Moore S. S., Corbet N. J., Allen J. M., Laing A. R., Fordyce G., and Lyons R. E.. . 2020. Use of whole-genome sequence data and novel genomic selection strategies to improve selection for age at puberty in tropically-adapted beef heifers. Genet. Sel. Evol. 52:1–13. doi: 10.1186/s12711-020-00547-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Weigel, K. A., de los Campos G., Gonzalez-Recio O., Naya H., Wu X. L., Long N., Rosa G. J., and Gianola D.. . 2009. Predictive ability of direct genomic values for lifetime net merit of Holstein sires using selected subsets of single nucleotide polymorphism markers. J. Dairy Sci. 92:5248–5257. doi: 10.3168/jds.2009-2092 [DOI] [PubMed] [Google Scholar]
- Weller, J. I., Ezra E., and Ron M.. . 2017. Invited review: a perspective on the future of genomic selection in dairy cattle. J. Dairy Sci. 100:8633–8644. doi: 10.3168/jds.2017-12879 [DOI] [PubMed] [Google Scholar]
- Xiang, R., Berg I. V. D., MacLeod I. M., Hayes B. J., Prowse-Wilkins C. P., Wang M., Bolormaa S., Liu Z., Rochfort S. J., Reich C. M., . et al. 2019. Quantifying the contribution of sequence variants with regulatory and evolutionary significance to 34 bovine complex traits. Proc. Natl. Acad. Sci. U.S.A. 116:19398–19408. doi: 10.1073/pnas.190415911 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Xiang, R., MacLeod I. M., Daetwyler H. D., de Jong G., O’Connor E., Schrooten C., Chamberlain A. J., and Goddard M. E.. . 2021. Genome-wide fine-mapping identifies pleiotropic and functional variants that predict many traits across global cattle populations. Nat. Commun. 12:860–860. doi: 10.1038/s41467-021-21001-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yang, J., Lee S. H., Goddard M. E., and Visscher P. M.. . 2011. GCTA: a tool for genome-wide complex trait analysis. Am. J. Hum. Genet. 88:76–82. doi: 10.1016/j.ajhg.2010.11.011 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yin, L., Zhang H., Zhou X., Yuan X., Zhao S., Li X., and Liu X.. . 2020. KAML: improving genomic prediction accuracy of complex traits using machine learning determined parameters. Genome Biol. 21:146. doi: 10.1186/s13059-020-02052-w [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zak, L. J., Gaustad A. H., Bolarin A., Broekhuijse M., Walling G. A., and Knol E. F.. . 2017. Genetic control of complex traits, with a focus on reproduction in pigs. Mol. Reprod. Dev. 84:1004–1011. doi: 10.1002/mrd.22875 [DOI] [PubMed] [Google Scholar]
- Zhang, Y. D., Johnston D. J., Bolormaa S., Hawken R. J., and Tier B.. . 2014. Genomic selection for female reproduction in Australian tropically adapted beef cattle. Anim. Prod. Sci. 54:16–24. doi: 10.1071/AN13016 [DOI] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.


