Abstract
Sheep milk is the most important feed resource for newborn lambs and an important food resource for humans. Sheep milk production and ingredients are influenced by genetic and environmental factors. In this study, we implemented selection signature analysis using Illumina Ovine SNP50 BeadChip data of 78 meat Lacaune and 103 milk Lacaune sheep, which have similar genetic backgrounds, from the Sheep HapMap project to identify candidate genes related to ovine milk traits. Since different methods can detect different variation types and complement each other, we used a haplotype-based method (hapFLK) to implement selection signature analysis. The results revealed six selection signature regions showing signs of being selected (): chromosomes 1, 2, 3, 6, 13 and 18. In addition, 38 quantitative trait loci (QTLs) related to sheep milk performance were identified in selection signature regions, which contain 334 candidate genes. Of those, SUCNR1 (succinate receptor 1) and PPARGC1A (PPARG coactivator 1 alpha) may be the most significant genes that affect sheep milking performance, which supply a significant indication for future studies to investigate candidate genes that play an important role in milk production and quality.
1. Introduction
Sheep have been raised for milk for thousands of years, which is much longer than cow milk production (Zervas and Tsiplakou, 2011). Sheep milk and its products are widely consumed in some parts of the world, especially in the Mediterranean. Sheep milk has a high degree of similarity to human milk in total fatty acid composition, which makes it a good raw material for infant formula production (Martin et al., 2016). In the sheep industry, prolific sheep usually cannot lactate enough milk for lambs, which could decrease lamb survival rate (Bradford, 1985). A milk replacer is sometimes used as alternative, but it is costly and labor-intensive. Therefore, it is important to identify genes related to sheep milk and then genetically improve sheep milk performance, which will obtain a better profitability in the sheep industry and diversify human milk resources.
Specialized strains of livestock have been cultivated by humans in long-standing husbandry practices; artificial and natural selection have imposed detectable selection signatures within genomes. These selection signatures can provide deep insights into selection mechanism and further uncover the causal genes related to relevant phenotypes. In sheep, selection signature analyses of closely related populations with divergent production purposes were successfully implemented in milk traits (Moioli et al., 2013), tail types (Moradi et al., 2012; Moioli et al., 2015; Yuan et al., 2017), gastrointestinal nematode-resistant traits (McRae et al., 2014) and reproductive traits (El-Halawany et al., 2016).
In order to find genes associated with ovine milk traits, researchers have conducted a selection signature analysis between five non-milk sheep breeds and five milk sheep breeds, and they have identified some milk-related genes such as ABCG2 and SPP1 (Gutierrez-Gil et al., 2014). However, this previous study still had some limitations in experimental design. First, the sheep breeds that they have chosen have different characteristics not only in milk performance but also in other phenotypic differences, such as in wool traits, which could lead to some false positives of the candidate genes. Meanwhile, this study analyzed milk Lacaune and meat Lacaune genome only using site frequency-based methods, although it has been suggested that the regression-based selection mapping approach is more accurate than that of haplotype-based analysis methods (e.g., extended haplotype homozygosity, EHH; integrated haplotype score, iSH) (Wiener and Pong-Wong, 2011). However, different methods can detect different variation types and complement each other, which could accurately and comprehensively reveal the selection signature that exists within the genome. With meat and milk Lacaune from the Sheep HapMap project (http://www.sheephapmap.org/hapmap.php, last access: 6 August 2019), we used a haplotype-based method (hapFLK) and considered population stratification to conduct the selection signatures searching within the sheep genome. We further identified genes related to sheep milk and then provided a potential theoretical basis for sheep breeding.
2. Materials and methods
2.1. Experimental data pre-processing
The Illumina Ovine SNP50 BeadChip data of 78 meat-purpose Lacaune sheep and 103 milk-purpose Lacaune sheep were downloaded from the Sheep HapMap project database (http://www.sheephapmap.org/, last access: 6 August 2019). The detailed description of these data was well done by Kijas et al. (2012). To facilitate subsequent gene annotation, the PLINK 1.07 software (Purcell et al., 2007) was used to upgrade the map file to match the sheep genome Oar_v3.1 and implement data quality control. Single-nucleotide polymorphisms (SNPs) were excluded from the subsequent analysis: (1) call rate 90 %, (2) minor allele frequency (MAF) 0.01 and (3) significantly deviated from Hardy–Weinberg equilibrium ().
2.2. Population structure analysis
The filtered SNPs were pruned using the indep-pairwise option (plink – file input – indep-pairwise 50 5 0.1) in PLINK 1.07 software (Purcell et al., 2007) to avoid the strong influence of SNP clusters in principal component analysis (PCA) and relatedness analysis. PCA identifies the principal components representing the population structure based on genetic correlations (shared identity-by-state segments) among individuals. The PCA was implemented using the snpStats R package (https://www.rdocumentation.org/packages/snpStats, last access: 6 August 2019). To verify PCA results, ADMIXTURE (Alexander et al., 2009) was implemented. ADMIXTURE estimates ancestry in a model-based manner from large autosomal SNP genotype datasets, and it includes a cross-validation procedure that allows the user to identify the number of presumed ancestral populations () for which the model has best predictive accuracy. In this study, was set from 1 to 5, and a 10-fold cross-validation procedure was performed.
2.3. Genome scans for selection signatures using hapFLK
The hapFLK statistics detect selection signatures based on differences of haplotype frequencies between populations (Fariello et al., 2013). Considering the population stratification of samples, the hapFLK method, which is based on haplotype frequency and considers population stratification, was used in this experiment. We used hapFLK software to compute the hapFLK statistic and kinship matrix assuming 10 clusters in the fastPHASE model and used 98 Australia Poll Merino samples as an outgroup (http://www.sheephapmap.org/hapmap.php, last access: 6 August 2019). Then the hapFLK statistic was computed as the average across 20 expectation–maximization (EM) iterations to fit the LD model. However, the hapFLK statistic does not strictly follow any of the existing statistical distributions. To investigate the distribution of the hapFLK statistics, we plot a histogram of the hapFLK statistic. Then, we standardized hapFLK following Eq. (1):
1 |
where SD (raw hapFLK) is standard deviation of raw hapFLK. Thus, the standardized hapFLK ( scores) roughly follows a standardized normal distribution. Finally, we computed the value for each SNP according to this standardized normal distribution.
2.4. Gene annotation
Candidate regions identified by hapFLK were annotated using ovine reference genome (Oar_v3.1). Gene function was annotated using the National Center for Biotechnology Information Gene (http://www.ncbi.nlm.nih.gov/gene/, last access: 6 August 2019), which was used for Gene Ontology (GO) analysis. The sheep QTL database (http://www.animalgenome.org/cgi-bin/QTLdb/OA/index, last access: 2 May 2019) was used to find whether the known milk-related QTLs are located in selection signature regions. In addition, the online database OMIM (http://www.ncbi.nlm.nih.gov/omim/, last access: 6 August 2019) and genomic information from other species, including humans, mice and bovines, were used to predict gene function.
2.5. GO and KEGG enrichment analysis
To extract biological meanings from the list of candidate genes, GO enrichment and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analyses were performed using the OmicShare tools (https://www.omicshare.com/tools/, last access: 6 August 2019). For GO enrichment, all candidate genes were mapped to GO terms in the Gene Ontology database. Gene numbers were calculated for every term, and significantly enriched GO terms in candidate genes compared to the genome background were defined by a hypergeometric test. The value calculated from hypergeometric distribution follows Eq. (2):
2 |
where is the number of all GO-annotated genes; is the number of candidate genes in ; is the number of particular GO-term-annotated genes in ; is the number of particular GO-term-annotated genes identified by selection signature in . For pathway enrichment analysis, significantly enriched pathways in candidate genes compared to the genome background were also defined by a hypergeometric test. The calculated value went through fast discovery rate (FDR) correction following Eq. (3):
3 |
where is the raw value, is the number of tests and rank is the rank for the specific raw value. Taking FDR 0.05, GO terms and pathways meeting this condition were defined as significantly enriched for candidate genes.
3. Results and discussions
3.1. Population genetic structure
A total of 46 781 SNPs and 171 individuals were selected for further analysis after quality control. After implementing LD pruning, 13 935 SNPs with low LD were used for PCA and ADMIXTURE analysis. The PCA results showed that all animals can be divided into two groups by the first principal component (PC1): milk and non-milk sheep (Fig. 1a). The first two principal components (PC1 and PC2) can divide these samples into four subgroups (Fig. 1a) and explained 2.8 % and 1.7 % of the variance respectively (Fig. 1b). The results from the ADMIXTURE analysis showed that the least amount of cross-validation error occurred when (Fig. 1c) indicating that was the optimal modeling choice. Therefore, these samples could be appropriately divided into four subgroups (Fig. 1d), which was consistent with PCA.
3.2. Genome scans for selection using hapFLK
The hapFLK histogram shows that the hapFLK statistic approximately follows a normal distribution (Fig. 2, top right), which is similar to the previous study (Kijas, 2014; Yuan et al., 2017). Therefore, the values could be calculated from the normal distribution. Negative values plotted in genomic order revealed six regions under strong selection (Fig. 2). The genomic location, size, peak SNP and peak genes in the selection regions identified using Oar_v3.1 (Jiang et al., 2014) were summarized in Table S1 in the Supplement. The average size of selective regions was 7.97 Mb ranging from 4.33 to 17.00 Mb. These six selected regions in this study were compared with the six convergence candidate regions (CCRs) identified by Gutierrez Gil et al. (2014). However, only an overlapping region (Chr6: 38.64–43.02 Mb) was found, and the majority of selection regions are not overlapping. A similar situation also appeared in the sheep-tail-type selection signature analyses (Moradi et al., 2012; Moioli et al., 2015). There are several reasons that may explain this result: (1) only one breed was analyzed in the current study, while 10 breeds were incorporated in a previous study (Gutierrez-Gil et al., 2014); (2) the methods between this study and Gutierrez-Gil et al. (2014) were different, which can detect different variants; (3) the low SNP density in both the current study and Gutierrez-Gil et al. (2014) might lead to a low statistical power (Simianer et al., 2014). These results suggested that separate populations selected for similar breeding goals have the low repeatability of selection signature analysis results.
3.3. Genes and functional annotations
To understand the function of these selection regions, we mapped them to the sheep genome Oar_v3.1 and sheep QTL database (http://www.animalgenome.org/cgi-bin/QTLdb/OA/index, last access: 2 May 2019). As a result, 38 QTLs related to milk traits (Table 1) and 334 candidate genes (Table S1) were identified in selected regions. These 38 milk-related QTLs were enriched in all milk-related QTLs in the sheep QTL database with a hypergeometric test of (significantly different from what is expected by chance). Significantly, the majority of milk-related QTLs (27 out of 38) were located at chromosome 1 (Chr1: 228.88–245.88 Mb). In detail, these 27 QTLs were linked to the percentage of milk fat and milk protein as well as yield of milk protein and milk fat in sheep (Sutera et al., 2019; Hao et al., 2019) suggesting that selection region on chromosome 1 may play an important role in sheep milk performance. In chromosome 13 selection region (Chr13: 43.57–53.55 Mb), two copy number variation regions (CNVRs) (Chr13: 48.83–49.71 Mb, Chr13: 49.01–49.71 Mb) were identified to be significantly () associated with milk yield in Valle del Belice sheep (Di Gerlando et al., 2019). However, there was no known milk-related QTL in chromosome 18 selection region (Chr18: 37.95–42.60 Mb). This may be the relatively poor annotation of the current sheep QTL database (release 38), and this will be addressed when more milk-related QTLs are identified in the future.
Table 1.
Region | Chromosome | Start | End | Size | Sheep QTL |
---|---|---|---|---|---|
(Mb) | (Mb) | (Mb) | |||
1 | 1 | 228.88 | 245.88 | 17.00 | 170 224 (Chr1: 233736829–233736869, MF) (Sutera et al., 2019) |
170 226 (Chr1: 233736829–233736869, PP) (Sutera et al., 2019) | |||||
169 252 (Chr1: 236278074–236278174, PY) (Hao et al., 2019) | |||||
169 251 (Chr1: 236278074–236278174, FY) (Hao et al., 2019) | |||||
169 400 (Chr1: 236299569–236299669, FY) (Hao et al., 2019) | |||||
169 150 (Chr1: 236964320–236964420, FY) (Hao et al., 2019) | |||||
169 182 (Chr1: 237198394–237198494, FY) (Hao et al., 2019) | |||||
169 181 (Chr1: 237198394–237198494, MY) (Hao et al., 2019) | |||||
169 180 (Chr1: 237198394–237198494, PY) (Hao et al., 2019) | |||||
169 524 (Chr1: 237368505–237368605, FY) (Hao et al., 2019) | |||||
169 208 (Chr1: 237476668–237476768, MY) (Hao et al., 2019) | |||||
169 207 (Chr1: 237476668–237476768, PY) (Hao et al., 2019) | |||||
169 206 (Chr1: 237476668–237476768, FY) (Hao et al., 2019) | |||||
169 442 (Chr1: 237702646–237702746, FY) (Hao et al., 2019) | |||||
169 168 (Chr1: 237899096–237899196, MY) (Hao et al., 2019) | |||||
169 167 (Chr1: 237899096–237899196, PY) (Hao et al., 2019) | |||||
169 166 (Chr1: 237899096–237899196, FY) (Hao et al., 2019) | |||||
169 446 (Chr1: 240960592–240960692, MY) (Hao et al., 2019) | |||||
169 445 (Chr1: 240960592–240960692, FY) (Hao et al., 2019) | |||||
169 280 (Chr1: 241703508–241703608, PY) (Hao et al., 2019) | |||||
169 279 (Chr1: 241703508–241703608, FY) (Hao et al., 2019) | |||||
169 499 (Chr1: 242397140–242397240, FY) (Hao et al., 2019) | |||||
169 388 (Chr1: 242789285–242789385, FY) (Hao et al., 2019) | |||||
169 387 (Chr1: 242789285–242789385, PY) (Hao et al., 2019) | |||||
169 386 (Chr1: 242789285–242789385, MY) (Hao et al., 2019) | |||||
169 551 (Chr1: 243458586–243458686, FY) (Hao et al., 2019) | |||||
|
|
|
|
|
169 144 (Chr1: 243778419–243778519, FY) (Hao et al., 2019) |
2 | 2 | 34.95 | 39.28 | 4.33 | 169 594 (Chr2: 37635669–37635769, MY) (Hao et al., 2019) |
13 911 (Chr2: 37102076–37260066, PP) (Gutierrez-Gil et al., 2009) | |||||
57 738 (Chr2: 32023745–207420807, PP) (Garcia-Gamez et al., 2013) | |||||
|
|
|
|
|
13 915 (Chr2: 8804882–248905321, MF) (Gutierrez-Gil et al., 2009) |
3 |
3 |
91.37 |
97.96 |
6.59 |
57 740 (Chr3: 97143203–97187127, MY) (Garcia-Gamez et al., 2013) |
4 | 6 | 38.64 | 43.94 | 5.30 | 169 477 (Chr6: 41850279–41850379, FY) (Hao et al., 2019) |
13 818 (Chr6: 43152047–43302377, MY) (Arnyasi et al., 2009) | |||||
13 819 (Chr6: 43152047–43302377, MLACT) (Arnyasi et al., 2009) | |||||
13 820 (Chr6: 43152047–43302377, MY) (Arnyasi et al., 2009) | |||||
|
|
|
|
|
13 821 (Chr6: 43152047–43302377, MLACT) (Arnyasi et al., 2009) |
5 |
13 |
43.57 |
53.55 |
9.98 |
169 479 (Chr13: 45264465–45264465, FY) (Hao et al., 2019) |
6 | 18 | 37.95 | 42.60 | 4.65 | – |
Note: milk fat percentage – MF; milk protein percentage – PP; milk protein yield – PY; milk fat yield – FY; milk yield – MY; milk lactose yield – MLACT.
Of those candidate genes, succinate receptor 1 (Chr1: 234.11–234.16 Mb, SUCNR1) has been reported as close to the SNP rs417079368 (Chr1: 233.59 Mb), which was significantly () associated with milk fat percentage and protein percentage in Valle del Belice sheep (Sutera et al., 2019). Its ligand, succinate, plays an important role not only in adenosine triphosphate generation (Littlewood-Evans et al., 2016) but also in signalling transduction by binding to and activating its specific receptor, SUCNR1 (also known as G-protein-coupled receptor-91, GPR91) (Mu et al., 2017). SUCNR1 is expressed in multi-tissues and organs in sheep, such as the omentum, spleen, liver and mammary gland (Clark et al., 2017). Additionally, the expression of SUCNR1 is related to milk protein trait in sheep (Suarez-Vega et al., 2016). PPARGC1A (Chr6: 43.23–43.33 Mb, PPARG coactivator 1 alpha) located near a multi-effect milk-related QTL region (Chr6: 43.15–43.30 Mb) (Arnyasi et al., 2009). Many scholars have shown that this gene was associated with milk production, milk fat percentage, and other milk-related properties (Khatib et al., 2007; Weikard et al., 2005; Schennink et al., 2009; Cong et al., 2016).
In order to further extract known biological meanings from these 334 candidate genes, GO and KEGG enrichment analyses were implemented using the OmicShare tools (https://www.omicshare.com/tools/, last access: 6 August 2019). GO analysis has shown that these candidate genes participated in 47 second-level GO categories (Fig. 3) and significant (FDR 0.05) enrichment in 12 GO terms (Table S1). The highest numbers of genes of second-level GO categories of biological processes (BPs), cellular components (CCs) and molecular function (MFs) are cellular process (GO: 0009987, biological process, 221 genes), cell (GO: 0005623, cellular components, 241 genes), cell part (GO: 0044464, cellular components, 241 genes) and binding (GO: 0005488, molecular function, 189 gene). Previously, genes involved in these second-level GO terms showed strong associations with ruminant milk productivities. For example, it has been reported that genes located in bovine milk yield QTL regions are preferred to enrichment in cellular process (GO: 000987), cell (GO: 0005623), cell part (GO: 0044464) and cellular process (GO: 0009987) (Salih and Adelson, 2009). Further, genes involved in cellular process (GO: 0009987) were found to be associated with fat yield, milk yield, protein yield and fertility index in Nordic red cattle (Iso-Touru et al., 2016). Also, miRNA target genes of goat mammary gland were enriched in cellular processes (GO: 000987, biological process), cell (GO: 0005623, cellular component), cell part (GO: 0044464, cellular component) and cellular process (GO: 0005488, molecular function) (Ji et al., 2012). All these findings support the candidate genes identified in the current study contribute to fundamental physiology of dairy sheep.
The most significant GO term of CC is cytoplasm (GO: 0005737, 6.93E10-8, FDR ). The most significant GO term of MF was the G-protein coupled nucleotide receptor activity (GO: 0001608, , FDR ) and G-protein coupled purinergic nucleotide receptor activity (GO: 0045028, , FDR ). The finding that the cytoplasm GO term (GO: 0005737) was enriched in our gene set is interesting. Previous studies have reported that candidate genes associated with milk protein composition traits in a Chinese Holstein population were significantly (FDR 0.0247) enriched in cytoplasm (GO: 0005737) (Zhou et al., 2019). Apart from GO analysis, KEGG analysis showed that candidate genes could be annotated to 36 KEGG classes (Fig. S1) and could participate in 173 pathways. The highest number of genes of KEGG categories was signal transduction (29 genes). However, no significant pathways were found.
4. Conclusions
Based on haplotype-based methods, the current study has six significant selection regions, which contains 38 known QTLs associated with milk yield. The identified six selection regions harbored 334 candidate genes. Some of the key candidate genes such as SUCNR1 and PPARGC1A may play an important role in sheep milk performance. The findings from this study can be useful to optimize breeding programs to improve the milk-related traits after further functional studies and validation of the association in other independent populations.
Supplement
Acknowledgements
The authors gratefully acknowledge Ruidong Xiang for language editing.
Contributor Information
Fadi Li, Email: lifd@lzu.edu.cn.
Xiangpeng Yue, Email: lexp@lzu.edu.cn.
Data availability
The data of the paper are available from Sheep HapMap project database (, International Sheep Genomics Consortium, 2019).
Author contributions
FL and XP designed the experiment and edited the paper. ZY analyzed the data and wrote the paper. WL discussed the results.
Competing interests
The authors declare that they have no conflict of interest.
Financial support
This research has been supported by the National Key Technology Research and Development Program of the Ministry of Science and Technology of China (grant no. 2015BAD03B05), the Program for Changjiang Scholars and Innovative Research Team in University (grant no. IRT13019), and the China Agriculture Research System (grant no. CARS-38).
Review statement
This paper was edited by Steffen Maak and reviewed by Majid Khansefid and one anonymous referee.
References
- Alexander DH, Novembre J, Lange K. Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 2009;19:1655–1664. doi: 10.1101/gr.094052.109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Arnyasi M, Komlosi I, Lien S, Czegledi L, Nagy S, Javor A. Searching for DNA markers for milk production and composition on chromosome 6 in sheep. J Anim Breed Genet. 2009;126:142–147. doi: 10.1111/j.1439-0388.2008.00764.x. [DOI] [PubMed] [Google Scholar]
- Bradford GE. Selection for litter size – Genetics of Reproduction in Sheep – Chapter 1. Genet Reproduct Sheep. 1985;41:3–18. [Google Scholar]
- Clark EL, Bush SJ, McCulloch MEB, Farquhar IL, Young R, Lefevre L, Pridans C, Tsang HG, Wu C, Afrasiabi C, Watson M, Whitelaw CB, Freeman TC, Summers KM, Archibald AL, Hume DA. A high resolution atlas of gene expression in the domestic sheep (Ovis aries) PLoS Genet. 2017;13:e1006997. doi: 10.1371/journal.pgen.1006997. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cong L, Sun D, Zhang S, Yang S, Alim MA, Qin Z, Li Y, Lin L. Genetic effects of FASN, PPARGC1A, ABCG2 and IGF1 revealing the association with milk fatty acids in a Chinese Holstein cattle population based on a post genome-wide association study. BMC Genet. 2016;17:e110. doi: 10.1186/s12863-016-0418-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Di Gerlando R, Sutera AM, Mastrangelo S, Tolone M, Portolano B, Sottile G, Bagnato A, Strillacci MG, Sardina MT. Genome-wide association study between CNVs and milk production traits in Valle del Belice sheep. PLoS One. 2019;14:e0215204. doi: 10.1371/journal.pone.0215204. [DOI] [PMC free article] [PubMed] [Google Scholar]
- El-Halawany N, Zhou X, Al-Tohamy AF, El-Sayd YA, Shawky AE, Michal JJ, Jiang Z. Genome-wide screening of candidate genes for improving fertility in Egyptian native Rahmani sheep. Anim Genet. 2016;47:513. doi: 10.1111/age.12437. [DOI] [PubMed] [Google Scholar]
- Fariello MI, Boitard S, Naya H, SanCristobal M, Servin B. Detecting signatures of selection through haplotype differentiation among hierarchically structured populations. Genetics. 2013;193:929–941. doi: 10.1534/genetics.112.147231. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Garcia-Gamez E, Gutierrez-Gil B, Suarez-Vega A, de la Fuente LF, Arranz JJ. Identification of quantitative trait loci underlying milk traits in Spanish dairy sheep using linkage plus combined linkage disequilibrium and linkage analysis approaches. J Dairy Sci. 2013;96:6059–6069. doi: 10.3168/jds.2013-6824. [DOI] [PubMed] [Google Scholar]
- Gutierrez-Gil B, Arranz JJ, Pong-Wong R, Garcia-Gamez E, Kijas J, Wiener P. Application of selection mapping to identify genomic regions associated with dairy production in sheep. PLoS One. 2014;9:e94623. doi: 10.1371/journal.pone.0094623. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gutierrez-Gil B, El-Zarei MF, Alvarez L, Bayon Y, de la Fuente LF, San Primitivo F, Arranz JJ. Quantitative trait loci underlying milk production traits in sheep. Anim Genet. 2009;40:423–434. doi: 10.1111/j.1365-2052.2009.01856.x. [DOI] [PubMed] [Google Scholar]
- Hao L, Xiaolin W, Richard GTJ, Bauck S, Thomas DL, Murphy TW, Rosa GJM. Genome-wide association study of milk production traits in a crossbred dairy sheep population using three statistical models. Wisconsin: Department of Animal Sciences, University of Wisconsin; 2019. [DOI] [PubMed] [Google Scholar]
- The Illumina Ovine SNP50 BeadChip data of 78 meat-purpose and 103 milk-purpose Lacaune sheep. International Sheep Genomics Consortium; 2019. [last access: 6 August 2019]. https://www.sheephapmap.org/ [Google Scholar]
- Iso-Touru T, Sahana G, Guldbrandtsen B, Lund MS, Vilkki J. Genome-wide association analysis of milk yield traits in Nordic Red Cattle using imputed whole genome sequence variants. BMC Genet. 2016;17:e55. doi: 10.1186/s12863-016-0363-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ji Z, Wang G, Xie Z, Zhang C, Wang J. Identification and characterization of microRNA in the dairy goat (Capra hircus) mammary gland by Solexa deep-sequencing technology. Mol Biol Rep. 2012;39:9361–9371. doi: 10.1007/s11033-012-1779-5. [DOI] [PubMed] [Google Scholar]
- Jiang Y, Xie M, Chen W, Talbot R, Maddox JF, Faraut T, Wu C, Muzny DM, Li Y, Zhang W, Stanton JA, Brauning R, Barris WC, Hourlier T, Aken BL, Searle SMJ, Adelson DL, Bian C, Cam GR, Chen Y, Cheng S, DeSilva U, Dixen K, Dong Y, Fan G, Franklin IR, Fu S, Guan R, Highland MA, Holder ME, Huang G, Ingham AB, Jhangiani SN, Kalra D, Kovar CL, Lee SL, Liu W, Liu X, Lu C, Lv T, Mathew T, McWilliam S, Menzies M, Pan S, Robelin D, Servin B, Townley D, Wang W, Wei B, White SN, Yang X, Ye C, Yue Y, Zeng P, Zhou Q, Hansen JB, Kristensen K, Gibbs RA, Flicek P, Warkup CC, Jones HE, Oddy VH, Nicholas FW, McEwan JC, Kijas J, Wang J, Worley KC, Archibald AL, Cockett N, Xu X, Wang W, Dalrymple BP. The sheep genome illuminates biology of the rumen and lipid metabolism. Science. 2014;344:1168–1173. doi: 10.1126/science.1252806. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Khatib H, Zaitoun I, Wiebelhaus-Finger J, Chang YM, Rosa GJ. The association of bovine PPARGC1A and OPN genes with milk composition in two independent Holstein cattle populations. J Dairy Sci. 2007;90:2966–2970. doi: 10.3168/jds.2006-812. [DOI] [PubMed] [Google Scholar]
- Kijas JW. Haplotype-based analysis of selective sweeps in sheep. Genome. 2014;57:433–437. doi: 10.1139/gen-2014-0049. [DOI] [PubMed] [Google Scholar]
- Kijas JW, Lenstra JA, Hayes B, Boitard S, Porto Neto LR, San Cristobal M, Servin B, McCulloch R, Whan V, Gietzen K, Paiva S, Barendse W, Ciani E, Raadsma H, McEwan J, Dalrymple B. Genome-wide analysis of the world's sheep breeds reveals high levels of historic mixture and strong recent selection. PLoS Biol. 2012;10:e1001258. doi: 10.1371/journal.pbio.1001258. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Littlewood-Evans A, Sarret S, Apfel V, Loesle P, Dawson J, Zhang J, Muller A, Tigani B, Kneuer R, Patel S, Valeaux S, Gommermann N, Rubic-Schneider T, Junt T, Carballido JM. GPR91 senses extracellular succinate released from inflammatory macrophages and exacerbates rheumatoid arthritis. J Exp Med. 2016;213:1655–1662. doi: 10.1084/jem.20160061. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Martin CR, Ling PR, Blackburn GL. Review of Infant Feeding: Key Features of Breast Milk and Infant Formula. Nutrients. 2016;8:e279. doi: 10.3390/nu8050279. [DOI] [PMC free article] [PubMed] [Google Scholar]
- McRae KM, McEwan JC, Dodds KG, Gemmell NJ. Signatures of selection in sheep bred for resistance or susceptibility to gastrointestinal nematodes. BMC Genomics. 2014;15:e637. doi: 10.1186/1471-2164-15-637. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Moioli B, Scatà MC, Steri R, Napolitano F, Catillo G. Signatures of selection identify loci associated with milk yield in sheep. BMC Genet. 2013;14:e76. doi: 10.1186/1471-2156-14-76. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Moioli B, Pilla F, Ciani E. Signatures of selection identify loci associated with fat tail in sheep. J Anim Sci. 2015;93:4660–4669. doi: 10.2527/jas.2015-9389. [DOI] [PubMed] [Google Scholar]
- Moradi MH, Nejati-Javaremi A, Moradi-Shahrbabak M, Dodds KG, McEwan JC. Genomic scan of selective sweeps in thin and fat tail sheep breeds for identifying of candidate regions associated with fat deposition. BMC Genet. 2012;13:e10. doi: 10.1186/1471-2156-13-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mu X, Zhao T, Xu C, Shi W, Geng B, Shen J, Zhang C, Pan J, Yang J, Hu S, Lv Y, Wen H, You Q. Oncometabolite succinate promotes angiogenesis by upregulating VEGF expression through GPR91-mediated STAT3 and ERK activation. Oncotarget. 2017;8:13174–13185. doi: 10.18632/oncotarget.14485. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, Maller J, Sklar P, de Bakker PI, Daly MJ, Sham PC. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81:559–575. doi: 10.1086/519795. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Salih H, Adelson DL. QTL global meta-analysis: are trait determining genes clustered? BMC Genomics. 2009;10:e184. doi: 10.1186/1471-2164-10-184. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schennink A, Bovenhuis H, Leon-Kloosterziel KM, van Arendonk JA, Visker MH. Effect of polymorphisms in the FASN, OLR1, PPARGC1A, PRL and STAT5A genes on bovine milk-fat composition. Anim Genet. 2009;40:909–916. doi: 10.1111/j.1365-2052.2009.01940.x. [DOI] [PubMed] [Google Scholar]
- Simianer H, Ma Y, Qanbari S. Statistical Problems in Livestock Population Genomics. World Congress on Genetics Applied To Livestock Production; 17–22 August 2014; Vancouver, Canada. 2014. [Google Scholar]
- Suarez-Vega A, Gutierrez-Gil B, Arranz JJ. Transcriptome expression analysis of candidate milk genes affecting cheese-related traits in 2 sheep breeds. J Dairy Sci. 2016;99:6381–6390. doi: 10.3168/jds.2016-11048. [DOI] [PubMed] [Google Scholar]
- Sutera AM, Riggio V, Mastrangelo S, Di Gerlando R, Sardina MT, Pong-Wong R, Tolone M, Portolano B. Genome-wide association studies for milk production traits in Valle del Belice sheep using repeated measures. Anim Genet. 2019;50:311–314. doi: 10.1111/age.12789. [DOI] [PubMed] [Google Scholar]
- Weikard R, Kuhn C, Goldammer T, Freyer G, Schwerin M. The bovine PPARGC1A gene: molecular characterization and association of an SNP with variation of milk fat synthesis. Physiol Genom. 2005;21:1–13. doi: 10.1152/physiolgenomics.00103.2004. [DOI] [PubMed] [Google Scholar]
- Wiener P, Pong-Wong R. A regression-based approach to selection mapping. J Hered. 2011;102:294–305. doi: 10.1093/jhered/esr014. [DOI] [PubMed] [Google Scholar]
- Yuan Z, Liu E, Liu Z, Kijas JW, Zhu C, Hu S, Ma X, Zhang L, Du L, Wang H, Wei C. Selection signature analysis reveals genes associated with tail type in Chinese indigenous sheep. Anim Genet. 2017;48:55–66. doi: 10.1111/age.12477. [DOI] [PubMed] [Google Scholar]
- Zervas G, Tsiplakou E. The effect of feeding systems on the characteristics of products from small ruminants. Small Ruminant Res. 2011;101:140–149. doi: 10.1016/j.smallrumres.2011.09.034. [DOI] [Google Scholar]
- Zhou C, Li C, Cai W, Liu S, Yin H, Shi S, Zhang Q, Zhang S. Genome-Wide Association Study for Milk Protein Composition Traits in a Chinese Holstein Population Using a Single-Step Approach. Front Genet. 2019;10:e72. doi: 10.3389/fgene.2019.00072. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The data of the paper are available from Sheep HapMap project database (, International Sheep Genomics Consortium, 2019).