Abstract
Background
Crossbreeding is an important way to improve production beef cattle performance. Pinan cattle is a new hybrid cattle obtained from crossing Piedmontese bulls with Nanyang cows. After more than 30 years of cross-breeding, Pinan cattle show a variety of excellent characteristics, including fast growth, early onset of puberty, and good meat quality. In this study, we analyzed the genetic diversity, population structure, and genomic region under the selection of Pinan cattle based on whole-genome sequencing data of 30 Pinan cattle and 169 published cattle genomic data worldwide.
Results
Estimating ancestry composition analysis showed that the composition proportions for our Pinan cattle were mainly Piedmontese and a small amount of Nanyang cattle. The analyses of nucleotide diversity and linkage disequilibrium decay indicated that the genomic diversity of Pinan cattle was higher than that of European cattle and lower than that of Chinese indigenous cattle. De-correlated composite of multiple selection signals, which combines four different statistics including θπ, CLR, FST, and XP-EHH, was computed to detect the signatures of selection in the Pinan cattle genome. A total of 83 genes were identified, affecting many economically important traits. Functional annotation revealed that these selected genes were related to immune (BOLA-DQA2, BOLA-DQB, LSM14A, SEC13, and NAALADL2), growth traits (CYP4A11, RPL26, and MYH10), embryo development (REV3L, NT5E, CDX2, KDM6B, and ADAMTS9), hornless traits (C1H21orf62), and climate adaptation (ANTXR2).
Conclusion
In this paper, we elucidated the genomic characteristics, ancestry composition, and selective signals related to important economic traits in Pinan cattle. These results will provide the basis for further genetic improvement of Pinan cattle and reference for other hybrid cattle related studies.
Supplementary Information
The online version contains supplementary material available at 10.1186/s12864-022-08645-y.
Keywords: Pinan cattle, WGS, Genetic diversity, Population Structure, DCMS
Background
Cattle are one of the most important domestic animals for human beings. According to the origin of cattle, domestic cattle can be simply divided into two subspecies: Bos taurus and Bos indicus [1]. Or according to geographical region, it can also be divided into five groups: European taurine, Eurasian taurine, East Asian taurine, Chinese indicine, and Indian indicine [2]. The earliest domesticated cattle were mainly used for draft purposes. With the development of society, except for a few developing countries, cattle have been developed in the direction of specialization through continuous breeding and improvement. Today, there are hundreds of cattle breeds in the world, and China alone has more than 50 native breeds, including five excellent breeds (Yanbian, Luxi, Qinchuan, Jinnan, and Nanyang cattle). Because the cattle in China used to be mainly used for farming, the production performance is not as good as the specialized beef cattle breeds in European and American countries. To compensate for the perceived lower beef production potential of local cattle, these cattle are often crossed with exotic breeds to develop new breeds. Piedmontese cattle, which are native to Italy, are internationally recognized as terminal male due to their double muscle genes and have been introduced to more than 20 countries for crossbreeding. Grading up is often used to improve the performance of cattle. A new breed called Pinan cattle has been developed in Henan Province of China by introducing Piedmontese cattle as male parents and grading up with local Nanyang cattle. It is generally used for production after crossing three generations. Pinan cattle has many advantages, such as the early onset of puberty, fast-growth speed, and good meat quality, which greatly improved the breeding efficiency and meet the demands of people.
Whole-genome sequencing (WGS) has been used to find a large number of variations that can be used as molecular genetic markers. The application of WGS plays an important role in the study of human diseases, the breeding of animals and plants, as well as the origin and domestication of species. WGS studies have yielded many important results in the field of livestock, including domestication, adaptive mechanisms of indigenous breeds, candidate genes responsible for important economic traits, and historical population dynamics [3, 4]. In earlier years, it was mainly European commercial cattle that were studied using whole-genome sequencing [5]. Later, researchers also began to look at native varieties to find genomic variations associated with desired traits in native varieties [6–10], which could help improve breeding.
In the current study, we sequenced the genomes of 30 Pinan cattle to detect genetic variation, population structure, and selective sweep. Our result will lay a foundation for further studies on the genetic basis of important economic traits, provide ideas and basis for future improvement in Pinan cattle, and provide a reference for related studies in other hybrid breeds.
Results
Sequencing and SNP detection
A total of 5,402,336,411 clean reads were generated after genome sequencing in 30 Pinan cattle samples. The clean reads were aligned to the latest Bos taurus reference genome (ARS-UCD1.2) with an average depth of 9.67X using Sentieon [11] software. We identified 23,077,494 biallelic SNPs in 30 Pinan cattle using GATK. Among these SNPs, functional annotation showed that SNPs mainly existed in the intergenic region (59.6%) and intronic region (37.4%). Exons accounted for 0.77% of the total SNPs, including 71,193 non-synonymous SNPs and 99,811 synonymous SNPs (Table S3).
To place Pinan cattle into a global context, we also detected the single nucleotide polymorphisms in five "core" cattle populations [2], and two Chinese cattle breeds (Qinchuan and Jiaxian Red cattle). Among these breeds, Chinese indicine (37,087,338) has the highest number of SNPs, followed by crossbreed Jiaxian cattle (29,315,624), Qinchuan cattle (26,270,077), Nanyang cattle (25,468,742), Pinan cattle (23,077,494) and Indian indicine (23,005,288). The number of SNP in Bos taurus was lower than that in indicine cattle and hybrid cattle. The results of this SNP distribution are consistent with the observations of other studies [2, 8].
Population structure analysis
To explore the genetic relationships between Pinan cattle and other cattle breeds around the world, estimating ancestry, building neighbor-joining (NJ) trees and principal component analysis (PCA) were performed (Fig. 1). The first and second PCs explained 6.23% and 2.58% of the variation in the entire genomic data, respectively. The first PC separated Bos taurus and Bos indicus. The analysis revealed clear geographical patterns of cattle distribution. The proportions of individuals in Pinan cattle inferred by the ADMIXTURE are presented in Fig. 1 A. The case with a small CV error value (Table S4) was selected for drawing. When K = 2, these different cattle breeds can be divided into two groups: Bos taurus and Bos indicus; when K = 4, European taurine and Zebu cattle have a single genomic composition. We can see that the ancestry proportions inherited from Bos taurus are greater than that from Bos indicus in Pinan cattle. To be more precise, Pinan cattle composition is more similar to Eurasian taurine which indicates that the genetic influence of Piedmontese cattle was greater than that of Nanyang cattle. The results of the NJ tree (Fig. 1B and Figure S1) and PCA (Fig. 1C) analysis were similar. All "core" cattle groups form independent clusters. The individuals of the Pinan cattle are also mostly clustered together and close to the Piedmontese cattle.
Genetic diversity and linkage disequilibrium
To elucidate the genomic characteristics in Pinan cattle, nucleotide diversity was calculated (Fig. 2A). The results showed that Chinese indicine had the highest nucleotide diversity, followed by Nanyang, Indian, and Qinchuan cattle. The lowest nucleotide diversity was found in European cattle. The nucleotide diversity of our crossbreed Pinan cattle is lower than that of female parent Nanyang cattle but higher than that of male parent Piedmontese cattle. From the perspective of linkage disequilibrium (LD) (Fig. 2B), although the LD of each variety decreased rapidly, there were some differences among varieties. At distances between markers (> 50 kb), Indian indicine and Chinese indicine had the lowest LD level. The highest LD level in Jersey, followed by Pinan. East Asian taurine and European taurine had a medium LD level. Overall, the LD level of indicine breeds was lower than that of taurine breeds, consistent with previous studies [12]. It is worth noting that Pinan cattle had a higher LD level than either of its parents.
De-correlated composite of multiple signals
Within-population and cross-population statistics were combined separately in a single score using the De-correlated composite of multiple signals (DCMS) statistic [13]. After calculation of the DCMS statistics, the p-value was fitted to a normal distribution and corrected for multiple testing (FDR < 0.05). The analysis identified 92 candidate regions (Table S5) of the genome containing 83 genes. We identified several of the most significant genomic regions on BTA 9: 63.80–64.09 M and BTA 19:27.50 M-28.19 M (Fig. 3). All regions were annotated and some candidate genes associated with important traits were found (Table 1). Including hornless traits (C1H21orf62), growth traits (CYP4A11, RPL26, and MYH10), Disease-related traits (MACC1 and SNX14), climate adaptation (ANTXR2), lipid metabolism (ANGPTL4), embryo development (REV3L, NT5E, CDX2, KDM6B, and ADAMTS9), immune response (BOLA-DQA2, BOLA-DQB, LSM14A, SEC13, and NAALADL2), sperm flagella (DNAH2), milk yield (ODF4, ARHGEF15, and MYH10).
Table 1.
Region (Mb) | q-value | Candidate gene | Trait | Citation |
---|---|---|---|---|
1: 27.40–27.90 | 0.027351 | PAXBP1, C1H21orf62 | hornless | [14] |
1: 82.84–82.89 | 0.041116 | FAM131A, EIF4G1, PSMD2, CLCN2 | disease-relate | [15] |
1: 92.80–92.85 | 0.042031 | NAALADL2 | embryo development | [16] |
3: 00.18–99.23 | 0.042031 | CYP4A11 | growth traits | [17] |
4: 28.90–28.95 | 0.016919 | MACC1 | colorectal carcinoma | [18] |
6: 37.14–37.19 | 0.046225 | MED28, LAP3, FAM184B | disease-relate, milk production | [19, 20] |
6: 94.72–94.79 | 0.035810 | ANTXR2 | climate adaptation | [21] |
7: 17.00–17.09 | 0.021124 | ANGPTL4, RAB11B, MARCH2, HNRNPM, | lipid metabolism | [22] |
7: 18.44–18.55 | 0.012144 | CAPS, VMAC, NDUFA11, RANBP3, FUT6, NRTN, PRR22 | disease-relate | [23] |
7: 18.74–18.79 | 0.011793 | LOC112447361 | ||
8: 60.72–60.77 | 0.044877 | MELK | disease-relate | [24] |
9: 38.98–39.07 | 0.000525 | REV3L | embryo development | [25] |
9: 63.80–64.07 | 0.008056 | SYNCRIP, SNX14, NT5E | disease-related, immune suppressor | [26–28] |
9: 104.28–104.33 | 0.015762 | LOC100300342 | ||
12: 11.26–11.31 | 0.003701 | WBP4 | disease-related | [29] |
12: 32.06–32.11 | 0.029085 | URAD, CDX2, FLT3 | embryo development, hematopoiesis | [30, 31] |
12: 71.96–72.01 | 0.020448 | LOC100337076 | ||
12: 72.44–72.49 | 0.024298 | LOC112449072 | ||
13: 43.42–43.47 | 0.020407 | LOC112441538, LOC112449293 | ||
15: 48.48–48.57 | 0.015635 | LOC781483, LOC781444, LOC781403 | ||
16: 8.92–8.97 | 0.022505 | LOC789494 | ||
16: 44.32–44.41 | 0.008253 | GPR157, LOC107131287 | neuronal differentiation | [32] |
18: 44.60–44.67 | 0.013972 | LSM14A | antiviral immune response | [33] |
19: 27.08–27.13 | 0.015391 | NLGN2, SPEM2, LOC112442746, TMEM102, TMEM256, SPEM1, FGF11, CHRNB1 | disease-related | [34–38] |
19: 27.50–28.19 | 0.015021 | TMEM88, NAA38, CYB5D, DNAH2, KDM6B, RANGRF, SLC25A35, ARHGEF15, ODF4, KRBA2, RPL26, RNF222, NDEL1, MYH10 | disease-related, growth traits, sperm flagella, milk yield | [39–47] |
21: 6.20–6.50 | 0.046249 | CERS3 | Congenital Ichthyosis | [48] |
22: 36.84–36.91 | 0.022639 | ADAMTS9 | embryo development | [49] |
22: 54.36–54.41 | 0.021701 | SEC13, ATP2B2 | immune response | [50, 51] |
23: 25.58–25.63 | 0.004795 | BOLA-DQA2, BOLA-DQB | immune response | [52] |
25: 13.46–13.51 | 0.020943 | PARN | disease-related | [53] |
29: 27.34–27.39 | 0.006772 | LOC787625 | ||
29: 27.42–27.47 | 0.000102 | LOC787694 |
Discussion
The characterization of population structure and genetic diversity can help us to evaluate bovine genetic resources and play an important role in utilization and conservation. In this study, the population genetic structure of Pinan Cattle was studied in the background with different cattle breeds. As can be seen from the ADMIXTURE analysis (Fig. 1), the ancestral contributions of Pinan cattle were mainly from Piedmontese.
Although Pinan cattle are closer to the Piedmontese lineage, the number of genetic variations inherited from Nanyang cattle is much higher than that inherited from Piedmontese cattle. The nucleotide diversity of Pinan cattle is lower than that of Chinese indicine and other native Chinese cattle breeds, but about twice that of European cattle breeds such as Angus, Hereford, and Piedmontese. The higher genomic diversity of Pinan cattle may be due to the influence of female parent Nanyang cattle. Because Nanyang cattle is a hybrid between Bos taurus and Bos indicus, it has high genomic diversity. In addition, cattle breeds with higher genetic diversity tend to have lower LD decay, such as Qinchuan cattle. Interestingly, the LD decay level of Pinan cattle was higher than that of Nanyang cattle and Piedmontese, which may be due to the greater selection intensity experienced by the Pinan herd than Nanyang cattle and Piedmontese.
In addition, we found important selective scanning markers in Pinan cattle. Over the past 30 years, Pinan cattle have been used for intensive beef breeding, leading to genetic improvement of production traits, especially for their early onset of puberty. Several genes (REV3L, NT5E, CDX2, KDM6B, ADAMTS9) in the Pinan cattle genome show selectivity for embryo development. To better understand the selection pressure, we explored the biological function of these genes. REV3L is involved in the DNA repair process, and it was upregulated in cattle embryos derived from depleted cells [54]. Loss of REV3L resulted in chromosome instability and embryogenesis failure in mice [25]. Genome-wide association study has shown that REV3L is associated with embryonic development in Nellore cattle [55]. NT5E produces a protein that encodes an enzyme that converts adenosine monophosphate into adenosine [28]. The study has shown that the NT5E gene affects the cleavage rate of cattle [56]. In addition, two non-synonymous SNP in NT5E can be useful for improving Inosine 5’-monophosphate which contributes to the umami taste in beef in Japanese Black beef [57]. CDX2 is a transcription factor that is localized in the nucleus of the trophectoderm [58]. Studies have shown that CDX2 is essential for early development and gene expression involved in the differentiation of inner cell mass and trophectoderm lineages in cattle embryos [30, 59]. KDM6B, also known as JMJD3, is a histone demethylase expressed in bovine cleavage embryos [60]. KDM6B is transiently upregulated around embryo developmental stages when the embryo genome is activated [61]. Another study has shown that H3K27me3 demethylase activity mediated by KDM6B is required for normal bovine embryo development and reprogramming gene expression during the embryonic genome [41]. ADAMTS9 is widely expressed during mouse embryo development [49]. Other studies have shown that ADAMTS9 is necessary for ovarian development and may promote the rupture for follicular rupture in zebrafish [62, 63]. A study has shown that ADAMTS proteases play a role in remodeling the bovine ovulatory follicle in preparation for ovulation and the formation of the corpus luteum [64]. Interestingly, a GWAS study reported that ADAMTS9 was associated with chest size in goats [65]. This suggests it may also play an important role in cattle. These genes may be an important reason for the early maturity and faster growth of Pinan cattle than Nanyang cattle.
Native Chinese cattle tend to be more resistant and adaptable than foreign breeds. In a selective sweep of Pinan cattle populations, we identified several immune system-related genes (BOLA-DQA2, BOLA-DQB, LSM14A, and NAALADL2). BOLA-DQA2 and BOLA-DQB both belong to the Bovine Leukocyte Antigen (BOLA) class II genes which are involved in the immune response. For example, BOLA-DQA2 plays an important role in resistance to mastitis in dairy cows [52]. LSM14A plays a critical role in antiviral immune responses [33, 66]. NAALADL2 is reported to be responsible for immune homeostasis [67]. They may have a similar effect in cattle but there are currently few studies on that. In addition, we found a positive selection gene (ANTXR2) associated with climate adaptation [21]. These positive selection genes are likely to improve the survival of Pinan cattle.
In addition, we found several genes (CYP4A11, RPL26, and MYH10) related to growth traits. CYP4A11 is involved in fatty acid metabolism, blood pressure regulation, and kidney tubule absorption of ions [68]. The study has shown that CYP4A11 affects growth and fat deposition in cattle [17]. RPL26, a ribosomal protein gene, is reported to be related to residual feed intake in Holstein cows [69]. GWAS analysis has shown that one missense variant in RPL26 is associated with average daily gain. There is a report shows that MYH10 is strongly associated with milk production and body conformation traits of dairy cattle [70]. GWAS study also shows that MYH10 is a candidate gene for yearling weight in Chinese Simmental beef cattle. In addition, two other positive selection genes ODF4 and ARHGEF15 have been reported to be involved in the milk yield of dairy goats. This suggests they may have a similar role in cattle.
Conclusions
In this study, genomic variation of Pinan cattle was studied comprehensively using WGS data. The study on the population structure and genomic diversity of Pinan cattle will provide the basis for the genetic evaluation of this breed and point out the direction for the development of reasonable breeding strategies for future breeding and improvement. In addition, we have identified a series of candidate genes that may have important effects on survival, growth, early maturity, and meat quality traits of this variety. These results will provide a basis for further genomic studies of beef cattle and provide a reference for genomic studies of imported crossbred beef cattle in China or other parts of the world.
Methods
Samples and sequencing
We collected blood samples from 30 Pinan cattle in Xinye County, Henan Province using EDTA-K2 anticoagulant tubes. After dry ice storage, genomic DNA was extracted in the laboratory using the standard phenol–chloroform method [71]. The integrity of genomic DNA was verified by agarose gel electrophoresis. And we used Nanodrop to measure the purity of DNA. DNA samples containing more than 1.5 µg were used to build libraries. A paired-end library with an average insert length of 500 bp and an average read length of 150 bp was constructed for each individual. Sequencing was performed using Illumina Hiseq 2000 platform at the Novogene Bioinformatics, Beijing, China.
To better compare the genetic diversity of Pinan cattle with other cattle breeds in the world, we collected additional samples based on the five "core" groups proposed by Chen et al., [2] as well as the data of 30 Jiaxian cattle and 30 Qinchuan cattle of local breeds in China. The samples include European varieties (Angus, Hereford, Red Angus, Gelbvieh, Limousin, Simmental, Piedmontese, Korean breed (Hanwoo), Chinese breed (Tibetan, Qinchuan, Leiqiong, Guangfeng, Ji’an Wannan and India-Pakistan cattle breeds ((Tharparkar, Sahiwal, Hariana, Nelore, and Gir) (Table S2). A total of 199 samples were used in this study.
Reads mapping, SNP calling, and annotation
After obtaining the fastq file of the raw data, Trimmomatic software (v0.36) was used to trim sequence reads with the parameters: “LEADING:20, TRAILING:20, SLIDINGWINDWOE: 3:15, AVGQUAL:20, MINLEN:35, TOPHRED33”. The clean reads were aligned to the Bos taurus reference genome ARS-UCD1.2 by the “sentieon bwa mem” module with default parameters from sentieon software (versionV202010) [11]. After that “sentieon driver” module was used to filter duplicate reads and used to perform Base Quality Score Recalibration with default parameters. Single nucleotide polymorphisms (SNPs) were detected by the Genome Analysis Toolkit (GATK, version 4.1.8.1) [72]. The raw SNPs were called using the “SelectVariants” module of GATK. After SNPs were called, the “VariantFiltration” module was used to filter SNPs with the parameters “QualByDepth (QD) < 2.0, Quality (QUAL) < 30.0, StrandOddsRatio (SOR) > 3.0, FisherStrand (FS) > 60.0, RMSMappingQuality (MQ) < 40.0, MappingQualityRankSumTest (MQRankSum) < -12.5, and (ReadPosRankSumTest) ReadPosRankSum < -8.0”. Finally, a total of 53,174,296 biallelic SNPs were obtained using bcftools (version 1.8) with the parameters: F_MISSING < 0.1 & MAC > 2. After that, SNPs were filtered using VCFtools v0.1.16 with the parameters: –maf 0.05 and –max-missing 0.9. Based on the Bos taurus reference genome annotation file(https://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/002/263/795/GCF_002263795.1_ARS-UCD1.2/GCF_002263795.1_ARS-UCD1.2_genomic.gff), Annovar [73] software was used to annotate the functions of each SNP.
Population genetic analysis
PLINK (version 1.9) [74] software was used to remove those sites with high LD with the parameter (–indep-pair-wise 50 5 0.2), and the remaining SNP sites were used for performing ADMIXTURE analysis. PCA was performed using the SmartPCA program in the Eigensoft V5.0 package [75]. ADMIXTURE v1.3 [76] was used for population structure analysis, the kinship set is from 2 to 8. The matrix of pairwise genetic distances supplied by plink was used for constructing an unrooted evolutionary tree. The visualization of evolutionary was done by MEGA7 v7.0.26 [77] and FigTree v1.4.4 (http://tree.bio.ed.ac.uk/software/figtree/).
Selective sweep identification
In order to detect selective signals in the Pinan cattle, we used different strategies to scan the genomes of Pinan cattle. The nucleotide diversity (θπ) and the composite likelihood ratio (CLR) [78] method were used in the Pinan cattle population. First, the SNP loci with allele frequency less than 0.05 were removed. After that, we used VCFtools v0.1.16 [79] to estimate the nucleotide diversity by a sliding window approach in which the windows were 50 kb and the step size is 20 kb. The SweepFinder2 [80] was used for calculating the CLR with a grid size of 50 kb. In addition, fixation index (FST) and cross-population extended haplotype homozygosity (XP-EHH) were used for comparing the genomic difference between Pinan and Piedmontese cattle. We used VCFtools to analyze FST with a 50 kb window and a 20 kb step. Within each population, haplotype phasing was performed using BEAGLE 5.0 [81], and selscan V1.1 [82] was used to calculate the XP-EHH statistics for each population. For XP-EHH selection scans, our test statistics are the average of standardized XP-EHH scores for each 50 kb region.
DCMS estimation
In this study, we combined four statistics including θπ, CLR, FST, and XPEHH into a single DCMS framework as described in Ma et al. [13]. The analysis steps were carried out by referring to other studies [10, 83–85]. The DCMS statistic was calculated for each 50 kb window using the R MINOTAUR package [86]. The genome-wide P-value was calculated by the stat_to_pvalue function of the R MINOTAUR package, and appropriate unilateral tests were performed, i.e., a left tailed test for the θπ and CLR (two.tailed = FALSE, right.tailed = FALSE) and a right tailed test for the FST and XPEHH (two.tailed = FALSE, right.tailed = TRUE). Then, the n × n correlation matrix between the statistics is calculated using the covNAMcd function (alpha = 0.75, nsamp = 50,000) of the R rrcovNA package [87]. The matrix was used as the input of the DCMS function of the R MINOTAUR package to calculate the DCMS value of the whole genome. After calculating the DCMS value, these DCMS values were transformed to a normal distribution with the robust linear model using the rlm function (dcms ~ 1) of the R MASS package [88]. The output of the fitting model is the input of the pnorm R function to calculate the p-value of DCMS statistics. To control for multiple test false discovery rate (FDR) in the rejected null hypothesis, the p.adjust R function (p = dcms_pvalue, method = ”BH”, n = length(dcms_pvalue)) was adopted to convert the p-value of DCMS into the corresponding q-value according to Benjamini and Hochberg methods.
Candidate gene annotation
The region with DCMS q-value lower than 0.05 was regarded as the statistically significant region, and the genes contained in the significant region were annotated through the reference genome annotation file based on the ARS-UCD 1.2 reference genome. The function of these candidate genes was further determined through a literature review.
Supplementary Information
Acknowledgements
We would like to thank Yu Jiang for his technical support.
Abbreviations
- SNP
Single nucleotide polymorphism
- WGS
Whole-genome sequencing
- LD
Linkage disequilibrium
- PCA
Principal component analysis
- NJ
Neighbor-joining
- θπ
Nucleotide diversity
- CLR
Composite likelihood ratio
- FST
Fixation index
- XP-EHH
Cross-population extended haplotype homozygosity
- GO
Gene Ontology
- KEGG
Kyoto Encyclopedia of Genes and Genomes
Authors’ contributions
YH and SZ conceived and designed the experiments. SZ and ZY performed the statistical analysis and data upload. Xinmiao L and PY performed the sample DNA extraction. NC and XX provided suggestions for the revision of the manuscript. YJ provided technical assistance. ZZ, XL, LX, SL, QS, EW, and BR contributed to the sample collections. YJ provides a data analysis platform. HC, CL, and YH provided the laboratories for DNA extraction and statistical analysis. SZ drafted the manuscript. All authors read and approved the final manuscript.
Funding
This work was supported by the China Agriculture Research System of MOF and MARA (Grant No. CARS-37), Henan Beef Cattle Industrial Technology System (No. S2013-08), Science-Technology Foundation for innovation and creativity of Henan Academy of Agricultural Sciences (2020CX09).
Availability of data and materials
Sequences are available from GenBank with the Bioproject accession number PRJNA698276 (https://www.ncbi.nlm.nih.gov/bioproject/PRJNA698276).
Declarations
Ethics approval and consent to participate
All cattle were handled following the guidelines established by the Council for Animal Welfare of China. The protocols for sample collection and animal handling have been approved by the Faculty of Animal Policy and Welfare Committee of Northwest A&F University (FAPWCNWAFU, Protocol number, NWAFAC 1008).
The study was carried out in compliance with the ARRIVE guidelines.
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Footnotes
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Shunjin Zhang and Zhi Yao contributed equally.
References
- 1.Decker JE, McKay SD, Rolf MM, Kim J, Molina Alcalá A, Sonstegard TS, Hanotte O, Götherström A, Seabury CM, Praharani L. Worldwide patterns of ancestry, divergence, and admixture in domesticated cattle. PLoS Genet. 2014;10(3):e1004254. doi: 10.1371/journal.pgen.1004254. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Chen N, Cai Y, Chen Q, Li R, Wang K, Huang Y, Hu S, Huang S, Zhang H, Zheng Z. Whole-genome resequencing reveals world-wide ancestry and adaptive introgression events of domesticated cattle in East Asia. Nat Commun. 2018;9(1):1–13. doi: 10.1038/s41467-017-02088-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Bai Y, Sartor M, Cavalcoli J. Current status and future perspectives for sequencing livestock genomes. J Anim Sci Biotechnol. 2012;3(1):1–6. doi: 10.1186/2049-1891-3-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Georges M, Charlier C, Hayes B. Harnessing genomic information for livestock improvement. Nat Rev Genet. 2019;20(3):135–156. doi: 10.1038/s41576-018-0082-2. [DOI] [PubMed] [Google Scholar]
- 5.Gutiérrez-Gil B, Arranz JJ, Wiener P. An interpretive review of selective sweep studies in Bos taurus cattle populations: identification of unique and shared selection signals across breeds. Front Genet. 2015;6:167. doi: 10.3389/fgene.2015.00167. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Shen J, Hanif Q, Cao Y, Yu Y, Lei C, Zhang G, Zhao Y. Whole genome scan and selection signatures for climate adaption in yanbian cattle. Front Genet. 2020;11:94. doi: 10.3389/fgene.2020.00094. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Bhati M, Kadri NK, Crysnanto D, Pausch H. Assessing genomic diversity and signatures of selection in Original Braunvieh cattle using whole-genome sequencing data. BMC Genomics. 2020;21(1):1–14. doi: 10.1186/s12864-020-6446-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Xia X, Zhang S, Zhang H, Zhang Z, Chen N, Li Z, Sun H, Liu X, Lyu S, Wang X. Assessing genomic diversity and signatures of selection in Jiaxian Red cattle using whole-genome sequencing data. BMC Genomics. 2021;22(1):1–11. doi: 10.1186/s12864-020-07340-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Tijjani A, Utsunomiya YT, Ezekwe AG, Nashiru O, Hanotte O. Genome sequence analysis reveals selection signatures in endangered trypanotolerant West African Muturu cattle. Front Genet. 2019;10:442. doi: 10.3389/fgene.2019.00442. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Yurchenko AA, Daetwyler HD, Yudin N, Schnabel RD, Vander Jagt CJ, Soloshenko V, Lhasaranov B, Popov R, Taylor JF, Larkin DM. Scans for signatures of selection in Russian cattle breed genomes reveal new candidate genes for environmental adaptation and acclimation. Sci Rep. 2018;8(1):1–16. doi: 10.1038/s41598-018-31304-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Weber JA, Aldana R, Gallagher BD, Edwards JS. Sentieon DNA pipeline for variant detection-Software-only solution, over 20x faster than GATK 3.3 with identical results. Peer J Pre Prints. 2016;4:e1672v1672. [Google Scholar]
- 12.Porto-Neto LR, Kijas JW, Reverter A. The extent of linkage disequilibrium in beef cattle breeds using high-density SNP genotypes. Genet Sel Evol. 2014;46(1):1–5. doi: 10.1186/1297-9686-46-22. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Ma Y, Ding X, Qanbari S, Weigend S, Zhang Q, Simianer H. Properties of different selection signature statistics and a new strategy for combining them. Heredity (Edinb) 2015;115(5):426–436. doi: 10.1038/hdy.2015.42. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Liu WB, Liu J, Liang C, Guo X, Bao P, Chu M, Ding X, Wang H, Zhu X, Yan P. Associations of single nucleotide polymorphisms in candidate genes with the polled trait in D atong domestic yaks. Anim Genet. 2014;45(1):138–141. doi: 10.1111/age.12081. [DOI] [PubMed] [Google Scholar]
- 15.Chartier-Harlin M-C, Dachsel JC, Vilariño-Güell C, Lincoln SJ, Leprêtre F, Hulihan MM, Kachergus J, Milnerwood AJ, Tapia L, Song M-S. Translation initiator EIF4G1 mutations in familial Parkinson disease. Am J Hum Genet. 2011;89(3):398–406. doi: 10.1016/j.ajhg.2011.08.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Nandolo W, Mészáros G, Wurzinger M, Banda LJ, Gondwe TN, Mulindwa HA, Nakimbugwe HN, Clark EL, Woodward-Greene MJ, Liu M. Detection of copy number variants in African goats using whole genome sequence data. BMC Genomics. 2021;22(1):1–15. doi: 10.1186/s12864-021-07703-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Yang M, Lv J, Zhang L, Li M, Zhou Y, Lan X, Lei C, Chen H. Association study and expression analysis of CYP4A11 gene copy number variation in Chinese cattle. Sci Rep. 2017;7(1):1–8. doi: 10.1038/s41598-016-0028-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Stein U, Walther W, Arlt F, Schwabe H, Smith J, Fichtner I, Birchmeier W, Schlag PM. MACC1, a newly identified key regulator of HGF-MET signaling, predicts colon cancer metastasis. Nat Med. 2009;15(1):59–67. doi: 10.1038/nm.1889. [DOI] [PubMed] [Google Scholar]
- 19.Zheng X, Ju Z, Wang J, Li Q, Huang J, Zhang A, Zhong J, Wang C. Single nucleotide polymorphisms, haplotypes and combined genotypes of LAP3 gene in bovine and their association with milk production traits. Mol Biol Rep. 2011;38(6):4053–4061. doi: 10.1007/s11033-010-0524-1. [DOI] [PubMed] [Google Scholar]
- 20.He X, Huang Q, Qiu X, Liu X, Sun G, Guo J, Ding Z, Yang L, Ban N, Tao T. LAP3 promotes glioma progression by regulating proliferation, migration and invasion of glioma cells. Int J Biol Macromol. 2015;72:1081–1089. doi: 10.1016/j.ijbiomac.2014.10.021. [DOI] [PubMed] [Google Scholar]
- 21.Flori L, Moazami-Goudarzi K, Alary V, Araba A, Boujenane I, Boushaba N, Casabianca F, Casu S, Ciampolini R, Coeur D'Acier A. A genomic map of climate adaptation in Mediterranean cattle breeds. Mol Ecol. 2019;28(5):1009–1029. doi: 10.1111/mec.15004. [DOI] [PubMed] [Google Scholar]
- 22.Mamedova L, Robbins K, Johnson B, Bradford B. Tissue expression of angiopoietin-like protein 4 in cattle. J Anim Sci. 2010;88(1):124–130. doi: 10.2527/jas.2009-2258. [DOI] [PubMed] [Google Scholar]
- 23.Berger I, Hershkovitz E, Shaag A, Edvardson S, Saada A, Elpeleg O. Mitochondrial complex I deficiency caused by a deleterious NDUFA11 mutation. Ann Neurol. 2008;63(3):405–408. doi: 10.1002/ana.21332. [DOI] [PubMed] [Google Scholar]
- 24.Jiang P, Zhang D. Maternal embryonic leucine zipper kinase (MELK): a novel regulator in cell cycle control, embryonic development, and cancer. Int J Mol Sci. 2013;14(11):21551–21560. doi: 10.3390/ijms141121551. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Lange SS, Tomida J, Boulware KS, Bhetawal S, Wood RD. The polymerase activity of mammalian DNA pol ζ is specifically required for cell and embryonic viability. PLoS Genet. 2016;12(1):e1005759. doi: 10.1371/journal.pgen.1005759. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Santangelo L, Giurato G, Cicchini C, Montaldo C, Mancone C, Tarallo R, Battistelli C, Alonzi T, Weisz A, Tripodi M. The RNA-binding protein SYNCRIP is a component of the hepatocyte exosomal machinery controlling microRNA sorting. Cell Rep. 2016;17(3):799–808. doi: 10.1016/j.celrep.2016.09.031. [DOI] [PubMed] [Google Scholar]
- 27.Scott EY, Woolard KD, Finno CJ, Murray JD. Cerebellar abiotrophy across domestic species. Cerebellum. 2018;17(3):372–379. doi: 10.1007/s12311-017-0914-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Martín-Satué M, Lavoie EG, Fausther M, Lecka J, Aliagas E, Kukulski F, Sévigny J. High expression and activity of ecto-5′-nucleotidase/CD73 in the male murine reproductive tract. Histochem Cell Biol. 2010;133(6):659–668. doi: 10.1007/s00418-010-0704-z. [DOI] [PubMed] [Google Scholar]
- 29.Yamazaki K, Umeno J, Takahashi A, Hirano A, Johnson TA, Kumasaka N, Morizono T, Hosono N, Kawaguchi T, Takazoe M. A genome-wide association study identifies 2 susceptibility loci for Crohn's disease in a Japanese population. Gastroenterology. 2013;144(4):781–788. doi: 10.1053/j.gastro.2012.12.021. [DOI] [PubMed] [Google Scholar]
- 30.Sakurai N, Takahashi K, Emura N, Fujii T, Hirayama H, Kageyama S, Hashizume T, Sawai K. The necessity of OCT-4 and CDX2 for early development and gene expression involved in differentiation of inner cell mass and trophectoderm lineages in bovine embryos. Cellular Reprogram (Formerly" Cloning and Stem Cells") 2016;18(5):309–318. doi: 10.1089/cell.2015.0081. [DOI] [PubMed] [Google Scholar]
- 31.Gilliland DG, Griffin JD. The roles of FLT3 in hematopoiesis and leukemia. Blood J Am Soc Hemat. 2002;100(5):1532–1542. doi: 10.1182/blood-2002-02-0492. [DOI] [PubMed] [Google Scholar]
- 32.Takeo Y, Kurabayashi N, Nguyen MD, Sanada K. The G protein-coupled receptor GPR157 regulates neuronal differentiation of radial glial progenitors through the Gq-IP3 pathway. Sci Rep. 2016;6(1):1–12. doi: 10.1038/srep25180. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Liu T-T, Yang Q, Li M, Zhong B, Ran Y, Liu L-L, Yang Y, Wang Y-Y, Shu H-B. LSm14A plays a critical role in antiviral immune responses by regulating MITA level in a cell-specific manner. J Immunol. 2016;196(12):5101–5111. doi: 10.4049/jimmunol.1600212. [DOI] [PubMed] [Google Scholar]
- 34.Sun C, Cheng M-C, Qin R, Liao D-L, Chen T-T, Koong F-J, Chen G, Chen C-H. Identification and functional characterization of rare mutations of the neuroligin-2 gene (NLGN2) associated with schizophrenia. Hum Mol Genet. 2011;20(15):3042–3051. doi: 10.1093/hmg/ddr208. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Rato MG, Bexiga R, Nunes SF, Vilela CL, Santos-Sanches I. Human group A streptococci virulence genes in bovine group C streptococci. Emerg Infect Dis. 2010;16(1):116. doi: 10.3201/eid1601.090632. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Zheng H, Stratton CJ, Morozumi K, Jin J, Yanagimachi R, Yan W. Lack of Spem1 causes aberrant cytoplasm removal, sperm deformation, and male infertility. Proc Natl Acad Sci. 2007;104(16):6852–6857. doi: 10.1073/pnas.0701669104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Wu X, Li M, Li Y, Deng Y, Ke S, Li F, Wang Y, Zhou S. Fibroblast growth factor 11 (FGF11) promotes non-small cell lung cancer (NSCLC) progression by regulating hypoxia signaling pathway. J Transl Med. 2021;19(1):1–14. doi: 10.1186/s12967-020-02683-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Agerholm JS, McEvoy FJ, Menzi F, Jagannathan V, Drögemüller C. A CHRNB1 frameshift mutation is associated with familial arthrogryposis multiplex congenita in Red dairy cattle. BMC Genomics. 2016;17(1):1–10. doi: 10.1186/s12864-016-2832-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Yx Ge, Wang Ch, Fy Hu, Lx P. Min J, Niu Ky, Zhang L, Li J, Xu T: New advances of TMEM88 in cancer initiation and progression, with special emphasis on Wnt signaling pathway. J Cell Physiol. 2018;233(1):79–87. doi: 10.1002/jcp.25853. [DOI] [PubMed] [Google Scholar]
- 40.Gao Y, Tian S, Sha Y, Zha X, Cheng H, Wang A, Liu C, Lv M, Ni X, Li Q. Novel bi-allelic variants in DNAH2 cause severe asthenoteratozoospermia with multiple morphological abnormalities of the flagella. Reprod Biomed Online. 2021;42(5):963–972. doi: 10.1016/j.rbmo.2021.01.011. [DOI] [PubMed] [Google Scholar]
- 41.Chung N, Bogliotti YS, Ding W, Vilarino M, Takahashi K, Chitwood JL, Schultz RM, Ross PJ. Active H3K27me3 demethylation by KDM6B is required for normal development of bovine preimplantation embryos. Epigenetics. 2017;12(12):1048–1056. doi: 10.1080/15592294.2017.1403693. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Cataldo S, Annoni GA, Marziliano N. The perfect storm? Histiocytoid cardiomyopathy and compound CACNA2D1 and RANGRF mutation in a baby. Cardiol Young. 2015;25(1):174–176. doi: 10.1017/S1047951113002382. [DOI] [PubMed] [Google Scholar]
- 43.Haitina T, Lindblom J, Renström T, Fredriksson R. Fourteen novel human members of mitochondrial solute carrier family 25 (SLC25) widely expressed in the central nervous system. Genomics. 2006;88(6):779–790. doi: 10.1016/j.ygeno.2006.06.016. [DOI] [PubMed] [Google Scholar]
- 44.Mucha S, Mrode R, Coffey M, Kizilaslan M, Desire S, Conington J. Genome-wide association study of conformation and milk yield in mixed-breed dairy goats. J Dairy Sci. 2018;101(3):2213–2225. doi: 10.3168/jds.2017-12919. [DOI] [PubMed] [Google Scholar]
- 45.Higgins MG, Fitzsimons C, McClure MC, McKenna C, Conroy S, Kenny DA, McGee M, Waters SM, Morris DW. GWAS and eQTL analysis identifies a SNP associated with both residual feed intake and GFRA2 expression in beef cattle. Sci Rep. 2018;8(1):1–12. doi: 10.1038/s41598-018-32374-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Sasaki S, Mori D, Toyo-Oka K, Chen A, Garrett-Beal L, Muramatsu M, Miyagawa S, Hiraiwa N, Yoshiki A, Wynshaw-Boris A. Complete loss of Ndel1 results in neuronal migration defects and early embryonic lethality. Mol Cell Biol. 2005;25(17):7812–7827. doi: 10.1128/MCB.25.17.7812-7827.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Zhuang Z, Xu L, Yang J, Gao H, Zhang L, Gao X, Li J, Zhu B. Weighted single-step genome-wide association study for growth traits in Chinese Simmental beef cattle. Genes. 2020;11(2):189. doi: 10.3390/genes11020189. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Radner FP, Marrakchi S, Kirchmeier P, Kim G-J, Ribierre F, Kamoun B, Abid L, Leipoldt M, Turki H, Schempp W. Mutations in CERS3 cause autosomal recessive congenital ichthyosis in humans. PLoS Genet. 2013;9(6):e1003536. doi: 10.1371/journal.pgen.1003536. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Jungers KA, Le Goff C, Somerville RP, Apte SS. Adamts9 is widely expressed during mouse embryo development. Gene Expr Patterns. 2005;5(5):609–617. doi: 10.1016/j.modgep.2005.03.004. [DOI] [PubMed] [Google Scholar]
- 50.Carayol J, Sacco R, Tores F, Rousseau F, Lewin P, Hager J, Persico AM. Converging evidence for an association of ATP2B2 allelic variants with autism in male subjects. Biol Psychiat. 2011;70(9):880–887. doi: 10.1016/j.biopsych.2011.05.020. [DOI] [PubMed] [Google Scholar]
- 51.Moreira TG, Zhang L, Shaulov L, Harel A, Kuss SK, Williams J, Shelton J, Somatilaka B, Seemann J, Yang J. Sec13 regulates expression of specific immune factors involved in inflammation in vivo. Sci Rep. 2015;5(1):1–14. doi: 10.1038/srep17655. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Hou Q, Huang J, Ju Z, Li Q, Li L, Wang C, Sun T, Wang L, Hou M, Hang S. Identification of splice variants, targeted microRNAs and functional single nucleotide polymorphisms of the BOLA-DQA2 gene in dairy cattle. DNA Cell Biol. 2012;31(5):739–744. doi: 10.1089/dna.2011.1402. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Twomey AJ, Berry DP, Evans RD, Doherty ML, Graham DA, Purfield DC. Genome-wide association study of endo-parasite phenotypes using imputed whole-genome sequence data in dairy and beef cattle. Genet Sel Evol. 2019;51(1):1–17. doi: 10.1186/s12711-019-0457-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Srirattana K, St John JC. Manipulating the mitochondrial genome to enhance cattle embryo development. G3. 2017;7(7):2065–2080. doi: 10.1534/g3.117.042655. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Melo TPD, De Camargo GMF, De Albuquerque LG, Carvalheiro R. Genome-wide association study provides strong evidence of genes affecting the reproductive performance of Nellore beef cows. PLoS One. 2017;12(5):e0178551. doi: 10.1371/journal.pone.0178551. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Cochran SD, Cole JB, Null DJ, Hansen PJ. Single nucleotide polymorphisms in candidate genes associated with fertilizing ability of sperm and subsequent embryonic development in cattle. Biol Reprod. 2013;89(3):69. doi: 10.1095/biolreprod.113.111260. [DOI] [PubMed] [Google Scholar]
- 57.Uemoto Y, Ohtake T, Sasago N, Takeda M, Abe T, Sakuma H, Kojima T, Sasaki S. Effect of two non-synonymous ecto-5′-nucleotidase variants on the genetic architecture of inosine 5′-monophosphate (IMP) and its degradation products in Japanese Black beef. BMC Genomics. 2017;18(1):1–15. doi: 10.1186/s12864-017-4275-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Wydooghe E, Vandaele L, Beek J, Favoreel H, Heindryckx B, De Sutter P, Van Soom A. Differential apoptotic staining of mammalian blastocysts based on double immunofluorescent CDX2 and active caspase-3 staining. Anal Biochem. 2011;416(2):228–230. doi: 10.1016/j.ab.2011.05.033. [DOI] [PubMed] [Google Scholar]
- 59.Zhenhua G, Rajput SK, Folger JK, Di L, Knott JG, Smith GW. Pre-and peri-/post-compaction follistatin treatment increases in vitro production of cattle embryos. PLoS ONE. 2017;12(1):e0170808. doi: 10.1371/journal.pone.0170808. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Canovas S, Cibelli JB, Ross PJ. Jumonji domain-containing protein 3 regulates histone 3 lysine 27 methylation during bovine preimplantation development. Proc Natl Acad Sci. 2012;109(7):2400–2405. doi: 10.1073/pnas.1119112109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Glanzner WG, Rissi VB, de Macedo MP, Mujica LKS, Gutierrez K, Bridi A, de Souza JRM, Gonçalves PBD, Bordignon V. Histone 3 lysine 4, 9, and 27 demethylases expression profile in fertilized and cloned bovine and porcine embryos. Biol Reprod. 2018;98(6):742–751. doi: 10.1093/biolre/ioy054. [DOI] [PubMed] [Google Scholar]
- 62.Liu DT, Hong WS, Chen SX, Zhu Y. Upregulation of adamts9 by gonadotropin in preovulatory follicles of zebrafish. Mol Cell Endocrinol. 2020;499:110608. doi: 10.1016/j.mce.2019.110608. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Carter NJ, Roach ZA, Byrnes MM, Zhu Y. Adamts9 is necessary for ovarian development in zebrafish. Gen Comp Endocrinol. 2019;277:130–140. doi: 10.1016/j.ygcen.2019.04.003. [DOI] [PubMed] [Google Scholar]
- 64.Fortune J, Willis E, Bridges P, Yang C. The periovulatory period in cattle: progesterone, prostaglandins, oxytocin and ADAMTS proteases. Anim Reprod. 2009;6(1):60. [PMC free article] [PubMed] [Google Scholar]
- 65.Yoshida GM, Yáñez JM. Multi-trait GWAS using imputed high-density genotypes from whole-genome sequencing identifies genes associated with body traits in Nile tilapia. BMC Genomics. 2021;22(1):1–13. doi: 10.1186/s12864-020-07341-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Li Y, Chen R, Zhou Q, Xu Z, Li C, Wang S, Mao A, Zhang X, He W, Shu H-B. LSm14A is a processing body-associated sensor of viral nucleic acids that initiates cellular antiviral response in the early phase of viral infection. Proc Natl Acad Sci. 2012;109(29):11770–11775. doi: 10.1073/pnas.1203405109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Onouchi Y. Genetics of Kawasaki Disease-What We Know and Don’t Know. Circ J. 2012;76(7):1581–6. doi: 10.1253/circj.CJ-12-0568. [DOI] [PubMed] [Google Scholar]
- 68.Liu M, Fang L, Liu S, Pan MG, Seroussi E, Cole JB, Ma L, Chen H, Liu GE. Array CGH-based detection of CNV regions and their potential association with reproduction and other economic traits in Holsteins. BMC Genomics. 2019;20(1):1–10. doi: 10.1186/s12864-018-5379-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Hou Y, Bickhart DM, Chung H, Hutchison JL, Norman HD, Connor EE, Liu GE. Analysis of copy number variations in Holstein cows identify potential mechanisms contributing to differences in residual feed intake. Funct Integr Genomics. 2012;12(4):717–723. doi: 10.1007/s10142-012-0295-y. [DOI] [PubMed] [Google Scholar]
- 70.Li B, VanRaden P, Null D, O'Connell J, Cole J. Major quantitative trait loci influencing milk production and conformation traits in Guernsey dairy cattle detected on Bos taurus autosome 19. J Dairy Sci. 2021;104(1):550–560. doi: 10.3168/jds.2020-18766. [DOI] [PubMed] [Google Scholar]
- 71.Hogan B, Costantini F, Lacy E. Manipulating the mouse embryo: a laboratory manual. 1986. [Google Scholar]
- 72.Van der Auwera GA, Carneiro MO, Hartl C, Poplin R, Del Angel G, Levy-Moonshine A, Jordan T, Shakir K, Roazen D, Thibault J. From FastQ data to high-confidence variant calls: the genome analysis toolkit best practices pipeline. Curr Protoc Bioinformatics. 2013;43(1):11.10.11–11.10.33. doi: 10.1002/0471250953.bi1110s43. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Wang K, Li M, Hakonarson H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 2010;38(16):e164–e164. doi: 10.1093/nar/gkq603. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, Maller J, Sklar P, De Bakker PI, Daly MJ. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81(3):559–575. doi: 10.1086/519795. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Patterson N, Price AL, Reich D. Population structure and eigenanalysis. PLoS Genet. 2006;2(12):e190. doi: 10.1371/journal.pgen.0020190. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Alexander DH, Lange K. Enhancements to the ADMIXTURE algorithm for individual ancestry estimation. BMC Bioinformatics. 2011;12(1):1–6. doi: 10.1186/1471-2105-12-246. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Kumar S, Stecher G, Tamura K. MEGA7: molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol Biol Evol. 2016;33(7):1870–1874. doi: 10.1093/molbev/msw054. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Nielsen R, Williamson S, Kim Y, Hubisz MJ, Clark AG, Bustamante C. Genomic scans for selective sweeps using SNP data. Genome Res. 2005;15(11):1566–1575. doi: 10.1101/gr.4252305. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Danecek P, Auton A, Abecasis G, Albers CA, Banks E, DePristo MA, Handsaker RE, Lunter G, Marth GT, Sherry ST. The variant call format and VCFtools. Bioinformatics. 2011;27(15):2156–2158. doi: 10.1093/bioinformatics/btr330. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.DeGiorgio M, Huber CD, Hubisz MJ, Hellmann I, Nielsen R. SweepFinder2: increased sensitivity, robustness and flexibility. Bioinformatics. 2016;32(12):1895–1897. doi: 10.1093/bioinformatics/btw051. [DOI] [PubMed] [Google Scholar]
- 81.Ayres DL, Darling A, Zwickl DJ, Beerli P, Holder MT, Lewis PO, Huelsenbeck JP, Ronquist F, Swofford DL, Cummings MP. BEAGLE: an application programming interface and high-performance computing library for statistical phylogenetics. Syst Biol. 2012;61(1):170–173. doi: 10.1093/sysbio/syr100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Szpiech ZA, Hernandez RD. selscan: an efficient multithreaded program to perform EHH-based scans for positive selection. Mol Biol Evol. 2014;31(10):2824–2827. doi: 10.1093/molbev/msu211. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Peripolli E, Reimer C, Ha N-T, Geibel J, Machado MA. Panetto JCdC, do Egito AA, Baldi F, Simianer H, da Silva MVGB: Genome-wide detection of signatures of selection in indicine and Brazilian locally adapted taurine cattle breeds using whole-genome re-sequencing data. BMC Genomics. 2020;21(1):1–16. doi: 10.1186/s12864-020-07035-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Illa SK, Mukherjee S, Nath S, Mukherjee A. Genome-wide scanning for signatures of selection revealed the putative genomic regions and candidate genes controlling milk composition and coat color traits in Sahiwal cattle. Front Genet. 2021;12:699422. doi: 10.3389/fgene.2021.699422. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Ghoreishifar SM, Eriksson S, Johansson AM, Khansefid M, Moghaddaszadeh-Ahrabi S, Parna N, Davoudi P, Javanmard A. Signatures of selection reveal candidate genes involved in economic traits and cold acclimation in five Swedish cattle breeds. Genet Sel Evol. 2020;52(1):1–15. doi: 10.1186/s12711-020-00571-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Verity R, Collins C, Card DC, Schaal SM, Wang L, Lotterhos KE. minotaur: A platform for the analysis and visualization of multivariate results from genome scans with R Shiny. Mol Ecol Resour. 2017;17(1):33–43. doi: 10.1111/1755-0998.12579. [DOI] [PubMed] [Google Scholar]
- 87.Todorov V, Templ M, Filzmoser P. Detection of multivariate outliers in business survey data with incomplete information. Adv Data Anal Classif. 2011;5(1):37–56. doi: 10.1007/s11634-010-0075-2. [DOI] [Google Scholar]
- 88.Venables WN, Ripley BD. Modern applied statistics with S. 4th ed. New York: Springer; 2002.
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Sequences are available from GenBank with the Bioproject accession number PRJNA698276 (https://www.ncbi.nlm.nih.gov/bioproject/PRJNA698276).