Abstract
Simple Summary
Whole-genome resequencing is an important tool to reveal the in-depth genomic characteristics of a genome. Adaptability traits are key to the survival of the south Chinese zebu cattle. However, the potential genetic information behind these remarkable traits still remains uncertain and needs to be addressed. In the current study, we utilized a total of 15 local south Chinese cattle samples (Leiqiong (LQ), Wannan (WN), Wenshan (WS)) from one of our previous studies mapped to the old reference genome (Btau_5.0.1) and remapped them to the latest reference genome (ARS-UCD1.2) to explore potential single nucleotide polymorphisms (SNPs) and insertions-deletions (InDels) responsible for some important immune related traits. The present study emphasizes and illustrates the genetic diversity, extending our previous study. The InDel annotation show that WS cattle had more enriched genes associated with immune functions than the other two breeds. Our findings provide valuable resources for further investigation of the functions of SNP- and InDels-related genes and help to determine the molecular basis of adaptive mutations in Chinese zebu cattle.
Abstract
We report genome characterization of three native Chinese cattle breeds discovering ~34.3 M SNPs and ~3.8 M InDels using whole genome resequencing. On average, 10.4 M SNPs were shared amongst the three cattle breeds, whereas, 3.0 M, 4.9 M and 5.8 M were specific to LQ, WN and WS breeds, respectively. Gene ontology (GO)analysis revealed four immune response-related GO terms were over represented in all samples, while two immune signaling pathways were significantly over-represented in WS cattle. Altogether, we found immune related genes (PGLYRP2, ROMO1, FYB2, CD46, TSC1) in the three cattle breeds. Our study provides insights into the genetic basis of Chinese indicine adaptation to the tropic and subtropical environment, and provides a valuable resource for further investigations of genetic characteristics of the three breeds.
Keywords: cattle, SNP, InDel, whole-genome resequencing
1. Introduction
The domestication of cattle was one of the most significant happenings around 10,000 years ago, assisting mankind with increasing meat, milk and leather supplies, throughout the world [1]. Domestic cattle adapted to various diverse environmental conditions during the natural selection, such as Brahman cattle adapted to harsh tropical conditions whereas Yakutian cattle adapted to the subarctic environment [2,3]. The genetic selection of cattle, e.g., Holstein and Beef master, led to the production of higher milk/meat than local cattle breeds [4,5]. At present, more than 800 cattle breeds have been identified in the world [6], and these cattle breeds constitute an important world heritage and a unique genome resources.
China has a vast territory, and harbors 53 domestic cattle breeds [7]. According to the previous studies [8,9,10], Chinese cattle can be geographically classified into three categories: the northern group distributed in the north of China (Bos taurus), the central group located in the middle and lower areas of the Yellow River and the Huaihe River (a mixture of Bos taurus and Bos indicus) and the southern group in the south of China (Bos indicus).
Considerable progress has been made through high-throughput sequencing to obtain cattle whole-genome sequences, which offer extremely promising approaches for screening the molecular targets of disease and resistance. Recently, a large number of genome-wide resequencing data for different cattle breeds have been published, including 11 indigenous Pakistani cattle breeds [11], Brahman cattle [12] and Indian cattle [13,14]. These studies have enriched the sequencing data of different breeds and also provided information for studying the genetic diversity of different domestic cattle breeds.
Until now, Chinese indicine cattle has been lacking behind in its detailed gene characterization. In the present study, we performed whole-genome resequencing of 15 individuals of three indigenous indicine breeds, including Wenshan cattle (WS, n = 7), Wannan cattle (WN, n = 5) and Leiqiong (LQ, n = 3) cattle. The aim of our study is to provide a valuable resource for further investigations of the genetic mechanisms underlying traits of interest in Chinese indicine cattle.
2. Materials and Methods
2.1. Whole-Genome Data
We used whole-genome data of three Chinese cattle breeds (n = 15) from our previous study (Table 1) [13]. DNA was extracted from the ear tissues of each individual using the standard phenol-chloroform method. Two libraries with insert sizes of 500 bp were constructed for each individual and sequenced using the HiSeq 2000 platform (Illumina, Beijing, China).
Table 1.
Breed | Sample ID | SRR ID | Total Reads | Aligned Reads Rate (%) | Duplication Rate (%) | Average Read Depth |
---|---|---|---|---|---|---|
WS | WS1 | SRR6024561 | 132599505 | 131966081 (99.52%) | 5.24% | 6.7367× |
WS2 | SRR6024562 | 210348316 | 208908938 (99.32%) | 7.57% | 10.9981× | |
WS3 | SRR6024569 | 213171893 | 209992479 (98.51%) | 7.56% | 11.0636× | |
WS4 | SRR6024575 | 220677796 | 219684659 (99.55%) | 7.58% | 10.6536× | |
WS5 | SRR6024576 | 185678168 | 184843226 (99.55%) | 6.60% | 9.3981× | |
WS6 | SRR6024577 | 220875686 | 219906218 (99.56%) | 7.17% | 11.5984× | |
WS7 | SRR6024578 | 124034762 | 123493668 (99.56%) | 5.40% | 6.5332× | |
WN | WN4 | SRR5507199 | 453240314 | 443641290 (97.88%) | 10.12% | 22.9631× |
WN8 | SRR5507198 | 453327732 | 448362891 (98.9%) | 9.91% | 23.1603× | |
WN9 | SRR5507195 | 189321051 | 187411280 (98.99%) | 5.63% | 9.6483× | |
WN10 | SRR5507196 | 203146450 | 201321637 (99.1%) | 6.00% | 10.1569× | |
WN11 | SRR5507197 | 229615463 | 228096174 (99.34%) | 6.33% | 11.7816× | |
LQ | LQ5 | SRR5507190 | 229615463 | 228096174 (99.34%) | 6.33% | 11.5007× |
LQ12 | SRR5507189 | 219526027 | 217652138 (99.15%) | 6.14% | 11.1222× | |
LQ15 | SRR5507188 | 208719494 | 207367555 (99.35%) | 5.91% | 10.6481× |
2.2. Read Mapping and SNP Calling
The reads were mapped to the latest reference genome (ARS-UCD1.2, Bos taurus, Breed Hereford) using BWA-mem [15], according to the default parameters. The Genome Analysis Toolkit (GATK, version 3.8) was further employed for SNP calling, followed by mark Duplicate by Picard [16]. GATK, “variant Filtration” was implemented for all SNPs as follows: (1) variant confidence/quality by depth (QD) <2; (2) RMS mapping quality (MQ) >40.0; (3) Phred-scaled p-value using Fisher’s exact test to detect strand bias (FS) <60; (4) z-score from the Wilcoxon rank sum test of Alt vs. Ref read position bias (ReadPosRankSum) >−8; (5) z-score form the Wilcoxon rank sum test of Alt vs. Ref read mapping qualities (MQRankSum) >12.5; and (6) variants with SOR (symmetric odds ratio of 2 × 2 contingency table to detect strand bias) >3.0.
2.3. Identification of InDels
For all the 15 cattle samples, the InDels were extracted by GATK [16] in the “variant Filtration” within 1 bp to 30 bp window with the following parameters: (1) variant confidence/quality by depth (QD) <2; (2) phred-scaled p-value using Fisher’s exact test to detect strand bias (FS) >200; (3) z-score from the Wilcoxon rank sum test of Alt vs. Ref read position bias (ReadPosRankSum) <−20; (4) Likelihood-based test for the consanguinity among samples (InbreedingCoeff) <−0.8.
2.4. Variant Functional Annotation and GO Enrichment
The SNP/InDels were classified as HIGH, MODERATE, LOW and MODIFIER as per their functionality using SnpEff [17]. Only HIGH and MODERATE impact SNPs/InDels were kept for further analysis, including SNPs with (HIGH: protein truncation or triggering loss/gain of function; MODERATE: missense variant and splice variant that could change protein effectiveness); whereas the InDels include (HIGH: frameshift variant or splice donor variant; MODERATE: disruptive inframe insertion) [11]. For all the individuals, the variants were filtered as >5 SNPs/gene, whereas >5 InDels/gene were identified per breed [3]. The SnpEff functional class vocabulary assigned to both SNPs and InDels were Untranslated Region(UTR) variant include 3 prime UTR and 5 prime UTR, Downstream and upstream gene variant, intergenic region, intragenic, intron, non-coding transcript exon and non-coding transcript variant, splice variant (splice acceptor, splice donor and splice region), start lost, stop gained, stop lost and stop retained. Functional classes exclusively used for InDels were conserved in-frame deletion and insertion, disruptive in-frame deletion and insertion, bidirectional gene fusion, frameshift, conservative-disruptive inframe InDel, whereas missense, initiator codon and synonymous were only assigned to SNPs.
The Gene Ontology online tool (http://geneontology.org/) was used to identify over-represented biological process using the filtered variants (Bos taurus reference genome build 9913, Released 2019-4-17) [18,19]. Fisher’s exact test with calculate false discovery rate testing was executed and a p-value of less than 0.05 was chosen as an inclusion criterion for functional categories.
3. Results
3.1. Read Alignment
Three breeds (LQ, n = 3; WN, n = 5; WS, n = 7) were distributed in Haikou city, Guangdong province; Jinde county, Anhui province; and Guangnan county, Yunnan province from southern China. The average genome coverage to the reference genome was 99.17% (ranging from 97.88% to 99.56%) with an elevated in-depth mapping coverage of 11.86 folds (ranging from 6.53 to 21.36) and 6.9% duplication rate.
3.2. Identification of SNPs and InDels
After quality filtering, ~34.3 million SNPs were identified across all 15 samples, in relation to the latest taurine reference genome (GCF_002263795.1) (Table S1). The SNPs density was also detected to be approximately 13791 SNPs per million bp (MB) in all samples, whereas 7703, 9275 and 9412 SNPs/MB were calculated in LQ, WN, WS, respectively. With the transition (Ts) versus transversion (Tv) ratio ranges from 2.375 to 2.390 (Tables S1 and S2). Compared with NCBI dbSNP bovine, a total of 9.33 million novel SNPs was identified, whereas, approximately 14.8, 17.8 and 18.2 million SNPs were previously identified and annotated in LQ, WN and WS cattle (Table 2, Figure S1A). The number of InDels (mostly ≤3 bp) was 2,153,542; 2,586,758; and 2,471,063 in LQ, WN and WS cattle, respectively (Figure 1, Table S3). Among all InDels, 2,294,379 (59.82%) are deletions, whereas 1,453,791 (58.83%),; 1,508,944 (58.33%); and 1,257,000 (58.37%) are breedwise deletions of LQ, WN and WS cattle, respectively (Figure S1B, Table S1).
Table 2.
Fields | LQ | WN | WS |
---|---|---|---|
Total number | 19,178,051 | 23,091,150 | 23,431,130 |
3 prime UTR | 390,813 | 481,642 | 500,217 |
5 prime UTR | 110,542 | 137,007 | 146,969 |
Downstream gene | 4,673,996 | 5,721,422 | 5,884,594 |
Initiator codon | 25 | 31 | 39 |
Intergenic region | 10,333,593 | 12,409,887 | 12,561,984 |
Intragenic | 13,622,957 | 16,715,251 | 17,111,225 |
Intron | 85,061,834 | 100,821,955 | 105,226,537 |
Missense | 138,168 | 173,641 | 212,294 |
Non coding transcript exon | 172,858 | 212,090 | 223,022 |
Non coding transcript | 32,436,546 | 39,244,542 | 39,287,003 |
Splice acceptor | 717 | 900 | 887 |
Splice donor | 857 | 1152 | 1129 |
Splice region | 59,437 | 74,604 | 79,809 |
Start lost | 246 | 273 | 329 |
Stop gained | 1108 | 1253 | 1476 |
Stop lost | 211 | 273 | 263 |
Stop retained | 221 | 246 | 258 |
Synonymous | 278,880 | 348,230 | 528,134 |
Upstream gene | 4,538,595 | 5,552,247 | 5,651,124 |
Novel | 4,354,976 | 5,288,934 | 5,226,594 |
Known | 14,823,074 | 17,802,215 | 18,204,535 |
In this study, 10,446,053 SNPs were shared amongst the three breeds, whereas, 3,046,955 (15.89%), 4861786 (21.05%) and 5,773,919 (24.64%) were specific to LQ, WN and WS cattle. Furthermore, 1,065,517 InDels were shared among all samples, whereas, 432,710; 662,590; and 668,308 number of InDels were private to LQ, WN and WS breeds, respectively (Figure 2).
3.3. Functional Annotation of SNPs and InDels
Functional classes of the SNPs identified in this study are shown in Table 2. The numbers of functionally annotated SNPs were slightly higher than those of the detected SNPs, the reason for which lies with the possible presence of multiple annotations to a single SNPs. As expected, most of the SNPs discovered in this study map to intergenic 24,610,698 (9.01%) and intronic regions 151,807,389 (55.61%) and are potentially neutral; 8,664,911 (3.17%) and 8,403,001 (3.08%) SNPs are positioned in a 1000 bp downstream and upstream regions from the genes set, respectively. These SNPs classified as HIGH and MODERATE impact, include 300,763 (0.11%) missense mutation and 119,959 (0.04%) splice mutation, whereas many other SNPs, such as stop-gain, start-loss and stop-loss (2284, 465 and 400, respectively) were also detected in the current study.
Functional annotation depicted all samples, the location of 2,037,409 (6.72%) InDels in intergenic and 16,689,877 (55.07%) in intronic regions, a large number of InDels exist in untranslated regions (103,089 InDels in 3 prime UTR and 19,146 InDel in 5 prime UTR). A total of 4018 and 2688 InDels were annotated as Disruptive and Conservative inframe InDels respectively. Additionally, 12,894 InDels induced frameshift mutations, 14655 InDels affected splice-sites (splice regions, splice donor and splice acceptor variants), and two InDels resulted in premature stop codons. At the breed level, most variants were identified in intergenic region, intragenic region and intron, while a small number of variants had a bearing on protein translation, such as, disruptive inframe InDel, split variants and frameshift variants (Table 3).
Table 3.
Fields | LQ | WN | WS |
---|---|---|---|
Total number | 2,153,542 | 2,586,758 | 2,471,063 |
3 prime UTR | 54,892 | 67,465 | 65,837 |
5 prime UTR | 9904 | 12,229 | 12,421 |
Bidirectional gene fusion | 80 | 83 | 139 |
Conservative inframe deletion | 561 | 814 | 1021 |
Conservative inframe insertion | 434 | 546 | 781 |
Disruptive inframe deletion | 1039 | 1292 | 1735 |
Disruptive inframe insertion | 483 | 583 | 955 |
Downstream gene | 585,215 | 719,779 | 693,994 |
Frameshift variant | 4168 | 5516 | 9559 |
Gene fusion | 198 | 178 | 232 |
Intergenic region | 1152,941 | 1,382,145 | 1,318,068 |
Intragenic | 1,485,117 | 1,817,239 | 1,775,709 |
Intron | 9,407,833 | 11,108,642 | 10,992,364 |
Non coding transcript exon | 17,425 | 21,355 | 21,201 |
Non coding transcript | 3,688,414 | 4,441,858 | 4,170,309 |
Splice acceptor | 384 | 451 | 471 |
Splice donor | 368 | 412 | 477 |
Splice region | 6396 | 7787 | 8542 |
Start lost | 58 | 64 | 54 |
Stop gained | 57 | 73 | 131 |
Stop lost | 32 | 73 | 61 |
Upstream gene | 54,6024 | 666,968 | 642,273 |
3.4. GO Analysis of the SNPs and InDels
The GO enrichment analysis of 1260 filtered genes identified 58 significant GO terms in all 15 cattle (Table 4, Table S4). The GO enrichment analysis revealed four GO categories associated with sensory perception processes (GO:0007608~sensory perception of smell; GO:0050911~detection of chemical stimulus involved in sensory perception of smell; GO:0007606~sensory perception of chemical stimulus; GO:0050907~detection of chemical stimulus involved in sensory perception). Five GO biological processes related to metabolic process (GO:0006956~complement activation; GO:0006959~humoral immune response; GO:0006958~complement activation, classical pathway; GO:0002253~activation of immune response) were found. Four immune responses related GO terms were also identified (GO:0006956~complement activation; GO:0006959~humoral immune response; GO:0006958~complement activation, classical pathway; GO:0002253~activation of immune response).
Table 4.
GO Biological Process Complete | Count | Fold Enrichment | FDR |
---|---|---|---|
Smell | |||
Sensory perception of smell (GO:0007608) | 104 | 3.05 | 2.78 × 10−18 |
Detection of chemical stimulus involved in sensory perception of smell (GO:0050911) | 101 | 3.03 | 5.15 × 10−18 |
Sensory perception of chemical stimulus (GO:0007606) | 104 | 2.96 | 4.27 × 10−18 |
Detection of chemical stimulus involved in sensory perception (GO:0050907) | 101 | 2.99 | 6.22 × 10−18 |
Immune responses | |||
Complement activation (GO:0006956) | 9 | 7.5 | 3.92 × 10−3 |
Humoral immune response (GO:0006959) | 11 | 4.72 | 1.62 × 10−2 |
Complement activation, classical pathway (GO:0006958) | 6 | 9.21 | 3.29 × 10−2 |
Activation of immune response (GO:0002253) | 16 | 3.01 | 4.58 × 10−2 |
Metabolic process | |||
Metabolic process (GO:0008152) | 114 | 0.57 | 2.55 × 10−10 |
Organic substance metabolic process (GO:0071704) | 98 | 0.56 | 2.16 × 10−9 |
Cellular metabolic process (GO:0044237) | 87 | 0.54 | 2.92 × 10−9 |
Primary metabolic process (GO:0044238) | 95 | 0.57 | 5.67 × 10−8 |
Organonitrogen compound metabolic process (GO:1901564) | 64 | 0.55 | 4.18 × 10−5 |
GO enrichment analysis of 1207, 1493, and 1937 filtered genes associated with frameshift InDel in LQ, WN and WS cattle, respectively. The GO enrichment analysis revealed16, 24, and 19 GO terms to be associated with biological processes in LQ, WN and WS cattle, respectively (Tables S5–S8). The GO enrichment analysis showed 14 significantly enriched GO terms, shared by the three cattle breeds. Three noticeably enriched GO terms (GO:0023052~signaling; GO:0007165~signal transduction; GO:0007154~cell communication) were shared by WS and WN (Table S8). Amongst three breeds, two GO terms related to immune response (GO:0002252~immune effector process; GO:0006956~complement activation) were enriched in WS cattle, alone. On the other hand, in the WN breed, a large number of genes were significantly associated with metabolic process functions, such as GO:0051171~regulation of nitrogen compound metabolic process; GO:0080090~regulation of primary metabolic process; GO:0019222~regulation of metabolic process; GO:0060255~regulation of macromolecule metabolic process; GO:0031323~regulation of cellular metabolic process; GO:0008152~metabolic process; GO:0010605~negative regulation of macromolecule metabolic process. In LQ cattle, two of the GO terms associated with L-lysine transport were enriched.
4. Discussion
We herein carefully examined the whole-genome sequences of three aboriginal breeds in China. The TS/TV ratio could evaluate the value of the resequencing error, which was used to assess the quality of the SNPs. In our study, TS/TV ratio ranges from 2.375 to 2.39, which is similar to previous studies (Tables S1 and S2) [20,21]. The homozygous and heterozygous SNP ratio in each breed indicate the normalization of the population structure.
The three breeds we studied are native to southern China regions with a subtropical climate. Compared with Bos taurus, the indicine has the presence of a hump, loose skin and shorter and thinner hair, and they all have the characteristics of adapt to the hot climate and resisting diseases [22]. Breed-shared SNP could possibly be helpful in further research common function or phenotype of Bos indicus. From all 29 autosomes, we identified more than 34.3 million SNPs and ~3.84 million InDels (Table 2 and Table 3), of which approximately 30.43% and 27.78% shared in all 15 cattle. We also found that 3.05 million (15.89%) in LQ, 4.86 million (21.05%) in WN and 5,773,919 (24.64%) in WS were private SNPs in our SNPs set (Figure 2). These breed specific SNPs provided conditions for breed characterization of the further research.
In our study, WS has the greatest number of SNPs, and WN has the greatest number of InDels. Interestingly, LQ has the minimum number of in both SNPs and InDels (Table 2 and Table 3). The lengths of the InDels ranged from −30 bp (deletion) to 30 bp (insertion). However, the small InDels (≥3 bp) account for 83.07% of the total InDels, which is comparable to the previous results [11,21].
Our survey of three geographical distinct indicine cattle breeds (LQ, WS, WN) showed that each of them has similar characteristic. GO analysis revealed that a lot of immune-related gene were shared by all samples. Among them, the PGLYRP2 gene is an important gene involved in bacterial infection immune response [23]. Studies have shown that this gene is related to somatic cells count in milk [24] and immune response to advantageous and harmful gut bacteria [25]. These results suggest that the PGLYRP2 gene may be associated with bovine gut bacteria and milk quality. ROMO1 gene encodes a mitochondrial membrane protein that has the effect on increasing intracellular reactive oxygen species [26]. Recent studies have shown that the ROMO1 gene product are highly expressed in cancer cells and triggers sustained inflammatory response [27]. In addition, PTNR22 are associated with immune diseases [28,29]. STK11 gene is a tumor suppressor gene [30]. FYB2 gene, also known as ARAP gene, encodes a T cell adaptor protein mediating cell adhesion [31]. These immunity-related genes were shared between three breeds of cattle. In the present study, three breeds enriched by common immune enrichment pathway, suggests that Chinese indicine may have some common mechanisms towards adaption to the environment in Southern China. In InDels, the immune response-related genes were only enriched in WS cattle. Among them, CD46, a type 1 transmembrane glycoprotein, whose main function is to regulate complement activation [32]. We found 55 InDels related to the CD46 gene in WS cattle, suggesting that WS cattle may have well antiviral infection characteristics [33,34]. EMP2 has been shown to promote angiogenesis in vitro and in vivo [35]. TSC1 gene encodes the growth inhibitory protein hamartin and increase gene expression contributing to cardiovascular health [36]. These genetic mutations may be important factors in WS cattle adaption to the local environment. In addition, we identified some InDels related to metabolism in WN cattle, and other genes related to lysine transport in the LQ breed.
5. Conclusions
Our study used resequencing data from three cattle breeds to provide detailed genomic information, including SNPs and InDels. Amongst them, WS cattle contained the greatest number of SNPs, which might have resulted in parallel to the maximum number of WS cattle in the current study. However, WN cattle dominated in terms of InDels. The breed-specific genetic variants are crucial for maintaining herds’ genetic diversity and development of its breeding strategies. In our present study, a total of ~3.04 M, ~4.86 M and ~5.77 M SNPs were identified specific to LQ, WN and WS cattle, while, the number of InDels ranged from ~0.432 M, ~0.662 M and ~0.668 M respectively. The Gene Ontology of SNPs enriched immune pathways, revealing PGLYRP2, ROMO1, PTNR22, STK11 and FYB2 genes. InDels on the other hand, depicted over representation of immune related GO terms in WS, while L-lysine transport and the metabolism showed over-representation only in LQ and WN respectively. The current study unveils the genetic characteristics of three important southern Chinese zebu cattle breeds, providing the genome resources for further study.
Acknowledgments
We would like to thank Yu Jiang for his good suggestions and technical support.
Supplementary Materials
The following are available online at https://www.mdpi.com/2076-2615/9/9/596/s1: Figure S1: Total/novel SNPs count and deletions/insertions count in three breeds of Chinese indicine. (a) The identified novel and known SNPs for each breed. (b) The identified deletions and insertions for each breed. Table S1: Summary of SNP and InDel Calling. Table S2: Summary of identified SNPs for individual samples. Table S3: summary of InDels length in three cattle breeds. Table S4: GO enrichment result for the genes containing SNPs >5 from HIGH and MODERATE data set in all cattle. Significantly enriched GO-terms are presented. Table S5: GO enrichment result for the genes containing >5 from frameshift InDels set in Leiqiong cattle. Significantly enriched GO-terms are presented. Table S6: GO enrichment result for the genes containing InDel >5 from frameshift InDels set in Wannan cattle. Significantly enriched GO-terms are presented. Table S7: GO enrichment result for the genes containing InDel >5 from frameshift InDels set in Wenshan cattle. Significantly enriched GO-terms are presented. Table S8: Gene Ontology (GO) reveals high or moderate effects >5 InDels, specific term for each breed or overlapping term between any two breeds.
Author Contributions
Conceptualization, F.Z. and C.L.; methodology, N.C.; software, Y.H.; validation, B.H.; formal analysis, N.C.; investigation, K.Q.; resources, K.Q.; data curation, J.Z.; writing-original draft preparation, F.Z.; writing-review and editing, H.C. and Q.H.; visualization, X.L.; supervision, F.Z. and Y.J.; project administration, R.D. and C.L.; funding acquisition, B.H. and C.L.
Funding
This work was supported by Natural Science Foundation of China (No. 31872317), the Program of National Beef Cattle and Yak Industrial Technology System (No. CARS-37), the Program of Yunling Scholar and the Young and Middle-aged Academic Technology Leader Backup Talent Cultivation Program in Yunnan Province, China (No. 2018HB045).
Conflicts of Interest
The authors declare no conflict of interest.
References
- 1.Daetwyler H.D., Capitan A., Pausch H., Stothard P., Van Binsbergen R., Brondum R.F., Liao X., Djari A., Rodriguez S., Grohs C. Whole-genome sequencing of 234 bulls facilitates mapping of monogenic and complex traits in cattle. Nat. Genet. 2014;46:858–865. doi: 10.1038/ng.3034. [DOI] [PubMed] [Google Scholar]
- 2.Dikmen S., Mateescu R.G., Elzo M.A., Hansen P.J. Determination of the optimum contribution of Brahman genetics in an Angus-Brahman multibreed herd for regulation of body temperature during hot weather. J. Animal Sci. 2018;96:2175–2183. doi: 10.1093/jas/sky133. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Weldenegodguad M., Popov R., Pokharel K., Ammosov I., Ming Y., Ivanova Z., Kantanen J. Whole-Genome Sequencing of Three Native Cattle Breeds Originating from the Northernmost Cattle Farming Regions. Front. Genet. 2019;9:390369. doi: 10.3389/fgene.2018.00728. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Ma L., Sonstegard T.S., Cole J.B., Vantassell C.P., Wiggans G.R., Crooker B.A., Tan C., Prakapenka D., Liu G.E., Da Y. Genome changes due to artificial selection in U.S. Holstein cattle. BMC Genomics. 2019;20:128. doi: 10.1186/s12864-019-5459-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Xu Y., Jiang Y., Shi T., Cai H., Lan X., Zhao X., Plath M., Chen H. Whole-genome sequencing reveals mutational landscape underlying phenotypic differences between two widespread Chinese cattle breeds. PLoS ONE. 2017;12:e0183921. doi: 10.1371/journal.pone.0183921. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Liao X., Peng F., Forni S., McLaren D., Plastow G., Stotharda P. Whole genome sequencing of Gir cattle for identifying polymorphisms and loci under selection. Genome. 2013;56:592–598. doi: 10.1139/gen-2013-0082. [DOI] [PubMed] [Google Scholar]
- 7.Zhang Y. Animal Genetic Resources in China-Bovines. China Agriculture Press; Beijing, China: 2011. (In Chinese) [Google Scholar]
- 8.Li R., Zhang X.M., Campana M.G., Huang J.P., Chang Z.H., Qi X., Shi H., Su B., Zhang R.F., Lan X. Paternal origins of Chinese cattle. Anim. Genet. 2013;44:446–449. doi: 10.1111/age.12022. [DOI] [PubMed] [Google Scholar]
- 9.Zhang R., Cheng M., Li X., Chen F., Zheng J., Wang X., Meng Q. Y-SNPs Haplotype Diversity in Four Chinese Cattle Breeds. Anim. Biotechnol. 2013;24:288–292. doi: 10.1080/10495398.2013.800531. [DOI] [PubMed] [Google Scholar]
- 10.Gao Y., Gautier M., Ding X., Zhang H., Wang Y., Wang X., Faruque O., Li J., Ye S., Gou X. Species composition and environmental adaptation of indigenous Chinese cattle. Sci. Rep. 2017;7:16196. doi: 10.1038/s41598-017-16438-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Iqbal N., Liu X., Yang T., Huang Z., Hanif Q., Asif M., Khan Q.M., Mansoor S. Genomic variants identified from whole-genome resequencing of indicine cattle breeds from Pakistan. PLoS ONE. 2019;14:e0215065. doi: 10.1371/journal.pone.0215065. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Khalkhali-Evrigh R., Hafezian S.H., Hedayat-Evrigh N., Farhadi A., Bakhtiarizadeh M.R. Genetic variants analysis of three dromedary camels using whole genome sequencing data. PLoS ONE. 2018;13:e0204028. doi: 10.1371/journal.pone.0204028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Chen N., Cai Y., Chen Q., Li R., Wang K., Huang Y., Hu S., Huang S., Zhang H., Zheng Z. Whole-genome resequencing reveals world-wide ancestry and adaptive introgression events of domesticated cattle in East Asia. Nat. Commun. 2018;9:2337. doi: 10.1038/s41467-018-04737-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Xu L., Liu Y., Bickhart D.M., Li J., Liu G.E. Analysis of Population-Genetic Properties of Copy Number Variations. Copy Number Variants. 2018:179–186. doi: 10.1007/978-1-4939-8666-8_14. [DOI] [PubMed] [Google Scholar]
- 15.Li H., Durbin R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics. 2009;25:1754–1760. doi: 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Mckenna A., Hanna M., Banks E., Sivachenko A., Cibulskis K., Kernytsky A.M., Garimella K., Altshuler D., Gabriel S.B., Daly M.J. The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010;20:1297–1303. doi: 10.1101/gr.107524.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Cingolani P., Platts A.E., Wang L.L., Coon M., Nguyen T., Wang L., Land S., Lu X., Ruden D.M. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly. 2012;6:80–92. doi: 10.4161/fly.19695. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Ashburner M., Ball C.A., Blake J.A., Botstein D., Butler H., Cherry J.M., Davis A.P., Dolinski K., Dwight S.S., Eppig J.T. Gene ontology: Tool for the unification of biology. The Gene Ontology Consortium. Nat. Genet. 2000;25:25–29. doi: 10.1038/75556. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.The Gene Ontology Consortium The Gene Ontology Resource: 20 years and still GOing strong. Nucleic Acids Res. 2019;47:D330–D338. doi: 10.1093/nar/gky1055. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Choi J.-W., Liao X., Park S., Jeon H.-J., Chung W.-H., Stothard P., Park Y.-S., Lee J.-K., Lee K.-T., Kim S.-H., et al. Massively parallel sequencing of Chikso (Korean brindle cattle) to discover genome-wide SNPs and InDels. Mol. Cells. 2013;36:203–211. doi: 10.1007/s10059-013-2347-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Choi J.-W., Choi B.-H., Lee S.-H., Lee S.-S., Kim H.-C., Yu D., Chung W.-H., Lee K.-T., Chai H.-H., Cho Y.-M., et al. Whole-Genome Resequencing Analysis of Hanwoo and Yanbian Cattle to Identify Genome-Wide SNPs and Signatures of Selection. Mol. Cells. 2015;38:466–473. doi: 10.14348/molcells.2015.0019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Cardoso C.C., Peripolli V., Amador S., Brandao E.G., Esteves G.I.F., Sousa C.M.Z., Franca M.F.M.S., Goncalves F.G., Barbosa F.A., Montalvao T.C. Physiological and thermographic response to heat stress in zebu cattle. Livest Sci. 2015;182:83–92. doi: 10.1016/j.livsci.2015.10.022. [DOI] [Google Scholar]
- 23.Li X., Wang S., Wang H., Gupta D. Differential expression of peptidoglycan recognition protein 2 in the skin and liver requires different transcription factors. J. Biol. Chem. 2006;281:20738–20748. doi: 10.1074/jbc.M601017200. [DOI] [PubMed] [Google Scholar]
- 24.Wang H.L., Li Z., Chen L., Yang J., Wang L.J., He H., Niu F.B., Liu Y., Guo J.Z., Liu X. Polymorphism in PGLYRP-2 gene by PCR-RFLP and its association with somatic cell score and percentage of fat in Chinese Holstein. Genet. Mol. Res. 2013;12:6743–6751. doi: 10.4238/2013.December.13.7. [DOI] [PubMed] [Google Scholar]
- 25.Royet J., Gupta D., Dziarski R. Peptidoglycan recognition proteins: Modulators of the microbiome and inflammation. Nat. Rev. Immunol. 2011;11:837–851. doi: 10.1038/nri3089. [DOI] [PubMed] [Google Scholar]
- 26.Kim J., Lee S.B., Park J.K., Yoo Y.D. TNF-alpha-induced ROS production triggering apoptosis is directly linked to Romo1 and Bcl-X(L) Cell Death Differ. 2010;17:1420–1434. doi: 10.1038/cdd.2010.19. [DOI] [PubMed] [Google Scholar]
- 27.Kim H.J., Jo M.J., Kim B.R., Kim J.L., Jeong Y.A., Na Y.J., Park S.H., Lee S.Y., Lee D.H., Kim B. Overexpression of Romo1 is an unfavorable prognostic biomarker and a predictor of lymphatic metastasis in non-small cell lung cancer patients. OncoTargets Ther. 2018;11:4233–4246. doi: 10.2147/OTT.S161587. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Gong L., Liu B., Wang J., Pan H., Qi A., Zhang S., Wu J., Yang P., Wang B. Novel missense mutation in PTPN22 in a Chinese pedigree with Hashimoto’s thyroiditis. BMC Endocr. Disord. 2018;18:76. doi: 10.1186/s12902-018-0305-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Kyogoku C., Langefeld C.D., Ortmann W., Lee A.T., Selby S.A., Carlton V.E.H., Chang M., Ramos P.S., Baechler E.C., Batliwalla F. Genetic association of the R620W polymorphism of protein tyrosine phosphatase PTPN22 with human SLE. Am. J. Hum. Genet. 2004;75:504–507. doi: 10.1086/423790. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Li Y., Hu S., Wang J., Chen S., Jia X., Lai S. Molecular cloning, polymorphism, and expression analysis of the LKB1/STK11 gene and its association with non-specific digestive disorder in rabbits. Mol. Cell. Biochem. 2018;449:127–136. doi: 10.1007/s11010-018-3349-1. [DOI] [PubMed] [Google Scholar]
- 31.Jung S.H., Yoo E.H., Yu M.J., Song H.M., Kang H.Y., Cho J., Lee J.R. ARAP, a Novel Adaptor Protein, Is Required for TCR Signaling and Integrin-Mediated Adhesion. J. Immunol. 2016;197:942–952. doi: 10.4049/jimmunol.1501913. [DOI] [PubMed] [Google Scholar]
- 32.Alzamel N., Bayrou C., Decreux A., Desmecht D. Soluble forms of CD46 are detected in Bos taurus plasma and neutralize BVDV, the bovine pestivirus. Comp. Immunol. Microb. 2016;49:39–46. doi: 10.1016/j.cimid.2016.09.001. [DOI] [PubMed] [Google Scholar]
- 33.Krey T., Himmelreich A., Heimann M., Menge C. Function of Bovine CD46 as a Cellular Receptor for Bovine Viral Diarrhea Virus Is Determined by Complement Control Protein 1. J. Virol. 2006;80:3912–3922. doi: 10.1128/JVI.80.8.3912-3922.2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Wang X., Zhong J., Gao Y., Ju Z., Huang J. A SNP in intron 8 of CD46 causes a novel transcript associated with mastitis in Holsteins. BMC Genomics. 2014;15:630. doi: 10.1186/1471-2164-15-630. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Qin Y., Takahashi M., Sheets K., Soto H., Tsui J., Pelargos P., Antonios J.P., Kasahara N., Yang I., Prins R.M. Epithelial membrane protein-2 (EMP2) promotes angiogenesis in glioblastoma multiforme. J. Neuro-Oncol. 2017;134:1–12. doi: 10.1007/s11060-017-2507-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Zhang H.M., Diaz V., Walsh M.E., Zhang Y. Moderate lifelong overexpression of tuberous sclerosis complex 1 (TSC1) improves health and survival in mice. Sci. Rep. 2017;7:834. doi: 10.1038/s41598-017-00970-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.