Abstract
Zhoushan cattle are an endangered cattle breed in the Zhoushan islands in China. Since Zhoushan cattle have been bred in isolation, they show unique characteristics, such as dark black coat colour. However, no studies have been conducted on the genome of Zhoushan cattle. Here, we performed whole-genome sequencing of seven individuals of Zhoushan cattle and nine cattle in Wenling, geographically close to the Zhoushan islands. By integrating our data and publicly-available data, we found that Zhoushan cattle are genetically highly similar to Bos indicus cattle in south-eastern China. Furthermore, by identifying the genomic regions shared between Zhoushan cattle and Angus cattle, a Bos taurus breed, we found that the p.F195L mutation in melanocyte-stimulating hormone receptor (MC1R) could be associated with their dark black coat colour. Taken together, our results provide a valuable resource for characterising the uniqueness of Zhoushan cattle.
Subject terms: Evolutionary biology, Phylogeny, Animal breeding, Evolutionary genetics, Phylogenetics, Population genetics, Conservation biology
Introduction
Chinese cattle are distributed throughout the country and are among the most critical components of world cattle due to their large number and abundant breed types1–3. Chinese cattle have long been used as draught animals, enhancing their pronounced merits in parasite resistance, utilisation of roughage-based diets, and tolerance to environmental challenges. Since cattle with a yellow coat colour are the overall majority in China, Chinese cattle have long been collectively called yellow cattle4. Likewise, black cattle are the minority.
Zhoushan cattle are a unique Chinese cattle breed that has been bred in the Zhoushan islands5. The Zhoushan islands are in the East China Sea, which is physically isolated away from the Chinese mainland by the sea. The Chinese mainland areas near the Zhoushan islands include Wenling and Shanghai. Zhoushan cattle have been bred on the Zhoushan islands for at least 300 years1,5. Zhoushan cattle show a dark black coat colour, curving and twisted horns, and large body size2,5. Zhoushan cattle have been widely used to cultivate paddy fields, but recently they have been replaced by mechanical cultivation and are in danger of extinction5. The population size was estimated to be approximately 5000 in 1980, and now the number has decreased further2.
Modern domesticated cattle consist of two major lineages, Bos indicus and Bos taurus6. These cattle are derived from independent domestications of the same progenitor species, Bos primigenius6. There are distinct differences in physical characteristics between these two lineages. B. taurus is a breed of cattle that originated from cattle in Europe6. The body of B. taurus is smooth with no protrusions and has tightened skin. Some typical breeds of B. taurus, such as Black Angus and Kobe cattle have a dark black coat. Although some non-black Angus breeds are known, all Angus breeds are derived from the cattle breed with black coat colour. Conversely, B. indicus is a breed of cattle that originated from cattle in South Asia3,7. B. indicus shows distinctive physical characteristics, such as a hump on the back, excess skin across the entire ventral midline, especially around the neck (throatlatch), chest (dewlap), and navel, larger ears, and shorter coat hair to cope with warmer climates than Europe8. These traits are not prominent in B. taurus. It was suggested that B. taurus was brought into East Asia from West Asia during the late Neolithic period, and B. indicus later was imported from India to East Asia9,10. Recent genome-wide analysis of Chinese cattle revealed that B. taurus is mainly located in east China and that B. indicus is primarily located in south China3. In north-central China, there are various hybrids between B. taurus and B. indicus3.
The origin of Zhoushan cattle is poorly understood because there are few records of their origin. Since Zhoushan cattle has not been genetically studied, the mainland cattle genetically closest to Zhoushan cattle are still unknown. In particular, it is fascinating to investigate whether the breed that is genetically closest to Zhoushan cattle is B. taurus or B. indicus or a hybrid thereof. Furthermore, Zhoushan cattle look like both B. taurus and B. indicus; considering that Zhoushan cattle have a hump on the back, some people propose that Zhoushan cattle are B. indicus5. Others suggest that Zhoushan cattle are B. taurus because the dewlap is less prominent, they show the dark black coat colour similar to Kobe cattle in Japan, and the Zhoushan islands are geographically close to Japan across the sea. If we identify the breed that is genetically closest to Zhoushan cattle, we could infer their origin. However, thus far, there has been no research on Zhoushan cattle's genome. Here, we performed whole-genome sequencing of seven individuals of Zhoushan cattle and nine Wenling cattle individuals, which are a local breed in Wenling and show typical characteristics of B. indicus. Wenling is a region of the Chinese mainland located south of Zhoushan islands and close to Zhoushan islands. Combined with the publicly-available genomic data of other cattle breeds, we performed population analyses with Zhoushan cattle and other domesticated cattle.
Results
Whole-genome sequencing of Zhoushan cattle and Wenling cattle populations
We collected seven individuals of Zhoushan cattle (Fig. 1a, upper panel). We also collected nine individuals of Wenling cattle (Fig. 1a, lower panel). Wenling cattle have a prominent hump on the back, dewlap, and larger ears, suggesting that its genetic background is largely B. indicus (Fig. 1a, lower panel). We performed whole-genome sequencing of these samples. To resolve their phylogenetic positions and interrelationships within domesticated cattle, we combined our data of 16 cattle individuals with publicly-available whole-genome sequencing data of five individuals from the Angus breed, a typical B. taurus in Europe, and 33 individuals from nine breeds with genetic backgrounds similar to B. indicus3, giving a total of 54 individuals (Fig. 1b, c; Table S1). We performed read trimming and aligned the trimmed reads to the UOA_Brahman_1 assembly of the cattle genome11. This assembly represents the maternal haplotype of an F1 hybrid of Brahman cattle (dam) and Angus (sire)11. After variant calling and filtering, we identified 32,970,327 single-nucleotide polymorphisms (SNPs) and 3,331,322 small indels. Based on this genomic variant information, we conducted the population genomic analyses.
Genetic relationship between Zhoushan cattle and other domesticated cattle
To reveal the phylogenetic positions and interrelationships of Zhoushan and other domesticated cattle, we performed population genomic analyses on 54 cattle individuals. First, we calculated the pairwise evolutionary distance between individuals and generated a neighbour-joining (NJ) tree to reconstruct the phylogenetic relationships between individuals of Zhoushan and other domesticated cattle (Fig. 1d). In the NJ tree, cattle clustered consistently with their geographical location (Fig. 1d). Angus individuals formed a sister group to all other individuals, including Zhoushan cattle, Wenling cattle, and other B. indicus (Fig. 1d). The individuals of Zhoushan and Wenling cattle formed monophyletic groups and were sisters to each other (Fig. 1d). The cattle in Guangfeng formed another monophyletic group and were sisters to both Zhoushan and Wenling cattle (Fig. 1d). Cattle in Wannan, Ji'an, and Leiqiong formed a single group, sister to the cattle of Zhoushan, Wenling, and Guangfeng (Fig. 1d). Zhoushan, Wenling, Guangfeng, Wannan, and Ji'an are geographically close to each other (Fig. 1b, c). The cattle of Dianzhong and Wenshan, which are in the south part of China, were distant from them (Fig. 1d). Cattle in Pakistan and India were located near the root of the phylogenetic tree (Fig. 1d). The branch lengths of Zhoushan cattle were shorter than other B. indicus cattle, suggesting the reduced genetic diversity of Zhoushan cattle (Fig. 1d).
To estimate the relatedness between Zhoushan and other domesticated cattle, we performed unsupervised clustering analysis with ADMIXTURE v1.3.0 software (https://dalexander.github.io/admixture/index.html)12. At K = 2, Angus cattle were distinct from all other cattle (Fig. 2a). At K = 3, Zhoushan and Wenling cattle were newly segregated from other cattle, suggesting that these two cattle breeds are genetically close to each other (Fig. 2a). The cattle of Guangfeng, Wannan, Ji'an, Leiqiong, and Wenshan had intermediate genetic structures between Zhoushan cattle and Dianzhong cattle (Fig. 2a). At K = 4, Zhoushan cattle and Wenling cattle were separated from each other (Fig. 2a).
To infer the population structure of cattle individuals analysed in this study, we conducted principal component analysis (PCA). The top three principal components accounted for 21.1% of the total variance (Fig. 2b). In the first component of PCA, Angus individuals were separated from all other cattle (Fig. 2b). Additionally, cattle of Wenling, Guangfeng, Wannan, Ji'an, and Leiqiong formed a cluster (dotted ellipse in Fig. 2b). In the second component of PCA, individuals of Zhoushan cattle were separated from all other cattle (Fig. 2b). In the third principal component, Wenling cattle individuals were separated from all other cattle (Fig. 2b).
We estimated the trends of the effective population size of Zhoushan and Wenling cattle over the past 100 generations (Fig. 2c). Both populations showed decreasing trends of effective population sizes (Fig. 2c). The effective population size of Zhoushan cattle was estimated to be smaller than that of Wenling cattle, suggesting the effect of island isolation on the genetic diversity of Zhoushan cattle (Fig. 2c).
Detection of candidate genes associated with dark black coat colour of Zhoushan cattle
To identify putative genes associated with the dark black coat colour of Zhoushan cattle, we searched genomic regions where the same mutations were shared between Zhoushan cattle and Angus cattle. To achieve this, we calculated the average fixation index (Fst) values in 40 kb windows with 10 kb steps (Fig. 3a). We identified four peaks of Fst at chromosomes 2, 4, 8, and 18 (Fig. 3a). Among these peaks, the highest peak of Fst was identified in the region from 51.05 to 51.35 Mbp on chromosome 18 (Fig. 3a, b). This region contains 18 genes (Fig. 3c). We searched for genes that have mutations altering the amino acid sequence and have been reported to be involved in the regulation of coat colour. Among these 18 genes, only the gene of melanocyte-stimulating hormone receptor (MC1R) is known to involved in the regulation of coat colour13–15. Therefore, we regarded MC1R as a strong candidate gene associated with the dark black coat colour of Zhoushan and Angus cattle (Fig. 3c). This gene is located in the region between 51,094,227 bp and 51,095,177 bp on chromosome 18. MC1R is expressed in the skin melanocyte and plays a crucial role in regulating animal coat colour formation16. Mutations of MC1R have been reported to be associated with black coat colour in some animals, such as cattle17, sheep16, pigs18, reindeer19, and geese20. In the protein-coding region of MC1R, we identified one missense mutation (c.583T > C, p.F195L) and one synonymous mutation (c.663C > T) (Figs. 3d, 4a). The missense mutation is located in the fifth transmembrane region of MC1R (Fig. 4b). All seven Zhoushan cattle were homozygous for the missense mutation (Figs. 3d, 4a). Four of five Angus individuals were homozygous for the missense mutation, and the remaining one was heterozygous for the missense mutation (Figs. 3d, 4a). Conversely, only 19% (8/42) and 33% (14/42) of B. indicus individuals were homozygous or heterozygous, respectively, for the missense mutation (Figs. 3d, 4a). The remaining 48% (20/42) of individuals of B. indicus were homozygous for the wild-type allele (Figs. 3d, 4a). We also found that the p.F195L mutation is also present in MC1R of Black Angus (accession number: ABX83563.1) in the NCBI Protein database (Fig. S1). Furthermore, we identified 15 upstream variants and three downstream variants in the intergenic regions between neighbouring genes (Table S2).
To characterise the missense mutation of MC1R (c.583T > C, p.F195L) found in Zhoushan and Angus cattle, we estimated the degree of evolutionary conservation of the 195th phenylalanine of MC1R. We obtained various MC1R orthologs of vertebrates from eight eutherian mammals, two marsupial mammals, four reptiles, two birds, two amphibians, one lobe-finned fish, one polypterus fish, four teleost fish, and two cartilaginous fish (Table S3). We aligned these 26 sequences with MC1R of Zhoushan cattle and B. indicus (Fig. 4c). This analysis revealed that the 195th phenylalanine of MC1R is highly conserved among vertebrates (Fig. 4c).
Furthermore, we verified whether any larger structural variants are spanning the MC1R region (chr18:51,058,185–51,148,307 bp) of Zhoushan cattle and Angus. If there are large structural variants in this region for these breeds, we should see regions where the read depth distributions are different among the groups. We assessed the integrated read depth distributions of Wenling cattle (n = 9), Zhoushan cattle (n = 7) and Angus (n = 5) (Fig. 5a). The read depth distribution was very similar among the three groups suggesting that there are not large structural variants spanning the MC1R region in these breeds (Fig. 5a). We also collected the sequence reads mapped to this region, and performed BreakDancer to detect structural variants21. However, no structural variants were detected in this region in any breeds. Moreover, we compared the reference genome sequence in MC1R region of the UOA_Brahman_1 assembly and that of the UOA_Angus_1 assembly11. The UOA_Brahman_1 assembly represents the maternal haplotype of an F1 hybrid of Brahman cattle (dam) and Angus (sire), and the UOA_Angus_1 assembly represents its paternal haplotype11. The results showed that the genome sequence in the MC1R region are highly preserved between these two assemblies (Fig. 5b).
Finally, we deduced the origin of the MC1R haplotype in Zhoushan cattle. We collected the SNPs located in the MC1R region (chr18:51,058,185–51,148,307 bp) from all individuals and performed admixture analysis using these SNPs. The result showed that Zhoushan cattle and Angus shared highly similar genetic components (Fig. 5c). However, the other individuals of B. indicus showed genetic components that differed from both Zhoushan cattle and Angus (Fig. 5c). These results suggest that the MC1R haplotype in Zhoushan cattle is derived from B. taurus, even though the genome of Zhoushan cattle as a whole is that of B. indicus.
Discussion
This study conducted whole-genome sequencing of seven individuals of Zhoushan cattle and nine Wenling cattle individuals. By analysing our data and publicly-available data together, we showed that Zhoushan cattle are genetically similar to Wenling cattle, followed by cattle in Guangfeng, Wannan, and Ji'an (Figs. 1d, 2a, b). Since these cattle's genetic background is B. indicus3, the genetic background of Zhoushan cattle is also largely B. indicus. Wenling, Guangfeng, Wannan, and Ji'an are the south-eastern region of China and geographically close to the Zhoushan islands. Since the cattle that are genetically close to the Zhoushan cattle are located geographically close to it, it is possible that the ancestral population of Zhoushan cattle was introduced from the mainland region near the Zhoushan islands. According to previous studies, the ancestral population of Zhoushan cattle might be Tangjiao cattle in Shanghai5. Shanghai is a city located north of Zhoushan islands and is geographically very close to the Zhoushan islands (Fig. 1c). Although Tangjiao cattle died out after the late 1980s, there are pictures of Tangjiao cattle showing its dark black coat colour and similar morphological appearance to Zhoushan cattle1,5. Whole-genome sequencing of residual genomic DNA of Tangjiao cattle from remaining hair or other tissues would further elucidate the origin of Zhoushan cattle.
Zhoushan cattle was introduced from these mainland areas across the sea to the Zhoushan islands hundreds of years ago and bred in isolation from the mainland until today1,5. Although the genetic background of Zhoushan cattle is similar to that of cattle in nearby mainland regions, including Wenling, Guangfeng, Wannan, Ji'an, and Leiqiong, Zhoushan cattle still show a distinct genetic background from these cattle (Figs. 1d, 2a, b). This is probably the result of island isolation for hundreds of years. We also estimated that the effective population size of Zhoushan cattle is decreasing (Fig. 2c). This is consistent with the previous report that island isolation is associated with reducing genetic diversity in animals22. Random genetic drift may cause the unique genetic population structure and the decreasing trend of Zhoushan cattle due to the small population size. Zhoushan cattle can be a valuable study model to analyse the genetic effects of island isolation on mammals' genomic compositions.
The colour of hair, skin, and eyes in animals mainly depends on the quantity, quality, and distribution of two types of melanin granules, eumelanin and pheomelanin, produced by melanocytes23,24. Eumelanin is black to brown, and pheomelanin is yellow to reddish23,24. We identified MC1R as a candidate gene responsible for the dark coat colour of Zhoushan cattle. MC1R is a G protein-coupled receptor and is expressed in the cell membrane of the skin melanocyte. When MC1R is activated by the melanocyte-stimulating hormone from the pituitary gland, MC1R initiates a downstream signalling cascade that leads to black pigment eumelanin production in the melanocytes13,14. We identified the p.F195L mutation in MC1R. So far, there are many reports about mutations of MC1R associated with black coat colour in some animals, such as cattle17,25, sheep16, pig18, reindeer19, and geese20. For example, the dominant missense mutation of p.L99P results in black coat colour, whereas a frameshift mutation of c.310delG, producing a prematurely terminated MC1R protein (p.Gly104ValfsX53), produces a red coat colour in cattle17,25. The mutation of p.F195L is a nonpolar-to-nonpolar conversion and is located in the fifth transmembrane region of MC1R (Fig. 4b). Since this residue is highly evolutionary conserved among vertebrates (Fig. 4c), p.F195L might lead to activation of MC1R. We also identified 15 upstream variants and three downstream variants in the intergenic regions between neighbouring genes. Another possibility is that these intergenic mutations alter the promoter and/or enhancer activity of MC1R, resulting in upregulation of MC1R gene expression. In either case, mutations identified in this study also can be excellent markers for selective breeding for dark black coat colour.
Recently, the mutation of p.F195L was reported as one of six candidate missense mutations associated with the black body colour of cattle26. They compared the protein-coding sequences of MC1R in Tharparkar and Karan Fries cattle (Tharparker × Holstein Friesian)26. According to the authors, Tharparkar cattle has a predominantly white coat colour, and Karan Fries cattle has a white and black coat colour26. They performed molecular cloning of MC1R genes from both cattle and identified six missense mutations, including p.F195L in Karan Fries cattle. Since Holstein Friesian cattle, a B. taurus breed, has a white and black coat colour, these mutations can be associated with the generation of the black coat area on Karan Fries cattle26. Notably, we only identified p.F195L among these six mutations by the unbiased whole-genome analysis. Therefore, the mutation of p.F195L may be a novel causative mutation of MC1R in Zhoushan and some B. taurus cattle with a dark coat colour.
Chinese cattle are one of the most important world cattle compositions because of their large number and abundant breed types. However, a number of local breeds, including Zhoushan cattle, are on the verge of extinction1,5. Zhoushan cattle have a unique history of being bred in the Zhoushan islands isolated from the mainland for hundreds of years5. Zhoushan cattle are closely related to the local culture and history in the Zhoushan islands and exhibit biologically interesting features, including a dark black coat colour and horns with various morphologies. In this study, we analysed the genetic relationship between Zhoushan cattle and other cattle and showed the genetic closeness of Zhoushan cattle to B. indicus bred in the nearby mainland regions. We also identified MC1R as a candidate gene responsible for the dark black coat colour of Zhoushan cattle. All these strategies and results provide us with valuable resources for developing strategies to conserve this unique cattle breed. Similarly, there are many outstanding local cattle breeds in China, and attention should be paid to their preservation. Conservation genomics is an effective strategy for preserving them.
In summary, we performed whole-genome sequencing of seven individuals of Zhoushan cattle and nine Wenling cattle individuals. Combined with the publicly-available genomic data of other cattle breeds, we identified that Zhoushan cattle is genetically similar to B. indicus in south-eastern China. Among them, Zhoushan cattle are most relative to Wenling cattle, suggesting that Zhoushan cattle's ancestor was B. indicus cattle bred in the mainland region close to the Zhoushan islands and introduced into the Zhoushan islands from there. Moreover, we identified MC1R as a candidate gene responsible for the dark black coat colour of Zhoushan cattle. Our results provide valuable resources for inferring the origin of Zhoushan cattle, characterising the uniqueness of Zhoushan cattle, and developing better breeding programs to preserve this unique and culturally important cattle breed.
Methods
Sample collection and whole-genome sequencing
We sampled a total of 16 cattle, including seven Zhoushan cattle and nine Wenling cattle. DNA was extracted from the ear tissues of each individual. DNA was extracted from the tissue using CTAB. Then, DNA quality and quantity were determined with an ND-2000 (NanoDrop Technologies). Only high-quality DNA samples (OD260/280 = 1.8–2.0, OD260/230 ≥ 2.0) were used to construct sequencing libraries. A DNA-seq library was prepared following the TruSeqTM Nano DNA Library Prep Kit from Illumina (San Diego, CA) using 1 μg of DNA that was fragmented by fragmentation buffer. Fragmented DNA was subjected to end-repair, phosphorylation, and 'A' base addition according to Illumina's library construction protocol. After quantification by TBS380, the paired-end DNA-seq library was sequenced with the Illumina HiSeq Xten/NovaSeq 6000 (2 × 150 bp read length).
Read trimming and mapping to the reference genome
We trimmed the raw sequencing reads using Trimmomatic v0.38 (LEADING:30 TRAILING:30 SLIDINGWINDOW:4:25 MINLEN:50)27. We downloaded the UOA_Brahman_1 assembly of the cattle genome from Ensembl Release 101 (www.ensembl.org) and indexed it using BWA v0.7.16a-r118128. The filtered reads were aligned to the UOA_Brahman_1 assembly by BWA-MEM with default parameters. We also downloaded the publicly-available whole-genome sequencing data of cattle breeds and analyzed them (Table S1). Unfortunately, body colour information of the five Angus individuals which we analyzed is not provided.
Identification of genomic variants
We performed variant calling using samtools mpileup with default parameters29. We filtered out variants with a call rate < 80% and minor allele frequency < 5%. For variant annotations, the genome annotation of B. indicus annotation was obtained from the Ensembl Release 101 database. SNPs were annotated with SnpEff v4.230. Finally, we retained 32,970,327 SNPs and 3,331,322 small indels as the initial dataset for the downstream analysis. The variant information was visualized with the Integrative Genomics Viewer31. In order to explore large structural variants in the MC1R region, we used samtools view to extract the reads mapped to the MC1R region (chr18:51,058,185–51,148,307 bp) and ran breakdancer-max v1.4.5. with parameter ‘-r 20’21.
Population genomic analysis
The genomic variant data in VCF format was converted to PLINK binary format using PLINK v1.90b4.532. PCA was performed with PLINK. The three-dimensional scatter plot was generated with the R rgl package. To build a NJ phylogenetic tree, we calculated pairwise genome-wide identical-by-state (IBS) distances based on the SNPs using PLINK. Based on the pairwise distance matrix (1-IBS), a NJ tree was constructed using MEGA733. The admixture analysis was performed with ADMIXTURE v1.3.0 software (https://dalexander.github.io/admixture/index.html)12. Cross-validation (CV) errors were estimated for each K-value. The K-value with the lowest CV error was regarded as optimal for assessing the admixture level in each sample. The effective population size was calculated for each group using SNeP v1.1 with the maximum number of SNPs per chromosome at 10,00034. Average Fst values were calculated using a 40 kb window with 10 kb step using PLINK. Average Fst values greater than 0.6 were regarded as significant.
Multiple sequence alignment of MC1R orthologs
The protein sequences of MC1R orthologs were downloaded from the Ensembl or NCBI Protein database. The identification numbers of all the sequences are listed in Table S3. These protein sequences were aligned by clustal omega with the default parameters35. The resulting multiple sequence alignments were verified and visualised by jalview36.
Genome alignment
In order to compare the MC1R regions of the UOA_Angus_1 assembly (chr18:49,477,288–49,566,766 bp) and that of the UOA_Brahman_1 assembly (chr18:51,058,185–51,148,307 bp), we extracted the genome sequences of the MC1R regions from both assemblies. We aligned the sequence of the UOA_Angus_1 assembly to that of the UOA_Brahman_1 assembly by minimap2 and LASTZ37. The results were plotted by the plot function of R v3.5.2.
Generation of the geographical maps
The geographical maps were generated by using R packages of maps v3.3.0 (https://cran.r-project.org/web/packages/maps) and mapdata v2.3.0 (https://cran.r-project.org/web/packages/mapdata).
Ethics declarations
All procedures were performed in accordance with the guidelines of the Regulations for the Administration of Laboratory Animals (Decree No. 2 of the State Science and Technology Commission of the People's Republic of China, November 14, 1988) and were approved by the Animal Ethics Committee of Zhejiang Ocean University (Zhoushan, China). The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Supplementary Information
Acknowledgements
This work was financially supported by the Fundamental Research Funds for Zhejiang Provincial Universities and Research Institutes (22016000319021), Grant-in-Aid for Scientific Research (17H06399 to A.O., 20K21452 to A.O., 19H03420 to Y.O., and 19K22426 to Y.O.) from the Japan Society for the Promotion of Science. We thank the Institute of Zootechnics and Veterinary of Wenling for sampling collection and picture affording.
Author contributions
L.J. and A.O. conceived this research and designed the experiment. L.J. and L.P. performed experiments. T.K., L.J., R.I., Y.O., A.O., Q.Z., C.C., I.T., L.H.N., H.T., and H.P. analyzed data. T.K., L.J., Y.O., and A.O. wrote the manuscript. All authors reviewed the manuscript.
Data availability
The raw reads were deposited in the DDBJ Sequence Read Archive (DRA) under accession number PRJDB10918.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Lihua Jiang and Tetsuo Kon.
Contributor Information
Lihua Jiang, Email: florencejiang@126.com.
Atsushi Ogura, Email: aogu@whelix.info.
Supplementary Information
The online version contains supplementary material available at 10.1038/s41598-021-96896-2.
References
- 1.Porter V, Alderson L, Hall S, Sponenberg P. Mason’s World Encyclopedia of Livestock Breeds and Breeding. CAB International; 2016. [Google Scholar]
- 2.Youlin J. Report on Domestic Animal Genetic Resources in China. China Agriculture Publishers; 2004. [Google Scholar]
- 3.Chen N, et al. Whole-genome resequencing reveals world-wide ancestry and adaptive introgression events of domesticated cattle in East Asia. Nat. Commun. 2018;9:2337. doi: 10.1038/s41467-018-04737-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Jiang, Y. Where is the Chinese yellow cattle from?. Nat. Ecol. Evol.https://natureecoevocommunity.nature.com/posts/34309-where-is-the-chinese-yellow-cattle-from. (2018).
- 5.Chen Y-C. Four interesting endangered breeds of animals in China. Anim. Genet. Resour. Inf. 1995;16:29–35. doi: 10.1017/S101423390000047X. [DOI] [Google Scholar]
- 6.McTavish EJ, Decker JE, Schnabel RD, Taylor JF, Hillis DM. New World cattle show ancestry from multiple independent domestication events. Proc. Natl. Acad. Sci. U. S. A. 2013;110:E1398–E1406. doi: 10.1073/pnas.1303367110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Mei C, et al. Genetic architecture and selection of Chinese cattle revealed by whole genome resequencing. Mol. Biol. Evol. 2018;35:688–699. doi: 10.1093/molbev/msx322. [DOI] [PubMed] [Google Scholar]
- 8.Utsunomiya YT, et al. Genomic clues of the evolutionary history of Bos indicus cattle. Anim. Genet. 2019;50:557–568. doi: 10.1111/age.12836. [DOI] [PubMed] [Google Scholar]
- 9.Cai D, et al. The origins of Chinese domestic cattle as revealed by ancient DNA analysis. J. Archaeol. Sci. 2014;41:423–434. doi: 10.1016/j.jas.2013.09.003. [DOI] [Google Scholar]
- 10.Chen S, et al. Zebu cattle are an exclusive legacy of the South Asia neolithic. Mol. Biol. Evol. 2010;27:1–6. doi: 10.1093/molbev/msp213. [DOI] [PubMed] [Google Scholar]
- 11.Koren S, et al. De novo assembly of haplotype-resolved genomes with trio binning. Nat. Biotechnol. 2018 doi: 10.1038/nbt.4277. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Alexander DH, Novembre J, Lange K. Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 2009;19:1655–1664. doi: 10.1101/gr.094052.109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Herraiz C, Garcia-Borron JC, Jiménez-Cervantes C, Olivares C. MC1R signalling. Intracellular partners and pathophysiological implications. Biochim. Biophys. Acta Mol. Basis Dis. 2017;1863:2448–2461. doi: 10.1016/j.bbadis.2017.02.027. [DOI] [PubMed] [Google Scholar]
- 14.Swope VB, Abdel-Malek ZA. MC1R: Front and center in the bright side of dark Eumelanin and DNA repair. Int. J. Mol. Sci. 2018;19:2667. doi: 10.3390/ijms19092667. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Hulsman Hanna LL, Sanders JO, Riley DG, Abbey CA, Gill CA. Identification of a major locus interacting with MC1R and modifying black coat color in an F2 Nellore-Angus population. Genet. Sel. Evol. 2014;46:4. doi: 10.1186/1297-9686-46-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Yang G-L, et al. Mutations in MC1R gene determine black coat color phenotype in Chinese sheep. Sci. World J. 2013;2013:675382. doi: 10.1155/2013/675382. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Klungland H, Våge DI, Gomez-Raya L, Adalsteinsson S, Lien S. The role of melanocyte-stimulating hormone (MSH) receptor in bovine coat color determination. Mamm. Genome. 1995;6:636–639. doi: 10.1007/BF00352371. [DOI] [PubMed] [Google Scholar]
- 18.Mao H, Ren J, Ding N, Xiao S, Huang L. Genetic variation within coat color genes of MC1R and ASIP in Chinese brownish red Tibetan pigs. Anim. Sci. J. 2010;81:630–634. doi: 10.1111/j.1740-0929.2010.00789.x. [DOI] [PubMed] [Google Scholar]
- 19.Våge DI, Nieminen M, Anderson DG, Røed KH. Two missense mutations in melanocortin 1 receptor (MC1R) are strongly associated with dark ventral coat color in reindeer (Rangifer tarandus) Anim. Genet. 2014;45:750–753. doi: 10.1111/age.12187. [DOI] [PubMed] [Google Scholar]
- 20.Huang J, et al. Sequence variation of melanocortin 1 receptor (MC1R) gene and association with plumage color in domestic geese. J. Poult. Sci. 2014;51:270–274. doi: 10.2141/jpsa.0130066. [DOI] [Google Scholar]
- 21.Chen K, et al. BreakDancer: An algorithm for high-resolution mapping of genomic structural variation. Nat. Methods. 2009;6:677–681. doi: 10.1038/nmeth.1363. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Wang S, et al. Population size and time since island isolation determine genetic diversity loss in insular frog populations. Mol. Ecol. 2014;23:637–648. doi: 10.1111/mec.12634. [DOI] [PubMed] [Google Scholar]
- 23.Nasti TH, Timares L. MC1R, eumelanin and pheomelanin: Their role in determining the susceptibility to skin cancer. Photochem. Photobiol. 2015;91:188–200. doi: 10.1111/php.12335. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Ito S, Wakamatsu K. Quantitative analysis of eumelanin and pheomelanin in humans, mice, and other animals: A comparative review. Pigment Cell Res. 2003;16:523–531. doi: 10.1034/j.1600-0749.2003.00072.x. [DOI] [PubMed] [Google Scholar]
- 25.Matsumoto, H. et al. MC1R c.310G>‐ and c.871G > A determine the coat color of Kumamoto sub‐breed of Japanese Brown cattle. Anim. Sci. J.91, (2020). [DOI] [PubMed]
- 26.Goud TS, Upadhyay RC, Onteru SK, Pichili VBR, Chadipiralla K. Identification and sequence characterization of melanocortin 1 receptor gene (MC1R) in Bos indicus versus (Bos taurus X Bos indicus) Anim. Biotechnol. 2020;31:283–294. doi: 10.1080/10495398.2019.1585866. [DOI] [PubMed] [Google Scholar]
- 27.Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30:2114–2120. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25:1754–1760. doi: 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Li H. A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics. 2011;27:2987–2993. doi: 10.1093/bioinformatics/btr509. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Cingolani P, et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly (Austin) 2014;6:80–92. doi: 10.4161/fly.19695. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Robinson JT, et al. Integrative genomics viewer. Nat. Biotechnol. 2011;29:24–26. doi: 10.1038/nbt.1754. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Purcell S, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 2007;81:559–575. doi: 10.1086/519795. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Kumar S, Stecher G, Tamura K. MEGA7: Molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol. Biol. Evol. 2016;33:1870–1874. doi: 10.1093/molbev/msw054. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Barbato M, Orozco-terWengel P, Tapio M, Bruford MW. SNeP: A tool to estimate trends in recent effective population size trajectories using genome-wide SNP data. Front. Genet. 2015;6:109. doi: 10.3389/fgene.2015.00109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Sievers F, Higgins DG. Clustal Omega, accurate alignment of very large numbers of sequences. Methods Mol. Biol. 2014;1079:105–116. doi: 10.1007/978-1-62703-646-7_6. [DOI] [PubMed] [Google Scholar]
- 36.Waterhouse AM, Procter JB, Martin DMA, Clamp M, Barton GJ. Jalview Version 2–a multiple sequence alignment editor and analysis workbench. Bioinformatics. 2009;25:1189–1191. doi: 10.1093/bioinformatics/btp033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Li H. Minimap2: Pairwise alignment for nucleotide sequences. Bioinformatics. 2018;34:3094–3100. doi: 10.1093/bioinformatics/bty191. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Ihaka R, Gentleman R. R: A language for data analysis and graphics. J. Comput. Graph. Stat. 1996;5:299. [Google Scholar]
- 39.Omasits U, Ahrens CH, Müller S, Wollscheid B. Protter: Interactive protein feature visualization and integration with experimental proteomic data. Bioinformatics. 2014;30:884–886. doi: 10.1093/bioinformatics/btt607. [DOI] [PubMed] [Google Scholar]
- 40.Kon T, Furukawa T. Origin and evolution of the Rax homeobox gene by comprehensive evolutionary analysis. FEBS Open Bio. 2020;10:657–673. doi: 10.1002/2211-5463.12832. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The raw reads were deposited in the DDBJ Sequence Read Archive (DRA) under accession number PRJDB10918.