Skip to main content
iScience logoLink to iScience
. 2026 Mar 24;29(4):115465. doi: 10.1016/j.isci.2026.115465

The genome landscape of Hong Kong feral cattle as a unique genetic resource

Xiaoyu Luo 1,7, Xiaoran Lu 1,7, Yan Ren 2, Yifan Cao 1, Xuewei Liu 1, Mario Barbato 3,4, Paolo Ajmone-Marsan 4, John L Williams 2,4, Rick Tearle 2, Richard AL Brown 5, Michael P Reichel 5,6, Chuzhao Lei 1, Ningbo Chen 1,, Wai Yee Low 2,8,∗∗
PMCID: PMC13091034  PMID: 42006315

Summary

In the Hong Kong Special Administrative Region of China, there is a feral cattle population that has not been well characterized genetically. In this study, we used high-coverage (∼30×) whole-genome sequencing from 30 Hong Kong feral (HKF) cattle and compared them to 116 individuals from four representative populations worldwide. Our analyses revealed that the HKF cattle have high genetic diversity in the face of a declining effective population size, suggesting their substantial and yet untapped genetic potential. We also identified introgression events that occurred prior to the divergence between HKF cattle and other East Asian indicine populations, which shaped the adaptation of HKF cattle in Asian agro-ecologies. Moreover, we identified positive selection in HKF cattle for environmental adaptation, particularly in traits related to heat tolerance, bone strength, and coat color. Our findings provide insights into the genetic origin and unique adaptation of HKF cattle.

Subject areas: Phylogenetics, Genomic analysis, Genomics

Graphical abstract

graphic file with name fx1.jpg

Highlights

  • HKF cattle represent a unique gene pool of indigenous East Asian populations

  • HKF cattle harbor substantial yet untapped genetic potential

  • Introgression shaped HKF cattle genomes before splitting from other East Asian populations

  • Selected genomic loci associated with environmental adaptation were found in HKF cattle


Phylogenetics; Genomic analysis; Genomics

Introduction

Cattle are one of the main livestock species worldwide and have played an important role in agriculture and society since their domestication. They were introduced into new regions through human migrations, where exposure to novel ecological conditions imposed strong selective pressures on their genomes over a brief period.1 The evolutionary consequences of these pressures were further shaped by human management. Populations under intensive management often exhibited restricted genetic and phenotypic diversity, whereas unmanaged populations typically retained higher diversity and were better adapted to local environments.2

In the Hong Kong Special Administrative Region of China, there are about 1,000 feral cattle3 that inhabit urban environments, which represents a unique context for feral populations worldwide, but little is known about their genetic background and adaptive changes. Previous studies indicated that the early geographical history of Hong Kong region was attributed to the Lingnan region in southern China.4 In this region, cattle likely arrived by a coastal route during the eastward spread of domesticated indicine cattle.5 Subsequent admixture with local wild or domesticated banteng and gaur led to increased genetic diversity.5,6,7 Due to the effects of geographical isolation, the Hong Kong cattle population may be distinct from mainland populations. In the 1970s, rapid urbanization and economic development in Hong Kong led to the abandonment of cattle farming, and as a result, the Hong Kong cattle started to roam freely and are now considered a feral population. They are therefore exposed to disease, have free mate choice, and other pressures associated with surviving in natural and sometimes urban environments. Interestingly, despite being a small population, the Hong Kong feral (HKF) cattle displayed high phenotypic diversity in terms of horn shape and orientation, coat color, hump size and shape, tail length, and conformation. Hence, the HKF population offers a unique opportunity to study the effects on the genome and to characterize the needs and diversity of this unique population.

One previous study based on single-nucleotide polymorphism (SNP) array data showed no evidence of gene flow from any of the Asian local breeds to the HKF cattle.8 Meanwhile, a study based on Y chromosome and mitochondrial DNA data reported the pattern of introgression of the indicine genome across southern China.5 Notably, both studies support the introgression from wild bovine species into the genome of Asian and HKF cattle. The findings of these studies indicate that the evolutionary history of HKF cattle has been significantly influenced by introgression. Furthermore, due to biased sampling, different methodological perspectives, and limited genetic material, previous studies have not examined the genetic history and adaptive evolution of HKF cattle. In this study, by analyzing whole-genome sequencing (WGS) data of 30 HKF cattle, we revealed the genetic origin, introgression from wild species, adaptive evolution, and the genetic basis underlying the phenotypic characteristics of HKF cattle. Our findings provide insights into the genetic history of HKF cattle and enhance our understanding of adaptive evolution in feral cattle.

Results

Genome sequencing and SNP variant calling

Individual genomes of 30 HKF cattle were sequenced to an average 29.89× coverage (Table S1). Genotypes for these were jointly called with publicly available genomes of four cattle populations, including European taurine (Hereford, Angus, and Finncattle), Northeast Asian taurine (Hanwoo and Fuzhou), East Asian indicine (EAI) (Weizhou and Leiqong), and South Asian indicine (SAI) (Sahiwal, Thawalam, and Dhanni) (Figure 1A). In total, all 146 samples were aligned to the Bos taurus reference assembly ARS-UCD1.2 with an average mapping rate of 99.50%. Variants were called and filtered using a customized pipeline (see the STAR Methods section), resulting in the retention of ∼57.06 million bi-allelic SNPs. Functional annotation revealed that the majority of SNPs were located in intergenic (58.59%) and intronic regions (38.23%), while the rest were in the regions upstream and downstream (1.25%) of open reading frames and untranslated regions (1.01%). Exons contained 0.92% of the SNPs, with 496,136 synonymous SNPs and 11,957 nonsynonymous SNPs (Table S2; Figure S1).

Figure 1.

Figure 1

Population structure and relationships of HKF cattle to other cattle populations

(A) Geographic map of sample locations and other cattle populations.

(B) PCA of selected cattle populations.

(C) NJ tree of the relationships between the cattle populations.

(D) The ancestral components inferred using ADMIXTURE for K from 2 to 4.

(E) Genetic diversity and population differentiation across five populations. The values inside the circles represent nucleotide diversity, while the values on the connecting lines between populations indicate the population FST.

In the HKF cattle population, we compared the SNPs detected between WGS and SNP50k array data.8 After pruning for missing data, we retained 33,720 SNPs in the array, of which 16,334 were monomorphic whereas 17,386 are low-frequency variants (minor allele frequency [MAF] ≤ 0.05) (Table S3). In contrast, WGS captured genetic variation comprehensively across the allele frequency spectrum. For comparison with the WGS data, 30,813 variants could be lifted over to ARS-UCD1.2 coordinates. A total of 15,614 SNPs were common to both datasets. We then calculated heterozygosity (Ho) and inbreeding coefficient (F) for the WGS data and compared them with the published full SNP panel (Ho = 0.144 ± 0.01, F = 0.57 ± 0.03) and ancestral SNP panel (Ho = 0.268 ± 0.02, F = 0.35 ± 0.05).2 Our WGS data showed higher heterozygosity (Ho = 0.288 ± 0.02) and lower inbreeding coefficient (F = 0.25 ± 0.05).

Genetic structure and differentiation

To explore the genetic relationships among HKF and other cattle populations, we performed principal-component analysis (PCA), neighbor-joining (NJ) tree, and ADMIXTURE analyses of the autosomal SNP genotype data. Principal component (PC)1 explained 14.03% of total variance and distinguished B. taurus from B. indicus, while the second and third PCs, which accounted for 2.64% and 3.80% of total variance, clustered HKF with EAI and distinguished HKF from SAI (Figures 1B, S2A, and S2B). Similar results were observed in the NJ tree (Figure 1C). To further investigate the ancestry of the populations, we used ADMIXTURE to calculate delta K for each K from 1 to 4, where K represents the assumed number of ancestral populations. K = 2 divided B. taurus and B. indicus ancestry; when K = 3, HKF shared genome ancestry with EAI; and when K = 4, HKF showed distinct genetic patterns compared to the other four populations (Figure 1D). In addition to autosomal analysis, we examined the Y chromosome to investigate the male histories of HKF cattle. The Y3 haplogroup belongs to B. indicus, and the HKF populations shared the Y3A sub-haplogroup with the EAI populations, whereas the sub-haplogroup Y3B was predominantly found in SAI (Figure S3). Consistent with SNP array results showing HKF as a distinct cluster with highly divergent SAI ancestry, our WGS analysis confirmed its close genetic relationship with EAI ancestry.

Fixation index (FST) also showed that HKF cattle had the greatest similarity to EAI compared with the other populations (Figure S4). We calculated nucleotide diversity (π) of the populations to measure the degree of polymorphism (Figure 1E). HKF cattle showed the highest level of nucleotide diversity, followed closely by EAI.

Inbreeding and demographic history of HKF cattle

Inbreeding in HKF populations was evaluated by screening runs of homozygosity (ROH) across the genome. In terms of population structure, short, medium, and long ROH reflect ancient haplotypes predating continental migration, background relatedness, and recent parental relatedness, respectively.9,10 Among the three populations with indicine ancestry (EAI, SAI, and HKF), the HKF cattle had the most ROH, regardless of the length of ROH analyzed (Figures 2A–2D). The European taurine (EUT), which has experienced reproductive barriers and intensive artificial selection, had the highest number of cumulative long ROH among all five populations (Figure 2D). The number of long ROH in HKF was second only to the inbred EUT, indicating that HKF has a greater inbreeding risk than other indicine populations. Analysis of long ROH specific to HKF revealed a total of 270 genes, including a region on BTA13 (22.66–23.16 Mb) harboring genes associated with energy balance (SKIDA111), growth traits (MLLT1012), and heat resistance (DNAJC113,14). Haplotype analysis revealed distinct patterns relative to other groups, indicating potential genetic differentiation and selective pressure in HKF cattle (Figure S5). Cumulative ROH length was highly correlated with ROH number in all three ROH length classes (short, medium, and long; R > 0.9; p < 0.01) (Figure S6).

Figure 2.

Figure 2

Characterization of inbreeding and demographic history in the HKF population

(A–D) Distribution of ROH segments ordered and colored by size class in all populations. Cumulative size of short (B), medium (C), and long ROH (D) segments across all cattle populations based on the Gaussian mixture model. Differences between populations were tested by Wilcoxon rank-sum tests. Asterisks denote ∗p < 0.05, ∗∗p < 0.01, ∗∗∗p < 0.001, and ∗∗∗∗p < 0.0001.

(E) Demographic history inferred from the MSMC2 method. The large gray-shaded boxes illustrate the early Holocene Optimum, the last glacial maximum (LGM), and the second Pleistocene glacial period.

(F) Effective population sizes for the HKF population were inferred using SMC++,15 computed on 15 groups of 10 randomly selected samples.

(G) The ratio of homozygous LoF alleles in all populations.

The HKF had the highest medium ROH among the populations studied (Figure 2C), which suggests that its effective population size is small. To further investigate the changes in effective population size over time in the HKF population, we estimated the historical Ne using the multiple sequentially Markovian coalescent (MSMC2) method,16 which revealed that the HKF experienced a great population decline at 20–30 kya (Figure 2E). The population size history of HKF was similar to that of EAI but differed from that of the taurine populations. The recent effective population size of HKF was estimated using SMC++,15 which revealed that HKF Ne was lowest 7–9 kya, after which the population size remained constant (Figure 2F).

As the HKF cattle have a high level of inbreeding, loss-of-function (LoF) mutations may have accumulated at high frequency in this population, which is concerning for conservation genetics. The LoF mutations were screened across the entire genome of the five populations to investigate putatively derived deleterious mutations. The HKF cattle had a lower homozygous deleterious mutation load than SAI (Figure 2G).

Historical introgression

Previous studies have reported introgression from banteng and gaur into HKF cattle,8 and suggested admixture from banteng and gaur in EAI cattle.5 These findings indicated a significant contribution of wild cattle to the indicine populations in East Asia. In this study, SNPs derived from high-coverage WGS were used to calculate D statistics17 and RFMix18 to verify and quantify introgression from banteng and gaur into HKF cattle. The results showed that the proportions of banteng and gaur ancestries ranged from 10.74% to 11.81% and from 9.70% to 11.08%, respectively, in the HKF genomes. The proportions of banteng and gaur ancestries in the HKF genome were greater than those in the EAI genome (Figures 3A and 3B; Tables S4 and S5). We further explored the historical gene flow and introgression events using qpgraph and TreeMix.17,19 When testing without admixture, the topologies based on qpgraph and TreeMix were consistent with the topology of the NJ tree (Figures S7 and S8), showing that the common ancestors of HKF and EAI cattle diverged from SAI cattle. When modeling one admixture event, qpgraph and TreeMix analysis indicated an introgression from banteng and gaur into the common ancestor of HKF and EAI cattle (Figures 3C and S8). We therefore explored the possible introgression times from banteng and gaur into HKF cattle using fastsimcoal.20 The estimated banteng introgression occurred ∼8.9 kya (95% confidence interval [CI]: 6.3–22.2 kya) with an admixture fraction of ∼10% in HKF cattle, while gaur introgression was estimated to have occurred ∼9.9 kya (95% CI: 5.8–22.7 kya) with an admixture fraction of ∼9% (Figure 3D). This is consistent with the qpgraph, suggesting that banteng and gaur introgression occurred prior to the divergence of HKF from EAI cattle.

Figure 3.

Figure 3

Introgression ratio and history of HKF cattle

(A) D statistics with the form D (W, X; Y, Z) for banteng/gaur and five populations.

(B) Pairwise D statistics assessing allele sharing between HKF cattle and banteng/gaur, and comparisons of allele sharing between EAI breed cattle and banteng/gaur.

(C) Origin history with 1 admixture event inferred by qpgraph.

(D) Demographic model evaluated by fastsimcoal. The 95% confidence intervals are shown in parentheses.

Effects of introgression on adaptation

To evaluate the effects of introgression on adaptation of HKF cattle, we used the U50 statistic to detect adaptive introgressed genes. We found 1,403 genes in the HKF genomes to be of banteng origin and 1,553 genes to be of gaur origin, with 1,234 genes shared by both banteng and gaur (Figures S9A and S9B). These overlapping genes were highly enriched in Kyoto Encyclopedia of Genes and Genomes (KEGG) and Gene Ontology (GO) pathways (corrected p value <0.05) of oxygen binding, transport and carrier activity (HBM, MB, HBQ1, HBA, HBZ, and HBA1), and bitter taste receptor activity (TAS2R7, TAS2R16, T2R12, TAS2R8, and TAS2R38) (Table S6). In addition, we observed that the most introgressed region on BTA9 (FAM135A) harbored genes relevant to coat color,21 lipase activity, and intramuscular fat deposition.22,23 The geographic distribution of haplotypes in global cattle populations revealed that the introgressed gaur and banteng haplotypes had the highest frequency in HKF and EAI cattle. This finding was further supported by the FST analysis (Figure S9C), phylogenetic analysis (Figure S9E), and the extent to which the haplotypes were shared across populations (Figure S9D).

Adaptive selection of HKF cattle to environmental stresses

Our genome-wide analysis showed that HKF cattle exhibit a genetic pattern that is distinct from other cattle. To identify signatures of positive selection in the feral population, we compared the genomes of HKF cattle with other populations of cattle using the Di statistic.24 A total of 653,861 SNPs (0.1%) had significant allele frequency differences, which may be targets of local adaptation (Figure 4A). Among these, strong signals of differentiation were obtained in the regions containing genes related to neurological development (TRIO),25 thyroid development (PDE8B),26 and autoimmune regulation (WDR41).27 A significant selection signature was observed for ADRA1A gene on BTA8 (Figure S10), which is associated with efficient recovery from heat stress.28 The ADRA1A gene had the strongest differentiation between HKF and EAI cattle for the FST and θπ ratios (Figure 4B). Evidence of selection on this gene in HKF cattle was provided by high FST values and low genetic diversity (Figure 4C). In addition, we detected 12 Di-SNPs in the ADRA1A gene, which are putative functional polymorphisms (Figure 4D; Table S7). These variants affect the enhancer and promoter activity of ADRA1A, thereby potentially influencing its expression. Notably, the SNP rs517553622 (C > T) located in the promoter of ADRA1A was predicted to enhance the binding of KLF17 transcription factor, which is involved in the regulation of immune responses (Table S8).29 Another strong selection signature was located on BTA5, covering the B4GALNT3 gene, which contributes to bone health and supports survival in the wild (Figure 4A).30 It had high FST between HKF and other indicine cattle, as well as a likely recent selection signal in HKF cattle seen from the θπ analysis (Figure 4E). The haplotypes of the B4GALNT3 gene in HKF cattle are distinct from other cattle populations (Figures 4F and 4G).

Figure 4.

Figure 4

Candidate selection signature variants in HKF cattle

(A) Di-HKF are the 0.1% most significant Di-SNPs identified when comparing HKF versus other populations.

(B) Distribution of the FST (x axis) and θπ ratio (y axis) between HKF and EAI cattle based on 50 kb windows with 20 kb steps. The loci with the highest values are highlighted by a dashed circle on BTA8.

(C) The FST and θπ estimates for SNPs in the selected region on BTA8 between HKF and other indicine cattle based on 5 kb windows with 2 kb steps.

(D) Overlap of HKF SNPs in ADRA1A with open chromatin, enhancer, and promoter markers.

(E) The FST and θπ estimates for SNPs in B4GALNT3 gene on BTA5 between HKF and other indicine cattle based on 5 kb windows with 2 kb steps.

(F) SNPs with MAF >0.05 were used to construct haplotype patterns. The major allele at each SNP position in HKF cattle is colored magenta.

(G) Haplotypes of the relationships among all populations on the B4GALNT3 genes. Phylogenetic tree constructed from SNPs in B4GALNT3 genes.

Copy number variation calling and identification of copy number variation regions

Our mapped sequence data had an average depth of 29.89× per sample, which was sufficient for detecting copy number variations (CNVs) based on read depth.31 After CNV calling and filtering, a total of 8,120 CNV regions (CNVRs) were identified on autosomes (Table S9), which consisted of 6,371 deletions and 1,749 duplications compared with the ARS-UCD 1.2 reference genome (Figure 5A). KEGG and GO analyses revealed enrichment of genes involved in biological processes relevant to olfactory receptor activity, the immune system, and response to autoimmune thyroid processes (corrected p value <0.05) (Figure 5B; Table S10). Among these, a set of genes associated with the immune response, including BLA-DQB, TNFSF10, RAET1G, BOLA-DQB, CD1A, MIC1, IL32, BOLA-DQA5, BOLA-DQA2, and BOLA-NC1, may affect the response of HKF cattle to parasitic pathogens.

Figure 5.

Figure 5

Detection and functional annotation of CNVRs

(A) Counts of each CNVR type in HKF cattle, with purple and pink bars representing deletion and duplication, respectively.

(B) Enrichment analysis based on the CNVR-related genes.

(C) Photographs of example HKF cattle used in this study, showing the black and yellow color phenotypes (adapted from Barbato et al.8).

(D) Manhattan plot of VST statistics between black and yellow color phenotypes in HKF cattle.

(E) Estimated copy numbers of CNVRs on BTA13 (CNVR-NINL) between yellow and black color phenotypes.

(F) Estimated copy numbers of CNVRs on BTA15 (CNVR-DEXI and CNVR-CIITA) between yellow and black color phenotypes.

To investigate the impact of candidate CNVRs on coat color of HKF cattle, we calculated the VST between individuals with black and yellow coats. A total of 143 CNVRs were found to have a VST value greater than or equal to the highest 1% of values (Figures 5C and 5D). Among these, a total of 51 genes were annotated as differentiated CNVRs. Four of these CNVRs were present in three genes that were associated with coat color (NINL, DEXI, and CIITA). We compared the copy numbers of overlapping CNVRs for black and yellow HKF cattle individuals. In black individuals, CNVR-NINL, CNVR-DEXI, and CNVR-CIITA had higher copy numbers than in yellow individuals (Figures 5E and 5F). We hypothesize that these genes may play important roles in the coat color phenotype of HKF cattle.

Discussion

HKF cattle exist primarily as feral animals in Hong Kong, China, and the population size has been declining.32 Our study used high-depth WGS data to investigate the genetic structure, demographic history, introgression time frames, and genetic architecture of adaptive traits of HKF cattle. The use of WGS data is an advantage over our previous work on HKF cattle that relied on BovineSNP50 SNP chip8 because it reduces ascertainment bias. The BovineSNP50 chip33,34 is designed based on B. taurus and, as such, is not representative of the genetic diversity of HKF cattle that have substantial indicine genetics. Moreover, our dataset allowed for the detection of CNVs that were missed in previous studies relying primarily on genotyping arrays.8

Here, we showed that the ancestral contributions of HKF cattle came from EAI cattle, a pattern that was supported by the distribution of Y chromosome uniparental lineages. We also found that the HKF cattle exhibit distinct genetic patterns and internal uniformity, likely shaped by prolonged geographic isolation. We quantified inbreeding in HKF cattle by identifying ROH. Our analysis revealed that ROH length and number in HKF cattle were significantly higher than in other indicine cattle. The high degree of inbreeding in HKF cattle may be partly explained by limited gene flow with mainland populations during Hong Kong’s 151 years as a British colony (1842–1941 and 1945–1997). Interestingly, we observed fewer homozygous mutations that were predicted to be deleterious (LoF variants) in HKF cattle compared to the other cattle groups included in this study. Many predicted deleterious alleles were found in the heterozygous state. When combined with the small effective population size of HKF cattle, they pose a significant threat to future population’s viability. It is possible that inbreeding in HKF cattle has resulted in the purging of deleterious alleles through purifying selection. Moreover, associative overdominance, whereby heterozygous loci mask deleterious effects, may also contribute to this pattern. Both mechanisms warrant further investigation to clarify their roles.

We found evidence of adaptive introgression of banteng and gaur ancestry in HKF cattle, consistent with EAI cattle.5 The banteng and gaur introgression occurred before the divergence between HKF and EAI cattle. The high proportion of the banteng/gaur ancestry in these populations may be due to their geographic location, which served as the primary entry point of indicine cattle into East Asia.5 The banteng and gaur genes introgressed into HKF were enriched in biological pathways of oxygen binding, transport and carrier activity, and bitter taste receptor activity. In particular, the bitter taste receptor activity pathway included several target genes (TAS2R7, TAS2R16, T2R12, TAS2R8, and TAS2R38), which are associated with local adaptation to different food sources. Preference for different food sources due to taste is considered a strong driving force in ecological speciation.35,36,37 The oxygen binding pathway has been shown to be associated with disease and parasite tolerance in EAI cattle.5,38 Generally, energy homeostasis and color variation can influence the adaptability and survival of species. In the HKF population, FAM135A, that is introgressed from banteng, was found to significantly influence coat color,21 potentially contributing to the adaptation of HKF cattle to tropical and subtropical environments. FAM135A has been found to be associated with the lipid metabolism pathway and maintenance of homeostasis.22,23

Genomic adaptations to diverse environments are increasingly recognized as vital for conservation and food security, with rapid evolution shaping population productivity and viability in the face of climate change.39 Our analyses identified unique and clear signals of recent selection in HKF cattle, which reflect adaptation to a free-ranging environment. One of these regions contained the gene ADRA1A, which is associated with heat stress recovery.40 ADRA1A is a member of the G protein-coupled receptor superfamily that enhances adipocyte thermogenesis41 and changes feeding behavior due to heat stress.42 Variants in ADRA1A identified in this study revealed SNPs in predicted promoter and enhancer regions from assay for transposase-accessible chromatin using sequencing (ATAC-seq) and chromatin immunoprecipitation sequencing (ChIP-seq) data. Notably, the SNP in the promoter was predicted to enhance the binding of KLF17, a transcription factor involved in immune regulation.29 These non-coding variants may play a role in regulating its expression and facilitating the adaptation of HKF cattle to hot tropical ecosystems in the absence of human heat-stress mitigation management. Although the open chromatin region data came from Hereford cattle, they provide a basis for the tentative identification of conserved regulatory regions.43 Further studies are needed to account for potential differences in HKF cattle for more accurate inference of regulatory regions. We also identified a selection signal for B4GALNT3 located at BTA5 in HKF cattle, with a haplotype that was different from the other four populations analyzed. We observed a mutation in B4GALNT3 that affects protein structure. B4GALNT3 is associated with circulating sclerostin in humans,44 and it may increase bone mass, reducing fracture risk.30 The positive selection signals in the genes ADRA1A and B4GALNT3 may reflect the adaptation of HKF cattle to heat stress and environmental stress.

CNVs can affect gene function by altering gene dosage and structure and thus may play a role in phenotypic evolution.45,46,47,48 Our analysis of the number of CNVs in HKF cattle revealed that compared to the reference genome, deletions significantly exceeded the expansions. Pathway annotation of the expanded CNVRs identified in HKF cattle revealed a significant enrichment of genes related to the immune system. This suggests that HKF cattle may possess genetic features related to adaptive immune function that are potentially associated with survival and reproduction in their specific environments. The genes BOLA-DQB, BOLA-DQA5, BOLA-DQA2, and BOLA-NC1 located on BTA 23, which are part of the bovine major histocompatibility complex (MHC) class II gene family, are in regions under selection and play key roles in immune response and adaptation to tropical conditions.49 Importantly, HKF cattle have not been subjected to recent artificial selection for production and morphological traits, which has led to phenotypic diversity in populations. In this study, we observed that the NINL gene differed between the two coat colors of HKF cattle. NINL has been shown to be involved in microtubule organization, leading to the dispersion of lysosomes toward the cell periphery, thereby affecting the distribution of melanosomes.50,51 The expression of NINL is higher in Asian skin, which may regulate the distribution of melanosomes, contributing to the distinct skin color differences compared to Caucasians.52 We hypothesize that NINL may have a role in the coat color phenotypic variations seen in HKF cattle. DEXI and CIITA have been shown to be associated with black color in camels.53

In this study, we explored the unique interactions between the demography and genetics of HKF cattle to better understand the evolutionary history of this population. We provided evidence that the ancestral contributions of HKF cattle came from EAI cattle. We identified wild introgression into HKF cattle, likely a result of their geographic position as a gateway for indicine migration into East Asia. At the genomic level, we were able to find several genetic variations that may be adaptations to the environment, potentially related to heat tolerance, survival, and disease resistance. Practically, HKF cattle have developed into a genetically distinct population clearly differentiated from their domestic ancestors. These results indicate how animals free from human management have to reshape their genome to cope with a harsher environment as they are unprotected from biotic and abiotic stress. This adaptation is at the expense of diversity caused by purifying selection and increased inbreeding due to mating of relatives from families having higher fitness under natural conditions.

Limitations of the study

This study provides insights into the genetic origin and unique adaptation of HKF cattle. One limitation lies in the lack of breed-specific open chromatin data, which restricts the direct assessment of selection on non-coding regulatory regions. Without such data, inferences about regulatory adaptation are indirect and may not fully reflect the breed’s unique regulatory landscape. Generating open chromatin datasets in relevant tissues would allow a more precise characterization of regulatory selection. Another limitation is that the functional effects of candidate loci, particularly those associated with coat color, remain untested. Targeted in vivo and in vitro studies will be necessary to validate these loci and test the hypotheses proposed here. Additionally, potential sex-specific effects cannot be excluded, and future studies should incorporate sex as an important variable.

Resource availability

Lead contact

Further information and requests for resources and reagents should be directed to and will be fulfilled by the lead contact, Wai Yee Low (wai.low@adelaide.edu.au).

Materials availability

This study did not generate new unique reagents.

Data and code availability

  • The newly generated whole-genome sequence data from this study have been deposited in the National Genomics Data Center (NGDC), China National Center for Bioinformation, under the Bioproject accession number PRJCA040422. The details of aforementioned data and other downloaded publicly available data used in this study are provided in Table S1.

  • All analysis code has been deposited in the GitHub repository and is publicly available as of the date of publication. Accession numbers are listed in the key resources table.

  • Any additional information required to reanalyze the data reported in this paper is available from the lead contact upon request.

Acknowledgments

This work was supported by National Key R&D Programs of China (grant no. 2021YFF1001000), the National Natural Science Foundation of China (32341054 and 32522096), and the China Agriculture Research System of MOF and MARA (CARS-37). W.Y.L., J.L.W., and R.T. were supported by the Davies Bequest through the Davies Livestock Research Center of the University of Adelaide. We thank the High-Performance Computing (HPC) of Northwest A&F University (NWAFU) for providing the computing resources. We gratefully acknowledge all participants who contributed to this study. The cow illustration used in the graphical abstract was provided by BioGDP.com (https://biogdp.com/).54

Author contributions

W.Y.L. and N.C. designed and supervised the project. X. Luo and X. Lu performed the majority of analysis. Y.R., X. Liu, and Y.C. were involved in data visualization. X. Luo, N.C., Y.R., W.Y.L., and C.L. wrote, reviewed, and edited the manuscript with input from all authors. M.P.R. and R.A.L.B. collected the samples and revised the manuscript. J.L.W. initiated the study, coordinated sample collection, and revised the manuscript. P.A.-M., M.B., and R.T. contributed ideas for analysis and revised the manuscript.

Declaration of interests

The authors declare no competing interests.

STAR★Methods

Key resources table

REAGENT or RESOURCE SOURCE IDENTIFIER
Biological samples

Hong Kong feral cattle (HKF) Hong Kong Special Administrative Region of China N/A

Deposited data

Resequencing data of Hong Kong feral cattle (HKF) This study; National Genomics Data Center (NGDC) PRJCA040422
Resequencing data of East Asian indicine cattle (EAI) National Center for Biotechnology Information search database (NCBI) PRJNA658727; PRJNA379859; PRJNA283480
Resequencing data of South Asian indicine cattle (SAI) National Center for Biotechnology Information search database (NCBI) PRJNA658727; PRJNA379859
Resequencing data of European taurine cattle (EUT) National Center for Biotechnology Information search database (NCBI); European Bioinformatics Institute (EBI) PRJNA176557; PRJEB28185
Resequencing data of Northeast Asian taurine cattle (NEAT) National Center for Biotechnology Information search database (NCBI) PRJNA210519; PRJNA1085861
Resequencing data of Buffalo cattle National Center for Biotechnology Information search database (NCBI) PRJNA547460
Resequencing data of Banteng and Gaur National Center for Biotechnology Information search database (NCBI); European Bioinformatics Institute (EBI) PRJNA325061; PRJNA658727; PRJEB31621; PRJNA427536

Software and algorithms

Burrow-Wheeler Aligner https://github.com/lh3/bwa v.0.7.13
SAMtools https://github.com/samtools/ v.1.9
Genome Analysis Toolkit (GATK) https://github.com/broadinstitute/gatk/releases v.3.8.1.0
ANNOVAR https://github.com/WGLab/doc-ANNOVAR v2016-02-01
PLINK https://www.cog-genomics.org/plink/ v.1.9
EIGENSOFT https://github.com/argriffing/eigensoft v4.2
ADMIXTURE https://dalexander.github.io/admixture/ v1.3.0
MEGA https://www.megasoftware.net/ v11.0
VCFtools https://vcftools.github.io/index.html v0.1.16
Mclust R package https://mclust-org.github.io/mclust/ v6.1.1
BEDTools https://github.com/arq5x/bedtools2 v.2.25.0
SnpEff http://pcingola.github.io/SnpEff/ v4.3
MSMC2 https://github.com/stschiff/msmc2 V2.1.0
SMC++ https://github.com/popgenmethods/smcpp v1.15.4
RepeatMasker http://www.repeatmasker.org v.4.0.5
Trim Galore https://github.com/FelixKrueger/TrimGalore v.0.6.10
Bowtie2 https://bowtie-bio.sourceforge.net/bowtie2 v.2.4.5
Picard http://broadinstitute.github.io/picard v.2.20.2
Macs2 https://github.com/macs3-project/MACS v.2.2.7.1
MEME https://github.com/cinquin/MEME v5.5.2
RFMix https://github.com/slowkoni/rfmix v2.02
ADMIXTOOLS https://github.com/DReichLab/AdmixTools v7.0.2
TreeMix https://github.com/carolindahms/TreeMix v1.13
Fastsimcoal https://speciationgenomics.github.io/fastsimcoal2/ v2.6
CNVnator https://github.com/abyzovlab/CNVnator v.0.4.1
Kobas http://bioinfo.org/kobas v3.0
Genome analysis pipeline https://github.com/xxiaoyull/cattle-genome-analysis N/A

Experimental model and study participant details

Whole-blood samples were collected from 30 cattle in Hong Kong, China. The animals sampled are part of a feral cattle population of approximately 1,000 individuals, with samples mainly collected from Sai Kung, Lantau Island, and the East and North New Territories of Hong Kong.8 Both male and female animals were included, and sex information was recorded for each individual (Table S1). All animal work was done in compliance with Animals (Control of Experiments) Ordinance Chapter 340, the Laws of Hong Kong (License Nos 16–47 to 50, DH/HA&P/8/2/5 Pt 5), and with City University of Hong Kong Ethical Committee approval. Animals were maintained under standard local husbandry conditions prior to sampling.

Method details

DNA re-sequencing data

Genomic DNA was extracted from whole-blood samples of 30 feral cattle (see Experimental Model and Study Participant Details). DNA libraries were prepared and sequenced according to the standard protocols of BGI using a BGI-SEQ500 sequencer. Paired-end sequencing (2x150bp) was done with the target coverage of 30X. We checked the sequencing quality with FastQC v0.11.3 (https://www.bioinformatics.babraham.ac.uk/projects/fastqc/). In addition, we collected 124 WGS data from the NCBI Sequence Read Archive (SRA; https://www.ncbi.nlm.nih.gov/sra/), for 30 European taurine (Hereford, Angus, and Finncattle), 30 Northeast Asian taurine (Hanwoo and Fuzhou), 26 East Asian indicine (Weizhou and Leiqong) and 30 South Asian indicine (Sahiwal, Thawalam and Dhanni), four banteng, two of gaur, and two swamp buffalo (Table S1).

Variant calling

All reads were mapped to the Bos taurus reference genome ARS-UCD1.2 using BWA-MEM v0.7.13-r1126.55 SAMtools v1.956 and Picard tools v2.20.2 (http://broadinstitute.github.io/picard) were used to sort the BAM file and filter duplicate reads. All genomic data were uniformly processed following the approach of a previous study.5 Variants were identified using ‘HaplotypeCaller’, ‘GenotypeGVCFs’, ‘SelectVariants’ and ‘VariantFiltration’ in the Genome Analysis Toolkit (GATK v3.8-1-0-gf15c1c3ef).57 Only biallelic alleles were retained for downstream analyses. Additionally, we performed functional annotation on all autosomes using ANNOVAR v2016-02-01.58

Comparison of findings between SNP array and WGS

Previously, the genetic composition of 21 HKF cattle was investigated with Illumina BovineSNP50v2 BeadChip.8 We downloaded the SNP data of these 21 animals to compare allele frequency spectrum and overlapping SNPs with our WGS data. There were only two animals in common between the two datasets. The genomic coordinates of the SNP array markers were determined according to the ARS-UCD1.2 genome assembly (liftover coordinates available at https://www.animalgenome.org/repository/cattle/UMC_bovine_coordinates/). Minor allele frequency, heterozygosity and inbreeding coefficients were computed using PLINK v1.9.59

Population structure and genetic differentiation analysis

For the population structure analysis, we filtered the SNPs with MAF < 0.01 and high linkage disequilibrium (LD) using PLINK v1.9.59 The LD filtering parameter was ‘--indep-pairwise 50 10 0.1’, corresponding to an r2 threshold of 0.1. This gave us 3,504,555 SNPs for principal component analysis (PCA) using EIGENSOFT v4.2.60 Using the same dataset, genetic admixture analysis was performed with ADMIXTURE v1.3.0.61 For the phylogenetic analysis, we selected SNPs with MAF < 0.01, resulting in 46,167,659 SNPs for tree construction. We calculated a matrix of pairwise genetic distances and constructed the neighbor-joining (NJ) phylogenetic tree with MEGA v.11.0.62

For 84 male samples, we obtained Y chromosomes with an average sequencing depth of 8.39x. All clean reads were aligned to the Btau_5.0.1 Y chromosome reference sequence (GCF_000003205.7), and SNPs with more than 10% missing genotypes were filtered out. A total of 2823 SNPs passed the quality control. Phylogenetic trees were then inferred using ML methods performed with MEGA v.11.0.62

FST were calculated using the ‘smartpca’ command from EIGENSOFT v4.260 with parameters ‘fstonly’. The nucleotide diversity of each breed was assessed by VCFtools v0.1.1663 with parameters: '--window-pi 50000 --window-pi-step 20000’.

Runs of homozygosity

ROH were calculated using PLINK v1.9 based on phased and imputed SNPs. To characterize the distribution of ROH lengths, we applied a Gaussian mixture model implemented in the ‘mclust’ R package per population.64 The ROH segments were classified into short, medium and long size classes following the recommended approach.9,65 The boundaries of ROH were calculated to compare distribution patterns across different populations.

To evaluate specific ROH in HKF cattle, we first defined common ROH within a population as those present in more than 50% of individuals. We then used BEDTools v.2.25.066 to compare ROH across multiple populations, retaining regions present in HKF but absent in other populations as specific ROH to HKF cattle.

Screening of loss-of-function mutations

We estimated the LoF mutations in all population genomes using SnpEff v4.3.67 We used the swamp buffalo genomes to infer ancestral alleles in the HKF genome to screen derived alleles. Here, we considered ‘stop_gained’, ‘stop_lost’, ‘start_lost’, ‘splice_donor_variant’, and ‘splice_acceptor_variant’ as LoF mutations. We counted the number of LoF mutations per individual in the homozygous and heterozygous states and calculated the proportion of homozygous-derived SNPs with the following formula: 2 × homozygous sites/(2 × homozygous sites + heterozygous sites).

Estimation of effective population size

To infer effective population sizes (Ne), we applied both MSMC216 and SMC++15 analyses. MSMC2 was applied to all populations using two deep-coverage individuals per group, while SMC++ was computed on 15 groups of randomly selected 10 samples in the HKF population. In accordance with the recommendation, we masked low-complexity regions of the genome identified by RepeatMasker v.4.0.5 (http://www.repeatmasker.org), and focused on biallelic SNPs. For demographic inference, we used a generation time of six years and a per-generation mutation rate of 1.26 × 10-8.

Selective sweep analysis

We calculated the Di statistic24,68 to identify SNPs with highly differentiated allele frequencies between HKF and other cattle populations. For each SNP, Di measures the standardized deviation of pairwise FST values from the mean across populations, with higher values indicating greater population differentiation. We then selected the top 0.1% of Di values for further downstream functional analysis. We also used a FST and θπ ratio to detect selection sweeps within 50 kb windows and 20 kb steps in HKF cattle. The genomic selection regions considered as important were those identified using both methods.

ATAC-seq, ChIP-seq and transcription factor footprinting analyses

A total of ATAC-seq and ChIP-seq data from seven tissues of two animals (n=2) were obtained from NCBI. The raw reads were filtered using Trim Galore v.0.6.10 (https://github.com/FelixKrueger/TrimGalore) and mapped to the ARS-UCD1.2 reference genome using Bowtie2 v.2.4.5.69 Duplicates and low-quality alignments were removed with Picard v.2.20.2 (http://broadinstitute.github.io/picard). The ATAC-seq peaks were called using macs2 v.2.2.7.1.70 The two duplicates of the same tissue were merged using BEDTools v.2.25.0.66 The same pipeline was used for processing the ChIP-seq data. Transcription factor binding sites were predicted using FIMO from the MEME suite,68,71 with motifs obtained from the JASPAR CORE 2024 database.72

Introgression analysis

We used the D statistics17 and RFMix v2.0218 methods to characterize potential banteng or gaur introgression into HKF cattle. To avoid bias in the introgression ratio, we selected five pure South Asian indicine cattle (SAI), retrieved from a previous publication,5 for subsequent D statistics calculations using ADMIXTOOLS v7.0.2.17 RFMix v2.0218 was used to determine introgressed ratios based on phased data. In addition, we inferred a population-level phylogeny and migration events using qpgraph in admixtools2 (https://uqrmaie1.github.io/admixtools/articles/admixtools.html) and TreeMix.19 We used the “find_graphs” function to find the best-fitting admixture graph, with the swamp buffalo set as the outgroup.73,74 We further explored demographic models and estimated the timing of introgression based on the qpgraph results using fastsimcoal v2.6.20 For the demographic model, we performed 100 optimization runs with ‘-n100000’ and ‘-L20’ to obtain the best-fitting parameters yielding the highest likelihood.

To detect regions of adaptive introgression, we used U50SAI, HKF cattle, banteng/gaur (1%, 50%, and 100%) based on 50 kb windows with 20 kb steps. For each window, we counted the number of SNPs where a specific allele was fixed (100% frequency) in banteng/gaur, its frequency in SAI cattle was less than 1% and greater than 50% in HKF cattle. Windows in the top 5% of such allele counts that met these criteria were considered as candidate introgressed regions.

Copy number variations calling and detection

The CNVnator v.0.4.1 software was used to call CNVs relative to the ARS-UCD1.2 reference assembly. The bin size was set to 250bp for each individual. All detected events had a length >1 kb, a p-value based on t-test statistics <0.001 and a fraction of reads with zero mapping quality (q0) < 0.5. CNVRs were defined using TCAG-WGS-CNV-workflow (https://github.com/bjtrost/TCAG-WGS-CNV-workflow). We considered CNVs with at least 50% reciprocal overlap as CNVRs.

To detect genetic variants involved in the coat color phenotype of HKF cattle, we calculated VST by comparing black and yellow HKF cattle. CNVRs with VST values in the top 1% of the empirical distribution were identified as putative selective sweeps associated with coat color.75,76

Annotation and functional enrichment analysis

We obtained candidate genes by annotating candidate selective sweep regions, introgressed regions and CNVRs. These candidate genes were analyzed using kobas 3.077 to explain their biological functions (corrected P value < 0.05). The statistical significance of enriched terms was assessed using the hypergeometric test or Fisher’s exact test, and the Benjamini and Hochberg method was applied to control the false discovery rate (FDR).78

Quantification and statistical analysis

All statistical details are provided in the corresponding figure legends. For comparisons of cumulative ROH length among populations, pairwise statistical analyses were performed using the Wilcoxon rank-sum test. Statistical significance is indicated by asterisks: ∗P < 0.05, ∗∗P < 0.01, ∗∗∗P < 0.001, and ∗∗∗∗P < 0.0001. Demographic models were inferred using fastsimcoal, and parameter uncertainty was assessed using 95% confidence intervals.

Published: March 24, 2026

Footnotes

Supplemental information can be found online at https://doi.org/10.1016/j.isci.2026.115465.

Contributor Information

Ningbo Chen, Email: ningbochen@nwafu.edu.cn.

Wai Yee Low, Email: wai.low@adelaide.edu.au.

Supplemental information

Document S1. Figures S1–S10
mmc1.pdf (1.1MB, pdf)
Table S1. Summary of sequencing data, related to STAR Methods
mmc2.xlsx (24KB, xlsx)
Table S2. Functional classification of the detected SNPs, related to Figure S1
mmc3.xlsx (10.4KB, xlsx)
Table S3. Comparison of allele frequency between SNP50 array and WGS of HKF cattle, related to STAR Methods
mmc4.xlsx (9.8KB, xlsx)
Table S4. The results of the 4-population test (qpDstat) in populations, related to Figure 3
mmc5.xlsx (10.2KB, xlsx)
Table S5. The results of the 4-population test (qpDstat) in individuals, related to Figure 3
mmc6.xlsx (17.7KB, xlsx)
Table S6. Results from the enrichment analysis of genes under introgressed analysis in Hong Kong feral cattle

Corrected P value of <0.05 were considered statistically significantly enriched. Related to Figure S9

mmc7.xlsx (17.4KB, xlsx)
Table S7. Twelve mutations affecting ADRA1A enhancer/promoter activity, related to Figure 4
mmc8.xlsx (11KB, xlsx)
Table S8. Summary of predicted motif information, related to Figure 4
mmc9.xlsx (9.3KB, xlsx)
Table S9. The statistics of CNV count, related to Figure 5
mmc10.xlsx (9.3KB, xlsx)
Table S10. Results from the enrichment analysis of genes under CNV analysis in Hong Kong feral cattle

Corrected P value of <0.05 were considered statistically significantly enriched. Related to Figure 5

mmc11.xlsx (14.6KB, xlsx)

References

  • 1.Felius M., Beerling M.-L., Buchanan D., Theunissen B., Koolmees P., Lenstra J. On the History of Cattle Genetic Resources. Diversity. 2014;6:705–750. [Google Scholar]
  • 2.Pitt D., Bruford M.W., Barbato M., Orozco-terWengel P., Martínez R., Sevane N. Demography and rapid local adaptation shape Creole cattle genome diversity in the tropics. Evol. Appl. 2019;12:105–122. doi: 10.1111/eva.12641. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.AFCD Stray Cattle Management Plan. Agriculture, Fisheries and Conservation Department. 2022. https://www.afcd.gov.hk/english/quarantine/cattlebuffalo.html
  • 4.Wang C. From Isolated Fishing Village to Cosmopolitan City: Geographical Changes of Hong Kong in Modern Times. Journal of Education, Humanities and Social Sciences. 2024;35:131–136. doi: 10.54097/sm9q6r74. [DOI] [Google Scholar]
  • 5.Chen N., Xia X., Hanif Q., Zhang F., Dang R., Huang B., Lyu Y., Luo X., Zhang H., Yan H., et al. Global genetic diversity, introgression, and evolutionary adaptation of indicine cattle revealed by whole genome sequencing. Nat. Commun. 2023;14:7803. doi: 10.1038/s41467-023-43626-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Wu D.D., Ding X.D., Wang S., Wójcik J.M., Zhang Y., Tokarska M., Li Y., Wang M.S., Faruque O., Nielsen R., et al. Pervasive introgression facilitated domestication and adaptation in the Bos species complex. Nat. Ecol. Evol. 2018;2:1139–1145. doi: 10.1038/s41559-018-0562-y. [DOI] [PubMed] [Google Scholar]
  • 7.Moran B.M., Payne C., Langdon Q., Powell D.L., Brandvain Y., Schumer M. The genomic consequences of hybridization. eLife. 2021;10 doi: 10.7554/eLife.69016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Barbato M., Reichel M.P., Passamonti M., Low W.Y., Colli L., Tearle R., Williams J.L., Ajmone Marsan P. A genetically unique Chinese cattle population shows evidence of common ancestry with wild species when analysed with a reduced ascertainment bias SNP panel. PLoS One. 2020;15 doi: 10.1371/journal.pone.0231162. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Pemberton T.J., Absher D., Feldman M.W., Myers R.M., Rosenberg N.A., Li J.Z. Genomic patterns of homozygosity in worldwide human populations. Am. J. Hum. Genet. 2012;91:275–292. doi: 10.1016/j.ajhg.2012.06.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Razali R.M., Rodriguez-Flores J., Ghorbani M., Naeem H., Aamer W., Aliyev E., Jubran A., Qatar Genome Program Research Consortium. Clark A.G., Fakhro K.A., Mokrab Y. Thousands of Qatari genomes inform human migration history and improve imputation of Arab haplotypes. Nat. Commun. 2021;12:5929. doi: 10.1038/s41467-021-25287-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Gulisija D., Gonzalez-Reymundez A., Fenton J.I., de Los Campos G., Bray M.S., Vazquez A.I. Uncovering covariance patterns across energy balance traits enables the discovery of new obesity-related genes. Obesity. 2025;33:1184–1194. doi: 10.1002/oby.24291. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Yang P., Zhang Z., Xu J., Qu K., Lyv S., Wang X., Cai C., Li Z., Wang E., Xie J., et al. The Association of the Copy Number Variation of the MLLT10 Gene with Growth Traits of Chinese Cattle. Animals. 2020;10 doi: 10.3390/ani10020250. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Chevalier M., Rhee H., Elguindi E.C., Blond S.Y. Interaction of murine BiP/GRP78 with the DnaJ homologue MTJ1. J. Biol. Chem. 2000;275:19620–19627. doi: 10.1074/jbc.M001333200. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Sahi C., Kominek J., Ziegelhoffer T., Yu H.Y., Baranowski M., Marszalek J., Craig E.A. Sequential duplications of an ancient member of the DnaJ-family expanded the functional chaperone network in the eukaryotic cytosol. Mol. Biol. Evol. 2013;30:985–998. doi: 10.1093/molbev/mst008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Terhorst J., Kamm J.A., Song Y.S. Robust and scalable inference of population history from hundreds of unphased whole genomes. Nat. Genet. 2017;49:303–309. doi: 10.1038/ng.3748. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Schiffels S., Wang K. MSMC and MSMC2: The Multiple Sequentially Markovian Coalescent. Methods Mol. Biol. 2020;2090:147–166. doi: 10.1007/978-1-0716-0199-0_7. [DOI] [PubMed] [Google Scholar]
  • 17.Patterson N., Moorjani P., Luo Y., Mallick S., Rohland N., Zhan Y., Genschoreck T., Webster T., Reich D. Ancient admixture in human history. Genetics. 2012;192:1065–1093. doi: 10.1534/genetics.112.145037. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Maples B.K., Gravel S., Kenny E.E., Bustamante C.D. RFMix: a discriminative modeling approach for rapid and robust local-ancestry inference. Am. J. Hum. Genet. 2013;93:278–288. doi: 10.1016/j.ajhg.2013.06.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Pickrell J.K., Pritchard J.K. Inference of population splits and mixtures from genome-wide allele frequency data. PLoS Genet. 2012;8 doi: 10.1371/journal.pgen.1002967. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Excoffier L., Dupanloup I., Huerta-Sánchez E., Sousa V.C., Foll M. Robust demographic inference from genomic and SNP data. PLoS Genet. 2013;9 doi: 10.1371/journal.pgen.1003905. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Baltazar-Soares M., Karell P., Wright D., Nilsson J.Å., Brommer J.E. Genomic basis of melanin-associated phenotypes suggests colour-specific environmental adaptations in tawny owls. Mol. Ecol. 2024;33 doi: 10.1111/mec.17247. [DOI] [PubMed] [Google Scholar]
  • 22.Poleti M.D., Regitano L.C.A., Souza G.H.M.F., Cesar A.S.M., Simas R.C., Silva-Vignato B., Oliveira G.B., Andrade S.C.S., Cameron L.C., Coutinho L.L. Longissimus dorsi muscle label-free quantitative proteomic reveals biological mechanisms associated with intramuscular fat deposition. J. Proteonomics. 2018;179:30–41. doi: 10.1016/j.jprot.2018.02.028. [DOI] [PubMed] [Google Scholar]
  • 23.Zhang W., Wu C., Ni R., Yang Q., Luo L., He J. Formimidoyltransferase cyclodeaminase prevents the starvation-induced liver hepatomegaly and dysfunction through downregulating mTORC1. PLoS Genet. 2021;17 doi: 10.1371/journal.pgen.1009980. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Akey J.M., Ruhe A.L., Akey D.T., Wong A.K., Connelly C.F., Madeoy J., Nicholas T.J., Neff M.W. Tracking footprints of artificial selection in the dog genome. Proc. Natl. Acad. Sci. USA. 2010;107:1160–1165. doi: 10.1073/pnas.0909918107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Barbosa S., Greville-Heygate S., Bonnet M., Godwin A., Fagotto-Kaufmann C., Kajava A.V., Laouteouet D., Mawby R., Wai H.A., Dingemans A.J.M., et al. Opposite Modulation of RAC1 by Mutations in TRIO Is Associated with Distinct, Domain-Specific Neurodevelopmental Disorders. Am. J. Hum. Genet. 2020;106:338–355. doi: 10.1016/j.ajhg.2020.01.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Taylor P.N., Porcu E., Chew S., Campbell P.J., Traglia M., Brown S.J., Mullin B.H., Shihab H.A., Min J., Walter K., et al. Whole-genome sequence-based analysis of thyroid function. Nat. Commun. 2015;6:5681. doi: 10.1038/ncomms6681. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.McAlpine W., Sun L., Wang K.W., Liu A., Jain R., San Miguel M., Wang J., Zhang Z., Hayse B., McAlpine S.G., et al. Excessive endosomal TLR signaling causes inflammatory disease in mice with defective SMCR8-WDR41-C9ORF72 complex function. Proc. Natl. Acad. Sci. USA. 2018;115:E11523–E11531. doi: 10.1073/pnas.1814753115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.de Freitas A.C., Reolon H.G., Abduch N.G., Baldi F., Silva R.M.O., Lourenco D., Fragomeni B.O., Paz C.C.P., Stafuzza N.B. Proteomic identification of potential biomarkers for heat tolerance in Caracu beef cattle using high and low thermotolerant groups. BMC Genom. 2024;25:1079. doi: 10.1186/s12864-024-11021-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Pérez-Stuardo D., Frazão M., Ibaceta V., Brianson B., Sánchez E., Rivas-Pardo J.A., Vallejos-Vidal E., Reyes-López F.E., Toro-Ascuy D., Vidal E.A., Reyes-Cerpa S. KLF17 is an important regulatory component of the transcriptomic response of Atlantic salmon macrophages to Piscirickettsia salmonis infection. Front. Immunol. 2023;14 doi: 10.3389/fimmu.2023.1264599. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Movérare-Skrtic S., Voelkl J., Nilsson K.H., Nethander M., Luong T.T.D., Alesutan I., Li L., Wu J., Horkeby K., Lagerquist M.K., et al. B4GALNT3 regulates glycosylation of sclerostin and bone mass. EBioMedicine. 2023;91 doi: 10.1016/j.ebiom.2023.104546. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Sudmant P.H., Kitzman J.O., Antonacci F., Alkan C., Malig M., Tsalenko A., Sampas N., Bruhn L., Shendure J., 1000 Genomes Project. Eichler E.E. Diversity of human copy number variation and multicopy genes. Science. 2010;330:641–646. doi: 10.1126/science.1197005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Massei G., Koon K.K., Benton S., Brown R., Gomm M., Orahood D.S., Pietravalle S., Eckery D.C. Immunocontraception for managing feral cattle in Hong Kong. PLoS One. 2015;10 doi: 10.1371/journal.pone.0121598. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Decker J.E., Pires J.C., Conant G.C., McKay S.D., Heaton M.P., Chen K., Cooper A., Vilkki J., Seabury C.M., Caetano A.R., et al. Resolving the evolution of extant and extinct ruminants with high-throughput phylogenomics. Proc. Natl. Acad. Sci. USA. 2009;106:18644–18649. doi: 10.1073/pnas.0904691106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Matukumalli L.K., Lawley C.T., Schnabel R.D., Taylor J.F., Allan M.F., Heaton M.P., O'Connell J., Moore S.S., Smith T.P.L., Sonstegard T.S., Van Tassell C.P. Development and characterization of a high density SNP genotyping assay for cattle. PLoS One. 2009;4 doi: 10.1371/journal.pone.0005350. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Berner D., Thibert-Plante X. How mechanisms of habitat preference evolve and promote divergence with gene flow. J. Evol. Biol. 2015;28:1641–1655. doi: 10.1111/jeb.12683. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Kopp M., Servedio M.R., Mendelson T.C., Safran R.J., Rodríguez R.L., Hauber M.E., Scordato E.C., Symes L.B., Balakrishnan C.N., Zonana D.M., van Doorn G.S. Mechanisms of Assortative Mating in Speciation with Gene Flow: Connecting Theory and Empirical Research. Am. Nat. 2018;191:1–20. doi: 10.1086/694889. [DOI] [PubMed] [Google Scholar]
  • 37.Barbato M., Hailer F., Orozco-terWengel P., Kijas J., Mereu P., Cabras P., Mazza R., Pirastru M., Bruford M.W. Genomic signatures of adaptive introgression from European mouflon into domestic sheep. Sci. Rep. 2017;7:7623. doi: 10.1038/s41598-017-07382-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Brown C.G. Dynamics and impact of tick-borne diseases of cattle. Trop. Anim. Health Prod. 1997;29:1S–3S. doi: 10.1007/BF02632905. [DOI] [PubMed] [Google Scholar]
  • 39.Stockwell C.A., Hendry A.P., Kinnison M.T. Contemporary evolution meets conservation biology. Trends Ecol. Evol. 2003;18:94–101. doi: 10.1016/S0169-5347(02)00044-7. [DOI] [Google Scholar]
  • 40.Reolon H.G., Abduch N.G., de Freitas A.C., Silva R.M.d.O., Fragomeni B.d.O., Lourenco D., Baldi F., de Paz C.C.P., Stafuzza N.B. Proteomic changes of the bovine blood plasma in response to heat stress in a tropically adapted cattle breed. Front. Genet. 2024;15 doi: 10.3389/fgene.2024.1392670. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Rahbani J.F., Scholtes C., Lagarde D.M., Hussain M.F., Roesler A., Dykstra C.B., Bunk J., Samborska B., O'Brien S.L., Tripp E., et al. ADRA1A-Galpha(q) signalling potentiates adipocyte thermogenesis through CKB and TNAP. Nat. Metab. 2022;4:1459–1473. doi: 10.1038/s42255-022-00667-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Cross A.J., Keel B.N., Brown-Brandl T.M., Cassady J.P., Rohrer G.A. Genome-wide association of changes in swine feeding behaviour due to heat stress. Genet. Sel. Evol. 2018;50:11. doi: 10.1186/s12711-018-0382-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Kern C., Wang Y., Xu X., Pan Z., Halstead M., Chanthavixay G., Saelao P., Waters S., Xiang R., Chamberlain A., et al. Functional annotations of three domestic animal genomes provide vital resources for comparative and agricultural research. Nat. Commun. 2021;12:1821. doi: 10.1038/s41467-021-22100-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Zheng J., Maerz W., Gergei I., Kleber M., Drechsler C., Wanner C., Brandenburg V., Reppe S., Gautvik K.M., Medina-Gomez C., et al. Mendelian Randomization Analysis Reveals a Causal Influence of Circulating Sclerostin Levels on Bone Mineral Density and Fractures. J. Bone Miner. Res. 2019;34:1824–1836. doi: 10.1002/jbmr.3803. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Harris R.M., Hofmann H.A. Seeing is believing: Dynamic evolution of gene families. Proc. Natl. Acad. Sci. USA. 2015;112:1252–1253. doi: 10.1073/pnas.1423685112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Bickhart D.M., Hou Y., Schroeder S.G., Alkan C., Cardone M.F., Matukumalli L.K., Song J., Schnabel R.D., Ventura M., Taylor J.F., et al. Copy number variation of individual cattle genomes using next-generation sequencing. Genome Res. 2012;22:778–790. doi: 10.1101/gr.133967.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Choi J.W., Lee K.T., Liao X., Stothard P., An H.S., Ahn S., Lee S., Lee S.Y., Moore S.S., Kim T.H. Genome-wide copy number variation in Hanwoo, Black Angus, and Holstein cattle. Mamm. Genome. 2013;24:151–163. doi: 10.1007/s00335-013-9449-z. [DOI] [PubMed] [Google Scholar]
  • 48.Low W.Y., Tearle R., Liu R., Koren S., Rhie A., Bickhart D.M., Rosen B.D., Kronenberg Z.N., Kingan S.B., Tseng E., et al. Haplotype-resolved genomes provide insights into structural variation and gene content in Angus and Brahman cattle. Nat. Commun. 2020;11:2071. doi: 10.1038/s41467-020-15848-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Maiorano A.M., Cardoso D.F., Carvalheiro R., Júnior G.A.F., de Albuquerque L.G., de Oliveira H.N. Signatures of selection in Nelore cattle revealed by whole-genome sequencing data. Genomics. 2022;114 doi: 10.1016/j.ygeno.2022.110304. [DOI] [PubMed] [Google Scholar]
  • 50.Casenghi M., Meraldi P., Weinhart U., Duncan P.I., Körner R., Nigg E.A. Polo-like kinase 1 regulates Nlp, a centrosome protein involved in microtubule nucleation. Dev. Cell. 2003;5:113–125. doi: 10.1016/s1534-5807(03)00193-x. [DOI] [PubMed] [Google Scholar]
  • 51.Wu X., Sakamoto T., Zhang F., Sellers J.R., Hammer J.A., 3rd In vitro reconstitution of a transport complex containing Rab27a, melanophilin and myosin Va. FEBS Lett. 2006;580:5863–5868. doi: 10.1016/j.febslet.2006.09.047. [DOI] [PubMed] [Google Scholar]
  • 52.Yin L., Coelho S.G., Ebsen D., Smuda C., Mahns A., Miller S.A., Beer J.Z., Kolbe L., Hearing V.J. Epidermal gene expression and ethnic pigmentation variations among individuals of Asian, European and African ancestry. Exp. Dermatol. 2014;23:731–735. doi: 10.1111/exd.12518. [DOI] [PubMed] [Google Scholar]
  • 53.Bitaraf S.M., Zare H.J., Banabazi M.H., Faraz A., Esmaeilkhanian S., Naderi A.S., Salim N., Teimoori A., Bitaraf A., Zadehrahmani M., et al. Identification of Candidate Genes for Pigmentation in Camels Using Genotyping-by-Sequencing. Animals. 2022;12 doi: 10.3390/ani12091095. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Jiang S., Li H., Zhang L., Mu W., Zhang Y., Chen T., Wu J., Tang H., Zheng S., Liu Y., et al. Generic Diagramming Platform (GDP): a comprehensive database of high-quality biomedical graphics. Nucleic Acids Res. 2025;53:D1670–D1676. doi: 10.1093/nar/gkae973. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Li H., Durbin R. Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics. 2010;26:589–595. doi: 10.1093/bioinformatics/btp698. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Li H., Handsaker B., Wysoker A., Fennell T., Ruan J., Homer N., Marth G., Abecasis G., Durbin R., 1000 Genome Project Data Processing Subgroup The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25:2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.McKenna A., Hanna M., Banks E., Sivachenko A., Cibulskis K., Kernytsky A., Garimella K., Altshuler D., Gabriel S., Daly M., DePristo M.A. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010;20:1297–1303. doi: 10.1101/gr.107524.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Wang K., Li M., Hakonarson H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 2010;38 doi: 10.1093/nar/gkq603. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Chang C.C., Chow C.C., Tellier L.C., Vattikuti S., Purcell S.M., Lee J.J. Second-generation PLINK: rising to the challenge of larger and richer datasets. GigaScience. 2015;4:7. doi: 10.1186/s13742-015-0047-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Patterson N., Price A.L., Reich D. Population structure and eigenanalysis. PLoS Genet. 2006;2 doi: 10.1371/journal.pgen.0020190. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Alexander D.H., Lange K. Enhancements to the ADMIXTURE algorithm for individual ancestry estimation. BMC Bioinf. 2011;12:246. doi: 10.1186/1471-2105-12-246. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Tamura K., Stecher G., Kumar S. MEGA11: Molecular Evolutionary Genetics Analysis Version 11. Mol. Biol. Evol. 2021;38:3022–3027. doi: 10.1093/molbev/msab120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Danecek P., Auton A., Abecasis G., Albers C.A., Banks E., DePristo M.A., Handsaker R.E., Lunter G., Marth G.T., Sherry S.T., et al. The variant call format and VCFtools. Bioinformatics. 2011;27:2156–2158. doi: 10.1093/bioinformatics/btr330. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Scrucca L., Fop M., Murphy T.B., Raftery A.E. mclust 5: Clustering, Classification and Density Estimation Using Gaussian Finite Mixture Models. Rom. Jahrb. 2016;8:289–317. [PMC free article] [PubMed] [Google Scholar]
  • 65.Szpiech Z.A., Xu J., Pemberton T.J., Peng W., Zöllner S., Rosenberg N.A., Li J.Z. Long runs of homozygosity are enriched for deleterious variation. Am. J. Hum. Genet. 2013;93:90–102. doi: 10.1016/j.ajhg.2013.05.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Quinlan A.R. BEDTools: The Swiss-Army Tool for Genome Feature Analysis. Curr. Protoc. Bioinform. 2014;47:11.12.1–11.12.34. doi: 10.1002/0471250953.bi1112s47. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Cingolani P., Platts A., Wang L.L., Coon M., Nguyen T., Wang L., Land S.J., Lu X., Ruden D.M. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly. 2012;6:80–92. doi: 10.4161/fly.19695. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Feng Y., Xie N., Inoue F., Fan S., Saskin J., Zhang C., Zhang F., Hansen M.E.B., Nyambo T., Mpoloka S.W., et al. Integrative functional genomic analyses identify genetic variants influencing skin pigmentation in Africans. Nat. Genet. 2024;56:258–272. doi: 10.1038/s41588-023-01626-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Langmead B., Salzberg S.L. Fast gapped-read alignment with Bowtie 2. Nat. Methods. 2012;9:357–359. doi: 10.1038/nmeth.1923. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Zhang Y., Liu T., Meyer C.A., Eeckhoute J., Johnson D.S., Bernstein B.E., Nusbaum C., Myers R.M., Brown M., Li W., Liu X.S. Model-based analysis of ChIP-Seq (MACS) Genome Biol. 2008;9 doi: 10.1186/gb-2008-9-9-r137. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Bailey T.L., Elkan C. Fitting a mixture model by expectation maximization to discover motifs in biopolymers. Proc. Int. Conf. Intell. Syst. Mol. Biol. 1994;2:28–36. [PubMed] [Google Scholar]
  • 72.Rauluseviciute I., Riudavets-Puig R., Blanc-Mathieu R., Castro-Mondragon J.A., Ferenc K., Kumar V., Lemma R.B., Lucas J., Chèneby J., Baranasic D., et al. JASPAR 2024: 20th anniversary of the open-access database of transcription factor binding profiles. Nucleic Acids Res. 2024;52:D174–D182. doi: 10.1093/nar/gkad1059. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Hou J., Guan X., Xia X., Lyu Y., Liu X., Mazei Y., Xie P., Chang F., Zhang X., Chen J., et al. Evolution and legacy of East Asian aurochs. Sci. Bull. 2024;69:3425–3433. doi: 10.1016/j.scib.2024.09.016. [DOI] [PubMed] [Google Scholar]
  • 74.Chen S., Ren L., Gao Y., Dong G., Sheng G., Han J., Liu X., Chen N., Chen F. Evidence of hybridization of cattle and aurochs on the Tibetan Plateau ∼3750 years ago. Sci. Bull. 2024;69:2825–2828. doi: 10.1016/j.scib.2024.06.035. [DOI] [PubMed] [Google Scholar]
  • 75.Yan H., Chen J., Lei C., Zhang F., He S., Song W., Ma H., Li S., Li G., Yongcuo L., et al. Genetic Diversity, Adaptation, Wild Introgression, and Coat Color Mutation of Golden Yak. Animal Research and One Health. 2026:1–13. doi: 10.1002/aro2.70049. [DOI] [Google Scholar]
  • 76.Liu Y.-K., Fu W.-W., Wang Z.-Y., Pei S.-W., Li K.-H., Wu W.-W., Le M.-Z., Yue X.-P. Genomic insights into the genetic diversity, lateral gaits and high-altitude adaptation of Chakouyi (CKY) horses. J Genet Genomics. 2025;52:1001–1010. doi: 10.1016/j.jgg.2024.11.008. [DOI] [PubMed] [Google Scholar]
  • 77.Bu D., Luo H., Huo P., Wang Z., Zhang S., He Z., Wu Y., Zhao L., Liu J., Guo J., et al. KOBAS-i: intelligent prioritization and exploratory visualization of biological functions for gene enrichment analysis. Nucleic Acids Res. 2021;49:W317–W325. doi: 10.1093/nar/gkab447. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Zhao Q., Xu J., Shi Z., Zhang Y., Du X., Zhai Y., Xu J., Liu F., Zhang Q. Genome-wide Pleiotropy Analysis Reveals Shared Genetic Associations between Type 2 Diabetes Mellitus and Subcortical Brain Volumes. Research. 2025;8 doi: 10.34133/research.0688. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Document S1. Figures S1–S10
mmc1.pdf (1.1MB, pdf)
Table S1. Summary of sequencing data, related to STAR Methods
mmc2.xlsx (24KB, xlsx)
Table S2. Functional classification of the detected SNPs, related to Figure S1
mmc3.xlsx (10.4KB, xlsx)
Table S3. Comparison of allele frequency between SNP50 array and WGS of HKF cattle, related to STAR Methods
mmc4.xlsx (9.8KB, xlsx)
Table S4. The results of the 4-population test (qpDstat) in populations, related to Figure 3
mmc5.xlsx (10.2KB, xlsx)
Table S5. The results of the 4-population test (qpDstat) in individuals, related to Figure 3
mmc6.xlsx (17.7KB, xlsx)
Table S6. Results from the enrichment analysis of genes under introgressed analysis in Hong Kong feral cattle

Corrected P value of <0.05 were considered statistically significantly enriched. Related to Figure S9

mmc7.xlsx (17.4KB, xlsx)
Table S7. Twelve mutations affecting ADRA1A enhancer/promoter activity, related to Figure 4
mmc8.xlsx (11KB, xlsx)
Table S8. Summary of predicted motif information, related to Figure 4
mmc9.xlsx (9.3KB, xlsx)
Table S9. The statistics of CNV count, related to Figure 5
mmc10.xlsx (9.3KB, xlsx)
Table S10. Results from the enrichment analysis of genes under CNV analysis in Hong Kong feral cattle

Corrected P value of <0.05 were considered statistically significantly enriched. Related to Figure 5

mmc11.xlsx (14.6KB, xlsx)

Data Availability Statement

  • The newly generated whole-genome sequence data from this study have been deposited in the National Genomics Data Center (NGDC), China National Center for Bioinformation, under the Bioproject accession number PRJCA040422. The details of aforementioned data and other downloaded publicly available data used in this study are provided in Table S1.

  • All analysis code has been deposited in the GitHub repository and is publicly available as of the date of publication. Accession numbers are listed in the key resources table.

  • Any additional information required to reanalyze the data reported in this paper is available from the lead contact upon request.


Articles from iScience are provided here courtesy of Elsevier

RESOURCES