Skip to main content
Scientific Reports logoLink to Scientific Reports
. 2017 Jul 5;7:4708. doi: 10.1038/s41598-017-04830-2

Whole Mitogenomes Reveal the History of Swamp Buffalo: Initially Shaped by Glacial Periods and Eventually Modelled by Domestication

S Wang 1,#, N Chen 1,#, M R Capodiferro 2,#, T Zhang 3, H Lancioni 4, H Zhang 5, Y Miao 6, V Chanthakhoun 7, M Wanapat 8, M Yindee 9, Y Zhang 10, H Lu 3, L Caporali 11, R Dang 1, Y Huang 1, X Lan 1, M Plath 1, H Chen 1, J A Lenstra 12, A Achilli 2,✉,#, C Lei 1,✉,#
PMCID: PMC5498497  PMID: 28680070

Abstract

The newly sequenced mitochondrial genomes of 107 Asian swamp buffalo (Bubalus bubalis carabensis) allowed the reconstruction of the matrilineal divergence since ~900 Kya. Phylogenetic trees and Bayesian skyline plots suggest a role of the glacial periods in the demographic history of swamp buffalo. The ancestral swamp-buffalo mitogenome is dated ~232 ± 35 Kya. Two major macro-lineages diverged during the 2nd Pleistocene Glacial Period (~200–130 Kya), but most (~99%) of the current matrilines derive from only two ancestors (SA1′2 and SB) that lived around the Last Glacial Maximum (~26–19 Kya). During the late Holocene optimum (11–6 Kya) lineages differentiated further, and at least eight matrilines (SA1, SA2, SB1a, SB1b, SB2a, SB2b, SB3 and SB4) were domesticated around 7–3 Kya. Haplotype distributions support an initial domestication process in Southeast Asia, while subsequent captures of wild females probably introduced some additional rare lineages (SA3, SC, SD and SE). Dispersal of domestic buffaloes created local population bottlenecks and founder events that further differentiated haplogroup distributions. A lack of maternal gene flow between neighboring populations apparently maintained the strong phylogeography of the swamp buffalo matrilines, which is the more remarkable because of an almost complete absence of phenotypic differentiation.

Introduction

Water buffalo (Bubalus bubalis) is one of the most important livestock species in several Asian countries and is used for the production of milk and meat and for draft power in rice cultivation. The domestic water buffalo in Asia is generally divided in two major subspecies, the dairy river buffalo and the draft swamp buffalo, which differ in morphology, behavior and number of chromosomes1, 2. The river buffalo is found in the Indian subcontinent, South Asia and the Mediterranean area (Italy, Egypt and the Balkans), and sporadically in Australia and South America, whereas the swamp buffalo is kept in Northeast India, China (southern regions and Yangtze valley) and Southeast Asia1, 3. Both types of water buffalo descend from the wild Asian buffalo (Bubalus arnee)4, which had a widely distribution range in eastern Indian, Sri Lanka and Southeast Asia until the beginning of XIX century58. Lau et al.7 hypothesized that the wild Asian buffalo originated in mainland of Southeast Asia and spread north toward China and west toward the Indian subcontinent, where the river type was probably domesticated.

Mitochondrial DNA (mtDNA), Y-chromosomal and nuclear microsatellite data showed a deep genetic divergence of swamp and river buffalo2, 57, 916, which indicate two independent domestications5, 17. Domestication of river buffalo most likely took place in the Indian subcontinent15, whereas swamp buffalo was proposed to originate from the border region between south China and north Indochina6, 17. However, the fragmentary buffalo mtDNA sequences from rDNA, COII, and cytochrome b loci reported to date57, 18, confound quantitative inferences of population history. So far two swamp buffalo mitogenomes (NC006295/AY702618 and JN632607) and one river buffalo mitogenome (AF547270) were deposited in GenBank. In this study, we report the mitogenomes from an additional 107 Southeast-Asian swamp buffaloes, covering most of its current geographic distribution with as outgroup one mitogenome from a Chinese river buffalo in order to establish the haplogroup phylogeny and reconstruct the demographic history of the swamp buffalo.

Results

Sequence variation of swamp buffalo mitogenomes

The 109 swamp buffalo mitogenomes with a length of 16340 to 16363 bps belong to 87 different haplotypes (Ht.s) and are divided into 21 haplogroups or subhaplogroups (Supplementary Dataset S1). The swamp haplotypes (Hd: 0.992) contain 362 polymorphic sites (π: 0.422) with a pairwise nucleotide difference of 69.0 ± 16.5 and a synonymous/non-synonymous ration of 4.63. Similar values have been reported for other livestock and human mitochondrial genomes1921. Seventy-five swamp haplotypes are observed once, while the most frequent haplotypes HT22 and HT55 occurred five and seven times, respectively. Forty-eight haplotypes with 59 sequences belong to lineage SA and 34 haplotypes with 44 sequences belong to lineage SB. The remaining 5 haplotypes belong to the rare haplogroups SC (2), SD (2) or SE (1). The two river buffalo mitogenomes belong to clades R1 and R21, 6.

Swamp Buffalo Mitochondrial Phylogeny

A maximum parsimony (MP) tree based on the 111 water buffalo mitogenomes (89 haplotypes) confirms two distinct branches, river and swamp buffalo, which were separated by 300 substitutions (Supplementary Dataset S2). The river buffalo branch includes 2 haplotypes belonging to lineages R1 and R2. The remaining 87 swamp buffalo haplotypes cluster into five divergent haplogroups (Hg.s), namely SA, SB, SC, SD and SE, with an overwhelming representation of SA (54.1%) and SB (40.4%). We largely confirm previous phylogenies6, 10, but also reveal the novel haplogroups SA3 and SB4 and many different subhaplogroups. Lineages SA and SB are divided into three (SA1 to SA3; SA1′2 as an ancestral node) and four (SB1-SB4; SB2′3′4 as an ancestral node) sublineages, respectively. A single control-region transition at position 16066 defines a major (32.1%) star-like subclade SA1a. For the three rare lineages SC (1.8%), SD (2.8%) and SE (0.9%), the complete mtDNA sequences confirm that the split-off of lineage SC preceded the SA-SB divergence and that SD is a sister clade of SB, but indicate that also SE and SA are sister clades. Maximum likelihood (ML) and Bayesian evolutionary analysis of sampling trees (Beast) retrieved remarkably similar tree topologies (Supplementary Figs S1 and S2).

Molecular Clocks and Age Estimates

Previously reported estimates of the divergence time between river and swamp range from 10 Kya to 1.7 Mya2, 5, 7, 911, 1316, 18, 22, 23, partially because different mtDNA segments were analyzed and different evolution rates were applied. In the present study, we phylogenetically compared 111 water buffalo mitogenomes with one African buffalo (Syncerus caffer; NC020617), while rooting our MP tree with one Bos taurus (V00654.1) and one ancient Bos primigenius (GU985279) mitogenome. The divergence pattern evaluated on all 114 bovine mitogenomes (Supplementary Fig. S3) confirms saturation of the D-loop divergence, emphasizing that sequencing of whole mitogenomes is essential for quantification.

We first considered the synonymous mutations for a ML estimation of the molecular age of phylogenetic nodes (Fig. 1). Using a fossil age estimate of the Bovini tribe of 8.8 Mya24 and the age (6.7 Ky) of the ancient Bos primigenius mtDNA25 we calculated a rate of 3.75 ± 0.47 × 10−5 synonymous substitutions per nucleotide per Ky for 3790 amino acid codons equalling 1 synonymous substitution every ~7.030 Ky. This yields divergence times of ~913 ± 78 Kya for the water buffalo mitogenome, ~82 ± 18 Kya for the only two river buffalo R1 and R2 haplotypes and ~232 ± 35 Kya for the swamp buffalo haplogroups (Table 1 and Fig. 1). At least 12 different mtDNA ancestral haplotypes of the current swamp buffaloes were present in Southeast Asia during the early Neolithic (11–6 Kya), which overlaps with the initial phase of domestication26. These haplotypes were the ancestors of the eight (sub)haplogroups SA1, SA2, SB1a, SB1b, SB2a, SB2b, SB3 and SB4, still common in modern herds, and of the rare SA3, SC, SD and SE.

Figure 1.

Figure 1

Phylogeny of complete mtDNAs from 111 buffalo mitogenomes. The topology was inferred by maximum parsimony (Supplementary Dataset S2). A maximum likelihood time scale, based on synonymous substitutions, is indicated below the tree. The standard error for the major nodes are represented by dot lines, further details are available in Table 1. Samples are indicated by 89 different haplotype IDs (Supplementary Dataset S1). The insert shows the geographic distribution of major haplogroups based on these complete mtDNAs. Samples from China have been divided into three regions (Yangtze Valley, Southwest China and Southeast China). The map has been drawn by hand in Adobe Photoshop (v. 8.0; http://www.adobe.com).

Table 1.

Age estimates of major buffalo branches based on different mitochondrial datasets.

Node N ML (synonymous subst) ML (only coding region) ML (all substitutions)a Beast (all substitutions)a
T(ky) SEb(ky) T(ky) SEb(ky) T(ky) SEb(ky) T(ky) SEb(ky)
A.W.M.c 111 912.6 78.3 1044.3 71.4 721.6 59.1 672.0 101.7
River 2 81.9 18.3 109.3 20.7 68.6 11.6 63.7 13.2
Swamp 109 231.8 35.3 280.1 28.8 204.4 20.0 194.2 31.4
SC 2 0.0 59.1 3.8 3.8 1.5 1.5 2.9 1.4
SABDE 107 189.6 27.2 222.9 25.0 176.6 17.7 166.6 26.8
SAE 60 129.3 21.2 152.7 20.9 129.1 15.5 120.6 20.7
SE 1 n.a. n.a. n.a. n.a. n.a. n.a. n.a. n.a.
SA 59 33.0 9.2 40.5 9.9 37.5 7.3 35.9 7.9
SA12 56 18.8 6.5 23.9 7.4 18.4 4.5 18.5 4.6
SA1 37 6.4 4.2 8.1 5.4 7.2 1.8 8.4 2.2
SA1a 35 6.4 1.5 8.1 1.7 5.5 0.9 6.7 1.5
SA1a1 3 5.2 1.6 6.8 1.8 4.8 1.0 4.1 1.2
SA1a2 6 4.4 1.7 5.7 1.9 4.5 1.1 4.2 1.0
SA1a3 9 3.3 2.4 4.4 2.5 3.9 1.2 4.2 1.0
SA2 21 3.3 1.5 3.6 1.6 5.1 1.2 7.0 1.8
SA3 1 n.a. n.a. n.a. n.a. n.a. n.a. n.a. n.a.
SB′D 47 175.5 27.1 210.0 25.4 157.7 16.9 146.4 24.3
SD 3 0.0 44.6 0.0 44.8 0.9 1.0 3.2 1.3
SB 44 25.8 7.0 35.1 8.3 30.7 5.9 31.1 6.8
SB1 18 11.4 6.2 12.2 4.9 8.6 2.4 8.9 2.7
SB1a 15 6.4 4.4 8.0 2.7 5.8 1.4 6.0 1.4
SB1a1 11 3.3 2.0 4.2 2.1 3.7 1.2 4.5 1.0
SB1a2 2 6.4 4.5 6.5 2.4 4.9 1.4 3.8 1.0
SB1b 3 0.0 13.0 0.0 13.1 1.2 1.4 3.5 1.3
SB234 26 19.8 5.7 26.1 6.7 23.3 4.7 23.4 5.3
SB2 4 19.8 10.6 26.1 12.5 15.7 4.5 10.8 4.0
SB2a 2 3.1 3.1 6.7 4.8 6.0 2.9 3.9 1.4
SB2b 2 6.8 6.8 7.0 7.0 9.9 4.5 5.2 2.3
SB3 19 6.9 3.4 7.2 3.5 4.7 1.8 6.6 1.7
SB3a 16 4.1 3.1 4.6 3.0 2.4 1.0 5.0 1.1
SB3a1 10 1.2 0.9 1.8 1.1 1.4 0.6 3.9 0.8
SB4 3 4.1 2.9 8.7 4.2 5.1 2.2 4.7 1.7

aThe entire genome was partitioned into coding and control region.

bThe 95% Confidence Interval (CI) corresponds to 1.96 times the value of the Standard Error (SE) reported here.

cAncestral Water-Buffalo Mitogenome.

Time estimates were confirmed by (i) using all open reading frame mutations, (ii) considering all mutations partitioned in coding and control regions and evaluated with both ML and Beast (Table 1). However, slightly deleterious mutations within the open reading frames may lead to overestimations of younger clades (Supplementary Fig. S4)19, 21. The overall mutation rate was estimated at 2.11 ± 0.34 × 10−8 substitutions per nucleotide per year (1 mutation every ~2900 years) over the entire mitogenome. As for the river buffalo internal variation, a better estimate will be obtained by analyzing more mitogenomes.

Estimating Past and Present Demographic Trends

Bayesian skyline plot (BSP) of swamp buffalo mitogenomes shows three major changes in the effective female population size: i) a slight decrease between about 200 and 130 Kya; ii) a more recent decrease starting around 25–20 Kya and much steeper during the early Neolithic (11–6 Kya); and iii) a rapid increase from 3 Kya (Fig. 2). This recent increase explains the star-like topology of some haplogroups (e.g. SA1, SA2, SB1a, SB3 and SB4; Fig. 1) and is very similar to previously analysis of water buffalo and other bovine domestic species27.

Figure 2.

Figure 2

Bayesian skyline plot showing the swamp buffalo population size trend. The Y axis indicates the effective number of females, as inferred from our mitogenome dataset considering a generation time of six years49. The black solid line is the median estimate and the blue shading shows the 95% highest posterior density limits.

An analysis of the geographic distribution of swamp haplogroups in Southeast Asia, based on the control-region data currently available in previous literature6 or deposited in GenBank (Supplementary Table S1) confirms the prevalence of SA or SB mtDNAs (>99%) and a geographic differentiation of subhaplogroups with contrasting geographic distributions of the subhaplogroups SA2, SB1, SB2 and SB3 (Supplementary Fig. S5). The rare haplogroup SC was previously reported to occur in Thailand, Bangladesh and sporadically in Southwest China, while SD and SE were found only in Thailand6. We confirmed the presence of SC in the Southwestern Chinese Dehong population, but the haplotypes SC and SE were also found in the Yibin and Poyanghu breeds from the Yangtze Valley (inset in Fig. 1).

Discussion

We sequenced the complete mitogenome of the swamp buffalo in order to reconstruct the phylogenetic relationships of the mitochondrial haplotypes and to obtain a time scale for the phylogeny. Most of the major haplogroups were already identified in previous studies2, 5, 7, 911, 1316, 18, 22, 23, but we recognized the novel haplogroups SA3 and SB4 and defined the subhaplogroups SA1a, SA1a1, SA1a2, SA1a3, SB1a, SB1a1, SB1a2, SB1b, SB2a, SB2b, SB3a, SB3a1, SD1 and SD2 (Supplementary Dataset S1 and S2).

During the glacial periods, drastic changes in ecological and climatic seasons had consecutive major effects on the distribution of plants and animals28, 29. As part of the Indo-Pacific Warm Pool, Southeast Asia was relatively warm during the Last Glacial Maximum (~26–19 Kya)30 with a temperature decrease of only 2.5 °C vs 5–10 °C globally31, 32. This made Southeast Asia a major global biodiversity hotspot33 in which many species survived (in glacial refuges) the fluctuations of temperature and forest coverage during the Pleistocene (for the latter, see Supplementary Fig. S6)28.

Estimations of divergence times are inherently imprecise34, but the accuracy of our data has been optimized by using data from the aurochs as internal calibration point, this in addition to a fossil age of the Bovini tribe of 7–11 Mya24. According to our estimate, the divergence of the swamp and river types of the water buffalo took place almost at the beginning of a glacial period (~900 to 860 Kya). Remarkably, from the swamp matrilineal diversity that must have formed until 200 Kya only the minor haplogroups SC and the ancestor of all other haplogroups (SA′B′D′E) have survived. We propose to divide the last 200 ka into five phases, correlating glacial periods and estimates of demographic/phylogenetic history (Figs 1 and 2 and Supplementary S6).

  1. During 2nd Pleistocene Glacial Period (~200 to 130 Kya) the first decline of population was observed in the BSP while two macro-haplogroups (SA′E and SB′D) diverged (Figs 1 and 2).

  2. The first phase of the last Pleistocene glacial period (~110 to 50 Kya) was still comparatively moderate, the population size remained almost unchanged and only one divergence event (SA-SE) has been identified.

  3. During the second phase of the last Pleistocene glacial period (~50 to 11 Kya) the population began to decline. The current demographic composition of swamp populations shows that ~99% of current mitogenomes are derived from only two ancestral haplotypes (SA1′2 and SB), both dated around the LGM (~26–19 Kya). Afterwards, the major haplogroups SA1′2 and SB differentiated into 8 haplotypes (SA1, SA2, SB1a, SB1b, SB2a, SB2b, SB3 and SB4).

  4. After 11 Kya the increasing temperature raised the sea level, which had a profound impact in the regions from Sundaland to Southeast China35. Paleoenvironmental data on the Holocene indicate a warm period between 11 and 6 Kya in southern China, known as the early Holocene optimum36. This overlapped with the first phases of the rice cultivation, which is believed to have triggered the domestication of the swamp buffalo at 7–3 Kya and an expansion of the water buffalo population1, 6, 10, 14, 26, 37.

  5. Finally, we observe a rapid increase since 3 Kya to the present population size. As previously proposed27, this was due to expansion of the domestic buffalo to the large current distribution range, harboring the several present populations with distinct haplogroup distributions.

Thus, the demographic history of swamp buffalo seems to be linked to the historic glacial events, establishing isolated refugia of swamp buffalo in which only a limited number of subhaplogroups survived. This may explain why only one divergence event (SA-SE) has been dated to the period of 180–40 Kya. The current demographic composition of swamp populations suggests that ~99% of current mitogenomes are derived from only two ancestors (SA1′2 and SB), which are both dated around the Last Glacial Maximum (~26–19 Kya). The time estimates in Fig. 1 and Table 1 further indicate that also the divergence of the current domestic subhaplogroups SA1, SA2, SB1a, SB1b, SB2a, SB2b, SB3 and SB4 preceded domestication, which gives 8 haplotypes as a minimum estimate of the swamp buffalo diversity captured by the first farmers.

Our account of the pre-domestic history of the swamp buffalo provides a context to previous studies of the diversity of mtDNA control region and cytochrome b gene6, 10, which has been summarized in Supplementary Fig. S5. The high diversity of domestic SA and SB haplotypes in the China-Vietnam border region was proposed as evidence supports an initial major domestication event of swamp buffalo in Southeast Asia, probably between southern China and Vietnam. The finding of SC, SD and SE haplotypes almost exclusively in Thailand and Bangladesh suggests incorporation of these haplotypes after the domestic buffaloes had reached the west bank of the Mekong river6 in a scenario of recurrent restocking the domestic population with wild females as proposed previously for the horse20, 38, 39. Extending the analysis to other loci or even to the entire genome and also a wider sampling covering the entire geographic range of the swamp buffalo is desirable to unravel further the domestication and subsequent demographic history of swamp buffalo and to enable an interesting comparison with the related river buffalo6.

Methods

Sample Collection

Most of the samples used for this work were already collected from previous collaborative works5, 10. All samples were already classified into mtDNA haplogroups based on control-region data. We selected 107 mtDNA for complete sequencing in order to represent all swamp lineages and to include the highest possible molecular variability avoiding potential redundancies. An additional mitogenome from river buffalo was sequenced to be used as an outgroup.

Mitogenome sequencing

DNA was extracted10 from 107 swamp buffaloes (blood, ear tissue and hair follicle) from China (46), Laos (23), Myanmar (3), Thailand (14), Vietnam (16) and India (5) and one Chinese river buffalo. Complete mitogenome sequences were obtained by using two different approaches: 1) PCR amplification (with 27 primer pairs, Supplementary Table S2A)40 and Sanger sequencing; 2) Long-Range PCR amplification (Supplementary Table S2B) and Illumina sequencing41. GenBank accession number, sequencing method, coverage and depth of each sample are reported in Supplementary Dataset S1. MtDNA genome sequences were analyzed using DNASTAR 7.0, Sequencher v5, DNAsp v5, Clustal X and GeneSyn packages.

All experimental procedures were performed in accordance with the Regulations for the Administration of Affairs Concerning Experimental Animals approved by the State Council of People’s Republic of China. The study was approved by Institutional Animal Care and Use Committee of Northwest A&F University (Permit Number: NWAFAC1019).

Phylogeny Construction and Demographic Inferences

The phylogeny construction was performed following a maximum parsimony (MP) criterion by hand and confirmed using an adapted version of mtPhyl4.01542, as previously described20, 21, 43, 44. The modified.txt files to be loaded in the program are available upon request. The tree was rooted on the Bos taurus reference sequence (V00654.1) and on the ancient Bos primigenius mtDNA (GU985279). A maximum likelihood (ML) tree was computed using MEGA7.045 with 1000 bootstrapping replicates.

A first ML analysis was performed using PAML X46 by considering only synonymous mutations in the protein coding genes. The ND6 gene was reverse-complemented to present the same reading direction as the other genes and non-synonymous substitutions were replaced with the ancestral base pairs. Stop codons were excluded from the analysis. Total lengths of coding genes were joined together and the final alignment (11370 bps/3790 codons long) was analyzed with CODEML to calculate a synonymous mutation rate. A second tree was calculated in the same way, but considering all coding mutations. The third and fourth trees with molecular ages were calculated by PAML X and BEAST v. 1.8.3 software47, respectively, while considering two partitions in the molecule corresponding to the coding (including all genes coding for mRNA, rRNA and tRNA) and control regions. Modelgenerator v.85 indicated for our dataset HKY + G + I as the best-supported model according to the AIC2 and BIC criterions. These substitution and site heterogeneity models with 8 gamma categories – the lowest number significantly increasing (>1.0) the likelihood – were selected for the subsequent ML and BEAST estimates. The generalized likelihood ratio statistic was always used to verify the clock hypothesis. In order to calibrate the molecular clock, we built a Bovini tree by including one African Buffalo (NC020617) and two Bos mitogenomes (one Bos taurus, V00654; one ancient Bos primigenius, GU985279) used as an outgroup. For the calibration point we used the estimated archaeological age of the Bovini tribe (8.8 ± 1.1 My; 95% CI: 7–11 My)24. Since multiple calibration points are preferable24, the age of the ancient Bos primigenius (6.7 ± 0.2 Ky; 95% CI: 6.3–7.1 ky) was also used as an internal (recent) calibration point. The major haplogroups were considered as monophyletic in order of being able to calculate their age estimates. The analyses were also repeated excluding the aurochs sequence, but the estimates changed by only ~4% on average. We then obtained a Bayesian skyline plot (BSP)48 from the swamp buffalo phylogeny by running 50,000,000 iterations with samples drawn every 10,000 steps. We constructed spatial frequency distribution plots with the program Surfer 9 (Golden Software, http://www.goldensoftware.com/products/surfer) by using the control-region data currently available in previous literature6 or deposited in GenBank (Supplementary Table S1).

Data accessibility

Sequences of the novel water buffalo mitogenomes have been deposited in GenBank under accession numbers KX758295 - KX758402 (108 complete mtDNAs).

Electronic supplementary material

41598_2017_4830_MOESM1_ESM.pdf (3.6MB, pdf)

Supplementary Figures S1-S6 and Tables S1-S2

Supplementary Dataset S2 (195.5KB, xls)
Supplementary Dataset S1 (25.8KB, xlsx)

Acknowledgements

This research received support from: Natural Science Foundation of China (31272399) and the Program of National Beef Cattle and Yak Industrial Technology System [CARS-38] (to CL); the Italian Ministry of Education, University and Research: Progetti Futuro in Ricerca 2012 (RBFR126B8I) (to AA) and Progetti Ricerca Interesse Nazionale 2012 (2012JA4BTY) (to AA); the University of Pavia strategic theme MIGRATions: towards an “INterdisciplinary Governance model” (MIGRAT.IN.G; to AA), and the Asia Link Project Reproductive biotechnology funded by the European Union.

Author Contributions

Conceived and designed the experiments: S.W., N.C., M.R.C., A.A., C.L.; Performed the experiments: S.W., N.C., M.R.C., L.C. Analyzed the data: S.W., N.C., M.R.C., H.La., A.A., C.L. Contributed reagents/materials/analysis tools: T.Z., A.A., C.L. Performed the collection of biological samples: S.W., N.C., T.Z., H.Z., Y.M., V.C., M.W., M.Y., Y.Z., H.Lu., R.D., Y.H., X.L., M.P., H.C., J.A.L., C.L. Wrote the paper: S.W., N.C., M.R.C., H.La., H.Lu., M.P., J.A.L., A.A., C.L. All authors reviewed and approved the manuscript.

Competing Interests

The authors declare that they have no competing interests.

Footnotes

S. Wang, N. Chen, M. R. Capodiferro, A. Achilli and C. Lei contributed equally to this work.

Electronic supplementary material

Supplementary information accompanies this paper at doi:10.1038/s41598-017-04830-2

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

A. Achilli, Email: alessandro.achilli@unipv.it

C. Lei, Email: leichuzhao1118@126.com

References

  • 1.Cockrill WR. The water buffalo: a review. Br. Vet. J. 1981;137:8–16. doi: 10.1016/s0007-1935(17)31782-7. [DOI] [PubMed] [Google Scholar]
  • 2.Kumar S, et al. Mitochondrial DNA analyses of Indian water buffalo support a distinct genetic origin of river and swamp buffalo. Anim. Genet. 2007;38:227–232. doi: 10.1111/j.1365-2052.2007.01602.x. [DOI] [PubMed] [Google Scholar]
  • 3.FAO. http://dad.fao.org/ (2014).
  • 4.Cockrill, W. R. The husbandry and health of the domestic buffalo. (Food and agricultural organization of the United nations, Rome 1974).
  • 5.Lei CZ, et al. Independent maternal origin of Chinese swamp buffalo (Bubalus bubalis) Anim. Genet. 2007;38:97–102. doi: 10.1111/j.1365-2052.2007.01567.x. [DOI] [PubMed] [Google Scholar]
  • 6.Zhang Y, et al. Strong and stable geographic differentiation of swamp buffalo maternal and paternal lineages indicates domestication in the China/Indochina border region. Mol. Ecol. 2016;25:1530–1550. doi: 10.1111/mec.13518. [DOI] [PubMed] [Google Scholar]
  • 7.Lau CH, et al. Genetic diversity of Asian water buffalo (Bubalus bubalis): mitochondrial DNA D-loop and cytochrome b sequence variation. Anim. Genet. 1998;29:253–264. doi: 10.1046/j.1365-2052.1998.00309.x. [DOI] [PubMed] [Google Scholar]
  • 8.Pandya P, et al. Bacterial diversity in the rumen of Indian Surti buffalo (Bubalus bubalis), assessed by 16S rDNA analysis. J. Appl. Genet. 2010;51:395–402. doi: 10.1007/BF03208869. [DOI] [PubMed] [Google Scholar]
  • 9.Mishra BP, et al. Genetic analysis of river, swamp and hybrid buffaloes of north-east India throw new light on phylogeography of water buffalo (Bubalus bubalis) J. Anim. Breed. Genet. 2015;132:454–466. doi: 10.1111/jbg.12141. [DOI] [PubMed] [Google Scholar]
  • 10.Yue X-P, et al. Phylogeography and Domestication of Chinese Swamp Buffalo. PLoS One. 2013;8:e56552. doi: 10.1371/journal.pone.0056552. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Barker JSF, et al. Genetic diversity of Asian water buffalo (Bubalus bubalis): microsatellite variation and a comparison with protein-coding loci. Anim. Genet. 1997;28:103–115. doi: 10.1111/j.1365-2052.1997.00085.x. [DOI] [PubMed] [Google Scholar]
  • 12.Navani N, Jain PK, Gupta S, Sisodia BS, Kumar S. A set of cattle microsatellite DNA markers for genome analysis of riverine buffalo (Bubalus bubalis) Anim. Genet. 2002;33:149–154. doi: 10.1046/j.1365-2052.2002.00823.x. [DOI] [PubMed] [Google Scholar]
  • 13.Barker JSF, Tan SG, Selvaraj OS, Mukherjee TK. Genetic variation within and relationships among populations of Asian water buffalo (Bubalus bubalis) Anim. Genet. 1997;28:1–13. doi: 10.1111/j.1365-2052.1997.00036.x. [DOI] [PubMed] [Google Scholar]
  • 14.Kierstein G, et al. Analysis of mitochondrial D-loop region casts new light on domestic water buffalo (Bubalus bubalis) phylogeny. Mol. Phylogenet. Evol. 2004;30:308–324. doi: 10.1016/S1055-7903(03)00221-5. [DOI] [PubMed] [Google Scholar]
  • 15.Kumar S, Nagarajan M, Sandhu JS, Kumar N, Behl V. Phylogeography and domestication of Indian river buffalo. BMC Evol. Biol. 2007;7:186. doi: 10.1186/1471-2148-7-186. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Lei C, et al. Two maternal lineages revealed by mitochondrial DNA D-loop sequences in Chinese native water buffaloes (Bubalus bubalis) Asian-australas. J. Anim. Sci. 2007;20:471. doi: 10.5713/ajas.2007.471. [DOI] [Google Scholar]
  • 17.Yindee M, et al. Y-chromosomal variation confirms independent domestications of swamp and river buffalo. Anim. Genet. 2010;41:433–435. doi: 10.1111/j.1365-2052.2010.02020.x. [DOI] [PubMed] [Google Scholar]
  • 18.Amano T, Miyakoshi Y, Takada T, Kikkawa Y, Suzuki H. Genetic variants of ribosomal DNA and mitochondrial DNA between swamp and river buffaloes. Anim. Genet. 1994;25:29–36. doi: 10.1111/j.1365-2052.1994.tb00400.x. [DOI] [PubMed] [Google Scholar]
  • 19.Soares P, et al. Correcting for purifying selection: an improved human mitochondrial molecular clock. Am. J. Hum. Genet. 2009;84:740–759. doi: 10.1016/j.ajhg.2009.05.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Achilli A, et al. Mitochondrial genomes from modern horses reveal the major haplogroups that underwent domestication. Proc. Natl. Acad. Sci. USA. 2012;109:2449–2454. doi: 10.1073/pnas.1111637109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Colli L, et al. Whole mitochondrial genomes unveil the impact of domestication on goat matrilineal variability. BMC Genomics. 2015;16:1115. doi: 10.1186/s12864-015-2342-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Tanaka K, et al. Nucleotide diversity of mitochondrial DNAs between the swamp and the river types of domestic water buffaloes, Bubalus bubalis, based on restriction endonuclease cleavage patterns. Biochem. Genet. 1995;33:137–148. doi: 10.1007/BF00554726. [DOI] [PubMed] [Google Scholar]
  • 23.Tanaka K, et al. Phylogenetic relationship among all living species of the genus Bubalus based on DNA sequences of the cytochromeb gene. Biochem. Genet. 1996;34:443–452. doi: 10.1007/BF00570125. [DOI] [PubMed] [Google Scholar]
  • 24.Bibi F. A multi-calibrated mitochondrial phylogeny of extant Bovidae (Artiodactyla, Ruminantia) and the importance of the fossil record to systematics. BMC Evol. Biol. 2013;13:166. doi: 10.1186/1471-2148-13-166. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Edwards CJ, et al. A complete mitochondrial genome sequence from a mesolithic Wild Aurochs (Bos primigenius) PLoS One. 2010;5:e9255. doi: 10.1371/journal.pone.0009255. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Patel, A. K. & Meadow, R. H. In Archaeology of the near east: Proceedings of the third international symposium on the archeozoology of the southwestern Asia and adjacent areas (eds H. L. Buitenhuis, L. Bartosiewicz, & A. M. Choyke) (ARC- publications, 1998).
  • 27.Finlay EK, et al. Bayesian inference of population expansions in domestic bovines. Biol. Lett. 2007;3:449–452. doi: 10.1098/rsbl.2007.0146. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Woodruff DS. Biogeography and conservation in Southeast Asia: how 2.7 million years of repeated environmental fluctuations affect today′s patterns and the future of the remaining refugial-phase biodiversity. Biodivers. Conserv. 2010;19:919–941. doi: 10.1007/s10531-010-9783-3. [DOI] [Google Scholar]
  • 29.De Deckker P, Tapper NJ, van der Kaars S. The status of the Indo-Pacific Warm Pool and adjacent land at the Last Glacial Maximum. Glob. Planet. Change. 2003;35:25–35. doi: 10.1016/S0921-8181(02)00089-9. [DOI] [Google Scholar]
  • 30.Clark PU, et al. Global climate evolution during the last deglaciation. Proc. Natl. Acad. Sci. USA. 2012;109:E1134–E1142. doi: 10.1073/pnas.1116619109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Yan X-H, Ho C-R, Zheng Q, Klemas V. Temperature and Size Variabilities of the Western Pacific Warm Pool. Science. 1992;258:1643. doi: 10.1126/science.258.5088.1643. [DOI] [PubMed] [Google Scholar]
  • 32.Crowley JT. CLIMAP SSTs re-revisited. Clim. Dyn. 2000;16:241–255. doi: 10.1007/s003820050325. [DOI] [Google Scholar]
  • 33.Cannon C. The Ecology of Tropical East Asia by Richard T. Corlett. Q. Rev. Biol. 2015;90:433. doi: 10.1086/683725. [DOI] [Google Scholar]
  • 34.Graur D, Martin W. Reading the entrails of chickens: molecular timescales of evolution and the illusion of precision. Trends Genet. 2004;20:80–86. doi: 10.1016/j.tig.2003.12.003. [DOI] [PubMed] [Google Scholar]
  • 35.Pelejero C, Kienast M, Wang L, Grimalt JO. The flooding of Sundaland during the last deglaciation: imprints in hemipelagic sediments from the southern South China Sea. Earth Planet. Sci. Lett. 1999;171:661–671. doi: 10.1016/S0012-821X(99)00178-8. [DOI] [Google Scholar]
  • 36.Zhou W, et al. High-resolution evidence from southern China of an early Holocene optimum and a mid-Holocene dry event during the past 18,000 years. Quat. Res. 2004;62:39–48. doi: 10.1016/j.yqres.2004.05.004. [DOI] [Google Scholar]
  • 37.Nagarajan M, Nimisha K, Kumar S. Mitochondrial DNA variability of domestic river buffalo (Bubalus bubalis) populations: Genetic evidence for domestication of river buffalo in Indian Subcontinent. Genome Biol. Evol. 2015;7:1252–1259. doi: 10.1093/gbe/evv067. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Librado P, et al. The evolutionary origin and genetic makeup of domestic horses. Genetics. 2016;204:423–434. doi: 10.1534/genetics.116.194860. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Lindgren G, et al. Limited number of patrilines in horse domestication. Nat. Genet. 2004;36:335–336. doi: 10.1038/ng1326. [DOI] [PubMed] [Google Scholar]
  • 40.Parma P, Erra-Pujada M, Feligini M, Greppi G, Enne G. Water buffalo (Bubalus bubalis): Complete nucleotide mitochondrial genome sequence. DNA Seq. 2004;15:369–373. doi: 10.1080/10425170400019318. [DOI] [PubMed] [Google Scholar]
  • 41.Olivieri A, et al. Mitogenomes from Egyptian cattle breeds: new clues on the origin of haplogroup Q and the early spread of Bos taurus from the Near East. PLoS One. 2015;10:e0141170. doi: 10.1371/journal.pone.0141170. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Eltsov, N. P. & Volodko, N. V. In http://eltsov.org (2011).
  • 43.Achilli A, et al. Mitochondrial genomes of extinct aurochs survive in domestic cattle. Curr. Biol. 2008;18:R157–158. doi: 10.1016/j.cub.2008.01.019. [DOI] [PubMed] [Google Scholar]
  • 44.Lancioni H, et al. Phylogenetic relationships of three Italian merino-derived sheep breeds evaluated through a complete mitogenome analysis. PLoS One. 2013;8:e73712. doi: 10.1371/journal.pone.0073712. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Tamura K, Stecher G, Peterson D, Filipski A, Kumar S. MEGA6: Molecular Evolutionary Genetics Analysis Version 6.0. Mol. Biol. Evol. 2013;30:2725–2729. doi: 10.1093/molbev/mst197. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Yang Z. PAML 4: Phylogenetic Analysis by Maximum Likelihood. Mol. Biol. Evol. 2007;24:1586–1591. doi: 10.1093/molbev/msm088. [DOI] [PubMed] [Google Scholar]
  • 47.Drummond AJ, Rambaut A. BEAST: Bayesian Evolutionary Analysis by Sampling Trees. BMC Evol. Biol. 2007;7:214. doi: 10.1186/1471-2148-7-214. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Drummond AJ, Rambaut A, Shapiro B, Pybus OG. Bayesian coalescent inference of past population dynamics from molecular sequences. Mol. Biol. Evol. 2005;22:1185–1192. doi: 10.1093/molbev/msi103. [DOI] [PubMed] [Google Scholar]
  • 49.Bollongino R, et al. Modern Taurine Cattle descended from small number of Near-Eastern founders. Mol. Biol. Evol. 2012;29:2101–2104. doi: 10.1093/molbev/mss092. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

41598_2017_4830_MOESM1_ESM.pdf (3.6MB, pdf)

Supplementary Figures S1-S6 and Tables S1-S2

Supplementary Dataset S2 (195.5KB, xls)
Supplementary Dataset S1 (25.8KB, xlsx)

Data Availability Statement

Sequences of the novel water buffalo mitogenomes have been deposited in GenBank under accession numbers KX758295 - KX758402 (108 complete mtDNAs).


Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

RESOURCES