Skip to main content
Evolutionary Applications logoLink to Evolutionary Applications
. 2018 Dec 13;12(2):292–300. doi: 10.1111/eva.12716

Introgression of Chinese haplotypes contributed to the improvement of Danish Duroc pigs

Minhui Chen 1,2, Guosheng Su 1,, Jinluan Fu 2, Aiguo Wang 2, Jian‐Feng Liu 2,, Mogens S Lund 1, Bernt Guldbrandtsen 1
PMCID: PMC6346729  PMID: 30697340

Abstract

The distribution of Asian ancestry in the genome of Danish Duroc pigs was investigated using whole‐genome sequencing data from European wild boars, Danish Duroc, Chinese Meishan and Bamaxiang pigs. Asian haplotypes deriving from Meishan and Bamaxiang occur widely across the genome. Signatures of selection on Asian haplotypes are common in the genome, but few of these haplotypes have been fixed. By defining 50‐kb windows with more than 50% Chinese ancestry, which did not exhibit extreme genetic differentiation between Meishan and Bamaxiang as candidate regions, the enrichment of quantitative trait loci in candidate regions supports that Asian haplotypes under selection play an important role in contributing genetic variation underlying production, reproduction, meat and carcass, and exterior traits. Gene annotation of regions with the highest proportion of Chinese ancestry revealed genes of biological interest, such as NR6A1. Further haplotype clustering analysis suggested that a haplotype of Chinese origin around the NR6A1 gene was introduced to Europe and then underwent a selective sweep in European pigs. Besides, functional genes in candidate regions, such as AHR and PGRMC2, associated with fertility, and SAL1, associated with meat quality, were identified. Our results demonstrate the contribution of Asian haplotypes to the genomes of European pigs. Findings herein facilitate further genomic studies such as genomewide association study and genomic prediction by providing ancestry information of variants.

Keywords: genomics, introgression, pigs, population genetics

1. INTRODUCTION

Adaptive introgression is a process whereby beneficial variants are acquired by introgression from related species or populations and are then spread further in the recipient population by positive selection. Since introgression can be an important source of advantageous alleles and the initial frequencies of beneficial alleles can be high if genetic admixture is intense (Hedrick, 2013; Jordan, 2016), introgression has the potential to speed adaptive evolution. Indeed, recent studies have provided evidence of adaptive evolution of introgressed haplotypes in humans (Huerta‐Sanchez et al., 2014; Juric, Aeschbacher, & Coop, 2015), plants (Hufford et al., 2013) and animals (Eriksson et al., 2008; Figueiro et al., 2017; Pardo‐Diaz et al., 2012).

Domestication of pigs has occurred independently from wild boars in Europe and Asia (Giuffra et al., 2000; Larson et al., 2005). It is well documented that around 1,700 Chinese pigs were introduced into Northern Europe to improve local breeds to meet the demand of intensified agriculture (White, 2011). A genomic study of Isla del Coco feral pigs, a population derived from British pigs and isolated since 1793, showed evidence of crossbreeding with Chinese pigs, demonstrating crossing as early as the 18th century (Bianco, Soto, Vargas, & Perez‐Enciso, 2015). A previous study (Chen et al., 2017) confirmed that most European pig breeds contained about 20% ancestry from Chinese pigs. Artificial selection on introgressed Chinese haplotypes has further been demonstrated in European Large White pigs (Bosse et al., 2014). Besides, adaptive introgression in pigs has also been observed on the X chromosome (Ai et al., 2015).

To identify introgressed haplotypes accurately, extant groups genetically similar to the source populations of admixture should be sampled. In our previous study (Chen et al., 2017), we found that the main source of the introgression from China to Europe was pigs from South China, which were genetically close to Bamaxiang. To better characterize the hybrid nature of European pig genome, in this study, we used both Bamaxiang pigs from South China and Meishan pigs from East China to identify the introgressed Chinese haplotypes in the Danish Duroc pigs.

The aim of the study was to detect selection on introgressed Chinese haplotypes and evaluate their contribution to the improvement of European pigs. To achieve this goal, we carried out a study utilizing whole‐genome sequencing data from 16 European wild boars (EUW), 90 Danish Duroc, 30 Meishan and 6 Bamaxiang pigs. EUW, Meishan and Bamaxiang pigs were used as donors to paint the genomes of Duroc pigs using ChromoPainter (Lawson, Hellenthal, Myers, & Falush, 2012), a recently developed software for local ancestry inference.

2. METHODS

2.1. Sample collection and DNA preparation

Samples from 30 Meishan pigs and 90 Danish Duroc pigs were collected. For Meishan pigs, genomic DNA was extracted from ear tissue using a standard phenol–chloroform method; for Duroc pigs, genomic DNA was extracted from blood sample using the QIAsymphony DNA Mini Kit (Qiagen). Sequencing was performed on the Illumina HiSeq 2000 platform. Whole‐genome sequencing data from 16 EUW (Bosse et al., 2014; Frantz et al., 2015) and six Bamaxiang pigs (Ai et al., 2015) were obtained. For EUW, FASTQ files for seven individuals were downloaded from European Nucleotide Archive (ENA, accession PRJEB9922), and BAM files for nine individuals were downloaded from ENA (accession ERP001813). For Bamaxiang pigs, we downloaded FASTQ files from NCBI Sequence Read Archive (accession SRA096093).

2.2. SNP calling and filtering

For each individual, the read‐pairs were pre‐processed using PRINSEQ (Schmieder & Edwards, 2011). Reads were trimmed to a minimum base PHRED quality of 20 from the 3′‐end and removed if shorter than 51 bp, with more than 3 Ns, or a mean quality score <18. Filtered reads were aligned to the porcine reference genome build 10.2 (Groenen et al., 2012) by the Burrows–Wheeler Aligner (BWA version 0.7.17) (Li & Durbin, 2009), employing the "mem" algorithm. SAMtools (version 1.8) (Li et al., 2009) was used for sorting, merging and removing potential PCR duplications. Variants were identified in all samples simultaneously using the Genome Analysis Toolkit's (GATK version 3.5) UnifiedGenotyper (McKenna et al., 2010) with default parameter values, which included downsampling of SNP sites with more than 250 reads per sample. We used vcftools (version 0.1.15) (Danecek et al., 2011) to retain only SNPs. Finally, we identified 40,116,524 SNPs in 142 genomes. As introgression differs between sex chromosomes and autosomes, we only retained SNPs on autosomes for subsequent analyses.

2.3. Population structure analyses

Population structure analyses were conducted together with our previously merged SNP array data from Eurasian domestic pigs and wild boars (Chen et al., 2017). These array data were collected from multiple studies (Ai, Huang, & Ren, 2013; Chen et al., 2017; Goedbloed et al., 2013; Manunza et al., 2013; Wilkinson et al., 2013), involving 695 individuals and 30,549 autosomal SNPs. Therefore, we extracted from the sequence data to obtain genotypes at all variants present in the merged data set. To reduce the effect of uneven sample size per population (McVean, 2009), we randomly selected six individuals for SNP array genotyped populations with more than six individuals, while keeping all sequenced individuals. We performed PCA (principal component analysis) using EIGENSOFT 6.0.1 (Patterson, Price, & Reich, 2006) and population structure analysis by ADMIXTURE (version 1.23) (Alexander, Novembre, & Lange, 2009) for K = 1 to 20 ancestral populations using default options.

2.4. Chromosome painting using ChromoPainter

Chromosome painting is a method to characterize shared ancestry between individuals that takes linkage disequilibrium (LD) information into account. We reconstructed haplotypes and imputed missing genotypes using BEAGLE (version 4.1) (Browning & Browning, 2007). Only bi‐allelic SNPs were used for chromosome painting. We used ChromoPainter v2 (Lawson et al., 2012) to perform chromosome painting on Duroc individuals using EUW, Meishan and Bamaxiang pigs as donor populations. We ran ChromoPainter twice, as recommended in the user manual. In the first run, we used all donor individuals to paint nine randomly selected Duroc individuals on six randomly selected chromosomes (SSC 2, 5, 7, 10, 11, 16 and 17). The aim of this run was to estimate switch rate, global mutation rate and copying probabilities by running 50 iterations of the expectation–maximization algorithm. We averaged estimated values of each parameter across these chromosomes, weighting by the number of SNPs, and then across individuals, to obtain estimates of parameters. In the second run, we used the estimates from the first run to perform chromosome painting for all individuals and autosomes. As a result, we obtained an estimated ancestry proportion for each SNP in the genome of Duroc.

To evaluate the variation of ancestry proportion, we divided the genome into non‐overlapping 50‐kb windows. In each window, we separately estimated the average proportion of ancestry from Meishan, Bamaxiang and EUW across SNPs. We focused our analyses on windows with a high proportion of introgression from Chinese pigs, rather than windows with a significant signal of selection. Specifically, we selected windows with more than 50% Chinese ancestry (the sum of Meishan and Bamaxiang ancestries) as candidate regions of adaptive introgression. To avoid the confounding of introgression from European pigs to China, we further excluded windows with extreme F ST between Meishan and Bamaxiang, that is, windows with F ST falling in the top 5% of the empirical distribution.

2.5. F ST calculation

To quantify genomic differentiation, we calculated F ST values between Duroc and Meishan, Duroc and Bamaxiang, and Meishan and Bamaxiang. Hudson's F ST (Hudson, Boos, & Kaplan, 1992) was calculated for each bi‐allelic SNP using the “PopGenome” package (Pfeifer, Wittelsburger, Ramos‐Onsins, & Lercher, 2014) in R. Then, we averaged F ST scores across SNPs within non‐overlapping 50‐kb windows. Because the range of true F ST values by definition is between 0 and 1 (Wright, 1951), we set F ST to 0 for windows with negative average estimates of F ST. The differentiation between Duroc and Chinese pigs was calculated as the average of F ST between Duroc and Meishan, and F ST between Duroc and Bamaxiang.

2.6. Gene annotation and functional analyses

We conducted gene annotation on five regions with the highest proportions of Meishan ancestry and five regions with the highest proportions of Bamaxiang ancestry. We took the following approaches to define these regions using Meishan ancestry as an example. We first selected 50‐kb windows with the highest proportions of Meishan ancestry, which did not exhibit extreme F ST between Meishan and Bamaxiang (i.e., exclude windows with F ST higher than 95% quantile of the empirical distribution); then, these windows were extended in both directions until the next window with <50% Meishan ancestry or with extreme F ST between Meishan and Bamaxiang. Since long regions were more likely to be caused by selection, we chose the five regions with at least two windows. Gene contents and QTL (quantitative trait locus) numbers in these regions were retrieved from the Ensembl Genes 89 Database using BioMart (Kinsella et al., 2011) and from the Animal QTL Database (Hu, Park, & Reecy, 2016).

We counted the number of QTLs in Animal QTL Database (Hu et al., 2016) which overlapped with candidate regions with more than 50% Chinese ancestry without extreme F ST between Meishan and Bamaxiang. First, 25,610 QTLs were extracted from the database. Among these, 8,937 have been identified by linkage analysis. These were excluded due to uncertain genomic locations. The remaining 16,673 QTLs, identified by association studies, were retained for further analysis. These QTLs were classified into five groups corresponding to five classes of traits as defined in the Animal QTL Database (Hu et al., 2016): meat and carcass, production, reproduction, health and exterior traits. We excluded 817 QTLs not located on autosomes or spanning more than 1 Mb. A total of 15,856 QTLs remained, with 7,654, 957, 837, 4,774 and 1,634 associated with meat and carcass, production, reproduction, health and exterior traits. The length of those QTL ranged from 40 bp to 1 Mb, with a mean of 24 ± 115 kb (mean ± SD). Of note, the database defined QTL regions as follows: (a) when flanking SNPs were given, the region was defined by the actual locations; (b) when only the centre SNP was given, the region was defined as a small region to create a minimum visible map window in a genome browser (the middle of the region was the actual location of centre SNP); (c) when SNPs were completely missing, the region was estimated from linkage association. We defined the mid‐point of a QTL region as the peak position. Finally, we counted the number of QTLs whose peak positions were located within candidate regions. When two QTL records associated with traits in the same trait group and had the same genomic interval in the Animal QTL Database, they were counted as one QTL. The same method was applied to each trait group separately.

To test for enrichment among all QTLs and among trait‐specific QTLs within candidate regions, we applied a permutation test. We concatenated the autosomes into a circular genome, in the order from SSC1 to SSC18. A permuted sample was formed by moving candidate regions along the genome by a randomly chosen amount. This permutation did not change the relative positions between these windows preserving their correlation structure. We then computed the number of all QTLs and those belonging to each trait group that presented in these simulated windows using the same method as above. In total, 10,000 permutations were performed. The distribution of numbers of QTLs observed in the permutated regions was treated as the null distribution from which we computed the significance levels of the number of QTLs observed in the real data.

2.7. Haplotype clustering in NR6A1

To check whether the NR6A1 haplotype in Danish Duroc pigs was introgressed from Meishan pigs or a breed genetically close to Meishan, we compared haplotypes within a reference pool of more Chinese pig breeds. We further collected whole‐genome sequencing data from Chinese pigs, including five Erhualian (Ai et al., 2015), two Jinhua (Frantz et al., 2015), four Tongcheng (Wang et al., 2015), six Luchuan (Ai et al., 2015) and six Wuzhishan pigs (Ai et al., 2015). For Meishan and Duroc, we randomly selected 10 individuals from each population. We used the same procedure as above to call variants in the 2‐Mb region (298–300 Mb on SSC1) around the NR6A1 gene. BEAGLE (version 4.1) (Browning & Browning, 2007) was used to impute missing genotypes and to phase haplotypes. Haplostrips (Marnetto, Huerta‐Sánchez, & Price, 2017) was used to cluster haplotypes in the peak region 299.3–299.4 Mb (including two windows with more than 90% Meishan ancestry from ChromoPainter analysis) and haplotype in the region 299.05–299.45 Mb (including eight windows with more than 50% Meishan ancestry from ChromoPainter analysis, regardless of F ST between Meishan and Bamaxiang). We used Duroc as the reference population, and calculated the distance of other haplotypes from the reference as the number of SNPs with different alleles. Similar haplotypes were clustered together and ordered based on the distance to the reference.

3. RESULTS

3.1. Population structure

The first four components of PCA are shown in Figure 1. The first principal component separated Chinese pigs from European pigs; the second principal component separated Duroc pigs from other European pigs; the third and fourth principal components separated populations into subgroups. As expected, sequenced individuals clustered together with corresponding genotyped individuals from the same breed. These results indicated that sequenced individuals are representative for their populations. ADMIXTURE analysis supported this result. The cross‐validation error dropped slowly after K reached 10, and was minimized at K = 19 (Supporting Information Figure S1). To evaluate the genetic similarity between sequenced individuals and genotyped individuals, we examined ADMIXTURE results for K = 4 (Figure 2). ADMIXTURE identified four components corresponding to 1: Duroc, 2: EUD (European domestic pigs) except Duroc, 3: EUW and 4: Chinese wild boars and domestic pigs. These four components could reveal whether there was recent gene flow to our sequenced individuals. The ancestry composition was similar in sequenced individuals and genotyped individuals from the same breed. However, one EUW individual (Sample ID: WB44U06) had about 5% of EUD ancestry, and was removed from further analysis. The remaining individuals of EUW, Meishan and Bamaxiang pigs contained little ancestry from other components.

Figure 1.

Figure 1

The plot of principal component analysis on Eurasian domestic pigs and wild boars. Genotyped wild boars are coloured with black; Chinese domestic pigs are coloured with red; European domestic pigs are coloured with green. ASW: Asian wild boars; BK: Berkshire; BMX: Bamaxiang; BMX_seq: sequenced Bamaxiang; BS: British Saddleback; CA: Canarian; DLS: Danish Landrace; DS: Dongshan; DU: Duroc; DU_seq: sequenced Duroc; EHL: Erhualian; EUW_seq: sequenced EUW; GLOS: Gloucestershire Old Spots; GX: Ganxi; HA: Hampshire; IB: Ibérico; JH: Jinhua; KL: Kele; LB: Large Black; LR: Landrace; LW: Large White; MA: Mangalica; MS: Meishan; MS_seq: sequenced Meishan; MW: Middle White; MZ: Min; PI: Pietrain; RC: Rongchang; SLS: Pied Landrace; SUT: Sutai; SZL: Shaziling; TA: Tamworth; TC: Tongcheng; WD: White Duroc; WE: Welsh; ZZ: Tibetan

Figure 2.

Figure 2

Ancestries of each sample using four ancestral populations (K = 4) in the ADMIXTURE analysis. Genotyped wild boars are labelled with black; genotyped Chinese domestic pigs are labelled with red; genotyped European domestic pigs are labelled with green; sequenced wild boars and domestic pigs are labelled with blue. ASW: Asian wild boars; BK: Berkshire; BMX: Bamaxiang; BMX_seq: sequenced Bamaxiang; BS: British Saddleback; CA: Canarian; DLS: Danish Landrace; DS: Dongshan; DU: Duroc; DU_seq: sequenced Duroc; EHL: Erhualian; EUW_seq: sequenced EUW; GLOS: Gloucestershire Old Spots; GX: Ganxi; HA: Hampshire; IB: Ibérico; JH: Jinhua; KL: Kele; LB: Large Black; LR: Landrace; LW: Large White; MA: Mangalica; MS: Meishan; MS_seq: sequenced Meishan; MW: Middle White; MZ: Min; PI: Pietrain; RC: Rongchang; SLS: Pied Landrace; SUT: Sutai; SZL: Shaziling; TA: Tamworth; TC: Tongcheng; WD: White Duroc; WE: Welsh; ZZ: Tibetan

3.2. Chromosome painting

The first run of ChromoPainter (Lawson et al., 2012) arrived at final values of N e = 3,106 for switch rate, 0.015 for global mutation rate, and 0.14, 0.13 and 0.73 for probabilities of copying from Meishan, Bamaxiang and EUW. In the second run of ChromoPainter, we estimated the proportion of ancestry from Meishan, Bamaxiang and EUW for each SNP in the Duroc genome. Figure 3 shows the distribution of proportions of ancestries contributed by Meishan, Bamaxiang, and EUW across the genome. As expected, most SNPs were fixed or were approaching fixation for variants derived from EUW. Besides, Meishan ancestry and Bamaxiang ancestry had almost the same distribution, confirming our previous result that pigs from East China also contributed to the introgression from Chinese pigs to European ones (Chen et al., 2017). The average proportions of ancestries over non‐overlapping 50‐kb windows are shown in Supporting Information Figure S2. Ancestry proportions varied a lot along the genome, with a genomic average of 0.12 ± 2.90e‐5 (mean ± SE), 0.13 ± 3.27e‐5 and 0.75 ± 4.28e‐5 from Meishan, Bamaxiang and EUW, respectively. Out of 48,555 windows, 8,086 had more than 50% ancestry from Chinese pigs (sum of Meishan and Bamaxiang ancestries) and F ST between Meishan and Bamaxiang lower than 95% quantile (F ST = 0.51) of the empirical distribution (details in Supporting Information Table S1). Among these, 1,640 windows had more than 50% Meishan ancestry, and 2,630 windows had more than 50% Bamaxiang ancestry. These results suggest selection for Chinese‐derived haplotypes in the Duroc genome.

Figure 3.

Figure 3

The distribution of ancestry proportion across SNPs in the genome

3.3. Population differentiation

F ST values were calculated to measure the degree of genetic differentiation between Duroc and Chinese pigs. Supporting Information Figure S2 shows the distribution of F ST values averaged over all SNPs within non‐overlapping 50‐kb windows. Regions with a high proportion of ancestry from Chinese pigs presented low F ST values. The correlation between 50‐kb window‐based F ST value and proportion of Chinese ancestry was r = −0.58 (p‐value <2.2e‐16). The strong correlation confirmed the accuracy of local ancestry inference. In windows with more than 70% Meishan ancestry, the average of 50‐kb window‐based F ST between Duroc and Meishan was 0.33 ± 0.18 (mean ± SD), much lower than the value (average F ST 0.45 ± 0.17) between Duroc and Bamaxiang. This demonstrates the presence of Meishan ancestry in Duroc.

3.4. Gene annotation and functional analyses

Gene annotation of five regions with the highest proportion of Meishan ancestry and five regions with the highest proportions of Bamaxiang ancestry is shown in Table S2. In total, these ten regions covered 4.1 Mb. All of the 10 regions contained genes that were annotated as known or novel genes. The 32.4–33.75 Mb region on SSC 7 with 94% Chinese Meishan ancestry contained the highest number of QTLs, in total 282 QTLs. These QTLs associated with a variety of traits, such as meat quality and carcass, production and exterior traits.

In order to identify potentially selected traits, QTLs located within candidate regions (windows with more than 50% Chinese ancestry and F ST lower than 95% quantile of empirical distribution) were counted. The 1,710 QTLs within these windows included 587, 162, 177, 575 and 209 of them associated with meat and carcass, production, reproduction, health and exterior traits, respectively. Figures 4 and 5 compare the observed number of QTLs with the null distribution simulated by a permutation test. As shown in Figure 4, the combined number of QTLs associated with all traits was significantly enriched (p = 0.0455). Figure 5 shows the number of QTLs counting for each trait group separately. QTLs associated with meat and carcass, exterior, production and reproduction were significantly enriched in candidate regions with p values of 0.0230, 0.0445, 0.0008 and 0.0002.

Figure 4.

Figure 4

The number of QTLs in 50‐kb windows with more than 50% Chinese ancestry. The histogram is the null distribution of QTL numbers simulated by permutation test. The dashed line indicates the observed number of QTLs

Figure 5.

Figure 5

The number of QTLs associated with meat and carcass, production, reproduction, health, and exterior traits in 50‐kb windows with more than 50% Chinese ancestry. The histograms are the null distributions of number of QTLs associated with each trait group by permutation test. The dashed line indicate the observed number of QTLs associated with each trait group

3.5. Introgression of NR6A1 haplotype

Figure 6 showed the result of haplotype clustering in the region 299.05–299.45 Mb on SSC1. Duroc and EUW haplotypes formed a subgroup of Chinese pigs, with some Meishan haplotypes dispersing among those of Duroc. This result suggested that NR6A1 haplotypes in Duroc were introgressed from Meishan pigs. In the region 299.3–299.4 Mb (Supporting Information Figure S3), Duroc and a portion of Meishan haplotypes clustered together and formed an outgroup of EUW and Chinese pigs. This result confirmed that the NR6A1 haplotypes in Duroc were genetically closer to those in Meishan pigs, while the direction of introgression was not clear. And haplotypes in Meishan and Duroc might originate from a third population, which was genetically distinct from European and Chinese pigs.

Figure 6.

Figure 6

Haplotype clustering on NR6A1 locus (299.05–299.45 Mb on SSC1). Haplotypes are clustered and sorted by increasing distance to the Duroc reference population. Each row represents one haplotype, and each column represents one SNP. The colours in the left column of the panel indicate to which population the haplotypes belong. Reference alleles are represented as white spots, and alternative alleles are represented as black spots. BMX: Bamaxiang; DU: Duroc; EHL: Erhualian; EUW: European wild boars; JH: Jinhua; LUC: Luchuan; MS: Meishan; TC: Tongcheng; WZS: Wuzhishan

4. DISCUSSION

In this study, we used ChromoPainter (Lawson et al., 2012) to analyse the genome of Danish Duroc to infer local ancestry by using EUW, Meishan pigs and Bamaxiang pigs as donor populations. Our results revealed a large number of genomic regions with a high level of introgression of Asian haplotypes. It was suggested that pigs from both South and East China contributed to the introgression from Chinese pigs into European ones. Meishan and Bamaxiang pigs can reveal the majority of introgressed Asian haplotypes in the genome of European pigs. However, of note, since Meishan and Bamaxiang might not be the precise direct source of introgression to European pigs, it could result in the loss of Asian haplotypes which are genetically distinct from Meishan and Bamaxiang.

Some regions with introgression from Chinese pigs were highly significantly enriched in QTLs affecting production and reproduction. These introgressed haplotypes were assumed to improve the performance of European pigs. This can have two (mutually non‐exclusive) causes: First, haplotypes from Chinese pigs may harbour alleles conveying superior performance for these traits, reflecting superior performance of Chinese pigs at the time of introgression; and two, pigs have been selected for traits that generally tend to produce QTLs that have not been fixed in the population. However, there is limited evidence to reflect selective pressure on Chinese haplotypes to improve health traits. It may be because these traits have been difficult to measure and have historically not been under strong selection. However, the QTLs analysed here are derived from different pig breeds and lines. As a result, some of them might not remain associated with traits in Danish Duroc. Further studies using QTLs from Danish Duroc population are needed to confirm the selection on introgressed haplotypes.

The 10 regions with the highest proportions of ancestry from Meishan or Bamaxiang overlapped hundreds of QTLs and genes of potential interest. Of particular interest is the region 299.3–299.45 Mb on SSC 1, which contains two protein‐coding genes including the NR6A1 gene. This introgression from Meishan into Danish Duroc is further proved in the result of haplotype clustering that Duroc and Meishan haplotypes cluster together and locate as a subgroup of Chinese pigs (Figure 6). However, in the result of the region 299.3–299.4 Mb (Supporting Information Figure S3), Meishan and Duroc haplotypes locate as an outgroup of EUW and Chinese pigs. One possible reason might be that there is not enough variation in this region to correctly place Meishan and Duroc cluster in the tree. This gene has been extensively investigated and is considered a causal gene affecting the number of vertebrae (Mikawa et al., 2005, 2007 ). There is also evidence showing that this gene underwent a selective sweep in European commercial and local breeds (Rubin et al., 2012). In our study, the high haplotype homozygosity in Duroc haplotypes (as shown in Figure 6) also confirms the selection on this haplotype. These results suggest that the NR6A1 haplotype from Meishan or a closely related breed was introduced into European breeds and subsequently fixed due to selective breeding for large body size.

We here identified a list of genes where the introgressed Chinese haplotypes have spread extensively in Duroc pigs. Many of these have effects on traits under selection. For example, there is more than 99% Chinese ancestry in the 95.5‐ to 95.55‐Mb window on SSC 9, containing the AHR gene, the only one gene in this window. This gene plays an important role in the female reproductive system (Hernandez‐Ochoa, Karman, & Flaws, 2009). Selection on AHR haplotypes of China origin has also been reported in European Large White pigs (Bosse et al., 2014). Examples of positively selected Chinese haplotypes, that were reported previously and confirmed in this study, include genes associated with fertility (PGRMC2: Bosse et al., 2014; with 62% Chinese ancestry), meat quality (SAL1: Bosse et al., 2014; with 72% Chinese ancestry), fat deposition (FASN: Bosse et al., 2015; with 55% Chinese ancestry), growth (BMP3: Bosse et al., 2015; with 52% Chinese ancestry) and pigmentation (RAB38: Bosse et al., 2015; with 53% Chinese ancestry). In contrast, we confirmed relatively low proportions of Chinese ancestry for several genes as previously observed, such as MC1R (Bosse et al., 2015; Fang, Larson, Soares Ribeiro, Li, & Andersson, 2009; with 4% Chinese ancestry), KIT (Bosse et al., 2015; with <1% Chinese ancestry) and MBNL1 (Bosse et al., 2015; with 19% Chinese ancestry). These results suggest that in these cases, Chinese haplotypes were not positively selected.

5. CONCLUSION

Admixture during the Industrial Revolution of Chinese pigs imported into Europe to improve the performance of European breeds has contributed a substantial fraction of the genomes of European domesticated pigs. This introgression is highly uneven across the genome, and that introgressed regions are associated with traits that have been actively selected for in the direction that would favour haplotypes of Asian origin.

CONFLICT OF INTEREST

None declared.

DATA ACCESSIBILITY

Sequencing data from Meishan pigs have been deposited in the NCBI Short Read Archive (SRA) (project accession no. PRJNA378496). Chromosome painting results are available from the Dryad Digital Repository: https://doi.org/10.5061/dryad.vh6t0c7.

Supporting information

 

 

 

 

 

ACKNOWLEDGEMENT

This research was supported by the Centre for Genomic Selection in Animals and Plants (GenSAP) partially funded by Innovation Fund Denmark (grant 0603‐00519B); the National High Technology Research and Development Program of China (863 Program 2013AA102503); the Program for Changjiang Scholar and Innovation Research Team in University (IRT1191); and the National Natural Science Foundations of China (31661146013).

Chen M, Su G, Fu J, et al. Introgression of Chinese haplotypes contributed to the improvement of Danish Duroc pigs. Evol Appl. 2019;12:292–300. 10.1111/eva.12716

Contributor Information

Guosheng Su, Email: guosheng.su@mbg.au.dk.

Jian‐Feng Liu, Email: liujf@cau.edu.cn.

REFERENCES

  1. Ai, H. , Fang, X. , Yang, B. , Huang, Z. , Chen, H. , Mao, L. , … Huang, L. (2015). Adaptation and possible ancient interspecies introgression in pigs identified by whole‐genome sequencing. Nature Genetics, 47(3), 217–225. 10.1038/ng.3199 [DOI] [PubMed] [Google Scholar]
  2. Ai, H. , Huang, L. , & Ren, J. (2013). Genetic diversity, linkage disequilibrium and selection signatures in chinese and Western pigs revealed by genome‐wide SNP markers. Plos One, 8(2), e56001 10.1371/journal.pone.0056001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Alexander, D. H. , Novembre, J. , & Lange, K. (2009). Fast model‐based estimation of ancestry in unrelated individuals. Genome Research, 19(9), 1655–1664. 10.1101/gr.094052.109 [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Bianco, E. , Soto, H. W. , Vargas, L. , & Perez‐Enciso, M. (2015). The chimerical genome of Isla del Coco feral pigs (Costa Rica), an isolated population since 1793 but with remarkable levels of diversity. Molecular Ecology, 24(10), 2364–2378. 10.1111/mec.13182 [DOI] [PubMed] [Google Scholar]
  5. Bosse, M. , Lopes, M. S. , Madsen, O. , Megens, H. J. , Crooijmans, R. P. , Frantz, L. A. , … Groenen, M. A. (2015). Artificial selection on introduced Asian haplotypes shaped the genetic architecture in European commercial pigs. Proceedings of the Royal Society B: Biological Sciences, 282(1821). 10.1098/rspb.2015.2019 [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Bosse, M. , Megens, H. J. , Frantz, L. A. , Madsen, O. , Larson, G. , Paudel, Y. , … Groenen, M. A. (2014). Genomic analysis reveals selection for Asian genes in European pigs following human‐mediated introgression. Nature Communications, 5, 4392 10.1038/ncomms5392 [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Browning, S. R. , & Browning, B. L. (2007). Rapid and accurate haplotype phasing and missing‐data inference for whole‐genome association studies by use of localized haplotype clustering. American Journal of Human Genetics, 81(5), 1084–1097. 10.1086/521987 [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Chen, M. , Su, G. , Fu, J. , Zhang, Q. , Wang, A. , Sando Lund, M. , & Guldbrandtsen, B. (2017). Population admixture in Chinese and European Sus scrofa. Scientific Reports, 7(1), 13178 10.1038/s41598-017-13127-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Danecek, P. , Auton, A. , Abecasis, G. , Albers, C. A. , Banks, E. , DePristo, M. A. , … 1000 Genomes Project Analysis Group (2011). The variant call format and VCFtools. Bioinformatics, 27(15), 2156–2158. 10.1093/bioinformatics/btr330 [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Eriksson, J. , Larson, G. , Gunnarsson, U. , Bed'hom, B. , Tixier‐Boichard, M. , Stromstedt, L. , … Andersson, L. (2008). Identification of the yellow skin gene reveals a hybrid origin of the domestic chicken. PLoS Genetics, 4(2), e1000010 10.1371/journal.pgen.1000010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Fang, M. , Larson, G. , Soares Ribeiro, H. , Li, N. , & Andersson, L. (2009). Contrasting mode of evolution at a coat color locus in wild and domestic pigs. PLoS Genetics, 5(1), e1000341 10.1371/journal.pgen.1000341 [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Figueiro, H. V. , Li, G. , Trindade, F. J. , Assis, J. , Pais, F. , Fernandes, G. , … Eizirik, E. (2017). Genome‐wide signatures of complex introgression and adaptive evolution in the big cats. Science Advances, 3(7), e1700299 10.1126/sciadv.1700299 [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Frantz, L. A. , Schraiber, J. G. , Madsen, O. , Megens, H. J. , Cagan, A. , Bosse, M. , … Groenen, M. A. (2015). Evidence of long‐term gene flow and selection during domestication from analyses of Eurasian wild and domestic pig genomes. Nature Genetics, 47, 1141–1148. 10.1038/ng.3394 [DOI] [PubMed] [Google Scholar]
  14. Giuffra, E. , Kijas, J. M. , Amarger, V. , Carlborg, O. , Jeon, J. T. , & Andersson, L. (2000). The origin of the domestic pig: Independent domestication and subsequent introgression. Genetics, 154(4), 1785–1791. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Goedbloed, D. J. , Megens, H. J. , Van Hooft, P. , Herrero‐Medrano, J. M. , Lutz, W. , Alexandri, P. , … Prins, H. H. (2013). Genome‐wide single nucleotide polymorphism analysis reveals recent genetic introgression from domestic pigs into Northwest European wild boar populations. Molecular Ecology, 22(3), 856–866. 10.1111/j.1365-294X.2012.05670.x [DOI] [PubMed] [Google Scholar]
  16. Groenen, M. A. , Archibald, A. L. , Uenishi, H. , Tuggle, C. K. , Takeuchi, Y. , Rothschild, M. F. , … Schook, L. B. (2012). Analyses of pig genomes provide insight into porcine demography and evolution. Nature, 491(7424), 393–398. 10.1038/nature11622 [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Hedrick, P. W. (2013). Adaptive introgression in animals: Examples and comparison to new mutation and standing variation as sources of adaptive variation. Molecular Ecology, 22(18), 4606–4618. 10.1111/mec.12415 [DOI] [PubMed] [Google Scholar]
  18. Hernandez‐Ochoa, I. , Karman, B. N. , & Flaws, J. A. (2009). The role of the aryl hydrocarbon receptor in the female reproductive system. Biochemical Pharmacology, 77(4), 547–559. 10.1016/j.bcp.2008.09.037 [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Hu, Z. L. , Park, C. A. , & Reecy, J. M. (2016). Developmental progress and current status of the Animal QTLdb. Nucleic Acids Research, 44(D1), D827–D833. 10.1093/nar/gkv1233 [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Hudson, R. R. , Boos, D. D. , & Kaplan, N. L. (1992). A statistical test for detecting geographic subdivision. Molecular Biology and Evolution, 9(1), 138–151. [DOI] [PubMed] [Google Scholar]
  21. Huerta‐Sanchez, E. , Jin, X. , Asan Bianba, Z. , Peter, B. M. , Vinckenbosch, N. , … Nielsen, R. (2014). Altitude adaptation in Tibetans caused by introgression of Denisovan‐like DNA. Nature, 512(7513), 194–197. 10.1038/nature13408 [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Hufford, M. B. , Lubinksy, P. , Pyhajarvi, T. , Devengenzo, M. T. , Ellstrand, N. C. , & Ross‐Ibarra, J. (2013). The genomic signature of crop‐wild introgression in maize. PLoS Genetics, 9(5), e1003477 10.1371/journal.pgen.1003477 [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Jordan, I. K. (2016). The Columbian Exchange as a source of adaptive introgression in human populations. Biology Direct, 11(1), 17 10.1186/s13062-016-0121-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Juric, I. , Aeschbacher, S. , & Coop, G. (2015). The strength of selection against Neanderthal introgression. PLoS Genetics, 12, e1006340. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Kinsella, R. J. , Kahari, A. , Haider, S. , Zamora, J. , Proctor, G. , Spudich, G. , … Flicek, P. (2011). Ensembl BioMarts: A hub for data retrieval across taxonomic space. Database: the Journal of Biological Databases and Curation, 2011, bar030 10.1093/database/bar030 [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Larson, G. , Dobney, K. , Albarella, U. , Fang, M. , Matisoo‐Smith, E. , Robins, J. , … Cooper, A. (2005). Worldwide phylogeography of wild boar reveals multiple centers of pig domestication. Science, 307(5715), 1618–1621. 10.1126/science.1106927 [DOI] [PubMed] [Google Scholar]
  27. Lawson, D. J. , Hellenthal, G. , Myers, S. , & Falush, D. (2012). Inference of population structure using dense haplotype data. PLoS Genetics, 8(1), e1002453 10.1371/journal.pgen.1002453 [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Li, H. , & Durbin, R. (2009). Fast and accurate short read alignment with Burrows‐Wheeler transform. Bioinformatics, 25(14), 1754–1760. 10.1093/bioinformatics/btp324 [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Li, H. , Handsaker, B. , Wysoker, A. , Fennell, T. , Ruan, J. , Homer, N. , … Durbin, R. (2009). The Sequence Alignment/Map format and SAMtools. Bioinformatics, 25(16), 2078–2079. 10.1093/bioinformatics/btp352 [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Manunza, A. , Zidi, A. , Yeghoyan, S. , Balteanu, V. A. , Carsai, T. C. , Scherbakov, O. , … Amills, M. (2013). A high throughput genotyping approach reveals distinctive autosomal genetic signatures for European and Near Eastern wild boar. Plos One, 8(2), e55891 10.1371/journal.pone.0055891 [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Marnetto, D. , Huerta‐Sánchez, E. , & Price, S. (2017). Haplostrips: Revealing population structure through haplotype visualization. Methods in Ecology and Evolution, 8(10), 1389–1392. 10.1111/2041-210x.12747 [DOI] [Google Scholar]
  32. McKenna, A. , Hanna, M. , Banks, E. , Sivachenko, A. , Cibulskis, K. , Kernytsky, A. , … DePristo, M. A. (2010). The genome analysis toolkit: A MapReduce framework for analyzing next‐generation DNA sequencing data. Genome Research, 20(9), 1297–1303. 10.1101/gr.107524.110 [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. McVean, G. (2009). A genealogical interpretation of principal components analysis. PLoS Genetics, 5(10), e1000686 10.1371/journal.pgen.1000686 [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Mikawa, S. , Hayashi, T. , Nii, M. , Shimanuki, S. , Morozumi, T. , & Awata, T. (2005). Two quantitative trait loci on Sus scrofa chromosomes 1 and 7 affecting the number of vertebrae. Journal of Animal Science, 83(10), 2247–2254. [DOI] [PubMed] [Google Scholar]
  35. Mikawa, S. , Morozumi, T. , Shimanuki, S. , Hayashi, T. , Uenishi, H. , Domukai, M. , … Awata, T. (2007). Fine mapping of a swine quantitative trait locus for number of vertebrae and analysis of an orphan nuclear receptor, germ cell nuclear factor (NR6A1). Genome Research, 17(5), 586–593. 10.1101/gr.6085507 [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Pardo‐Diaz, C. , Salazar, C. , Baxter, S. W. , Merot, C. , Figueiredo‐Ready, W. , Joron, M. , … Jiggins, C. D. (2012). Adaptive introgression across species boundaries in Heliconius butterflies. PLoS Genetics, 8(6), e1002752 10.1371/journal.pgen.1002752 [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Patterson, N. , Price, A. L. , & Reich, D. (2006). Population structure and eigenanalysis. PLoS Genetics, 2(12), e190 10.1371/journal.pgen.0020190 [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Pfeifer, B. , Wittelsburger, U. , Ramos‐Onsins, S. E. , & Lercher, M. J. (2014). PopGenome: An efficient Swiss army knife for population genomic analyses in R. Molecular Biology and Evolution, 31(7), 1929–1936. 10.1093/molbev/msu136 [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Rubin, C. J. , Megens, H. J. , Martinez Barrio, A. , Maqbool, K. , Sayyab, S. , Schwochow, D. , … Andersson, L. (2012). Strong signatures of selection in the domestic pig genome. Proceedings of the National Academy of Sciences of the United States of America, 109(48), 19529–19536. 10.1073/pnas.1217149109 [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Schmieder, R. , & Edwards, R. (2011). Quality control and preprocessing of metagenomic datasets. Bioinformatics, 27(6), 863–864. 10.1093/bioinformatics/btr026 [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Wang, C. , Wang, H. , Zhang, Y. , Tang, Z. , Li, K. , & Liu, B. (2015). Genome‐wide analysis reveals artificial selection on coat colour and reproductive traits in Chinese domestic pigs. Molecular Ecology Resources, 15(2), 414–424. 10.1111/1755-0998.12311 [DOI] [PubMed] [Google Scholar]
  42. White, S. (2011). From globalized pig breeds to capitalist pigs: A study in animal cultures and evolutionary history. Environmental History, 16(1), 94–120. 10.1093/envhis/emq143 [DOI] [Google Scholar]
  43. Wilkinson, S. , Lu, Z. H. , Megens, H. J. , Archibald, A. L. , Haley, C. , Jackson, I. J. , … Wiener, P. (2013). Signatures of diversifying selection in European pig breeds. PLoS Genetics, 9(4), e1003453 10.1371/journal.pgen.1003453 [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Wright, S. S. (1951). The genetical structure of populations. Annals of Eugenics, 15(4), 323–354. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

 

 

 

 

 

Data Availability Statement

Sequencing data from Meishan pigs have been deposited in the NCBI Short Read Archive (SRA) (project accession no. PRJNA378496). Chromosome painting results are available from the Dryad Digital Repository: https://doi.org/10.5061/dryad.vh6t0c7.


Articles from Evolutionary Applications are provided here courtesy of Wiley

RESOURCES