Abstract
Jiaji Duck (JJ) is a Muscovy duck species that possesses many superior characteristics, and it has become an important genetic resource in China. However, to date, its genetic characteristics and genetic relationship with other duck breeds have not been explored yet, which greatly limits the utilization of JJ. In the present study, we investigated the genome sequences of 15 individual ducks representing five different duck populations, including JJ, French Muscovy duck (FF), mallard (YD), hong duck (HD) and Beijing duck (BD). Moreover, we investigated the characteristics of JJ-specific single nucleotide polymorphisms (SNPs) and compared the genome sequences of JJ vs. YD and JJ vs. BD using integrated strategies, including mutation detection, selective screening, and Gene Ontology (GO) analysis. More than 40 Gb of clean data were obtained for each population (mean coverage of 13.46 Gb per individual). A total number of 22,481,367 SNPs and 4,156,829 small insertion-deletions (Indels) were identified for the five duck populations, which could be used as molecular markers in breeding and utilization of JJ. Moreover, we identified 1,447,932 JJ-specific SNPs, and found that genes covering at least one JJ-specific SNP mainly involved in protein phosphorylation and dephosphorylation, as well as DNA modification. Phylogenetic tree and principal components analysis (PCA) revealed that the genetic relationship of JJ was closest to FF, while it was farthest to BD. A total of 120 and 111 genes were identified as positive selection genes for JJ vs. BD and JJ vs. YD, respectively. GO and Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses showed that the positive selection genes for JJ vs. BD ducks mainly involved in pigmentation, muscle contraction and stretch, gland secretion, and immunology, while the positive selection genes obtained from JJ vs. YD ducks mainly involved in embryo development, muscle contraction and stretch, and gland secretion. Taken together, our findings enabled us to better understand the characteristics of JJ and provided a molecular basis for the breeding and hybrid utilization of JJ in the future.
1. Introduction
Jiaji Duck (JJ), a Muscovy duck variety, is initially raised at Jiaji Town, Qionghai County, Hainan Province. JJ duck is covered by black and white plumage and has been believed to be originated from Southeast Asia. JJ has been raised at Jiaji Town for more than 200 years, and it has become an important genetic resource in China. JJ duck has various superior characteristics, such as fast growth, heat resistance, strong adaptability, high lean meat rate, and good meat quality. However, only few studies on JJ have been reported, while none of them is related to the genetic characteristics of JJ and genetic relationship between JJ and other duck breeds. This greatly limits the breeding and hybrid utilization of JJ. The approach of whole genome re-sequencing (WGRS) has been proved to be a very powerful tool in genetic evaluation and genetic relationship exploration of various animals, including pigs [1], chickens [2], cattle [3], sheep [4], dogs [5] and rabbits [6]. For ducks, the genome sequences have been constructed in 2013 [7], providing a platform for the research of ducks using WGRS method and promoting the related exploration of ducks. Xu et al. have explored the genetic signatures of artificial selection using genome WGRS data [8]. Zhang et al. have indicated the genomic variation in Peking duck populations using WGRS data [9]. Zhou et al. have identified the genes associated with body size and plumage color in ducks using WGRS and genome-wide association study (GWAS) technologies [10]. However, no report has investigated the genetic characteristics of JJ and its relationship with other duck populations using WGRS method.
In this study, we selected five duck populations that are different in a couple of characteristics. The first one was JJ, which has been introduced as above-mentioned. Similar to JJ, French Muscovy duck (FF) is another Muscovy duck population that has been introduced into Hainan Province since 1990’s. However, FF grows faster than JJ, and it is fully covered by white plumage. Mallard (YD), widely accepted as the ancestor of domestic duck [11], is used to confirm whether JJ and FF are also derived from YD. Beijing duck (BD) has been subjected to intensive selection to provide the raw material for Beijing roast duck dishes. Hong duck (HD) is the hybrid offspring of Beijing duck and Sheldrake, which has higher quality of meat relative to Beijing duck and faster growth rate compared with Sheldrake. Four out of the five duck populations (JJ, FF, HD and BD) have been recorded by Animal Genetic Resources in China Poultry [12]. To characterize the genetic background of JJ duck and explore its genetic relationship with the above-listed duck breeds, we selected 15 ducks representing the five duck populations with three individuals for each population, and the whole genome of each duck was individually sequenced. Moreover, we, for the first time, investigated the genetic characteristics of JJ through analyzing JJ-specific single nucleotide polymorphisms (SNPs) and compared the genomic sequences of these five duck populations. Collectively, our results provided basis for the reasonable utilization of JJ in the future.
2. Materials and methods
2.1. Sample preparation
In the present study, 15 adult female ducks representing five different duck populations were selected, with three ducks for each population. Genomic DNA was extracted from about 5 mL venous blood for each individual as previously described [13]. After blood collection, the birds were housed for further analysis. JJ and FF ducks, obtained from Hainan Chuanwei Muscovy Duck Breeding Co., Ltd., were sampled. BD and HD ducks were developed by Institute of Animal Science, Chinese Academy of Agricultural Sciences. YD was raised by the Ji’ao Austrian Agricultural Science and Technology Co., Ltd. in Fenghua, Zhejiang Province.
2.2. Library construction and sequencing
Libraries were generated for each individual using standard Illumina sequencing protocols. The constructed libraries were sequenced as 150-bp paired-end reads on Illumina sequencing platform (HiSeqTM 4000), and more than 13 Gb clean data were generated for each individual (Table 1).
Table 1. The re-sequencing and mapping results.
Items | Clean Reads (M) | Clean Bases (Gb) | Mapped Bases (Gb) | Mapping Rate (%) | 1Dup. Rate (%) | 2Uniq. Rate (%) | Mean Depth | 3Cov. Rate (%) |
---|---|---|---|---|---|---|---|---|
BD1 | 88.67 | 13.30 | 12.51 | 94.04 | 25.38 | 94.37 | 12.09 | 96.51 |
BD2 | 90.83 | 13.63 | 12.79 | 93.86 | 24.93 | 93.88 | 12.39 | 96.85 |
BD3 | 91.35 | 13.70 | 12.85 | 93.80 | 25.04 | 93.86 | 12.45 | 96.85 |
FF1 | 90.35 | 13.55 | 12.45 | 91.84 | 23.53 | 93.96 | 12.32 | 94.10 |
FF2 | 91.14 | 13.67 | 12.48 | 91.29 | 23.54 | 93.65 | 12.43 | 94.01 |
FF3 | 90.41 | 13.56 | 12.48 | 92.06 | 23.99 | 93.93 | 12.33 | 94.09 |
HD1 | 90.78 | 13.62 | 12.82 | 94.13 | 7.81 | 94.41 | 12.38 | 97.00 |
HD2 | 90.07 | 13.51 | 12.91 | 95.52 | 24.99 | 95.23 | 12.28 | 96.51 |
HD3 | 84.46 | 12.67 | 12.01 | 94.76 | 24.29 | 94.52 | 11.52 | 96.80 |
JJ1 | 90.21 | 13.53 | 12.41 | 91.71 | 24.40 | 93.60 | 12.30 | 94.11 |
JJ2 | 90.26 | 13.54 | 12.35 | 91.24 | 23.06 | 93.61 | 12.31 | 94.02 |
JJ3 | 89.62 | 13.44 | 12.18 | 90.61 | 23.70 | 92.97 | 12.22 | 94.13 |
YD1 | 89.14 | 13.37 | 12.57 | 93.98 | 25.15 | 94.42 | 12.15 | 96.50 |
YD2 | 88.75 | 13.31 | 12.51 | 93.94 | 25.49 | 94.29 | 12.10 | 96.46 |
YD3 | 89.99 | 13.50 | 12.72 | 94.25 | 24.95 | 94.51 | 12.27 | 96.50 |
Means | 89.74 | 13.46 | 12.54 | 93.14 | 23.35 | 94.08 | 12.24 | 95.63 |
Total | 1346.03 | 201.90 | 188.03 | 195.78 |
1 Duplication rate
2 Unique rate
3 The coverage rate, referring to the percentage of the mapped bases divide by the total bases of duck genome, the mapped bases means bases that covered by cleans bases more than 1 X. BD1- BD3, Beijing duck, individual 1-individual 3; FF1- FF3, French Muscovy duck, individual 1- individual 3; HD1- HD3, Hong duck, individual 1-individual 3; JJ1- JJ3, Jiaji Duck, individual 1-individual 3; YD1- YD3, mallard, individual 1- individual 3.
2.3. Read mapping
Sequencing adaptors were first removed using Trim Galore version 0.3.7 (http://www.bioinformatics.babraham.ac.uk/projects/trim_galore/). Subsequently, the trimmed data (clean data) were mapped to the duck genome assembly (ensemble version: BGI_duck_1.0) using the Burrows-Wheeler Alignment tool (BWA) [14] with default settings. The uniquely mapped reads were used for downstream analysis.
2.4. Detection and annotation of SNPs and small insertion–deletions (Indels)
For each individual of a breed, the genome analysis toolkit (GATK) was used to detect SNPs and Indels (1–50 bp) with GATK best practices method using RealignerTargetCreator, IndelRealigner, HaplotypeCaller and GenotypeGVCFs [15]. Then SelectVariants was used to separate SNPs, Indels and other variants [16, 17]. The SNPs and Indels from the three individuals were merged to form the SNP and Indel sets of this breed. The merged SNPs and Indels were filtered using the parameters recommended by the GATK mentor, and those variants with ultra-high (> 500) or ultra-low (< 4) coverage were discarded. The remaining variants were used for downstream analysis.
SnpEff [18] was used to annotate SNPs and Indels against the reference genome annotation to identify the effects of sequence variants on gene functionality.
2.5. The genetic characteristics of JJ obtained through analyzing JJ-specific SNPs and the related genes
If an SNP was only found in JJ population, it was considered as JJ-specific SNP without considering whether it was a homozygous or heterozygous SNP. To enhance the sufficiency, a JJ-specific SNP must be found in two of the three individuals of JJ ducks. According to these criteria, the JJ-specific SNPs were identified using a homemade pearl script. The SNPs that were located on the exons of duck genes were then selected and then classified into different types based on the annotation of SNPs, such as synonymous mutation SNPs, nonsynonymous mutation SNPs, stop-gain SNPs and stop-loss SNPs. Finally, the genes covering at least one JJ-specific SNP that was located on exons of genes were picked out, and Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) functional enrichment analyses were performed as described in section 2.8.
2.6. Genetic structure analysis of five duck populations
The genetic relationships of the five duck populations were inferred through phylogenetic tree analysis and principal components analysis (PCA). SNPs for each duck population with a minor allele frequency (MAF) ≤ 0.1 were removed, and the remaining SNPs were used for phylogenetic tree analysis and PCA. For phylogenetic tree analysis, NJ method of PHYLIP software was used to construct phylogenetic tree [19] with the default settings, and Newick Utilities software was used to generate phylogenetic tree graphics [20]. For PCA, EIGENSOFT software [21] was used to infer genetic relationships among the five duck populations, and the first two principal components were used to construct the PCA figure.
2.7. Analysis of selective sweeps, positive selection genes
Selective sweeps occur when the frequency of beneficial genetic variants is increased as a result of positive selection along with nearby linked genetic variants [9]. Generally, positive selection can lead to reduced heterozygosity (Hp) of the selected population and increased fixation index (Fst) between populations around the selected site [6]. In order to define candidate regions undergoing positive selection, outlier approach in combination with Hp estimates was used for the five sequenced duck populations with estimates of Fst between duck populations. Since the current duck reference genome was assembled in sub-chromosomal scaffolds [22], only scaffolds>150 kb (1,226 scaffolds) were used for the downstream analysis to ensure the accuracy of our calculation of Hp and Fst, which was similar to studies on rabbits [6] and chickens [2]. In addition, to reduce the number of false positive windows, windows with > 20 SNPs were used in downstream analysis. The Hp values and Fst values of the genomic regions were calculated using a 100 kb-sliding window (step = 50 kb), Z-transform Hp (ZHp) and FST (ZFst) as previously described [5]. Windows with both their ZHp and ZFst values falling in the top 5% were identified as the positive selection regions. Genes that were overlapped with or harbored by positive selection regions were defined as positive selection genes and were used in downstream analysis.
2.8. GO and KEGG analyses of positive selection genes
The duck GO annotation information from the Ensembl Genome Browser was used to perform GO enrichment analysis of genes under positive selection, and the whole genome set of protein-coding genes in the annotation was used as the background. For each query, represented GO terms were tested against the background using the GOstats Bioconductor R package (https://www.r-project.org/) [23]. To further understand the biological functions of the positive selection genes, KEGG [24] was used to perform pathway enrichment analysis.
2.10. Funding
This work was financially supported by Key Research and Development Programs of Hainan Province (Grant no. ZDYF2019053, ZDYF2018224), Chinese Modern Technology System of Agricultural Industry (Grant no. CARS-42-50), and Central Public-interest Scientific Institution Basal Research Fund for Chinese Academy of Tropical Agricultural Sciences (Grant no. 1630032017034).
2.11. Ethics statement
The animal-related handling and sampling procedures were approved by the Animal Care and Use Committee of Chinese Academy of Tropical Agricultural Sciences (CATAS), and great efforts were made to minimize the suffering of animals in accordance with recommendations of European Commission (1997). All methods were conducted in accordance with relevant guidelines. Venous blood was kindly collected to ameliorate the suffering of ducks.
3. Results and discussion
3.1. A number of variants are found in duck genome
WGRS data were obtained from five duck populations with three samples of each population (JJ1, JJ2, JJ3; BD1, BD2, BD3; FF1, FF2, FF3; HD1, HD2, HD3; YD1, YD2, YD3) using HiSeq TM 4000 system. After quality control [25], averagely 13.46 Gb (12.24 X) data were obtained for each individual, > 40 Gb (> 36 X) data were obtained for each duck population, and a total of 201.90 Gb (195.78 X) high-quality data remained (Table 1). Subsequently, the high-quality data were mapped to the A. platyrhynchos genome (BGI_duck_1.0), resulting in an average mapping rate of 93.14% and an average coverage rate of 95.63% for each individual (Table 1). The average mapping rate of this study (93.14%) was higher compared with our previous report (89.71%) [8], indicating a better quality of sequencing data used in this study.
In this study, SNPs and Indels of each duck population were identified using GATK according to previous studies [26, 27]. The SNPs and Indels for codons with multiple alleles were removed. Since the duck genome only contains scaffolds without chromosomes, we could not compare the genome differences among the five duck populations at the chromosomal level. For JJ duck, a total number of 22,481,367 SNPs and 4,156,829 Indels were identified after filtration. For SNPs, ~50% of total SNPs were located in intergenic regions (11,042,847 SNPs, 49.12%), and 45.31% of total SNPs were located in intronic regions (10,186,307). SNPs located in exonic regions (303,498) only accounted for 1.35% of the total SNPs (Fig 1 and Table 2). Similar to SNPs, 50.02% of total Indels (2,079,246) were located in intergenic regions, 44.02% of total Indels (1,874,314) were located in intronic regions, and only 1.44% of total Indels (59,858) were harbored by exonic regions (Fig 1 and Table 2). These results indicated that the majority of the identified mutations were located in non-coding sequences (intergenic regions or intronic regions), and only a small proportion of them fell in exon sequence, showing that only very few variants along with duck genome could exert functional effects on protein function. Our results were consistent with those of Xu et al. [8] in ducks and Carneiro et al. in rabbits [6].
Table 2. Annotation of identified SNPs and Indels for JJ duck population.
Items | SNPs | Indels | ||
---|---|---|---|---|
Number | Percent (%) | Number | Percent (%) | |
Total | 22,481,367 | 100 | 4,156,829 | 100 |
UTR5 | 123,648 | 0.55 | 17,459 | 0.42 |
UTR3 | 312,491 | 1.39 | 60,274 | 1.45 |
exonic | 303,498 | 1.35 | 59,858 | 1.44 |
splicing | 2,248 | 0.01 | 1,247 | 0.03 |
exonic; splicing | 0 | 0 | 0 | 0 |
upstream | 249,543 | 1.11 | 46,141 | 1.11 |
downstream | 245,047 | 1.09 | 59,443 | 1.43 |
upstream | 13,489 | 0.06 | 2,494 | 0.06 |
intronic | 10,186,307 | 45.31 | 1,829,836 | 44.02 |
intergenic | 11,042,847 | 49.12 | 2,079,246 | 50.02 |
ncRNA_UTR3 | 0 | 0 | 0 | 0 |
ncRNA_UTR5 | 0 | 0 | 0 | 0 |
ncRNA_exonic | 0 | 0 | 0 | 0 |
ncRNA_splicing | 0 | 0 | 0 | 0 |
ncRNA_intronic | 2,248 | 0.01 | 831 | 0.02 |
other | 0 | 0 | 0 | 0 |
SNPs mean single nucleotide polymorphisms; Indels mean short insertions and deletions.
3.2. The JJ-specific SNPs
The specific SNPs are usually the basic genetic characteristics of a species. A high throughput SNP marker can offer strong accuracy for genotyping, which can be used in marker-assisted selection [28, 29]. In this study, we screened the JJ-specific SNPs and found 1,447,932 specific SNPs, of which only 56,651 SNPs (accounting for 4% of the total specific SNPs) were located in the exons of duck genes. Among these SNPs located in the exons, there were 36,517 synonymous mutation SNPs (64.46%), suggesting that the majority of the SNPs were random mutations that did not lead to the changes of amino acids. In addition, there were 19,817 non-synonymous mutation SNPs, which could lead to the changes of amino acids. Moreover, we detected 285 stop-gain SNPs and 11 stop-loss SNPs, indicating that SNP mutations resulting in alteration or loss of gene function were rare events, which was consistent with previous reports in rabbits [6], pigs [30], and chickens [2].
The genes covering at least one JJ-specific non-synonymous SNP could lead to functional differences of these genes for JJ compared with other duck populations, and herein might represent the specificity of JJ. In this study, we selected genes covering at least one JJ-specific non-synonymous SNP, and performed GO and KEGG analysis on these genes to explore JJ-specific genetic characteristics. The biological process (BP) of GO analysis demonstrated that genes covering at least one JJ-specific non-synonymous SNP mainly played roles in protein phosphorylation and dephosphorylation, as well as DNA modification (Fig 2A and S1 Table). We also carried out KEGG analysis and constructed the significant enrichment graphics of q-values via the 20 most significant pathways (Fig 2B and S2 Table). The results indicated that Fanconi anemia pathway was the most significant pathway, while the metabolic pathway covered the most genes. These findings might reflect the specificity of JJ in Fanconi anemia and metabolism.
3.3. Genetic relationships between JJ and other duck populations
A phylogenetic tree is a branching diagram or “tree” showing the inferred evolutionary relationships among various biological species or subspecies based on the similarities and differences of their genetic characteristics [8]. In this study, we constructed the phylogenetic tree to illustrate the genetic relationships among the five duck populations (Fig 3A). The results showed that the individuals of JJ were firstly clustered into a class, followed by FF, YD, HD, and BD sequentially, indicating that FF was the closest population genetically related to JJ, followed by YD, HD and BD. Interestingly, the genetic relationship between JJ and YD was closer than that between YD and HD or BD. Since YD (mallard) was considered as the ancestor of HD and BD, we hypothesized that JJ and FF were differently originated from HD and BD. In addition, we used PCA to infer genetic relationships among the five duck populations in another aspect (Fig 3B). The PCA results revealed that individuals of JJ and FF were clustered into one group, suggesting the closest genetic relationship of JJ and FF, which was consistent with the results of phylogenetic tree analysis. Individuals in the remaining three populations were clustered into three separate groups.
3.4. Many genomic regions and genes associated with positive selective between JJ and other duck populations are found
Selective sweeps occur when the frequency of beneficial genetic variants is increased as a result of positive selection along with nearby linked genetic variants [9]. Given that the current duck reference genome is assembled in sub-chromosomal scaffolds [22], only scaffolds>150 kb (1,226 scaffolds) were used for the downstream analysis in our study to ensure the accuracy of Hp and Fst calculation, which was similar to studies on rabbits [6] and chickens [2]. We calculated the Hp and Fst values of the genomic regions using a 100 kb-sliding window (step = 50 kb). Windows with both their ZHp and ZFst values in the top 5% were identified as the positive selection regions. Since individuals in JJ and FF were clustered into one group, and genetic structure analysis suggested that HD was very close to BD in genetics, we only explored the positive selection regions in two comparisons, namely JJ vs. YD and JJ vs. BD. We selected the longest 100 scaffolds to construct the ZHp and ZFst figures to show the basic information. For JJ vs. YD comparison, 79 positive selection regions were obtained, which covered 111 genes (Fig 4 and S3 Table). For JJ vs. BD comparison, 76 positive selection regions were obtained, which covered 120 genes (Fig 5 and S4 Table).
The GO project is a collaborative effort to develop and use ontology to support biologically meaningful annotation of genes and their products [31]. In this study, we performed GO analysis to describe the properties of positive selection genes of JJ vs. BD ducks and those of JJ vs. YD ducks. For positive selection genes of JJ vs. BD ducks, 232 significant GO terms were enriched (S5 Table). Of which, the 20 most significant terms (Fig 5) were mainly involved in pigmentation (terms included melanocyte differentiation, pigment cell differentiation, developmental pigmentation), muscle contraction and stretch (muscle contraction, muscle system process, detection of muscle stretch, Wnt signaling pathway [32, 33], skeletal muscle thin filament assembly), and immunology (negative regulation of mast cell apoptotic process, mast cell apoptotic process, regulation of mast cell apoptotic process, mast cell homeostasis) [34, 35]. The results were consistent with the differences in feather color, athletic ability, and immunity between JJ and YD. For positive selection genes of JJ vs. YD ducks, 152 significant GO terms were obtained (S6 Table). The positive selection genes of JJ vs. YD ducks were mainly involved in embryo development because eight GO terms were related to embryo development in the 20 most significant terms, including wing disc development, instar larval or pupal development, larval development, imaginal disc-derived wing morphogenesis, wing disc morphogenesis, imaginal disc morphogenesis, post-embryonic appendage morphogenesis, and instar larval or pupal morphogenesis. Subsequently, terms of muscle contraction and stretch (muscle contraction, response to muscle stretch, and detection of muscle stretch) were also enriched (Fig 6). These results highlighted the differences in embryo development and muscle contraction and stretch between JJ and YD.
KEGG is a "computer representation" of the biological system [24], and the KEGG database can be utilized for modeling, simulation, browsing and retrieval of data. In this study, the significant pathways (P<0.05) for JJ vs. BD ducks and JJ vs. YD ducks were selected (Table 3). The results showed that the pathways related to gland secretion were significantly enriched for JJ vs. BD and JJ vs. YD. For example, five out of eight significantly enriched pathways were associated with gland secretion for JJ vs. BD, while five out of 11 significantly enriched pathways for JJ vs. YD ducks were related with gland secretion. The results of KEGG analysis showed that gland secretion was the main difference between JJ and BD as well as YD.
Table 3. Significantly enriched KEGG pathways for JJ vs. BD ducks and JJ vs. YD ducks.
JJ vs. BD | ||
pathway names | p-value | genes |
Hypertrophic cardiomyopathy (HCM) | 0.002826 | gene4589, gene16244, gene13399 |
Salivary secretion | 0.003638 | gene13401, gene6509, gene12554 |
Insulin secretion | 0.005433 | gene14642, gene6509, gene12554 |
Pancreatic secretion | 0.006374 | gene13401, gene6509, gene12554 |
Bile secretion | 0.03918 | gene14642, gene12554 |
Dilated cardiomyopathy | 0.0404 | gene4589, gene13399 |
Gastric acid secretion | 0.0404 | gene13401, gene12554 |
PI3K-Akt signaling pathway | 0.042055 | gene3589, gene9998, gene16244 |
JJ vs. YD | ||
Salivary secretion | 0.002882 | gene13401, gene6509, gene12554 |
Insulin secretion | 0.004314 | gene14642, gene6509, gene12554 |
Pancreatic secretion | 0.005068 | gene13401, gene6509, gene12554 |
Aldosterone-regulated sodium reabsorption | 0.010621 | gene9288, gene12554 |
Jak-STAT signaling pathway | 0.015062 | gene9287, gene3589, gene9286 |
Ubiquitin mediated proteolysis | 0.024208 | gene12556, gene12904, gene9290 |
Hypertrophic cardiomyopathy (HCM) | 0.028686 | gene4589 gene13399 |
Bile secretion | 0.03379 | gene14642, gene12554 |
Cytokine-cytokine receptor interaction | 0.034 | gene9287, gene3589, gene9286 |
Dilated cardiomyopathy | 0.034851 | gene4589, gene13399 |
Gastric acid secretion | 0.034851 | gene13401, gene12554 |
Grey represents the comparison of JJ vs. BD and the secretion related pathways in the significant enriched KEGG pathways. Light green means the comparison of JJ vs. BD and the secretion related pathways in the significant enriched KEGG pathways.
Collectively, we, for the first time, explored the genetic characteristics of JJ and its genetic relationship with other four duck breeds using WGRS data. The findings of this study would benefit our understanding of the genetic characteristics of JJ and provide valuable references for the utilization of JJ breeds in the future.
Supporting information
Acknowledgments
We thank Dr. Li Chen for providing mallard blood and egg samples, Prof. Qi Zhang and Dr. Zhanbao Guo for their help with sample collection, Dr. Kyle Schachtschneider for editing the manuscript and Dr. Minggang Wang for modifying the initial manuscript.
Data Availability
Duck genomic resequencing datasets from JJ, FF, YD, BD and HD have been uploaded to the Short Read Archive (SRA) under the accession number PRJNA549423.
Funding Statement
The commercial company of Hainan Chuanwei Muscovy breeding Ltd is a commercial breeding company in Jiaji Duck. In this study, this company provided the blood samples of JJ Duck. This company did not provide support in the form of salaries for any author and did not have any additional role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Schiavo G, Galimberti G, Calo DG, Samore AB, Bertolini F, Russo V, et al. Twenty years of artificial directional selection have shaped the genome of the Italian Large White pig breed. Anim Genet. 2016;47(2):181–191. 10.1111/age.12392 [DOI] [PubMed] [Google Scholar]
- 2.Rubin CJ, Zody MC, Eriksson J, Meadows JR, Sherwood E, Webster MT, et al. Whole-genome resequencing reveals loci under selection during chicken domestication. Nature. 2010;464:587–591. 10.1038/nature08832 [DOI] [PubMed] [Google Scholar]
- 3.Kim ES, Cole JB, Huson H, Wiggans GR, Van Tassell CP, Crooker BA, et al. Effect of artificial selection on runs of homozygosity in u.s. Holstein cattle. PLoS One. 2013; 8:e80813 10.1371/journal.pone.0080813 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Loehr J, Carey J, Hoefs M, Suhonen J, Ylonen H. Horn growth rate and longevity: implications for natural and artificial selection in thinhorn sheep (Ovis dalli). J Evol Biol. 2007;20:818–828. 10.1111/j.1420-9101.2006.01272.x [DOI] [PubMed] [Google Scholar]
- 5.Axelsson E, Ratnakumar A, Arendt ML, Maqbool K, Webster MT, Perloski M, et al. The genomic signature of dog domestication reveals adaptation to a starch-rich diet. Nature. 2013;495:360–364. 10.1038/nature11837 [DOI] [PubMed] [Google Scholar]
- 6.Carneiro M, Rubin CJ, Di Palma F, Albert FW, Alfoldi J, Barrio AM, et al. Rabbit genome analysis reveals a polygenic basis for phenotypic change during domestication.Science.2014;345:1074–1079. 10.1126/science.1253714 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Huang Y.H., Li Y.R., Burt D.W., Chen H.L., Zhang Y., et al. The duck genome and transcriptome provide insight into an avian influenza virus reservoir species. Nature Genetics 45 (2013) 776–783. 10.1038/ng.2657 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Xu TS, Gu LH, Yu HP, Jiang XF, Zhang YS, Zhang XH, et al. Analysis of Anasplatyrhynchos genome resequencing data reveals genetic signatures of artificialselection.PLOSONE.2019;14:e0211908. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Zhang Z, Jia Y, Chen Y, Wang L, Lv X, Yang F, et al. Genomic variation in Pekin duck populations developed in three different countries as revealed by whole-genome data. Anim Genet. 2018;49):132–136. 10.1111/age.12639 [DOI] [PubMed] [Google Scholar]
- 10.Zhou Z, Li M, Cheng H, Fan W, Yuan Z, Gao Q, et al. An intercross population study reveals genes associated with body size and plumage color in ducks. Nature Communication. 2018;9:2648. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Li HF, Zhu WQ, Song WT, Shu JT, Han W, Chen KW. Origin and genetic diversity of Chinese domestic ducks. Mol Phylogenetics and Evolution. 2010;57:634–640. [DOI] [PubMed] [Google Scholar]
- 12.Chen KW, Yang N, Wang GY, Wang ZY, Wang KH, Wang JW, et al. Animal Genetic Resources in China Poultry. Beijing: China Agriculture Press; 2010 [Google Scholar]
- 13.Xu TS, Gu LH, Sun Y, Zhang XH, Ye BG, Liu XL, et al. Characterization of MUSTN1 gene and its relationship with skeletal muscle development at postnatal stages in Pekin ducks. Genet Mol Res. 2015;14:4448–4460. 10.4238/2015.May.4.2 [DOI] [PubMed] [Google Scholar]
- 14.Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25:1754–1760. 10.1093/bioinformatics/btp324 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.De Summa S, Malerba G, Pinto R, Mori A, Mijatovic V, Tommasi S. GATK hard filtering: tunable parameters to improve variant calling for next generation sequencing targeted gene panel data. BMC Bioinformatics. 2017;18:119 10.1186/s12859-017-1537-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.DePristo MA, Banks E, Poplin R, Garimella KV, Maguire JR, Hartl C, et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat Genet. 2011;43:491–498. 10.1038/ng.806 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Van der Auwera GA, Carneiro MO, Hartl C, Poplin R, Del Angel G, Levy-Moonshine A, et al. From FastQ data to high confidence variant calls: the Genome Analysis Toolkit best practices pipeline. Curr Protoc Bioinformatics. 2013;43:111–33. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Cingolani P, Platts A, Wang le L, Coon M, Nguyen T, Wang L, et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly (Austin). 2012;6(2):80–92. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Plotree D, Plotgram D. PHYLIP-phylogeny inference package (version 3.2). Cladistics. 1989;5:163–166. [Google Scholar]
- 20.Junier T, Zdobnov EM. The Newick utilities: high-throughput phylogenetic tree processing in the UNIX shell. Bioinformatics. 2010;26(13):1669–1670. Epub 2010/05/18. 10.1093/bioinformatics/btq243 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Price AL, Patterson NJ, Plenge RM, Weinblatt ME, Shadick NA, Reich D. Principal components analysis corrects for stratification in genome-wide association studies. Nat Genet. 2006;38(8):904–909. 10.1038/ng1847 [DOI] [PubMed] [Google Scholar]
- 22.Huang Y, Li Y, Burt DW, Chen H, Zhang Y, Qian W, et al. The duck genome and transcriptome provide insight into an avian influenza virus reservoir species. Nat Genet. 2013;45:776–783. 10.1038/ng.2657 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Falcon S, Gentleman R. Using GOstats to test gene lists for GO term association.Bioinformatics.2007;23(2):257–258. 10.1093/bioinformatics/btl567 [DOI] [PubMed] [Google Scholar]
- 24.Kanehisa M, Araki M, Goto S, Hattori M, Hirakawa M, Itoh M, et al. KEGG for linking genomes to life and the environment. Nucleic Acids Res. 2008;36:D480–484. Epub 2007/12/14. 10.1093/nar/gkm882 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Patel RK, Jain M. NGS QC Toolkit: a toolkit for quality control of next generation sequencing data. PLoS One. 2012;7):e30619 10.1371/journal.pone.0030619 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Qiu Q, Wang L, Wang K, Yang Y, Ma T, Wang Z, et al. Yak whole-genome resequencing reveals domestication signatures and prehistoric population expansions. Nature Communications. 2015;6:10283 10.1038/ncomms10283 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Meyer RS, Choi JY, Sanches M, Plessis A, Flowers JM, Amas J, et al. Domestication history and geographical adaptation inferred from a SNP map of African rice. Nat Genet. 2016;48(9):1083–1088. 10.1038/ng.3633 [DOI] [PubMed] [Google Scholar]
- 28.Hunger S, Di Gaspero G, Mohring S, Bellin D, Schafer-Pregl R, Borchardt DC, et al. Isolation and linkage analysis of expressed disease-resistance gene analogues of sugar beet (Beta vulgaris L.). Genome. 2003;46:70–82. 10.1139/g02-106 [DOI] [PubMed] [Google Scholar]
- 29.Schneider K, Weisshaar B, Borchardt DC, Salamini F. SNP frequency and allelic haplotype structure of Beta vulgaris expressed genes. Mol. Breeding. 2001;8: 63–74. [Google Scholar]
- 30.Rubin CJ, Megens HJ, Martinez Barrio A, Maqbool K, Sayyab S, Schwochow D, et al. Strong signatures of selection in the domestic pig genome. Proc Natl Acad Sci U S A. 2012;109:19529–19536. 10.1073/pnas.1217149109 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Gene Ontology C. The Gene Ontology project in 2008. Nucleic Acids Res. 2008;36:D440–444. 10.1093/nar/gkm883 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Barik A, Zhang B, Sohal GS, Xiong WC, Mei L. Crosstalk between Agrin and Wnt signaling pathways in development of vertebrate neuromuscular junction. Dev Neurobiol. 2014;74(8):828–838. 10.1002/dneu.22190 [DOI] [PubMed] [Google Scholar]
- 33.Tsivitse S. Notch and Wnt signaling, physiological stimuli and postnatal myogenesis. Int J Biol Sci. 2010;6(3):268–281. 10.7150/ijbs.6.268 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Elieh Ali Komi D, Grauwet K. Role of Mast Cells in Regulation of T Cell Responses in Experimental and Clinical Settings. Clin Rev Allergy Immunol. 2018;54:432–45. 10.1007/s12016-017-8646-z [DOI] [PubMed] [Google Scholar]
- 35.Cardet JC, Castells MC, Hamilton MJ. Immunology and clinical manifestations of non-clonal mast cell activation syndrome. Curr Allergy Asthma Rep. 2013;13:10–18. 10.1007/s11882-012-0326-8 [DOI] [PMC free article] [PubMed] [Google Scholar]