Abstract
Single-nucleotide polymorphism (SNP) markers and high-density genetic maps are important resources for marker-assisted selection, mapping of quantitative trait loci (QTLs) and genome structure analysis. Although linkage maps in certain catfish species have been obtained, high-density maps remain unavailable in the economically important southern catfish (Silurus meridionalis). Recently developed restriction site-associated DNA (RAD) markers have proved to be a promising tool for SNP detection and genetic map construction. The objective of the present study was to construct a high-density linkage map using SNPs generated by next-generation RAD sequencing in S. meridionalis for future genetic and genomic studies. An F1 population of 100 individuals was obtained by intraspecific crossing of two wild heterozygous individuals. In total, 77 634 putative high-quality bi-allelic SNPs between the parents were discovered by mapping the parents' paired-end RAD reads onto the reference contigs from both parents, of which 54.7% were transitions and 45.3% were transversions (transition/transversion ratio of 1.2). Finally, 26 714 high-quality RAD markers were grouped into 29 linkage groups by using de novo clustering methods (Stacks). Among these markers, 4514 were linked to the female genetic map, 23 718 to the male map and 6715 effective loci were linked to the integrated map spanning 5918.31 centimorgans (cM), with an average marker interval of 0.89 cM. High-resolution genetic maps are a useful tool for both marker-assisted breeding and various genome investigations in catfish, such as sequence assembly, gene localization, QTL detection and genome structure comparison. Hence, such a high-density linkage map will serve as a valuable resource for comparative genomics and fine-scale QTL mapping in catfish species.
Keywords: catfish, genetic map, single-nucleotide polymorphism, restriction site-associated DNA sequencing
1. Introduction
Discovering the genes or chromosomal regions that control morphological, physiological and behavioural characteristics is critical for understanding adaptive evolution and the evolutionary responses of natural populations. To identify such genes, a fine linkage map, which is an ordered listing of genetic markers located along the chromosomes in the genome, is needed. The linkage map is obtained by calculating the recombination frequency between genetic markers. It shows the relative position between genetic markers rather than the physical location of the genes on the chromosome. The phenomenon of genetic linkage underpins the genetic map. The construction of the linkage map provides an essential basis for identifying chromosomal regions and quantitative trait loci (QTLs) by genetic linkage analysis [1]. Linkage maps also serve as a link between genomic information of model organisms and that of related non-model organisms through the comparison of their genomes to facilitate the discovery of candidate genes of non-model organisms [2–4].
Genetic markers form the core of modern genetics and are basic tools to study adaptive evolution, ecological genetics and population genetics. After the introduction of the restriction fragment length polymorphism (RFLP) technique [5], approaches using other molecular markers, such as random amplified polymorphic DNA (RAPD), amplification fragment length polymorphism (AFLP), simple sequence repeats (SSRs) and single-nucleotide polymorphisms (SNPs), have also been developed. In particular, SNPs have a greater advantage over the other marker types because single base-pair changes between individuals [6] are the source of the most abundant small-scale genetic variation. SNPs have become an increasingly important tool for molecular genetic analysis, such as linkage map construction in fishes (e.g. [7,8]) and other species (e.g. [9,10]). Because SNPs are widely used in genetic and genomic studies, particularly to identify and tag economically important traits, a method that can uncover hundreds of polymorphic markers spanning the entire genome in a single, simple and reliable experiment is of paramount importance.
The emergence of massively parallel next-generation sequencing (NGS) enables identification of a large number of genetic markers, construction of high-resolution linkage maps, and comparative genomics studies in both model and non-model species. Several methods have been developed for high-throughput genetic marker discovery and genotyping using restriction enzymes, including restriction site-associated DNA sequencing (RAD-seq) [11]. This technique allows exploration of SNPs adjacent to restriction enzyme sites by sequencing only the DNA-flanking sequences across the whole genome, with simultaneous genotyping of markers in multiple individuals [11–14]. RAD-seq has been successfully applied to various organisms for linkage map construction [15,16], population genetics [17], QTL analysis [16,18], linkage disequilibrium analysis [19], comparative genomics [20–22] and genome assembly [23].
Animal breeders and geneticists are continuously improving the methods of high-resolution linkage map construction to make them more rapid and cost-effective. A high-resolution linkage map is an excellent tool for the assembly of genome sequences, comparative analysis of synteny, dissection of complex genetic traits and molecular marker-assisted breeding. To date, researchers have constructed linkage maps in over 45 fish species [24]. However, the density of these maps constructed in fish, e.g. in bighead carp (Aristichthys nobilis) [25], bighead catfish (Clarias macrocephalus) [26], mirror carp (Cyprinus carpio carpio) [27], common carp (Cyprinus carpio) [28] or channel catfish (Ictalurus punctatus) [29,30], is still low. With the development of NGS, it has become easier to construct high-density SNP-based linkage maps for various economically important fish species, such as channel catfish (Ictalurus punctatus) [8,31], large yellow croaker (Larimichthys crocea) [32], rainbow trout (Oncorhynchus mykiss) [33], Nile tilapia (Oreochromis niloticus) [34], Japanese flounder (Paralichthys olivaceus) [35], Atlantic salmon (Salmo salar) [36] and yellowtail (Seriola quinqueradiata) [37]. For example, by using the linkage map of farmed Atlantic salmon (Salmo salar) with 6712 SNPs, 50 QTL-linked SNPs were found to be associated with the resistance to infectious pancreatic necrosis [38]. Thus, such high-density genetic maps have shown significant advantages in fine QTL mapping, comparative genomics and assembly of genome sequences.
Southern catfish (Silurus meridionalis), also known as Chinese large-mouth catfish, is the largest catfish in China. It is also one of the most widely cultivated catfish species for food consumption in China. Although it is an economically important fish, its genetic resources are very limited. To facilitate genetic and genomic studies in S. meridionalis, the development of its high-resolution genetic map is imperative. The aim of the present study was to construct a high-resolution genetic linkage map for S. meridionalis to facilitate QTL analysis, comparative genomics, genome assembly and selective breeding of this economically important fish species.
2. Material and methods
We declare that the sample locations were public, and no specific permission was required to access these locations. The studied fish species was not protected under any wildlife conservation programmes. Euthanasia methods followed the guidelines of the Journal of the American Veterinary Medical Association (AVMA guidelines) [39]. All experiments, especially the use of ethanol for euthanasia, were conducted in accordance with the regulations provided by the guide for care and use of laboratory animals and were approved by the Committee of Laboratory Animal Experimentation of the Southwest University (Chongqing, China).
2.1. Mapping population and DNA extraction
The full-sib F1 population consisted of 200 progeny from an intraspecific cross between two unrelated wild fish caught from the upper Yangtze River in 2012. The resource family was reared in 1000 litre tanks until the collection of tissue samples for genotyping. In total, 100 progeny were randomly sampled for genetic map construction. Fresh muscle and both pelvic and pectoral fins from each individual were removed and preserved in 95% ethanol at −20°C before DNA extraction. The remaining tissues of each fish were stored in 95% ethanol at room temperature and labelled with a unique voucher number. All of the DNA samples were extracted from small pieces of fins using a DNeasy Blood and Tissue kit (Qiagen, Shanghai, China). The residual RNA was removed from the genomic DNA (gDNA) by RNase treatment. The quality and quantity of gDNA were determined using a NanoDrop 2000 UV-vis spectrophotometer (Thermo Fisher Scientific Inc., Wilmington, DE, USA). The integrity of the DNA in every sample was evaluated by agarose gel electrophoresis.
2.2. Library construction and RAD sequencing
Approximately 0.3–0.5 µg of genomic DNA was digested in a 50 µl reaction volume for 60 min at 37°C with 20 units of EcoRI (New England Biolabs, Ipswich, MA, USA) that recognizes the 5′-G/AATTC3-3′ sequence. The RAD sequencing protocols applied were as described by Baird et al. [11], except for several modifications. To track different samples, we used a modified Illumina P1 adapter containing individual-specific nucleotide barcodes with a length of 4–8 bp. At least two nucleotides differed in all barcodes to minimize sample misassignment due to sequencing error. The adapter-ligated fragments included in different samples within each library were subsequently pooled and randomly sheared with a Covaris E210 device (Covaris, Woburn, MA, USA) to an average size of 450 bp. They were then size-selected by electrophoresis on a 1% agarose (Sigma, Shanghai, China) 0.5× TBE gel from 350 bp to 550 bp and isolated using a MinElute Gel Extraction Kit (Qiagen, Shanghai, China). A modified Illumina P2 adapter was ligated, following which double-stranded DNA ends were treated with end-blunting enzymes, and 3′-adenine overhangs were added. Finally, libraries were enriched by PCR amplification, and the PCR products were sequenced on an Illumina HiSeq 2000 using paired-end reads (90 bp).
2.3. RAD-seq data processing
The raw reads were processed by deleting the reads with adapter sequences and removing the reads containing more than 50% of low-quality bases (quality value, ≤ 5). Then, all reads were assigned to the individuals by unambiguous barcodes with one mismatch. Reads without the unique barcodes or specific sequence were discarded. Finally, 1548.19 million clean reads were further trimmed to the RAD tags with a uniform length of 82 nucleotides (nt). Each RAD tag comprised 5 nt of the EcoRI recognition site and 77 nt of the potentially variable sequence. To obtain the SNPs, Rainbow 2.02 [40] was used to assemble RAD paired-end data for the male, female and both parents (the lengths of the first and second reads were 82 bp and 90 bp, respectively). Default parameters were used except for the minimum number of reads used to generate a contig, which we chose to be 10. Contigs shorter than 200 bp from both parents were removed. SOAP2.02 [41] was used to map the parents' paired-end RAD reads onto the reference contigs. SNPs were detected by SOAPsnp [42] to calculate the likelihood of genotypes for each individual. SNPs with a sequencing depth above 200 were removed as RAD sequences that were too abundant and probably represented repetitive sequences in the genome [43]. The genotypes that differed between the two parents were treated as potential high-quality SNP markers if the following criteria were satisfied: sequencing depth of 5–200 and base quality of ≥25. For the linkage genetic map construction, Stacks version 0.9998 pipeline [44] was used, and the RAD data showed a substantial increase in the total number of parents' putative SNPs at the tails of the sequences (the last eight nucleotides from positions 75 to 82) (electronic supplementary material, additional file 1), suggestive of sequencing errors, as described in earlier reports [45,46]. These were consequently removed from the analyses. For subsequent analyses, the final read length was trimmed to 74 nucleotides, and only the first (left) paired read was used.
DNA fragments created by RAD-tag library preparation have a restriction site at one end and are randomly sheared at the other end, so that the second paired-end reads are less suitable for SNP calling because of lower coverage than that of the first reads [43]. The clean reads were then used to assemble the RAD sequences into loci. Alleles were identified, and SNPs were detected using Stacks version 0.9998. First, the RAD tags were clustered into exactly matching stacks (ustacks). A minimum stack depth of five was required to create a stack in parents and progeny; a maximum sequence mismatch of two was allowed between stacks to merge into a locus within an individual. Second, a catalogue of the possible parents' loci was created and three mismatches were allowed between male and female tags when generating the catalogue. Third, each individual was matched against the catalogue. Finally, at the genotyping stage, a minimum stack depth of 10 reads was used, which was the number of exactly matching reads that had to be found to create a stack in an individual while correcting for the heterozygous alleles neglected due to their low coverage depth.
2.4. Linkage map construction
Only loci that were polymorphic within one or both parents could be mapped according to the double pseudo-test cross strategy [47], and markers showing significantly distorted segregation (p-value < 0.01) were excluded from map construction. Linkage analysis was performed for at least 80% of the markers by using Lep-MAP [48]. The genotype data were first filtered manually to remove obvious Mendelian errors from the offspring. Linkage groups (LGs) were identified with a logarithm of odds score limit of 10. The marker order was obtained using the OrderMarkers module of Lep-MAP [48]. The eventual linkage maps were drawn using MapChart v. 2.2 [49].
2.5. Sequence comparison
Consensus sequences of RAD-tag markers (75 bases in length) were aligned with chromosome sequences of channel catfish (Ictalurus punctatus). The channel catfish genome sequences were retrieved from ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCA_001660625.1_IpCoco_1.2/GCA_001660625.1_IpCoco_1.2_assembly_structure/Primary_Assembly/assembled_chromosomes/FASTA/, and compared by the method previously described by Kakioka et al. [50]. In brief, The RAD-tag markers were mapped to the channel catfish genome sequences using the BLAST software suite (NCBI BLAST+ version 2.2.26) with an e-value cut-off of 10−10. In cases where the search of a query sequence hit two or more loci, the hit with the smallest e-value was considered significant, whereas if the difference in e-values between the first and the second smallest hits was not greater than 103, the hit was considered insignificant [50].
3. Results and discussion
3.1. RAD sequencing and SNP discovery
Taking advantage of massively parallel sequencers, we constructed and characterized a high-density SNP linkage map of southern catfish derived from RAD-seq. The genomic DNA of each sample was digested using the restriction enzyme EcoRI. Nine RAD libraries from the F1 population (two parents and their 100 progeny) were constructed and sequenced.
Approximately 145.02 G bases (approx. 1611 M reads) were generated (raw data) for all pooling lanes. After quality filtering, 141.99 G bases were generated (clean data) and retained. All libraries were split into different samples by barcoding, yielding 139.34 G (approx. 1548 M reads) bases for the F1 population. As shown in the electronic supplementary material, additional file 2, 16.99 M reads and 15.14 M reads, on average, were produced for parents and progeny, respectively. The Q20 value (Q ≥ 20) of each sample was above 98%; therefore, the quality of the data was very high. According to Zhao et al. [46], the parents' RAD paired-end reads can be useful for assembly into consensus contig sequences and discovery of SNPs. Based on the methods of locally assembling RAD tags, some software packages have been developed to identify SNPs and define putative haplotypes in populations [14,40,51]. Here, the two parents' RAD data were assembled into 241 007 contigs with a mean length of 246 bp (N90 = 246 bp) (table 1) using the software Rainbow 2.0 [40], which suggests that most of the SNPs can be used as candidate markers for marker-assisted selection or breeding. The distribution of contig lengths, as shown in the electronic supplementary material, additional file 3, had two peaks. The total contig lengths for male, female and both were 68.89 Mb, 60.15 Mb and 77.83 Mb, respectively, which is approximately 10% of the southern catfish genome (the total size of the catfish genome is 780 Mb according to k-mer analysis; unpublished data).
Table 1.
RAD paired-end contig assembly.
| male |
female |
male and female |
||||
|---|---|---|---|---|---|---|
| contig length (bp) | number | contig length (bp) | number | contig length (bp) | number | |
| N50 | 278 | 220 045 | 267 | 200 185 | 286 | 125 758 |
| N60 | 271 | 192 354 | 260 | 174 730 | 280 | 153 274 |
| N70 | 264 | 165 855 | 252 | 150 438 | 272 | 181 454 |
| N80 | 255 | 140 143 | 243 | 126 953 | 263 | 210 510 |
| N90 | 241 | 115 049 | 229 | 104 113 | 246 | 241 007 |
| total_length | 68 892 708 | 60 150 047 | 77 828 008 | |||
| maximum length | 699 | 1136 | 782 | |||
| number ≥ 200 bp | 250 708 | 227 972 | 275 278 | |||
| GC_rate | 0.382 | 0.381 | 0.382 | |||
Between the parents, 77 634 high-quality SNPs with a sequencing depth of ≥5× and base quality of ≥25 were detected (electronic supplementary material, additional file 4). Nucleotide variations of 77 634 SNPs in southern catfish were bi-allelic, of which 54.7% were transitions and 45.3% were transversions (figure 1). Thus, the rate of transitions was higher (transition/transversion ratio of 1.20). In brief, 42 447 transitions (A/G and C/T) and 35 187 transversions (A/C, A/T, C/G and G/T) were detected among polymorphic SNPs, with A/G being the most common (50.3%) and C/G being the least common (13.6%) substitutions observed, which was similar to the findings reported in other vertebrate species [52].
Figure 1.
Transitions and transversions within 77 634 bi-allelic single-nucleotide polymorphisms (SNPs) detected among southern catfish parents.
3.2. Stacks analysis in F1 population
The Stacks software is a useful analysis tool set for population genomics [44,46,53]. More than 6.16 million RAD-tag sequences were generated in each of the two parents using the pipeline of Stacks version 0.9998. These sequences were clustered and counted, which yielded more than 270 000 unique stacks in each parent (electronic supplementary material, additional file 5). A set of 304 782 stacks and 247 310 shared stacks contained both monomorphic and polymorphic RAD tags. The 304 782 stacks were the total stacks of the combination of male and female unique stacks (merged the shared stacks). In total, 62 547 and 24 880 putative SNPs were found in the female and male parent, respectively, and 56 987 RAD markers between the parents were screened (table 2). With respect to the RAD-tag sequences, the heterozygosity rate in Pfemale (maternal parent) (0.308%) was much higher than that in Pmale (paternal parent) (0.121%; table 2).
Table 2.
RAD-tag and SNP discovery in the parents. Note: Filtered RAD tags are the number of the reads with the specific recognition site (AATTC). The unique stacks are the number of clustered RAD-tag sequences found in the data from each parent using ustacks with two mismatches. The shared stacks are the number of unique stacks shared between the two parents including both polymorphic and monomorphic loci. Putative SNPs are the stacks containing loci in each parent. The heterozygosity rate is calculated as the ratio of the number of SNPs to the total length of the unique stacks in each parent. The RAD markers are the different loci between the two parents by matching the loci from each parent against the catalogue with a 10-fold minimum stack depth.
| parent | filtered RAD tags | unique stacks | shared stacks | putative SNPs | heterozygosity rate | RAD markers |
|---|---|---|---|---|---|---|
| Pfemale | 6 162 903 | 274 293 | 247 310 | 62 547 | 0.308% | 56 987 |
| Pmale | 9 958 522 | 277 799 | 247 310 | 24 880 | 0.121% | 56 987 |
3.3. Linkage map construction
To date, there have been several attempts to construct catfish genetic maps, but most of them were in channel catfish (Ictalurus punctatus) [8,29,30]. No high-density linkage maps for any silurid catfish species have been obtained. In the present study, a high-resolution RAD-based SNP genetic map of S. meridionalis was constructed using Lep-MAP for the first time. The integrated linkage map consisted of 29 LGs comprising 26 714 SNPs (table 3), which corresponded to 6715 effective loci. The effective loci have different genetic distances, i.e. SNPs that have the same genetic distances represent only one effective locus. The number of LGs was congruent with the number of chromosomes of both southern catfish (2n = 58) [54,55] and Silurus asotus (2n = 58) [56], a species belonging to the same genus as southern catfish. The total integrated map length was 5918.31 cM, with a mean distance between markers of 0.89 cM. Moreover, the maximal interval between genetic RAD loci was 49.27 cM (LG6), whereas the minimal one was 0.01 cM. The genetic length of each LG ranged from 62.59 cM (LG29) to 445.84 cM (LG4), with an average interval between loci of 0.44–1.77 cM. Moreover, the number of markers mapped on an LG varied from 125 (LG29) to 331 (LG3) per group. LG24 was the densest, having 235 effective loci with an average density of 0.44 cM, whereas LG29 had the lowest number of effective loci (only 125). On average, each LG contained 231.6 effective loci spanning 204.08 cM (table 3). Locus names and SNP positions on the 29 LGs of the genetic map are listed in the electronic supplementary material, additional file 6.
Table 3.
A summary of the genetic linkage map constructed in southern catfish (Silurus meridionalis).
| maternal map |
paternal map |
integrated_map |
||||||
|---|---|---|---|---|---|---|---|---|
| group | number of marker | genetic distance (cM) | number of marker | genetic distance (cM) | number of marker | genetic distance (cM) | no. of effective loci | average interlocus distance (cM) |
| LG1 | 1269 | 253.80 | 243 | 218.97 | 1424 | 303.83 | 287 | 1.06 |
| LG2 | 1144 | 365.98 | 209 | 167.24 | 1276 | 389.37 | 289 | 1.35 |
| LG3 | 1121 | 285.71 | 227 | 462.22 | 1264 | 337.69 | 331 | 1.02 |
| LG4 | 1049 | 521.91 | 208 | 367.79 | 1195 | 445.84 | 288 | 1.55 |
| LG5 | 1031 | 182.77 | 172 | 268.29 | 1149 | 252.78 | 343 | 0.74 |
| LG6 | 1027 | 593.78 | 199 | 294.68 | 1147 | 417.85 | 236 | 1.78 |
| LG7 | 971 | 257.45 | 173 | 218.48 | 1103 | 283.41 | 245 | 1.16 |
| LG8 | 949 | 141.25 | 185 | 155.80 | 1059 | 179.72 | 275 | 0.66 |
| LG9 | 894 | 163.47 | 156 | 148.03 | 991 | 148.89 | 195 | 0.77 |
| LG10 | 901 | 106.97 | 137 | 102.95 | 987 | 105.11 | 208 | 0.51 |
| LG11 | 868 | 177.06 | 133 | 180.61 | 965 | 191.38 | 242 | 0.79 |
| LG12 | 858 | 110.41 | 160 | 131.55 | 961 | 134.48 | 239 | 0.57 |
| LG13 | 819 | 145.58 | 180 | 282.80 | 942 | 233.04 | 271 | 0.86 |
| LG14 | 836 | 120.21 | 157 | 252.61 | 942 | 221.64 | 206 | 1.08 |
| LG15 | 825 | 115.72 | 153 | 113.76 | 931 | 172.90 | 222 | 0.78 |
| LG16 | 806 | 113.77 | 151 | 142.95 | 911 | 144.19 | 246 | 0.59 |
| LG17 | 765 | 202.95 | 163 | 130.17 | 890 | 244.45 | 258 | 0.95 |
| LG18 | 793 | 234.10 | 150 | 200.00 | 890 | 217.55 | 176 | 1.24 |
| LG19 | 704 | 269.80 | 136 | 157.02 | 786 | 204.76 | 217 | 0.95 |
| LG20 | 675 | 91.81 | 140 | 144.38 | 771 | 131.38 | 262 | 0.50 |
| LG21 | 680 | 236.81 | 127 | 161.83 | 770 | 236.22 | 190 | 1.25 |
| LG22 | 682 | 88.44 | 126 | 94.95 | 762 | 90.38 | 195 | 0.47 |
| LG23 | 644 | 242.23 | 149 | 237.80 | 743 | 249.95 | 233 | 1.08 |
| LG24 | 645 | 77.99 | 129 | 98.10 | 738 | 102.49 | 235 | 0.44 |
| LG25 | 613 | 84.70 | 131 | 86.30 | 697 | 108.96 | 179 | 0.61 |
| LG26 | 599 | 86.31 | 131 | 100.78 | 686 | 111.20 | 171 | 0.65 |
| LG27 | 592 | 99.87 | 129 | 95.62 | 669 | 117.13 | 195 | 0.60 |
| LG28 | 550 | 68.39 | 87 | 71.26 | 603 | 79.13 | 156 | 0.51 |
| LG29 | 418 | 63.15 | 73 | 58.45 | 462 | 62.59 | 125 | 0.50 |
| total: | 23 728 | 5502.39 | 4514 | 5145.39 | 26 714 | 5918.31 | 6715 | 0.89 |
Currently, most catfish genetic maps are constructed using expressed sequence tag (EST) [57], AFLP [26,29,58] and SSR [30,57,58]. The density of these linkage maps in catfish is still low, with an average density of 1.40–15.20 cM. The average density of the southern catfish linkage map in this study was much higher than those described above. Furthermore, a fine linkage map is an essential tool for genomics. A high-density linkage map, such as the one we constructed, provides detailed information on genomic structure and is a valuable resource for comparative genomics and fine-scale QTL mapping in catfishes.
3.4. Analysis of recombination rates
Recombination refers to the occurrence of gene combinations in the progeny that differ from those of the parents as a result of independent assortment and crossing over. It plays a central role in genetics. Within each LG (figure 2 and electronic supplementary material, additional file 7), mild-to-strong localized, specific recombination patterns were observed. Strongly labelled regions may be regarded as recombination hotspots, whereas weakly labelled regions may indicate no recombination or missing markers. Large intervals were often found in some LGs, possibly because no recombination events occurred during meiosis between the markers used for mapping analysis. In our study, large intervals were found in LG6, indicating that some regions of the southern catfish genome had extremely low recombination rates. This suggests the presence of a centromere or repetitive region in this LG, given the higher recombination rate near the telomeres and lower recombination rate near the centre of the chromosomes [59]. It has, however, also been observed that genetic recombination rates might be artificially inflated by genotyping errors or if markers with large amounts of missing data are included in the analysis (e.g. [60]).
Figure 2.
The linkage map of southern catfish (Silurus meridionalis). Within each linkage group, red, blue and black lines, respectively, represent maternal heterozygous SNPs, paternal heterozygous SNPs and SNPs heterozygous in both parents. Genetic map details are given in the electronic supplementary material, additional file 7.
3.5. Syntenic relationship between southern catfish and channel catfish
BLAST searches of the mapped southern catfish RAD-tag marker consensus sequences against the genome sequences of channel catfish indicated a syntenic relationship between them. Generally, the organization of the southern catfish genome is similar to that of channel catfish such that synteny can be identified. Of the hits against the channel catfish genome, most were aligned to the chromosomes that had a one-to-one relationship with southern catfish linkage groups. Karyotypes of southern catfish and channel catfish both have 29 chromosomes (2n = 58) [31,55], 27 (93.10%) of which have a simple one-to-one relationship. For example, nearly all of the southern catfish RAD-tag marker consensus sequences in southern catfish LG2 were in channel catfish chromosome (chr) 6. This suggests highly conserved synteny between southern catfish and channel catfish (figure 3).
Figure 3.
Comparison of 29 linkage groups of southern catfish with the genome of channel catfish (Ictalurus punctatus; 29 chromosomes).
Most of the RAD loci located in southern catfish LGs had a good syntenic relationship with a single channel catfish chromosome, and most of the LGs had one-to-one relationships with channel catfish chromosomes, except that LG1 corresponded to chr11 and chr12, whereas chr12 shared loci with LG1 and LG29 (figure 3). This suggested that southern catfish and channel catfish have high chromosomal collinearity, as they belong to the same order Siluriformes. The result is consistent with previous comparative analyses, which indicated that inter-chromosomal rearrangements are less frequent in teleost fishes than in mammals [61–65]. Although synteny was observed, chromosome fusion and fission are not rare. It is likely that extensive inter-chromosomal rearrangements have not occurred in either of the lineages that led to southern catfish and channel catfish since they diverged. However, our results also revealed one channel catfish chromosome (chr12) that shared loci with two southern catfish linkage groups (LG1 and LG29), whereas LG1 shared loci with chr11 and chr12. This could have resulted from a chromosome recombination or transposition followed by fusion/fission events [66].
Our results set the foundation for future analysis of the genetic mechanisms and evolutionary history that underlie the phenotypic diversity of catfishes, which are globally distributed and inhabit rivers, lakes, ponds, bog, paddy fields, channels and even underground water. Given these complex and different habitats, catfishes may be highly adaptable. However, evolutionary ecological studies of catfishes are rare. Comparative analyses can be used to investigate catfish genomes. Such studies may be used to predict candidate genes responsible for the traits that have ecological and evolutionary significance and are related to phenotypic divergence.
4. Conclusion
In the present study, we constructed a highly dense genetic linkage map of southern catfish and applied RAD sequencing in silurid catfish for the first time. The extensive synteny between southern catfish and channel catfish has been demonstrated, and differences between these species have also been identified. Our results establish the basis for the dissection of complex genetic traits and molecular marker-assisted breeding in catfish species. This map provides detailed genome sequence data for future gene-finding and genomic studies in this economically important catfish species. The map will also be a valuable resource for improving and validating silurid catfish whole-genome sequencing and assembly, which are currently underway.
Supplementary Material
Supplementary Material
Supplementary Material
Supplementary Material
Supplementary Material
Supplementary Material
Supplementary Material
Acknowledgements
We thank Yanping Li, Deyong Pu and Wenjing Tao for the sample collections.
Ethics
All experiments were conducted in accordance with the regulations provided by the guide for care and use of laboratory animals and were approved by the Committee of Laboratory Animal Experimentation at Southwest University (Chongqing, China). The fish under study is not protected under wildlife conservation. Euthanasia methods followed the guidelines of the Journal of the American Veterinary Medical Association (AVMA guidelines).
Data accessibility
The RAD sequencing raw data are available under accession numbers (SRP118891) at the NCBI Sequence Read Archive database (http://www.ncbi.nlm.nih.gov/Traces/sra). Additional results are available as electronic supplementary material, additional files 1–7.
Authors' contributions
Z.P., Y.Z. and F.S. conceived and designed the experiments. M.X. carried out the experiments. M.X., Y.M., F.S. and J.J. performed the data analyses. M.X. and Z.P. wrote the paper. All the authors gave their final approval for publication.
Competing interests
The authors declare no competing interests.
Funding
Z.P. was supported by the Fundamental Research Funds for the Central Universities (grant no. XDJK2015A011).
References
- 1.Ellegren H, Sheldon BC. 2008. Genetic basis of fitness differences in natural populations. Nature 452, 169–175. (doi:10.1038/nature06737) [DOI] [PubMed] [Google Scholar]
- 2.Gross JB, Protas M, Conrad M, Scheid PE, Vidal O, Jeffery WR, Borowsky R, Tabin CJ. 2008. Synteny and candidate gene prediction using an anchored linkage map of Astyanax mexicanus. Proc. Natl Acad. Sci. USA 105, 20 106–20 111. (doi:10.1073/pnas.0806238105) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Sarropoulou E, Nousdili D, Magoulas A, Kotoulas G. 2008. Linking the genomes of nonmodel teleosts through comparative genomics. Mar. Biotechnol. 10, 227–233. (doi:10.1007/s10126-007-9066-5) [DOI] [PubMed] [Google Scholar]
- 4.Sarropoulou E, Fernandes JM. 2011. Comparative genomics in teleost species: knowledge transfer by linking the genomes of model and non-model fish species. Comp. Biochem. Physiol. Part D Genomics Proteomics 6, 92–102. (doi:10.1016/j.cbd.2010.09.003) [DOI] [PubMed] [Google Scholar]
- 5.Grodzicker T, Williams J, Sharp P, Sambrook J. 1974. Physical mapping of temperature-sensitive mutations of adenoviruses. Cold Spring Harb Symp. Quant. Biol. 39, 439–446. (doi:10.1101/SQB.1974.039.01.056) [DOI] [PubMed] [Google Scholar]
- 6.Vignal A, Milan D, SanCristobal M, Eggen A. 2002. A review on SNP and other types of molecular markers and their use in animal genetics. Genet. Sel. Evol. 34, 275–305. (doi:10.1186/1297-9686-34-3-275) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Lien S, Gidskehaug L, Moen T, Hayes BJ, Berg PR, Davidson WS, Omholt SW, Kent MP. 2011. A dense SNP-based linkage map for Atlantic salmon (Salmo salar) reveals extended chromosome homeologies and striking differences in sex-specific recombination patterns. BMC Genomics 12, 615 (doi:10.1186/1471-2164-12-615) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Liu S, Li Y, Qin Z, Geng X, Bao L, Kaltenboeck L, Kucuktas H, Dunham R, Liu Z. 2016. High-density interspecific genetic linkage mapping provides insights into genomic incompatibility between channel catfish and blue catfish. Anim. Genet. 47, 81–90. (doi:10.1111/age.12372) [DOI] [PubMed] [Google Scholar]
- 9.Huang X, et al. 2010. Genome-wide association studies of 14 agronomic traits in rice landraces. Nat. Genet. 42, 961–967. (doi:10.1038/ng.695) [DOI] [PubMed] [Google Scholar]
- 10.Xu X, et al. 2013. The genetic basis of white tigers. Curr. Biol. 23, 1031–1035. (doi:10.1016/j.cub.2013.04.054) [DOI] [PubMed] [Google Scholar]
- 11.Baird NA, Etter PD, Atwood TS, Currey MC, Shiver AL, Lewis ZA, Selker EU, Cresko WA, Johnson EA. 2008. Rapid SNP discovery and genetic mapping using sequenced RAD markers. PLoS ONE 3, e3376 (doi:10.1371/journal.pone.0003376) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Davey JW, Blaxter ML. 2010. RADSeq: next-generation population genetics. Brief Funct. Genomics 9, 416–423. (doi:10.1093/bfgp/elq031) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Stapley J, et al. 2010. Adaptation genomics: the next generation. Trends Ecol. Evol. 25, 705–712. (doi:10.1016/j.tree.2010.09.002) [DOI] [PubMed] [Google Scholar]
- 14.Etter PD, Bassham S, Hohenlohe PA, Johnson EA, Cresko WA. 2011. SNP discovery and genotyping for evolutionary genetics using RAD sequencing. Methods Mol. Biol. 772, 157–178. (doi:10.1007/978-1-61779-228-1_9) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Baxter SW, Davey JW, Johnston JS, Shelton AM, Heckel DG, Jiggins CD, Blaxter ML. 2011. Linkage mapping and comparative genomics using next-generation RAD sequencing of a non-model organism. PLoS ONE 6, e19315 (doi:10.1371/journal.pone.0019315) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Pfender WF, Saha MC, Johnson EA, Slabaugh MB. 2011. Mapping with RAD (restriction-site associated DNA) markers to rapidly identify QTL for stem rust resistance in Lolium perenne. Theor. Appl. Genet. 122, 1467–1480. (doi:10.1007/s00122-011-1546-3) [DOI] [PubMed] [Google Scholar]
- 17.Hohenlohe PA, Bassham S, Etter PD, Stiffler N, Johnson EA, Cresko WA. 2010. Population genomics of parallel adaptation in threespine stickleback using sequenced RAD tags. PLoS Genet. 6, e1000862 (doi:10.1371/journal.pgen.1000862) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Chutimanitsakun Y, Nipper RW, Cuesta-Marcos A, Cistue L, Corey A, Filichkina T, Johnson EA, Hayes PM. 2011. Construction and application for QTL analysis of a restriction site associated DNA (RAD) linkage map in barley. BMC Genomics 12, 4 (doi:10.1186/1471-2164-12-4) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Hohenlohe PA, Bassham S, Currey M, Cresko WA. 2012. Extensive linkage disequilibrium and parallel adaptive divergence across threespine stickleback genomes. Phil. Trans. R. Soc. B 367, 395–408. (doi:10.1098/rstb.2011.0245) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Amores A, Catchen J, Ferrara A, Fontenot Q, Postlethwait JH. 2011. Genome evolution and meiotic maps by massively parallel DNA sequencing: spotted gar, an outgroup for the teleost genome duplication. Genetics 188, 799–808. (doi:10.1534/genetics.111.127324) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Brieuc MS, Waters CD, Seeb JE, Naish KA. 2014. A dense linkage map for Chinook salmon (Oncorhynchus tshawytscha) reveals variable chromosomal divergence after an ancestral whole genome duplication event. G3 4, 447–460. (doi:10.1534/g3.113.009316) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Li X, Wei Y, Acharya A, Jiang Q, Kang J, Brummer EC. 2014. A saturated genetic linkage map of autotetraploid alfalfa (Medicago sativa L.) developed using genotyping-by-sequencing is highly syntenous with the Medicago truncatula genome. G3 4, 1971–1979. (doi:10.1534/g3.114.012245) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Dasmahapatra KK, et al. 2012. Butterfly genome reveals promiscuous exchange of mimicry adaptations among species. Nature 487, 94–98. (doi:10.1038/nature11041) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Yue GH. 2014. Recent advances of genome mapping and marker-assisted selection in aquaculture. Fish Fish. 15, 376–396. (doi:10.1111/faf.12020) [Google Scholar]
- 25.Zhu C, et al. 2014. A second-generation genetic linkage map for bighead carp (Aristichthys nobilis) based on microsatellite markers. Anim. Genet. 45, 699–708. (doi:10.1111/age.12194) [DOI] [PubMed] [Google Scholar]
- 26.Poompuang S, Na-Nakorn U. 2004. A preliminary genetic map of walking catfish (Clarias macrocephalus). Aquaculture 232, 195–203. (doi:10.1016/S0044-8486(03)00489-7) [Google Scholar]
- 27.Zhang TQ, Zhang XF, Tan ZJ, Cao Z, Wang XP, Li WS, Chu ZY, Sun XW. 2011. Mapping and genetic effect analysis of quantitative trait loci related to body length in mirror carp (Cyprinus carpio L.). Hereditas 33, 1245–1250. (doi:10.3724/SP.J.1005.2011.01245) [DOI] [PubMed] [Google Scholar]
- 28.Zhang X, et al. 2013. A consensus linkage map provides insights on genome character and evolution in common carp (Cyprinus carpio L.). Mar. Biotechnol. 15, 275–312. (doi:10.1007/s10126-012-9485-9) [DOI] [PubMed] [Google Scholar]
- 29.Liu Z, Karsi A, Li P, Cao D, Dunham R. 2003. An AFLP-based genetic linkage map of channel catfish (Ictalurus punctatus) constructed by using an interspecific hybrid resource family. Genetics 165, 687–694. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Kucuktas H, et al. 2009. Construction of genetic linkage maps and comparative genome analysis of catfish using gene-associated markers. Genetics 181, 1649–1660. (doi:10.1534/genetics.108.098855) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Li Y, et al. 2015. Construction of a high-density, high-resolution genetic map and its integration with BAC-based physical map in channel catfish. DNA Res. 22, 39–52. (doi:10.1093/dnares/dsu038) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Ao J, et al. 2015. Construction of the high-density genetic linkage map and chromosome map of large yellow croaker (Larimichthys crocea). Int. J. Mol. Sci. 16, 26 237–26 248. (doi:10.3390/ijms161125951) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Nichols KM, et al. 2003. A consolidated linkage map for rainbow trout (Oncorhynchus mykiss). Anim. Genet. 34, 102–115. (doi:10.1046/j.1365-2052.2003.00957.x) [DOI] [PubMed] [Google Scholar]
- 34.Palaiokostas C, Bekaert M, Khan MGQ, Taggart JB, Gharbi K, McAndrew BJ, Penman DJ. 2015. A novel sex-determining QTL in Nile tilapia (Oreochromis niloticus). BMC Genomics 16, 171 (doi:10.1186/s12864-015-1383-x) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Shao C, et al. 2015. Genome-wide SNP identification for the construction of a high-resolution genetic map of Japanese flounder (Paralichthys olivaceus): applications to QTL mapping of Vibrio anguillarum disease resistance and comparative genomic analysis. DNA Res. 22, 161–170. (doi:10.1093/dnares/dsv001) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Gonen S, Lowe NR, Cezard T, Gharbi K, Bishop SC, Houston RD. 2014. Linkage maps of the Atlantic salmon (Salmo salar) genome derived from RAD sequencing. BMC Genomics 15, 166 (doi:10.1186/1471-2164-15-166) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Aoki JY, Kai W, Kawabata Y, Ozaki A, Yoshida K, Koyama T, Sakamoto T, Araki K. 2015. Second generation physical and linkage maps of yellowtail (Seriola quinqueradiata) and comparison of synteny with four model fish. BMC Genomics 16, 406 (doi:10.1186/s12864-015-1600-7) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Houston RD, et al. 2012. Characterisation of QTL-linked and genome-wide restriction site-associated DNA (RAD) markers in farmed Atlantic salmon. BMC Genomics 13, 244 (doi:10.1186/1471-2164-13-244) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Leary S, Underwood W, Anthony R, Cartner S, Corey D, Grandin T, Greenacre CB.2013. AVMA guidelines for the euthanasia of animals: 2013 edition. See http://works.bepress.com/cheryl_greenacre/14 .
- 40.Chong Z, Ruan J, Wu CI. 2012. Rainbow: an integrated tool for efficient clustering and assembling RAD-seq reads. Bioinformatics 28, 2732–2737. (doi:10.1093/bioinformatics/bts482) [DOI] [PubMed] [Google Scholar]
- 41.Li R, Li Y, Kristiansen K, Wang J. 2008. SOAP: short oligonucleotide alignment program. Bioinformatics 24, 713–714. (doi:10.1093/bioinformatics/btn025) [DOI] [PubMed] [Google Scholar]
- 42.Li R, Li Y, Fang X, Yang H, Wang J, Kristiansen K, Wang J. 2009. SNP detection for massively parallel whole-genome resequencing. Genome Res. 19, 1124–1132. (doi:10.1101/gr.088013.108) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Etter PD, Preston JL, Bassham S, Cresko WA, Johnson EA.. 2011. Local de novo assembly of RAD paired-end contigs using short sequencing reads. PLoS ONE 6, e18561 (doi:10.1371/journal.pone.0018561) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Catchen J, Hohenlohe PA, Bassham S, Amores A, Cresko WA. 2013. Stacks: an analysis tool set for population genomics. Mol. Ecol. 22, 3124–3140. (doi:10.1111/mec.12354) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Pujolar JM, et al. 2013. A resource of genome-wide single-nucleotide polymorphisms generated by RAD tag sequencing in the critically endangered European eel. Mol. Ecol. Resour. 13, 706–714. (doi:10.1111/1755-0998.12117) [DOI] [PubMed] [Google Scholar]
- 46.Zhao J, et al. 2014. Rapid SNP discovery and a RAD-based high-density linkage map in jujube (Ziziphus Mill.). PLoS ONE 9, e109850 (doi:10.1371/journal.pone.0109850) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Grattapaglia D, Sederoff R. 1994. Genetic linkage maps of Eucalyptus grandis and Eucalyptus urophylla using a pseudo-testcross: mapping strategy and RAPD markers. Genetics 137, 1121–1137. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Rastas P, Paulin L, Hanski I, Lehtonen R, Auvinen P. 2013. Lep-MAP: fast and accurate linkage map construction for large SNP datasets. Bioinformatics 29, 3128–3134. (doi:10.1093/bioinformatics/btt563) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Voorrips RE. 2002. MapChart: software for the graphical presentation of linkage maps and QTLs. J. Hered. 93, 77–78. (doi:10.1093/jhered/93.1.77) [DOI] [PubMed] [Google Scholar]
- 50.Kakioka R, Kokita T, Kumada H, Watanabe K, Okuda N. 2013. A RAD-based linkage map and comparative genomics in the gudgeons (genus Gnathopogon, Cyprinidae). BMC Genomics 14, 32 (doi:10.1186/1471-2164-14-32) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Willing EM, Hoffmann M, Klein JD, Weigel D, Dreyer C. 2011. Paired-end RAD-seq for de novo assembly and marker design without available reference. Bioinformatics 27, 2187–2193. (doi:10.1093/bioinformatics/btr346) [DOI] [PubMed] [Google Scholar]
- 52.Plyler ZE, Hill AE, McAtee CW, Cui X, Moseley LA, Sorscher EJ. 2015. SNP formation bias in the murine genome provides evidence for parallel evolution. Genome Biol. Evol. 7, 2506–2519. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Catchen JM, Amores A, Hohenlohe P, Cresko W, Postlethwait JH. 2011. Stacks: building and genotyping loci de novo from short-read sequences. G3 1, 171–182. (doi:10.1534/g3.111.000240) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Cai ZH. 1991. Studies on the chromosomes G-banding pattern in south catfish. (S. Soldatovi meridionalis Chen). J. Chongqing Teachers College 8, 69–72. [Google Scholar]
- 55.He G, Fang CL, Wang WP, Xu YF, Cheng BJ, Yang XH, Gan JH. et al. 2011. Study on the morphology and genetics of F1 hybrids of Silurus meridionalis and Silurus soldatovi. Hubei Agric. Sci. 23, 4916–4919. [Google Scholar]
- 56.Liu L, Liu Y. 1987. Studies on the chromosome high resolution g-banding patterns in Silurus asotus and Pseudobageus fulvidraco. J. Beijing Normal University 3, 81–87. [Google Scholar]
- 57.Waldbieser GC, Bosworth BG, Nonneman DJ, Wolters WR. 2001. A microsatellite-based genetic linkage map for channel catfish, Ictalurus punctatus. Genetics 158, 727–734. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Ninwichian P, et al. 2012. Second-generation genetic linkage map of catfish and its integration with the BAC-based physical map. G3 2, 1233–1241. (doi:10.1534/g3.112.003962) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Nachman MW, Churchill GA. 1996. Heterogeneity in rates of recombination across the mouse genome. Genetics 142, 537–548. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Ronin Y, Mester D, Minkov D, Korol A. 2010. Building reliable genetic maps: different mapping strategies may result in different maps. Nat. Sci. 2, 576–589. [Google Scholar]
- 61.Jaillon O, et al. 2004. Genome duplication in the teleost fish Tetraodon nigroviridis reveals the early vertebrate proto-karyotype. Nature 431, 946–957. (doi:10.1038/nature03025) [DOI] [PubMed] [Google Scholar]
- 62.Kai W, et al. 2005. A genetic linkage map for the tiger pufferfish, Takifugu rubripes. Genetics 171, 227–238. (doi:10.1534/genetics.105.042051) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Kai W, et al. 2011. Integration of the genetic map and genome assembly of fugu facilitates insights into distinct features of genome evolution in teleosts and mammals. Genome Biol. Evol. 3, 424–442. (doi:10.1093/gbe/evr041) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Kasahara M, et al. 2007. The medaka draft genome and insights into vertebrate genome evolution. Nature 447, 714–719. (doi:10.1038/nature05846) [DOI] [PubMed] [Google Scholar]
- 65.Mouse Genome Sequencing Consortium. 2002. Initial sequencing and comparative analysis of the mouse genome. Nature 420, 520–562. (doi:10.1038/nature01262) [DOI] [PubMed] [Google Scholar]
- 66.Amores A, Catchen J, Nanda I, Warren W, Walter R, Schartl M, Postlethwait JH. 2014. A RAD-tag genetic map for the platyfish (Xiphophorus maculatus) reveals mechanisms of karyotype evolution among teleost fish. Genetics 197, 625–641. (doi:10.1534/genetics.114.164293) [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The RAD sequencing raw data are available under accession numbers (SRP118891) at the NCBI Sequence Read Archive database (http://www.ncbi.nlm.nih.gov/Traces/sra). Additional results are available as electronic supplementary material, additional files 1–7.



