Abstract
Simple sequence repeats (SSRs) are among the most important markers for population analysis and have been widely used in plant genetic mapping and molecular breeding. Expressed sequence tag-SSR (EST-SSR) markers, located in the coding regions, are potentially more efficient for QTL mapping, gene targeting, and marker-assisted breeding. In this study, we investigated 51,694 nonredundant unigenes, assembled from clean reads from deep transcriptome sequencing with a Solexa/Illumina platform, for identification and development of EST-SSRs in Chinese cabbage. In total, 10,420 EST-SSRs with over 12 bp were identified and characterized, among which 2744 EST-SSRs are new and 2317 are known ones showing polymorphism with previously reported SSRs. A total of 7877 PCR primer pairs for 1561 EST-SSR loci were designed, and primer pairs for twenty-four EST-SSRs were selected for primer evaluation. In nineteen EST-SSR loci (79.2%), amplicons were successfully generated with high quality. Seventeen (89.5%) showed polymorphism in twenty-four cultivars of Chinese cabbage. The polymorphic alleles of each polymorphic locus were sequenced, and the results showed that most polymorphisms were due to variations of SSR repeat motifs. The EST-SSRs identified and characterized in this study have important implications for developing new tools for genetics and molecular breeding in Chinese cabbage.
1. Introduction
Chinese cabbage (Brassica rapa L. ssp. pekinensis) is a diploid (2n = 2x = 20) dicot with a genomic size of 550 Mb (http://www.brassica.info/resource/). It is a subspecies of B. rapa with the A genome [1]. The species originated in China and now has become one of the most important and widely cultivated leaf vegetables in Asia. Chinese cabbage has rosette leaves (RLs) and folding leaves (FLs). The tight leafy head is the main edible part. After a long history of domestication, Chinese cabbage evolves into different cultivars with a variety of characteristics, such as rosette leaf morphology, heading leaf morphology, leafy head shape, size, and structure, flowering time, nutrient composition, and resistance to biotic and abiotic. A better understanding of the molecular mechanism of evolution of Chinese cabbage and further development of marker-assisted selection (MAS) will accelerate the selection process of improved cultivars to meet the growing consumers and environmental needs. Although progress has been made in underlining the molecular mechanism [2–5], many aspects are still unclear.
Molecular markers have been widely used to study the genetic basis of important traits and map regulatory genes in plants. Markers tightly linked with important agronomic traits can potentially be used for molecular breeding to develop improved cultivars. Many molecular markers and genetic maps of Chinese cabbage have been reported previously [6–25]. However, there is still a great need to develop novel molecular markers for construction of high-density linkage maps for genetics and molecular studies of important traits in Chinese cabbage.
Simple sequence repeat (SSR) markers or microsatellite markers are among the most important markers in plants. SSRs have been widely used in genetic mapping and molecular breeding in plants because they are highly abundant and have significant polymorphism. Other factors, like accessibility for detection, reliability, and codominance, also make them perfect markers for such purposes [26]. SSRs found in transcribed sequences are called expressed sequence-simple sequence repeats (EST-SSRs). Compared with genomic-SSRs detected in noncoding sequences, EST-SSRs are more efficient for QTL mapping, gene targeting, and MAS [27]. As transcribed sequences are more conserved than noncoding sequences, the transferability of EST-SSRs is better than genomic-SSRs [28–30], which can be utilized for cross genome comparison and evolutionary analysis [27, 31]. Additionally, abundant ESTs were generated in recent years with the development of next-generation sequencing approaches, making identification of EST-SSRs more practical and cost-efficient [32]. Many EST-SSRs have been identified in Chinese cabbage [16, 20, 25, 33–36]. Because whole genome sequencing of Chinese cabbage is still underway, new EST-SSRs could also be identified for further studies such as high-density genetic linkage map construction, gene/QTL mapping, and cultivar identification.
In our previous study, the whole transcriptomes were analyzed for the rosette leaves and folding leaves of a typical heading Chinese cabbage, namely, FuShanBaoTou, using a Solexa/Illumina RNA-Seq platform, and a large-scale EST database was generated [37]. In this study, we further assembled those ESTs from the RL and FL libraries into nonredundant unigenes. A total of 10,420 EST-SSRs were identified, among which 2744 EST-SSRs are detected for the first time, according to the SSR marker database for Brassica (http://oilcrops.info/SSRdb). We characterized these identified EST-SSRs and designed 7877 PCR primer pairs for 1561 EST-SSRs. Furthermore, serving as a validation purpose, we tested polymorphisms of 24 EST-SSRs. We expect this study can pave the road for further investigation of new EST-SSR markers and for construction of high-density genetic maps.
2. Materials and Methods
2.1. Plant Materials
For EST-SSR identification and primer design, a typical heading Chinese cabbage, namely, FuShanBaoTou, was used in this study. For primer assessment and SSR polymorphism analysis, a panel of twenty-four cultivars of Chinese cabbage was used, including nineteen morphologically diverse cultivars of Brassica rapa L. ssp. pekinensis (B. pekinensis L.) and five Brassica rapa L. chinensis (B. chinensis L.). All plants were grown in a greenhouse with 16/8 photoperiod at 22 ± 2°C. Leaves were collected after they were grown for two weeks from ten seedlings of each cultivar and were pooled together for DNA extraction.
2.2. De Novo Assembly
We assembled the clean read dataset presented by Wang et al. [37] from the RL and FL libraries according to the methods described by Wang et al. [38] using the Trinity software (http://trinityrnaseq.sourceforge.net/). Contigs and unigenes were obtained from these two libraries, respectively. Redundant sequences were removed and overlapping unigenes were assembled into continuous sequences by the TIGR Gene Indices Clustering (TGICL) tools [39]. Similarity was set at 94% and an overlap length was set at 100 bp.
2.3. Identification of EST-Derived SSRs and Primer Design
SSRs were detected with the MicroSAtellite software (MISA; http://pgrc.ipk-gatersleben.de/misa/). Parameters were set with a minimum number of 12, 6, 5, 5, 4, and 4 repeat units for identification of mono-, di-, tri-, tetra-, penta-, and hexanucleotide motifs, respectively. Primers were designed using primer 3 with no SSR allowed in primers. Primer length ranged from 18 to 28 bp (with an optimality at 23). Annealing temperature was set at 55–65°C (with an optimality at 60°C). The size of a PCR product ranged from 80 to 300 bp.
2.4. Mapping EST-SSRs
The physical positions of the EST-SSRs identified in the study were determined by aligning the SSRs and flanking sequences (50 bp at each side) to the Brassica rapa (Chiifu-401) reference genome (http://brassicadb.org/brad/) using BLASTN. New EST-SSRs were identified by comparing with previously reported SSRs in the SSR marker database for Brassica (http://oilcrops.info/SSRdb) [25].
2.5. SSR Amplification and SSR Polymorphism Analysis
DNA was extracted following a CTAB DNA extraction protocol [40]. The DNA sample of the Chinese cabbage FuShanBaoTou was used as template to detect the availability of SSR primers designed above. The DNA samples of those aforementioned twenty-four cultivars of Chinese cabbage were used as templates for SSR polymorphism analysis. The polymorphisms of EST-SSRs were validated by 6% denaturing polyacrylamide gel electrophoresis, 12% nondenaturing polyacrylamide gel electrophoresis, and sequencing.
3. Results
3.1. De Novo Assembly
High quality clean read data from the RL and FL libraries by Wang et al. [37] were assembled using the Trinity software package [41]. A total of 99,684 and 95,411 contigs were obtained, with an average length of 333 and 342 bp and a median length (N50) of 531 and 536 bp, from the RL and FL libraries, respectively (Table 1).
Table 1.
Sample | Total number | Total length (nt) | Average length (nt) | N50 | Total consensus sequences | Distinct clusters | Distinct singletons | |
---|---|---|---|---|---|---|---|---|
Contig | RL | 99,684 | 33,205,708 | 333 | 531 | — | — | — |
FL | 95,441 | 32,596,297 | 342 | 536 | — | — | — | |
| ||||||||
Unigene | RL | 46,294 | 32,729,586 | 707 | 1000 | 46,294 | 19,512 | 26,782 |
FL | 48,473 | 32,971,187 | 680 | 980 | 48,473 | 19,749 | 28,724 | |
All | 51,694 | 40,724,256 | 788 | 1154 | 51,694 | 23,850 | 27,844 |
Contigs from the same transcript were detected with paired-end reads, as well as the distances between these contigs. Using the Trinity software package, we assembled these contigs into unigenes, in which Ns were removed. These unigenes were set to be not extendable on either end of the sequences. A total of 46,294 and 48,473 unigenes from the RL and FL libraries were obtained with an average length of 707 and 680 bp and a median length (N50) of 1000 and 980 bp, respectively (Table 1). Size distribution of the contigs and unigenes is consistent with the RL and FL libraries as shown in Figure 1, indicating that our Illumina sequencing solution is reliable and reproducible. Unigenes from the two samples were combined; redundant unigenes were removed; and the rest was assembled with TGICL [39] to form a single dataset, which represents 40.7 Mb of sequence and contains a total of 51,694 nonredundant unigenes, with an average read length of 788 bp, and a median read length (N50) of 1154 bp (Table 1). The sequences of the unigenes are listed in Table s1 (see Supplementary Material available online at http://dx.doi.org/10.1155/2015/473028).
The length of 24,271 nonredundant unigenes (46.95%) is between 200 and 500 bp; the length of 13,613 (26.33%) is between 501 and 1,000 bp, and the length of 13,810 (26.72%) is longer than 1,000 bp (Figure 1).
3.2. Characterization of EST-SSRs in Chinese Cabbage
A total of 10420 EST-SSRs were detected with the MicroSAtellite software (MISA; http://pgrc.ipk-gatersleben.de/misa/) in 8571 unigenes, accounting for 16.6% of total nonredundant unigenes (Tables 2 and s2). The mean SSR density is one per 3.9 Kb, corresponding to one for every 5.0 nonredundant unigenes. 1502 unigenes (17.5%) harbored more than one SSR and 666 SSRs (6.4%) were present in compound formation that had more than one repeat type (Table 2).
Table 2.
Searching items | Numbers |
---|---|
Total number of sequences examined | 51694 |
Total size of examined sequences (bp) | 40724256 |
Total number of identified SSRs | 10420 |
Number of SSR-containing sequences | 8571 |
Number of sequences containing more than one SSR | 1502 |
Number of SSRs present in compound formation | 666 |
% EST-SSRs | 16.6% |
The size of SSR repeat units ranged from one to six. The number of SSRs with each repeat unit was found to be quite different. The SSRs with tri- and dinucleotide repeat motifs were the most common (4,405, 42.27%; 4,043, 38.80%, resp.), followed by mono- (1,644, 15.78%), hexa- (126, 1.21%), penta- (112, 1.07%) and tetra- (90, 0.86%) nucleotide repeat motifs (Figure 2). The most common two repeat motif types accounted for 81.07% of the total SSRs detected, and the rest repeat motifs types only accounted for 18.93%.
The iterate number of repeat units in an EST-SSR ranged from 4 to 25. The occurrence frequency of EST-SSTs with different iterate numbers was found to be unequal either. EST-SSRs with iterate number of 5 (2832, 27.18%) were the most common ones, followed by 6 (2739, 26.29%), 7 (1368, 13.13%), 8 (703, 6.75%), 12 (542, 5.20%), and 9 (480, 4.61%) (Table s3). A dinucleotide containing EST-SSRs with a maximum of 25 repeat units was identified. For EST-SSRs with more than 10 repeat units, the mononucleotide repeat motifs were the most abundant, accounting for 93.46% of these EST-SSRs. The lengths of EST-SSR sequences ranged from 12 to 65 bp (Table s4). The longest one is a pentanucleotide containing EST-SSR with 65 bp in length. The lengths of most EST-SSRs are from 12 to 20 bp, accounting for 91.47% of the total EST-SSRs, followed by EST-SSRs with 21–30 bp in length (874 SSRs, 8.39%). Only 13 EST-SSRs were identified with over 30 bp, accounting for 0.12% of the total EST-SSRs.
A total of 124 EST-SSR motifs were identified, including 2 mono-, 3 di-, 10 tri-, 13 tetra-, 33 penta-, and 63 hexanucleotide repeat units containing EST-SSRs. The dominant motif identified in our EST-SSRs was AG/CT (3,519, 33.8%), followed by A/T (1,562, 15.0%), AAG/CTT (1,445, 13.9%), AGG/CCT (776, 7.4%), ATC/ATG (627, 6.0%), AAC/GTT (392, 4.4%), ACC/GGT (392, 3.8%), AC/GT (349, 3.3%), and AGC/CTG (317, 3.0%) (Figure 3). The other 115 motifs have low frequency, accounting only for 9.3% of total EST-SSRs.
Physical locations of the EST-SSRs were assigned by searching against the nonredundant (nr) protein database of NCBI (http://www.ncbi.nlm.nih.gov/) and the Brassica database (http://brassicadb.org/brad/) using BLASTX. Our results showed that 4329 EST-SSRs (44.4%) were located in coding regions (CDSs), 3456 (35.5%) in 5′-UTRs, and 1297 (13.3%) in 3′-UTRs (Figure 4, Table s4). Locations of the remaining 672 EST-SSRs (6.9%) were not successfully assigned (Figure 4, Table s4). For the EST-SSRs localized in the CDS region, trinucleotide repeats were the most common ones, accounting for 62.72% of the total EST-SSRs localized in this region, followed by dinucleotide repeats (897, 20.72%), mononucleotide repeats (325, 7.51), and compound formation (287, 6.63%) (Table s4). Dinucleotide repeats (1909, 55.24%) were the dominant types in 5′-UTRs, followed by trinucleotide repeats (730, 21.12%), mononucleotide repeats (483, 13.98%), and compound formation ones (214, 6.19%) (Table s4). Mono-, di-, and trinucleotide repeat EST-SSRs were the top three types found in 3′-UTRs, accounting for 35.08%, 30.07%, and 28.60% of the total EST-SSRs localized in these regions, respectively.
3.3. New EST-SSRs Identification
The EST-SSRs and the flanking sequences (50 bp on each side) were aligned to the Brassica rapa (Chiifu-401) reference genome (http://brassicadb.org/brad/) using BLASTN to determine their physical positions. New EST-SSRs were identified by comparing with the earlier reported SSRs in the SSR marker database for Brassica (http://oilcrops.info/SSRdb). A total of 2744 new EST-SSRs (26.3%) were identified in the study. Of the 7676 known SSRs (73.6%), 2317 EST-SSRs (22.2%) show polymorphism with different repeat numbers, and 5359 (51.4%) were exactly the same with the earlier reported SSRs based on the Brassica rapa (Chiifu-401) genomic sequence [25] (Table s2).
3.4. Primer Design and Evaluation of EST-SSRs in Chinese Cabbage
A total of 7877 PCR primer pairs from the unique sequences flanking 1561 EST-SSR loci were designed according to the criteria described in Section 2 using primer 3 (Table s5). For each EST-SSR locus, a maximum of 5 alternative primer pairs was designed. The other 8859 EST-SSRs, which had no appropriate PCR primer pairs designed as their flanking sequences, did not fulfill the primer design criteria mentioned above. For the 1561 EST-SSRs with PCR primers designed, PCR primers of those aforementioned 24 loci with n ≥ 20 bp were selected for primer synthesis and amplification evaluation in Chinese cabbage FuShanBaoTou. Nineteen (79.2%) of these 24 EST-SSR loci successfully yielded PCR amplicons in FuShanBaoTou. We sequenced these nineteen PCR amplicons and found that the amplicons in thirteen loci were exactly the same as expected; two were longer than the expected size, and four were shorter (Table 3). Size deviation of five EST-SSRs loci with the expected sizes (BR-es6, BR-es7, BR-es8, BR-es12, and BR-es18) was due to the variations of SSR repeat motifs (Table s6). One amplicon (BR-es16) deviated from the expected sizes and had an additional 86 bp containing a (TC)9 motif near the SSR repeat motif region (Table s6).
Table 3.
Code | EST-SSR name | Motif | Product size expected (bp) | Product size validated (bp) | SSR location | Number of alleles |
---|---|---|---|---|---|---|
BR-es1 | CL3455.Contig1_All-2 | (TC)11 | 97 | 97 | CDS | 2 |
BR-es2 | CL4114.Contig2_All-2 | (TCA)7 | 157 | 157 | 3-UTR | 3 |
BR-es3 | CL2525.Contig4_All-1 | (TAG)9 | 160 | 160 | CDS | 4 |
BR-es4 | Unigene10387_All-1 | (CTC)9 | 127 | 127 | CDS | 2 |
BR-es5 | CL7077.Contig2_All-1 | (AATC)5 | 153 | 153 | 3-UTR | 3 |
BR-es6 | Unigene16359_All-1 | (AACC)5 | 134 | 130 | 5-UTR | 1 |
BR-es7 | CL4685.Contig1_All-1 | (CCTT)6 | 160 | 148 | 3-UTR | 2 |
BR-es8 | CL5247.Contig3_All-1 | (TTTC)6 | 133 | 141 | CDS | 3 |
BR-es9 | CL3462.Contig4_All-1 | (AATCG)4 | 155 | 155 | CDS | 2 |
BR-es10 | CL5726.Contig2_All-2 | (TCTCT)4 | 146 | 146 | 5-UTR | 3 |
BR-es11 | Unigene6713_All-1 | (AAAAC)4 | 118 | 118 | CDS | 1 |
BR-es12 | CL7282.Contig2_All-1 | (GAGGA)5 | 140 | 117 | CDS | 4 |
BR-es13 | Unigene2970_All-1 | (GAACT)5 | 106 | 106 | 3-UTR | 3 |
BR-es14 | Unigene8739_All-1 | (GATTT)5 | 130 | 130 | 5-UTR | 4 |
BR-es15 | CL5873.Contig4_All-2 | (CCCTAA)4 | 146 | 146 | 3-UTR | 4 |
BR-es16 | Unigene14449_All-1 | (CTCAAG)5 | 99 | 185 | CDS | 6 |
BR-es17 | Unigene5096_All-1 | (ACTCCC)5 | 141 | 141 | CDS | 3 |
BR-es18 | CL4691.Contig2_All-1 | (GATGGT)7 | 155 | 117 | CDS | 6 |
BR-es19 | Unigene13507_All-1 | (ATTTG)4 | 152 | 152 | CDS | 2 |
EST-SSRs shown in bold have sizes different from the expected sizes.
3.5. Validation of Polymorphism of EST-SSRs
Nineteen effective primer pairs were used for polymorphism validation for these aforementioned 24 Chinese cabbage cultivars. The results showed that 17 loci (89.5%) were polymorphic (Figure 5). A total of 56 alleles at the 17 polymorphic loci were identified and the average number of alleles per SSR locus was 3.29 with a range between 2 and 6. A maximum of 6 alleles was detected for BR-es16 and BR-es18 loci. BR-es6 and BR-es11 had no polymorphic allele in all 24 cultivars in this study (Figure 5, Tables 3 and s4). Of the 17 polymorphic loci, twelve loci were polymorphic in all cultivars of B. pekinensis L. and B. chinensis L. Three loci (BR-es2, BR-es9, and BR-es19) had no polymorphism in the cultivars of B. pekinensis L. but had polymorphism in the cultivars of B. chinensis L., while two loci (BR-es4 and BR-es7) were polymorphic in the cultivars of B. pekinensis L. but were not polymorphic in the cultivars of B. chinensis L. (Figure 5, Table s8).
We sequenced the polymorphic alleles of the 17 polymorphic loci and found that polymorphisms of 9 loci (BR-es1, BR-es4, BR-es7, BR-es8, BR-es10, BR-es14, BR-es17, BR-es18, and BR-es19) were because of different iterate numbers of SSR repeat motifs. In another 6 polymorphic loci (BR-es2, BR-es3, BR-es12, BR-es13, BR-es15, and BR-es16), the most polymorphic alleles were found in the repeat motifs with additional changes in other regions (Table s7). For example, compared with the allele BR-es3-160 bp in FuShanBaoTou, the polymorphic alleles BR-es3-163 bp and 145 bp had different iterate numbers of the TAG/ATC repeat motif, while the polymorphic allele 99 bp had not only a different number of the repeat motif, but also a deletion in another region (Table s7). The other two polymorphic loci, BR-es5 and BR-es9, had polymorphisms that are not related with the repeat numbers of SSR motifs (Table s7).
4. Discussion
4.1. High-Throughput RNA Sequencing Provides Substantial Knowledge for EST-SSRs
Illumina paired-end RNA sequencing is one of the fast immerging next-generation sequencing (NGS) technologies. Because of its advantages in high-throughput, high accuracy, and low cost, Illumina paired-end sequencing has been widely used for de novo transcriptome sequencing and assembly and transcriptome quality and quantity analysis in many plants [37, 38, 42, 43]. In our previous study, the transcriptome of rosette and folding leaves in Chinese cabbage was analyzed using the Illumina paired-end RNA sequencing technology, and abundant clean reads and ESTs with high quality were obtained [37]. The large quantity of clean reads would increase coverage depth of transcriptome nucleotide, enhance sequencing accuracy, and provide useful information for developing new tools for genetic mapping and molecular breeding of Chinese cabbage. In this study, we further assembled the clean reads into contigs and unigenes from the RL and FL libraries, respectively. The parameters for both contigs and unigenes between the two libraries had no significant differences (Table 1), indicating our Illumina sequencing solutions have high reliability and reproducibility. The unigenes of the two libraries were further assembled and a total of 51,694 nonredundant unigenes were obtained from the 40.7 Mb sequence data. We discovered more nonredundant unigenes than those in previous studies [35, 36], which represent a large portion of the Chinese cabbage transcriptome and are important for a comprehensive understanding of EST-SSRs.
4.2. Frequency and Distribution of EST-SSRs in Chinese Cabbage
A total of 10,420 SSRs with over 12 bp were identified from the deep transcriptome sequence dataset of Chinese cabbage. About 16.6% of the unigenes have SSRs. The frequency of occurrence of SSRs is slightly higher than those reported in previous studies on Chinese cabbage (about 8.4–15.6%) [20, 34–36] and also higher than those of other dicotyledonous species such as peanut (6.8%) [44], sweetpotato (8.2%) [21], sesame (8.9%) [43], pigeonpea (7.6%) [45], grapes (2.5%) [46], pepper (4.9%) [47], and flax (3.5%) [48], but it is lower than those of coffee (18.5%) [49], radish (23.8%) [38], and caster bean (28.4%) [50]. Detection of EST-SSRs depends on a number of factors such as genome structure [51], tools and parameters for EST-SSRs detection and exploration [43], and size of dataset for unigene assembly [27].
The frequency of SSRs with different sizes of repeat units is not evenly distributed in plants. Previous studies showed dinucleotide SSR loci are the most abundant class in safflower [52], pigeonpea [45], and sesame [43], whereas trinucleotide repeats are the most frequent ones in barley [53], sweetpotato [21], Jatropha curcas [54], iris [55], pepper [47], caster bean [50], flax [48], Cucurbita pepo [56], and radish [38]. In ramie [57] and wheat [58], dinucleotide and trinucleotide repeat motifs are the most two abundant types. In the present study, trinucleotide (4405, 42.3%) was found to be the most common repeat motif class in Chinese cabbage, followed by dinucleotide (4043, 38.8%) (Figure 2). It is consistent with previous reports for SSRs identification from unigenes of Chinese cabbage [20]. However, on the genomic level, of Chinese cabbage, dinucleotide is the most common repeat motif, followed by trinucleotide [25].
We found the most dominant mononucleotide repeat motif in Chinese cabbage was A/T (1,562, accounting for 15.0% of the total EST-SSRs), which is consistent with previous reports for Chinese cabbage [25] and for other plants such as Arabidopsis [59], rice [59], wheat [60], radish [38], castor bean [50], Gossypium raimondii [61], oil palm [62], and eggplant [63]. For dinucleotide motif, AG/CT was the most common repeat motif, accounting for 87.0% of the total dinucleotide EST-SSRs. It is in close agreement with the results in previous studies for genic SSRs in Chinese cabbage [20, 36] and those in most other plants such as sweetpotato [21], iris [55], sesame [43], and radish [38]. The AG/CT repeat motif was also the most dominant repeat among all the EST-SSRs identified in this study, accounting for 33.8% of the total EST-SSRs. However, for genomic-SSRs in Chinese cabbage, AT/TA is the most common dinucleotide motif [25]. The AAG/CTT (1,445, 13.9%) motif was the most frequent motif among trinucleotide EST-SSRs in the study, which is consistent with the results in previous studies in Chinese cabbage [25, 36] and many dicot species, for example, Arabidopsis [64], soybean [65], peanut [44], sweetpotato [21], radish [38], and sesame [43]. In many monocot species such as maize, barley, and sorghum [66, 67], CCG/GGC is the most dominant trinucleotide repeat motif. It is considered a specific feature of monocot genomes due to the high GC content in monocot genomes [68].
4.3. New EST-SSRs Identification
Of all 10420 EST-SSRs identified in this study, more than 70% have been identified and presented in the SSR marker database (http://oilcrops.info/SSRdb), among which over half were exactly the same with the earlier reported SSRs based on the Brassica rapa (Chiifu-401) genomic sequence (Table s2) [25]. It demonstrates that our method is highly reliable for EST-SSR identification. 2317 EST-SSRs (22.2%) with polymorphism in different repeat numbers could further be used for identification of Chiifu-401 and FuShanBaoTou and for genetic linkage map constructions using these two cultivars as parents. A total of 2744 new EST-SSRs (26.3%) were identified in the study, which, in combination with previously discovered EST-SSRs, could be used for high-density genetic linkage map construction, gene/QTL mapping, cultivar identification, and so forth.
4.4. High Polymorphism of Chinese Cabbage EST-SSRs
In the present study, 79.2% of the EST-SSRs primer pairs selected for primer evaluation successfully generated high quality amplicons, indicating that the ESTs from the high-throughput RNA sequencing of Chinese cabbage transcriptome are suitable for specific primer design. The unsuccessfully designed primer pairs may be due to splice sites, large introns, chimeric primer(s), or poor quality sequences [27]. We sequenced all PCR amplicons in Chinese cabbage FuShanBaoTou yielding 19 successful primer pairs. We found that all amplicons contained the expected SSRs and the SSRs in 13 amplicons were exactly the same as predicted (Table s6). The deviation of EST-SSR PCR amplicons from the expected size is likely due to the presence of introns, large insertions or repeat number variations, a lack of specificity, or assembly errors [43]. In the present study, we found five of six amplicons with unexpected sizes had different iterate number of SSR repeat units, while the other one had a 86 bp insertion near the expected SSR repeat motif region (Table s6). These results suggested that the unigenes assembled from the high-throughput RNA sequencing of Chinese cabbage transcriptome are reliable, and the EST-SSRs identified in our dataset could be used for further studies, such as genetic mapping and cultivar identification.
Most of the EST-SSR loci (accounting for 89.5% of the tested loci) were found to be polymorphic among the 24 tested cabbage cultivars. The mean number of alleles per SSR locus was 3.29 with a range between 2 and 6 (Table 3), indicating that polymorphism of EST-SSRs in Chinese cabbage is relatively high. Most of the polymorphisms of the tested EST-SSR loci are due to the variations of SSR repeat motifs in this study. There were only two loci where the polymorphisms were not related to the SSR repeat motif variations (Table s6). The results indicate that the EST-SSRs identified and the PCR primers designed in this study could further be used for constructing high-density genetic linkage maps, mapping quantitative trait loci, assessing germplasm polymorphism and evolution, marker-assisted selection, and cloning functional gene in Chinese cabbage.
In summary, we assembled a large set of clean reads with high quality derived from the Chinese cabbage transcriptome using high-throughput RNA sequencing technology with a Solexa/Illumina platform. A total of 51,694 nonredundant unigenes were obtained from 40.7 Mb sequence data, providing substantial knowledge for EST-SSR identification and characterization. 10,420 EST-SSRs were identified and characterized, and PCR primer pairs for 1561 EST-SSRs were designed. By comparing with previously reported SSRs in the SSR marker database for Brassica (http://oilcrops.info/SSRdb), we identified a total of 2744 new EST-SSRs. Primer pairs for 24 EST-SSRs were selected for primer evaluation, and 79.2% of the 24 EST-SSR loci successfully generated high quality amplicons. Among the effective primers, 89.5% of them showed polymorphism in 24 cultivars of Chinese cabbage. The EST-SSRs developed in this study, in combination with previously reported EST-SSRs, will provide valuable resources for constructing high-density genetic linkage maps, mapping quantitative trait loci, assessing germplasm polymorphism and evolution, marker-assisted selection, and cloning functional gene in Chinese cabbage. To our knowledge, this is the first successful attempt to develop large quantity of EST-SSRs with high quality based on the transcriptome of Chinese cabbage using high-throughput RNA sequencing technology.
Supplementary Material
Acknowledgments
This research was supported by the China Postdoctoral Science Foundation Funded Project (2013M541948), the Shandong Postdoctoral Science Foundation Funded Project, China (201303030), the National Natural Science Foundation of China (31401869), the National High-tech R&D Program of China (863 Program) (Grant 2012AA100103), the Modern Agricultural Industrial Technology System Funding of Shandong Province, China (SDAIT-02-022-04), and the Project for Cultivation of Major Achievements in Science and Technology in SAAS (2015CGPY09).
Conflict of Interests
The authors declare that there is no conflict of interests regarding the publication of this paper.
Authors' Contribution
Qian Ding and Jingjuan Li contributed equally to this work.
References
- 1.Wang X., Wang H., Wang J., et al. The genome of the mesopolyploid crop species Brassicarapa . Nature Genetics. 2011;43:1035–1039. doi: 10.1038/ng.919. [DOI] [PubMed] [Google Scholar]
- 2.Yu X., Peng J., Feng X., et al. Cloning and structural and expressional characterization of BcpLH gene preferentially expressed in folding leaf of Chinese cabbage. Science in China, Series C: Life Sciences. 2000;43(3):328–329. doi: 10.1007/BF02906829. [DOI] [PubMed] [Google Scholar]
- 3.Wu H., Yu L., Tang X.-R., Shen R.-J., He Y.-K. Leaf downward curvature and delayed flowering caused by AtLH overexpression in Arabidopsis thaliana . Acta Botanica Sinica. 2004;46(9):1106–1113. [Google Scholar]
- 4.Lee J., Han C. T., Hur Y. Overexpression of BrMORN, a novel ‘membrane occupation and recognition nexus’ motif protein gene from Chinese cabbage, promotes vegetative growth and seed production in Arabidopsis . Molecules and Cells. 2010;29(2):113–122. doi: 10.1007/s10059-010-0006-2. [DOI] [PubMed] [Google Scholar]
- 5.Wang B., Zhou X., Xu F., Gao J. Ectopic expression of a Chinese cabbage BrARGOS gene in Arabidopsis increases organ size. Transgenic Research. 2010;19(3):461–472. doi: 10.1007/s11248-009-9324-6. [DOI] [PubMed] [Google Scholar]
- 6.Song K. M., Suzuki J. Y., Slocum M. K., Williams P. M., Osborn T. C. A linkage map of Brassica rapa (syn. campestris) based on restriction fragment length polymorphism loci. Theoretical and Applied Genetics. 1991;82(3):296–304. doi: 10.1007/bf02190615. [DOI] [PubMed] [Google Scholar]
- 7.Chyi Y.-S., Hoenecke M. E., Sernyk J. L. A genetic linkage map of restriction fragment length polymorphism loci for Brassica rapa (syn. campestris) Genome. 1992;35(5):746–757. doi: 10.1139/g92-115. [DOI] [Google Scholar]
- 8.Suwabe K., Iketani H., Nunome T., Kage T., Hirai M. Isolation and characterization of microsatellites in Brassica rapa L. Theoretical and Applied Genetics. 2002;104(6-7):1092–1098. doi: 10.1007/s00122-002-0875-7. [DOI] [PubMed] [Google Scholar]
- 9.Lowe A. J., Moule C., Trick M., Edwards K. J. Efficient large-scale development of microsatellites for marker and mapping applications in Brassica crop species. Theoretical and Applied Genetics. 2004;108(6):1103–1112. doi: 10.1007/s00122-003-1522-7. [DOI] [PubMed] [Google Scholar]
- 10.Suwabe K., Iketani H., Nunome T., Ohyama A., Hirai M., Fukuoka H. Characteristics of microsatellites in Brassica rapa genome and their potential utilization for comparative genomics in cruciferae. Breeding Science. 2004;54(2):85–90. doi: 10.1270/jsbbs.54.85. [DOI] [Google Scholar]
- 11.Choi S. R., Teakle G. R., Plaha P., et al. The reference genetic linkage map for the multinational Brassica rapa genome sequencing project. Theoretical and Applied Genetics. 2007;115(6):777–792. doi: 10.1007/s00122-007-0608-z. [DOI] [PubMed] [Google Scholar]
- 12.Jung S. K., Tae Y. C., King G. J., et al. A sequence-tagged linkage map of Brassica rapa . Genetics. 2006;174(1):29–39. doi: 10.1534/genetics.106.060152. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Suwabe K., Tsukazaki H., Iketani H., et al. Simple sequence repeat-based comparative genomics between Brassica rapa and Arabidopsis thaliana: the genetic origin of clubroot resistance. Genetics. 2006;173(1):309–319. doi: 10.1534/genetics.104.038968. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Wu J., Yuan Y.-X., Zhang X.-W., et al. Mapping QTLs for mineral accumulation and shoot dry biomass under different Zn nutritional conditions in Chinese cabbage (Brassica rapa L. ssp. pekinensis) Plant and Soil. 2008;310(1-2):25–40. doi: 10.1007/s11104-008-9625-1. [DOI] [Google Scholar]
- 15.Li F., Kitashiba H., Inaba K., Nishio T. A brassica rapa linkage map of EST-based SNP markers for identification of candidate genes controlling flowering time and leaf morphological traits. DNA Research. 2009;16(6):311–323. doi: 10.1093/dnares/dsp020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Li L., Zheng X. Y. The development of multiplex EST-SSR markers to identification Chinese cabbage [Brassica campestris L. chinensis (L.) Makino and Brassica campestris L. pekinensis (Lour.) Olsson] cultivars. Acta Horticulturae Sinica. 2009;37(11):1627–1634. [Google Scholar]
- 17.Iniguez-Luy F. L., Lukens L., Farnham M. W., Amasino R. M., Osborn T. C. Development of public immortal mapping populations, molecular markers and linkage maps for rapid cycling Brassica rapa and B. oleracea . Theoretical and Applied Genetics. 2009;120(1):31–43. doi: 10.1007/s00122-009-1157-4. [DOI] [PubMed] [Google Scholar]
- 18.Feng H., Wei P., Piao Z.-Y., et al. SSR and SCAR mapping of a multiple-allele male-sterile gene in Chinese cabbage (Brassica rapa L.) Theoretical and Applied Genetics. 2009;119(2):333–339. doi: 10.1007/s00122-009-1042-1. [DOI] [PubMed] [Google Scholar]
- 19.Li X., Ramchiary N., Choi S. R., et al. Development of a high density integrated reference genetic linkage map for the multinational Brassica rapa Genome Sequencing Project. Genome. 2010;53(11):939–947. doi: 10.1139/g10-054. [DOI] [PubMed] [Google Scholar]
- 20.Parida S. K., Yadava D. K., Mohapatra T. Microsatellites in Brassica unigenes: relative abundance, marker design, and use in comparative physical mapping and genome analysis. Genome. 2010;53(1):55–67. doi: 10.1139/g09-084. [DOI] [PubMed] [Google Scholar]
- 21.Wang Z., Li J., Luo Z., et al. Characterization and development of EST-derived SSR markers in cultivated sweetpotato (Ipomoea batatas) BMC Plant Biology. 2011;11, article 139 doi: 10.1186/1471-2229-11-139. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Li W., Zhang J., Mou Y., et al. Integration of Solexa sequences on an ultradense genetic map in Brassica rapa L. BMC Genomics. 2011;12, article 249 doi: 10.1186/1471-2164-12-249. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Zou J., Fu D., Gong H., et al. De novo genetic variation associated with retrotransposon activation, genomic rearrangements and trait variation in a recombinant inbred line population of Brassica napus derived from interspecific hybridization with Brassica rapa . Plant Journal. 2011;68(2):212–214. doi: 10.1111/j.1365-313x.2011.04679.x. [DOI] [PubMed] [Google Scholar]
- 24.Bagheri H., El-Soda M., van Oorschot I., et al. Genetic analysis of morphological traits in a new, versatile, rapid-cycling Brassica rapa recombinant inbred line population. Frontiers in Plant Science. 2012;3, article 183 doi: 10.3389/fpls.2012.00183. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Shi J., Huang S., Zhan J., et al. Genome-wide microsatellite characterization and marker development in the sequenced Brassica crop species. DNA Research. 2014;21(1):53–68. doi: 10.1093/dnares/dst040. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Powell W., Morgante M., Andre C., et al. The comparison of RFLP, RAPD, AFLP and SSR (microsatellite) markers for germplasm analysis. Molecular Breeding. 1996;2(3):225–238. doi: 10.1007/bf00564200. [DOI] [Google Scholar]
- 27.Varshney R. K., Graner A., Sorrells M. E. Genic microsatellite markers in plants: features and applications. Trends in Biotechnology. 2005;23(1):48–55. doi: 10.1016/j.tibtech.2004.11.005. [DOI] [PubMed] [Google Scholar]
- 28.Eujayl I., Sledge M. K., Wang L., et al. Medicago truncatula EST-SSRs reveal cross-species genetic markers for Medicago spp. Theoretical and Applied Genetics. 2004;108(3):414–422. doi: 10.1007/s00122-003-1450-6. [DOI] [PubMed] [Google Scholar]
- 29.Zhang L. Y., Bernard M., Leroy P., Feuillet C., Sourdille P. High transferability of bread wheat EST-derived SSRs to other cereals. Theoretical and Applied Genetics. 2005;111(4):677–687. doi: 10.1007/s00122-005-2041-5. [DOI] [PubMed] [Google Scholar]
- 30.Saha M. C., Cooper J. D., Mian M. A. R., Chekhovskiy K., May G. D. Tall fescue genomic SSR markers: development and transferability across multiple grass species. Theoretical and Applied Genetics. 2006;113(8):1449–1458. doi: 10.1007/s00122-006-0391-2. [DOI] [PubMed] [Google Scholar]
- 31.Varshney R. K., Sigmund R., Börner A., et al. Interspecific transferability and comparative mapping of barley EST-SSR markers in wheat, rye and rice. Plant Science. 2005;168(1):195–202. doi: 10.1016/j.plantsci.2004.08.001. [DOI] [Google Scholar]
- 32.Zalapa J. E., Cuevas H., Zhu H., et al. Using next-generation sequencing approaches to isolate simple sequence repeat (SSR) loci in the plant sciences. The American Journal of Botany. 2012;99(2):193–208. doi: 10.3732/ajb.1100394. [DOI] [PubMed] [Google Scholar]
- 33.Li L., He W. M., Ma L. P., et al. Construction Chinese cabbage (Brassica rapa L.) core collection and its EST-SSR fingerprint database by EST-SSR molecular markers. Genomics and Applied Biology. 2009;28:76–88. [Google Scholar]
- 34.Xin Y., Cui H., Lu M., et al. Data mining for SSRs in ESTs and EST-SSR marker development in Chinese cabbage. Acta Horticulturae Sinica. 2006;33(3):549–554. [Google Scholar]
- 35.Ge Y., Ramchiary N., Wang T., et al. Development and linkage mapping of unigene-derived microsatellite markers in Brassica rapa L. Breeding Science. 2011;61(2):160–167. doi: 10.1270/jsbbs.61.160. [DOI] [Google Scholar]
- 36.Ramchiary N., Nguyen V. D., Li X., et al. Genic microsatellite markers in brassica rapa: development, characterization, mapping, and their utility in other cultivated and wild brassica relatives. DNA Research. 2011;18(5):305–320. doi: 10.1093/dnares/dsr017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Wang F., Li L., Li H., et al. Transcriptome analysis of rosette and folding leaves in Chinese cabbage using high-throughput RNA sequencing. Genomics. 2012;99(5):299–307. doi: 10.1016/j.ygeno.2012.02.005. [DOI] [PubMed] [Google Scholar]
- 38.Wang S., Wang X., He Q., et al. Transcriptome analysis of the roots at early and late seedling stages using Illumina paired-end sequencing and development of EST-SSR markers in radish. Plant Cell Reports. 2012;31(8):1437–1447. doi: 10.1007/s00299-012-1259-3. [DOI] [PubMed] [Google Scholar]
- 39.Pertea G., Huang X., Liang F., et al. TIGR gene indices clustering tools (TGICL): a software system for fast clustering of large EST datasets. Bioinformatics. 2003;19(5):651–652. doi: 10.1093/bioinformatics/btg034. [DOI] [PubMed] [Google Scholar]
- 40.Winnepenninckx B., Backeljau T., de Wachter R. Extraction of high molecular weight DNA from molluscs. Trends in Genetics. 1993;9(12):p. 407. doi: 10.1016/0168-9525(93)90102-n. [DOI] [PubMed] [Google Scholar]
- 41.Grabherr M. G., Haas B. J., Yassour M., et al. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nature Biotechnology. 2011;29(7):644–652. doi: 10.1038/nbt.1883. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Wang Z., Fang B. P., Chen J., et al. De novo assembly and characterization of root transcriptome using Illumina paired-end sequencing and development of cSSR markers in sweetpotato (Ipomoea batatas) BMC Genomics. 2010;11, article 726 doi: 10.1186/1471-2164-11-726. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Wei W., Qi X., Wang L., et al. Characterization of the sesame (Sesamum indicum L.) global transcriptome using Illumina paired-end sequencing and development of EST-SSR markers. BMC Genomics. 2011;12, article 451 doi: 10.1186/1471-2164-12-451. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Liang X., Chen X., Hong Y., et al. Utility of EST-derived SSR in cultivated peanut (Arachis hypogaea L.) and Arachis wild species. BMC Plant Biology. 2009;9, article 35 doi: 10.1186/1471-2229-9-35. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Dutta S., Kumawat G., Singh B. P., et al. Development of genic-SSR markers by deep transcriptome sequencing in pigeonpea [Cajanus cajan (L.) Millspaugh] BMC Plant Biology. 2011;11, article 17 doi: 10.1186/1471-2229-11-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Scott K. D., Eggler P., Seaton G., et al. Analysis of SSRs derived from grape ESTs. Theoretical and Applied Genetics. 2000;100(5):723–726. doi: 10.1007/s001220051344. [DOI] [Google Scholar]
- 47.Shirasawa K., Ishii K., Kim C., et al. Development of Capsicum EST-SSR markers for species identification and in silico mapping onto the tomato genome sequence. Molecular Breeding. 2013;31(1):101–110. doi: 10.1007/s11032-012-9774-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Cloutier S., Niu Z., Datla R., Duguid S. Development and analysis of EST-SSRs for flax (Linum usitatissimum L.) Theoretical and Applied Genetics. 2009;119(1):53–63. doi: 10.1007/s00122-009-1016-3. [DOI] [PubMed] [Google Scholar]
- 49.Aggarwal R. K., Hendre P. S., Varshney R. K., Bhat P. R., Krishnakumar V., Singh L. Identification, characterization and utilization of EST-derived genic microsatellite markers for genome analyses of coffee and related species. Theoretical and Applied Genetics. 2007;114(2):359–372. doi: 10.1007/s00122-006-0440-x. [DOI] [PubMed] [Google Scholar]
- 50.Qiu L., Yang C., Tian B., Yang J.-B., Liu A. Exploiting EST databases for the development and characterization of EST-SSR markers in castor bean (Ricinus communis L.) BMC Plant Biology. 2010;10, article 278 doi: 10.1186/1471-2229-10-278. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Kumpatla S. P., Mukhopadhyay S. Mining and survey of simple sequence repeats in expressed sequence tags of dicotyledonous species. Genome. 2005;48(6):985–998. doi: 10.1139/g05-060. [DOI] [PubMed] [Google Scholar]
- 52.Yamini K. N., Ramesh K., Naresh V., Rajendrakumar P., Anjani K., Dinesh Kumar V. Development of EST-SSR markers and their utility in revealing cryptic diversity in safflower (Carthamus tinctorius L.) Journal of Plant Biochemistry and Biotechnology. 2013;22(1):90–102. doi: 10.1007/s13562-012-0115-4. [DOI] [Google Scholar]
- 53.Thiel T., Michalek W., Varshney R. K., Graner A. Exploiting EST databases for the development and characterization of gene-derived SSR-markers in barley (Hordeum vulgare L.) Theoretical and Applied Genetics. 2003;106(3):411–422. doi: 10.1007/s00122-002-1031-0. [DOI] [PubMed] [Google Scholar]
- 54.Wen M., Wang H., Xia Z., Zou M., Lu C., Wang W. Developmenrt of EST-SSR and genomic-SSR markers to assess genetic diversity in Jatropha Curcas L. BMC Research Notes. 2010;3, article 42 doi: 10.1186/1756-0500-3-42. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Tang S., Okashah R. A., Cordonnier-Pratt M.-M., et al. EST and EST-SSR marker resources for Iris. BMC Plant Biology. 2009;9, article 72 doi: 10.1186/1471-2229-9-72. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Blanca J., Cañizares J., Roig C., Ziarsolo P., Nuez F., Picó B. Transcriptome characterization and high throughput SSRs and SNPs discovery in Cucurbita pepo (Cucurbitaceae) BMC Genomics. 2011;12, article 104 doi: 10.1186/1471-2164-12-104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Liu T., Zhu S., Fu L., et al. Development and characterization of 1,827 expressed sequence tag-derived simple sequence repeat markers for ramie (Boehmeria nivea L. Gaud) PLoS ONE. 2013;8(4):1091–1104. doi: 10.1371/journal.pone.0060346.e60346 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Pan H., Wang J., Wang Y., Qi Z., Li S. Development and mapping of EST-SSR markers in wheat. Scientia Agricultura Sinica. 2010;43:452–461. [Google Scholar]
- 59.Sonah H., Deshmukh R. K., Sharma A., et al. Genome-wide distribution and organization of Microsatellites in plants: an insight into marker development in Brachypodium . PLoS ONE. 2011;6(6) doi: 10.1371/journal.pone.0021298.e21298 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Peng J. H., Lapitan N. L. V. Characterization of EST-derived microsatellites in the wheat genome and development of eSSR markers. Functional and Integrative Genomics. 2005;5(2):80–96. doi: 10.1007/s10142-004-0128-8. [DOI] [PubMed] [Google Scholar]
- 61.Wang C., Guo W., Cai C., Zhang T. Characterization, development and exploitation of EST-derived microsatellites in Gossypium raimondii Ulbrich. Chinese Science Bulletin. 2006;51(5):557–561. doi: 10.1007/s11434-006-0557-y. [DOI] [Google Scholar]
- 62.Singh R., Zaki N. M., Ting N.-C., et al. Exploiting an oil palm EST database for the development of gene-derived SSR markers and their exploitation for assessment of genetic diversity. Biologia. 2008;63(2):227–235. doi: 10.2478/s11756-008-0041-z. [DOI] [Google Scholar]
- 63.Stàgel A., Portis E., Toppino L., Rotino G. L., Lanteri S. Gene-based microsatellite development for mapping and phylogeny studies in eggplant. BMC Genomics. 2008;9, article 357 doi: 10.1186/1471-2164-9-357. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Cardle L., Ramsay L., Milbourne D., Macaulay M., Marshall D., Waugh R. Computational and experimental characterization of physically clustered simple sequence repeats in plants. Genetics. 2000;156(2):847–854. doi: 10.1093/genetics/156.2.847. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Gao L., Tang J., Li H., Jia J. Analysis of microsatellites in major crops assessed by computational and experimental approaches. Molecular Breeding. 2003;12(3):245–261. doi: 10.1023/A:1026346121217. [DOI] [Google Scholar]
- 66.Varshney R. K., Thiel T., Langridge N. S. P., Graner A. In silico analysis on frequency and distribution of microsatellites in ESTs of some cereal species. Cellular and Molecular Biology Letters. 2002;7(2):537–546. [PubMed] [Google Scholar]
- 67.La Rota M., Kantety R. V., Yu J.-K., Sorrells M. E. Nonrandom distribution and frequencies of genomic and EST-derived microsatellite markers in rice, wheat, and barley. BMC Genomics. 2005;6, article 23 doi: 10.1186/1471-2164-6-23. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Morgante M., Hanafey M., Powell W. Microsatellites are preferentially associated with nonrepetitive DNA in plant genomes. Nature Genetics. 2002;30(2):194–200. doi: 10.1038/ng822. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.