Abstract
Platostoma palustre (Blume) A.J.Paton is an annual herbaceous persistent plant of the Labiatae family. However, there is a lack of genomic data for this plant, which severely restricts its genetic improvement. In this study, we performed genome survey sequencing of P. palustre and developed simple sequence repeat (SSR) markers based on the resulting sequence. K-mer analysis revealed that the assembled genome size was approximately 1.21 Gb. A total of 15,498 SSR motifs were identified and characterized in this study; among them, dinucleotide, and hexanucleotide repeats had the highest and lowest, respectively. Among the dinucleotide repeat motifs, AT/TA repeat motifs were the most abundant, and GC/CG repeat motifs were rather rare, accounting for 44.28% and 0.63%, respectively. Genetic similarity coefficient analysis by the UPMGA methods clustered 12 clones, of P. palustre and related species into two subgroups. These results provide helpful information for further research on P. palustre resources and variety improvements.
Subject terms: Genetic markers, Natural variation in plants, Plant genetics
Introduction
Platostoma palustre (Blume) A.J.Paton, also known as Chinese mesona, is an annual herbaceous persistent plant of the Labiatae (Lamiaceae) family1. In China, P. palustre is mainly distributed in Taiwan, Zhejiang, Jiangxi, Guangdong, Fujian, and Guangxi provinces2. As a traditional Chinese edible and medicinal plant, it contains polysaccharides3, triterpenoid acids4,5, flavonoids6, phenolic compounds (such as epicatechins7 and caffeic acid8), and trace elements8. Wang et al.9 isolated five new caffeic acid oligomers, as well as four known analogues, and one compound showed significant in vitro antiviral activity against respiratory syncytial virus. A study by Song et al.10 showed that an extract of P. palustre had antioxidant and α-glucosidase inhibitory activities.P. palustre is widely used as a raw material for herbal tea, Guiling paste, and Chinese medicine. The caffeic acid extracted from P. palustre was proven to have antioxidative activity11. Moreover, it was also reported that P. palustre polysaccharide (MP) treatment can increase the immunomodulatory activity of mice12. Water and alcohol extracts of P. palustre were reported to be effective in ameliorating hypertension13 and hyperglycaemia14 in rats and can inhibit the growth of Escherichia coli and Salmonella15,16.
To date, research on P. palustre has mainly focused on component extraction, activity, and development for food, and few studies on the genetic diversity of germplasm resources have been reported because of the limited genetic and genomic resources for this species. The concentrations of polysaccharides, triterpenoid acid, flavonoids, and other compounds of different P. palustre varieties vary widely, which directly affects their palatability and use in production17. Hence, variety identification is very important for P. palustre.
For the identification of P. palustre, morphological features such as leaf colour, tillering number, and flowering time have been employed, but this method relies on the accumulated experience of the appraiser, which is vulnerable to environmental and subjective factors and is time-consuming, laborious, and inaccurate. Therefore, it is very important to establish a set of rapid, accurate, and economical identification technologies to promote the utilization of P. palustre. Simple sequence repeat (SSR) marker are a powerful and cost-effective molecular method for quantifying genetic variation in plants due to their abundance in genome, polymorphism, co-dominance and high reproducibility18, and have been developed for many plant species19, especially those used in traditional Chinese herbal medicine, such as watermelon20, Psidium21, Bupleurum falcatum22, and Ligusticum chuanxiong23. These SSR markers have been broadly applied for genetic purity detection (for identifying off-types and selfed females in many hybrid seeds)20, species identification21, haplotype determination, quantitative trait locus (QTL) discovery, marker-assisted selection (MAS) for desired traits and breeding, cultivar DNA fingerprinting, genome-wide association studies (GWASs), and harnessing heterosis19,24. It is almost certain that, when developed, SSR markers can be used for P. palustre identification and accelerating breeding. There have been a few studies on SSR marker development for Chinese herbal medicines; however, few studies on P. palustre molecular markers have been reported. Thus, it is urgent to develop SSR markers for P. palustre.
With advances in next-generation sequencing (NGS) technology, genome survey sequencing has proven to be an important and cost-effective strategy for exploring genomic information and developing molecular markers for plants25, especially for non-model plants which no genetic information is known19. In this study, genome survey sequencing was employed to investigate the genome of P. palustre. We first mined SSRs from genome survey sequences of P. palustre and validated 90 SSRs to understand the genetic relationships among six P. palustre varieties and six other Labiatae species. We aimed to provide a reference for the genotyping, breeding, germplasm collection, and management of P. palustre.
Results
Genome sequencing and estimation of genome size
Paired-end sequencing with 270-bp short inserts of P. palustre was conducted using genomic DNA from sample MX 1. A total of 54.99 Gb of raw data was generated by the Illumina HiSeq sequencing platform, which was approximately 45.37-fold the estimated genome size. All reads were used for k-mer analysis, and abnormal k-mers were removed to calculate genome size, the repeat rate, and heterozygosity. We used 270-bp library data to construct a k-mer distribution map with k = 19 (Fig. 1). For the 19-mer frequency distribution, the peak of the depth distribution was approximately 38. The sequence at the k-mer depth was more than twice the depth at the main peak, which can be attributed to the repeated k-mer sequence with a depth of more than 76. Moreover, a k-mer depth at half the main peak (near 19) represents heterozygosity.
The sequencing data yielded total k-mer values of 48,380,469 and 234. When k-mers with depth abnormalities were removed, the remaining k-mer values were found to be 46,608,868 and 033. These values were further used for the estimation of gene leaders. The genome size was estimated to be 1.21 Gbp, using the following formula: genome size = k-mer count/peak of the k-mer distribution. To ensure the accuracy of the genome size prediction, GenomeScope2 and findGSE software with different k-mer sizes (k = 21, 23, 25, and 27) as well as MGSE were used for genome size prediction. The genome sizes predicted by the different tools with different parameters were in the range of 1.3 Gb to 1.4 Gb (Supplementary Table 1). Based on the k-mer distribution, almost 70.62% of the sequence was repeated. The peak heterozygosity was as low as 0.33%; thus, there was no obvious heterozygosity. The results suggest that the genome of P. palustre is highly complex and has a high degree of repetition.
The resequencing data was de novo assembled by SOAP denovo software. A total of 6,968,859 raw contigs were observed. Unique contigs for scaffold generation were obtained after blasting reads and contigs. Gaps resulting from sequence repetition were filled with paired-end reads. Consequently, the genome was assembled in the form of a total of 5,822,179 scaffolds with a length of 1,374,372,218 bp. We obtain totally 401,762,775 raw reads. Among them, 393,971,228 (98.06%) reads were properly mapped against the assembled sequence by Bwa mem software. Among the scaffolds, scaffold N50 was found to be 191 bp in length and L50 was 1,359,845 from the SOAP denovo software, as shown in Table 1. The raw sequencing data have been submitted to the NCBI database (accession number: PRJNA706453). As the N50 value was very low in contigs as well as scaffolds, we also performed assembly with another software programme, SPAdes. The results showed that contig N50 was 193 bp, which was consistent with results from SOAP. The low quality of the assembled sequence might have been due to the complexity of the P. palustre genome.
Table 1.
ID | Features | Sequencing reads | Value (SOAP) | Value (SPADes) |
---|---|---|---|---|
1 | Total number of bases (Gb) | 63.19 | – | – |
2 | Clean reads (Gb) | 54.99 | – | – |
3 | Clean reads proportion (%) | 87.02% | – | – |
4 | Q20% | 99.43 | – | – |
5 | Q30% | 95.31 | – | – |
6 | GC content (%) | 40.19% | – | – |
7 | Total number of contigs | – | 6,968,859 | 6,783,951 |
8 | Assembly length (bp) | – | 1,374,372,218 | 1,335,937,505 |
9 | Largest contig size (bp) | – | 61,900 | 62,170 |
10 | L50 | – | 1,359,845 | 1,310,596 |
11 | N50 (bp) | – | 191 | 193 |
12 | Number of SSR identified through MISA | – | 15,498 | – |
13 | Number of SSRs tested | – | 90 | – |
14 | Number of polymorphic SSR markers | – | 37 | – |
Identification and characterisation of SSR motifs
In total, 15,498 SSRs were identified from the P. palustre genome survey results (Supplementary Table 2). The identified SSR motifs included dinucleotide (71.96%), trinucleotide (26.26%), tetranucleotide (1.52%), pentanucleotide (0.19%), and hexanucleotide (0.07%) repeats, as shown in Fig. 2a. AT/TA was the most abundant type of dinucleotide repeat, with a content of 44.28% (4939 of all dinucleotide repeats). The AG/CT content was 43.54% (4856 repeats), and the very rare type GC/CG accounted for only 0.63% (70 repeats) (Fig. 2b). In the case of trinucleotides, the most abundant type was ATT/AAT (1183, 29.07% of all trinucleotide repeats), followed by ATG/CAT (17.47%, 711 repeats) and AAC/GTT (14.40%, 586 repeats) (Fig. 2c). Among the tetra-, penta-, and hexanucleotide repeats, the most abundant type was TTTA/TAAA (48.94%, 115 repeats). Furthermore, 99.99% di-, 99.92% tri-, 100% tetra-, 79.31% penta- and 90.00% hexanucleotide repeats were shorter than 30 bp.
SSR motif analysis of P. palustre revealed repeat frequencies of 6–15, 5–10, and 5–6 for dinucleotide, trinucleotide, and hexanucleotide repeats, respectively. The repeat frequencies for both tetra- and pentanucleotides were in the range of 5–7, as shown in Fig. 3. The results further revealed the highest frequency for motifs with 6 tandem repeats (37.11%, 5751), followed by motifs with 5 tandem repeats (18.93%, 2934), 7 tandem repeats (18.07%, 2801), and 8 tandem repeats (11.60%, 1789).
SSR marker verification
A subset of 90 SSR markers representing each repeat class were randomly selected for their validation through PCR amplification (Supplementary Table 3). Among the selected markers, the di-, tri-, tetra-, penta-, and hexanucleotide repeat classes, were represented by 8, 6, 37, 29, and 10 SSR primer pairs, respectively. The results showed that 79 SSRs (87.78%) were successfully amplified, and 37 of these (46.83%) demonstrated polymorphic banding pattern.
The thirty-seven SSR markers were further investigated among P. palustre and related Labiatae genera including Mentha haplocalyx, M. spicata, Prunella vulgaris, Salvia miltiorrhiza, Scutellaria indica, and S. barbata (Supplementary Table 4). A total of 685 fragments were generated through the PCR amplification of the 12 accessions with a mean of 18.5 alleles per marker loci (Fig. 4 and Supplementary Table 4; the full-length gels are presented in Supplementary Fig. 1). Among the tested SSRs, 10 were specifically amplified in P. palustre while the remaining 27 showed a varied level of cross transferability in other related taxa. According to the clustering analysis performed with 27 SSR markers (Fig. 5), 12 accessions were divided into two groups. The 6 P. palustre accessions were clustered into one group, and the 6 accessions of related Labiatae genera were clustered into another group.
Discussion
P. palustre is an important traditional Chinese medicine and edible plant resource with heat-clearing and detoxifying functions. The leaves, roots, and stems of P. palustre have been widely found to contain gel mainly consisting of cortex phellodendri, benzoic acid, ursolic acid, organic acids, flavones, and catechins3–5. Because food and medicinal products of P. palustre have different requirements in terms of quality, it is necessary to breed varieties with different characteristics through genetic improvement. In addition, adulterant plants are common in P. palustre collections. Thus, establishing an accurate and rapid method by molecular markers to identify P. palustre and related species is important for the genetic identification and improvement of P. palustre. Shi et al.2 analysed P. palustre and its adulterants using the internal transcribed spacer 2 (ITS2) region and found that the ITS2 region, as a DNA barcode, could accurately and effectively distinguish P. palustre from its adulterants, including Isodon serra Maxim. However, the study showed that there was no difference in the ITS2 region among the 26 P. palustre accessions from Guangxi province, Guangdong province, Jiangxi province, Fujian province, and Hainan province in China. The results showed that the ITS2 region is not suitable for identifying P. palustre cultivars. Therefore, it is necessary to develop alternativemolecular markers for genetic resource evaluation and improvement.
A genome survey of P. palustre was applied for the first time in this study, with the aim of identifying markers for P. palustre and understanding the genetic diversity and relationships among cultivars and related species. According to the k-mer analysis of the genome survey sequences, the genome of P. palustre is approximately 1.21 Gbp and is complex with a low level of heterozygosity (0.33%). The genome of P. palustre is smaller than that of its related species; for instance, the genome of S. miltiorrhiza is 8.19 Gbp26. However, it is much larger than that of other dicotyledons, such as buckwheat (497 Mb)27, shantung maple (529 Mb)28 and jute (338 Mb)29. In plants, there is a positive correlation between genome size and repetitive elements30. For example, the repetitive element content of P. palustre is 70.62%, which is higher than that of shantung maple (529 Mb, 48.8%) and lower than that of Radix bupleuri (2.11 Gb, 83.89%)31.
A draft reference de novo assembly with sequencing data was used to explore SSRs. A total of 54.99 Gb of clean reads were generated and de novo assembled into 6,968,859 contigs. Due to the complex genome of P. palustre, the contig N50 value was lower. SSRs with high polymorphism and codominance have been used to evaluate genetic resources and in a variety of improvement programs32,33. In this study, a total of 15,498 SSRs were identified in P. palustre using genome survey sequencing. Morgante et al.claimed that there was a negative correlation between genome size and SSR distribution frequency34. However, the SSR distribution frequency in this genome survey was estimated to be 12.80 SSRs per Mb, which is lower than that in R. bupleuri (43.11 SSR per Mb)31 and buckwheat (49.30 SSR per Mb)27. Obviously, P. palustre did not follow this rule. The di- and trinucleotide repeats accounted for the majority of the SSRs, while tetra-, penta-, and hexanucleotide repeats accounted for a very small proportion. Similarly, among the five tandem repeat types of SSRs in P. palustre, di- and trinucleotide repeats accounted for 98.22% of the total SSRs, while tetra-, penta-, and hexanucleotide repeat SSRs accounted for only 1.52%. In P. palustre, we found that AT/TA (44.28%) and ATT/AAT (29.07%) were frequent among the di- and trinucleotide repeat SSRs; these percentages are different not only from those in sorghum35 (AT/AT, 54.4% and CCG/CGG, 18.1%), rice33 (AG/CT, 41.9% and CCG/CGG, 47.5%), and buckwheat27 (AT/AT, 78.60% and AAT/TTA, 31.83%) but also from those in the majority of grasses (GA/TC dimers, A/T monomers, and GCG/CGC trimers were the most abundant SSR types), with some exceptions34. Interestingly, in a study of 16 tree species, a similar trend was observed, where AT/TA base pairs were found to be the most prevalent dimers, followed by AG/TC. AAT/TTA were the most frequent trimers36. In summary, SSR types have different distribution patterns among species at a large evolutionary scale37,38, but the distribution patterns of closely related species and even different parts of the same species differ39,40. The reason for the high polymorphism at these loci needs much more exploration.
As high variability in repeat unit number is observed, SSRs are highly polymorphic and are suitable for use as specific markers for different species/genera and germplasm characterization. In this study, we identified 64 SSRs with polymorphisms among the P. palustre accessions. By using 37 of the 64 SSRs, 395 specific fragments of P. palustre, accounting for 58.96% of all fragments, were detected. The results showed that there was significant genetic differentiation between P. palustre and related Labiatae species. The high polymorphism and specificity of the SSR markers developed in this research suggest that these SSRs could be further used in genetic linkage mapping, MAS, and the identification of genuine hybrids between cultivated P. palustre varieties and the other 6 related Labiatae genera.
This study revealed genomic information for P. palustre and unique SSR loci, providing valuable information for follow-up studies on cultivar identification, improvement and genetic resource management. However, because of the current absence of a reference genome sequence for this species, the genome location/genome coverage of these SSRs makers is unknown. In future, with more genome information for P. palustre is revealed, more molecular makers could be developed and accelerate genetic improvement of P. palustre.
Methods
Plant materials
The plant materials comprised six P. palustre accessions (MX 1, TW 1, ZC 1, ZC 2, XU 1, and XU 2) and six accessions of related Labiatae species, including M. haplocalyx, M. spicata, P. vulgaris, S. miltiorrhiza, S. indica, and S. barbata. Of the six P. palustre accessions, M X 1 and TW 1 was from Fujian province, China. While ZC 1, ZC 2, XU 1 and XU 2 were from Guangdong province, China.
Library construction, genome sequencing and genome character estimation
Total genomic DNA was isolated from young leaf tissue of all plants following a modified CTAB procedure41, and the quality was evaluated by 1% agarose gel electrophoresis. The concentrations of DNA were checked by a BioPhotometer (Eppendorf, Germany). The most widely planted P. palustre MX 1, was selected for the genome survey.
The genomic DNA was broken into fragments of approximately 270 bp by the ultrasonic vibration. The small-insert fragment library was constructed from fragmented random genomic DNA following the manufacturer’s instructions (NEBNext® Ultra DNA Library Prep Kit for Illumina). Adapter ligation and DNA cluster preparation were performed, followed by sequencing using an Illumina Genome Analyzer (Illumina HiSeq 2000, USA) according to the manufacturer’s standard protocol.
In total, four paired-end sequencing libraries with insert sizes of approximately 270 bp were constructed, and paired-ends of 150 bp were sequenced using the Illumina HiSeq 2100 platform. The quality control and pre-processing of sequencing raw reads were carried out using the fastp software42. 284, Raw reads were filtered by Trimmomatic software (v0.39; http://www.usadellab.org/cms/?page=trimmomatic) to remove low quality reads and adaptor sequences. GC distribution analysis was performed by in-house perl code After filtering, clean reads were obtained and used for the following analyses. K-mer (k = 19) analysis was performed, and the abnormal k-mers were filtered out for subsequent analysis. The rate of heterozygosity and the repeat rate were estimated according to k-mer analysis43. GenomeScope244 and findGSE45 with different k-mer sizes (k = 21, 23, 25, and 27) as well as MGSE software46 were employed to predict genome size. The genome size was estimated with the formula: Genome_Size = K-mer coverage/Mean k-mer depth47.
Genome assembly and SSR marker development
After removing the adapters, raw sequencing data were further cleaned for downstream analysis by filtering out reads containing low-quality bases, reads < 100 bp in length, and duplicated reads. The clean reads of all the libraries were assembled into scaffolds and contigs using SOAPdenovo v2 (http://soap.genomics.org.cn/soapdenovo.html) software. SSRs in the DNA sequences were identified using MIcro-SAtellite (MISA) software (version 1.0)48. SSR identification was based on two parameters. First, SSR minimum numbers of 6, 5, 5, 5, and 5 were adopted for the identification of di-, tri-, tetra-, penta-, and hexanucleotides, respectively. Second, an interruption of less than 100 bp between two SSRs was defined as a compound repeat of SSR. Primer Premier V5.0 software (Premier Biosoft International, Palo Alto, CA) was used for primer design with the following parameters: 100–300 bp for final product length, 18–25 bp for primer size (with an optimum size of 20 nucleotides), 35–70% for GC content, and 55–65 °C for annealing temperature.
Verification of SSR markers and genetic similarity analysis
A total of six accessions of P. palustre and six related species, including M. haplocalyx, M. spicata, P. vulgaris, S. miltiorrhiza, S. indica, and S. barbata, were used for the verification of SSR markers developed by genome survey sequencing. In total, 90 SSR markers were selected to verify the quality of SSR markers and polymorphisms in the six accessions of P. palustre. Thirty-seven SSR markers were used to analyse the genetic similarity among the 12 accessions of P. palustre and related species. PCR was performed using EasyTaq® DNA Polymerase (TransGen Biotech, China) with the following programme: 94 °C for 5 min (initial denaturation) followed by 35 cycles of 94 °C for 30 s, 58–61 °C for 30 s, and 72 °C for 1 min, with an extension of 72 °C for 10 min and hold at 4 °C. The products obtained from the PCR were analyzed with 7% polyacrylamide gel electrophoresis (PAGE) and detected by staining with AgNO3 solution. Clear and strong allelic fragments in the same horizontal position were scored manually as 0 (absent) or 1 (present), and the number of alleles (Na), effective number of alleles (Ne), percentage of polymorphic loci (PIC) and expected heterozygosity were calculated using GenAlEx 6.549,50. The genetic similarity coefficients of these clones were calculated and cluster analysis was performed based the neighbor-joining method using the pvclust R package51.
Supplementary Information
Acknowledgements
We are grateful to Xiangbo Zhang, Xiaomin Feng, the staff at INSI, GAAS for critical reading and revision of the manuscript. This work was supported by the GDAS Project of Science and Technology Development (2019GDASYL-0104013; 2020GDASYL-20200302005), the National Natural Science Foundation of China (32072027), the Science and Technology Planting Project of Guangdong Province, China (2019B020238001).
Author contributions
Y.Q. and Q.Z. designed the experiments; N.Z. and Y.H. collected plant materials; N.Z., Z.Z. and Z.H. performed the SSR experiments and analyzed the data; the whole genome sequencing assembly was performed by Q.Z.; Y.Q., N.Z. and Z.Z. drafted this manuscript.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Zhao Zheng and Nannan Zhang.
Supplementary Information
The online version contains supplementary material available at 10.1038/s41598-021-04264-x.
References
- 1.Santoso H. Platostoma palustre (Blume) A.J.Paton Lamiaceae. In: Franco FM, editor. Ethnobotany of the Mountain Regions of Southeast Asia. Ethnobotany of Mountain Regions. Springer; 2021. [Google Scholar]
- 2.Shi YH, et al. Identification of Herbal tea ingredient Mesona chinensis and its adulterants using ITS2 barcode. Chin. J. Pharm. 2015;50:1282–1285. [Google Scholar]
- 3.Huang L, et al. Effect of highpressure microfluidization treatment on the physicochemical properties and antioxidant activities of polysaccharide from Mesona chinensis Benth. Carbohydr. Polym. 2018;200:191–199. doi: 10.1016/j.carbpol.2018.07.087. [DOI] [PubMed] [Google Scholar]
- 4.Zheng L, et al. The analysis of natural acid-benzoic acid from Mesona chinensis benth by HPLC. China Food Addit. 2013;5:206–213. [Google Scholar]
- 5.Liu BR. Determination of ursolic acid in Mesona chinensis Benth by HPLC. J. Foshan Univ. 2008;6:13–15. [Google Scholar]
- 6.Liu ZW, Wu HM, Zhang C. Enzymatic extraction of flavonoids from Mesona chinensis Benth. Lishizhen Med. Med. Res. 2010;11:2903–2904. [Google Scholar]
- 7.Qiu T, Lin XC, Wang BY. Identification of epicatechin from Mesona chinensis Benth. Natl. Process. Res. Dev. 2010;5:798–800. [Google Scholar]
- 8.Hung CY, Yen G. Antioxidant activity of phenolic compounds isolated from Mesona procumbens Hemsl. J. Agric. Food Chem. 2002;50:2993–2997. doi: 10.1021/jf011454y. [DOI] [PubMed] [Google Scholar]
- 9.Wang ZQ, et al. Caffeic acid oligomers from Mesona chinensis and their In Vitro antiviral activities. Fitoterapia. 2020;144:104603. doi: 10.1016/j.fitote.2020.104603. [DOI] [PubMed] [Google Scholar]
- 10.Song X, et al. Study on antioxidant activity and inhibitory effect of α-glucosidase of different polar extracts of Mesona chinensis. Herald Medic. 2020;39:286–291. [Google Scholar]
- 11.Qin LH, et al. Anti-anoxic constituents from Mesona chinensis Benth. J. Shenyang Pharm. Univ. 2006;23(10):633–636. [Google Scholar]
- 12.Huang L, et al. Mesona chinensis Benth polysaccharides protect against oxidative stress and immunosuppression in cyclophosphamide-treated mice via MAPKs signal transduction pathways. Int. J. Biol. Macromol. 2020;152:766–774. doi: 10.1016/j.ijbiomac.2020.02.318. [DOI] [PubMed] [Google Scholar]
- 13.Yeh CT, et al. Antihypertensive effects of Hsian-tsao and its active compound in spontaneously hypertensive rats. J. Nutr. Biochem. 2009;20:866–875. doi: 10.1016/j.jnutbio.2008.07.015. [DOI] [PubMed] [Google Scholar]
- 14.Liu Y, et al. Hypoglycemic effect and acute toxicity test of Mesona chinensis Benth. J. Fuzhou Gen. Hosp. 2005;12(4–5):266–267. [Google Scholar]
- 15.Liu FL, Feng CL. Study on substituting antibiotics in duck production with Mesona chinensis Benth. Chin. J. Vet. Med. 2009;2:7–10. [Google Scholar]
- 16.Liu FL, Feng CL. The in vitro bacteriostasis test on avian Escherichia coli of Herba Herba. Guangdong J. Anim. Vet. Sci. 2008;33(6):17–43. [Google Scholar]
- 17.Li XH, Li YJ, Huang RS, Jiang ML, Wang HH. SCoT an ISSR analysis of genetic diversity of Mesona chinensis. Southwest China J. Agric. Sci. 2012;25:1834–1840. [Google Scholar]
- 18.Vieira ML, Santini L, Diniz AL, Munhoz C. Microsatellite markers: what they mean and why they are so useful. Genet. Mol. Biol. 2016;39(3):312–328. doi: 10.1590/1678-4685-GMB-2016-0027. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Sima T, et al. Mining and development of novel SSR markers using next generation sequencing (NGS) data in plants. Molecules. 2018;23(2):399–399. doi: 10.3390/molecules23020399. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Lu X, et al. Identification of high-efficiency SSR markers for assessing watermelon genetic purity. J. Genet. 2018;97(5):1295–1306. [PubMed] [Google Scholar]
- 21.Tuler AC, Carrijo TT, Nóia LR, Ferreira A, Peixoto AL, Ferreira M. SSR markers: a tool for species identification in Psidium (myrtaceae) Mol. Biol. Rep. 2015;42(11):1501–1503. doi: 10.1007/s11033-015-3927-1. [DOI] [PubMed] [Google Scholar]
- 22.Zhu CR, Xu J, Du ML, Wang LH, Sui C, Wei JH. Genome survey analysis and SSR loci mining of Bupleurum falcatum. Zhongguo Zhong Yao Za Zhi. 2019;44(18):3960–3966. doi: 10.19540/j.cnki.cjcmm.20190730.103. [DOI] [PubMed] [Google Scholar]
- 23.Yuan C, et al. EST-SSR identification, markers development of Ligusticum chuanxiong based on Ligusticum chuanxiong transcriptome sequences. Zhongguo Zhong Yao Za Zhi. 2017;42(17):3332–3340. doi: 10.19540/j.cnki.cjcmm.20170814.004. [DOI] [PubMed] [Google Scholar]
- 24.Tabkhkar N, Rabiei B, Lahiji HS, Chaleshtori MH. Genetic variation and association analysis of the ssr markers linked to the major drought-yield QTLs of rice. Biochem. Genet. 2018;56(4):356–374. doi: 10.1007/s10528-018-9849-6. [DOI] [PubMed] [Google Scholar]
- 25.Davey J, et al. Genome-wide genetic marker discovery and genotyping using next-generation sequencing. Nat. Rev. Genet. 2011;12:499–510. doi: 10.1038/nrg3012. [DOI] [PubMed] [Google Scholar]
- 26.Xu H, et al. Analysis of the genome sequence of the medicinal plant Salvia miltiorrhiza. Mol. Plant. 2016;9:949–952. doi: 10.1016/j.molp.2016.03.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Hou S, et al. Genetic diversity of buckwheat cultivars (Fagopyrum tartaricum Gaertn.) assessed with SSR markers developed from genome survey sequences. Plant Mol. Biol. Rep. 2016;34:233–241. [Google Scholar]
- 28.Wang RK, et al. Genome survey sequencing of Acer truncatum Bunge to identify genomic information, simple sequence repeat (SSR) markers and complete chloroplast genome. Forest. 2019;10(2):87. [Google Scholar]
- 29.Yao JY, et al. Evaluation and characteristic analysis of SSRs from the whole genome of jute (Corchorus capsularis) Acta Agron. Sin. 2019;45(1):10–17. [Google Scholar]
- 30.Wang C, et al. Genome survey sequencing of purple elephant grass (Pennisetum purpureum Schum ‘Zise’) and identification of its SSR markers. Mol. Breed. 2018;38(7):1–10. [Google Scholar]
- 31.Zhu CR, et al. Genome survey analysis and SSR loci mining of Bupleurum falcatum. China J. Chin. Mater. Med. 2019;44(18):3960–3966. doi: 10.19540/j.cnki.cjcmm.20190730.103. [DOI] [PubMed] [Google Scholar]
- 32.Tautz D. Hypervariabilty of simple sequences as a general source of polymorphic DNA markers. Nucleic Acids Res. 1989;17:6463–6471. doi: 10.1093/nar/17.16.6463. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.McCouch SR, et al. Microsatellite marker development, mapping and applications in rice genetics and breeding. Plant Mol. Biol. 1997;35:89–99. [PubMed] [Google Scholar]
- 34.Morgante M, et al. Microsatellites are preferentially associated with nonrepetitive DNA in plant genomes. Nat. Genet. 2002;30(2):194–200. doi: 10.1038/ng822. [DOI] [PubMed] [Google Scholar]
- 35.Sonah H, et al. Genome-wide distribution and organization of microsatellites in plants: an insight into marker development in Brachypodium. PLoS ONE. 2011;6:e21298. doi: 10.1371/journal.pone.0021298. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Xia X, et al. Using the Genome-wide analysis of SSR and ILP markers in trees: diversity profiling, alternate distribution, and applications in duplication. Sci. Rep. 2017;7:17902. doi: 10.1038/s41598-017-17203-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Srivastava S, et al. Patterns of microsatellite distribution across eukaryotic genomes. BMC Genom. 2019;20(1):153. doi: 10.1186/s12864-019-5516-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Tóth G, Gáspári Z, Jurka J. Microsatellites in different eukaryotic genomes survey and analysis. Genome Res. 2000;10:967–981. doi: 10.1101/gr.10.7.967. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Wang XT, et al. Comparative analyses of simple sequence repeats SSRs in 23 mosquito species genomes: Identification, characterization and distribution (Diptera: Culicidae) Insect Sci. 2018;26:606–619. doi: 10.1111/1744-7917.12577. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Xu Y, et al. Characterization of perfect microsatellite based on genome-wide and chromosome level in Rhesus monkey (Macaca mulatta) Gene. 2016;592:269–275. doi: 10.1016/j.gene.2016.07.016. [DOI] [PubMed] [Google Scholar]
- 41.Saghi Maroof MA, Soliman KM, Jorgensen AR, Allard RW. Ribisinal DNA space length polymorphism in barley: mendelian inheritance, chromosomal location and population dynamics. Proc. Natl. Acad. Sci. USA. 1984;81:8014–8018. doi: 10.1073/pnas.81.24.8014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Chen S, et al. fastp: An ultra-fast all-in-one FASTQ preprocessor. Bioinformatics. 2018;34:i884–i890. doi: 10.1093/bioinformatics/bty560. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Zeng Q, et al. Genome survey sequence and the development of simple sequence repeat (SSR) markers in Erianthus arundinaceus. Sugar Tech. 2020 doi: 10.1007/s12355-020-00872-5. [DOI] [Google Scholar]
- 44.Vurture GW, et al. GenomeScope: Fast reference-free genome profiling from short reads. Bioinformatics. 2017;33:2202–2204. doi: 10.1093/bioinformatics/btx153. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Sun H, Ding J, Mathieu P, Korbinian S. findGSE: Estimating genome size variation within human and Arabidopsis using k-mer frequencies. Bioinformatics. 2018;34:550–557. doi: 10.1093/bioinformatics/btx637. [DOI] [PubMed] [Google Scholar]
- 46.Shi JP, et al. Chromosome conformation capture resolved near complete genome assembly of broomcorn millet. Nat. Commun. 2019;10(1):464. doi: 10.1038/s41467-018-07876-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Pucker B. Mapping-based genome size estimation. Boas Pucker. bioRxiv. 2019 doi: 10.1101/607390. [DOI] [Google Scholar]
- 48.Thiel T, et al. Exploiting EST databases for the development and characterization of gene-derived SSR-markers in barley (Hordeum vulgare L.) Theor. Appl. Genet. 2003;106(3):411–422. doi: 10.1007/s00122-002-1031-0. [DOI] [PubMed] [Google Scholar]
- 49.Peakall R, Smouse PE. GenAlEx 6.5: Genetic analysis in Excel Population genetic software for teaching and research—An update. Bioinformatics. 2012;28:2537–2539. doi: 10.1093/bioinformatics/bts460. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Peakall R, Smouse PE. GENALEX 6: Genetic analysis in Excel. Population genetic software forteaching and research. Mol. Ecol. Notes. 2006;6:288–295. [Google Scholar]
- 51.Suzuki R, Shimodaira H. Pvclust: An R package for assessing the uncertainty in hierarchical clustering. Bioinformatics. 2006;22:1540–1542. doi: 10.1093/bioinformatics/btl117. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.