Abstract
Iris lactea var. chinensis is a well-regarded ornamental plant in the genus Iris (family Iridaceae). In this report, we present the complete chloroplast (cp) genome sequence of I. lactea var. chinensis for the first time. The complete cp genome of I. lactea var. chinensis was assembled using high-throughput sequencing, and phylogenetic analysis was undertaken based on a dataset of coding regions. The cp genome of I. lactea var. chinensis measures 152,409 bp in length, with regions having two inverted copies (IR 26,026 bp), and separated by the large single copy (LSC 82,256 bp) and small single copy (SSC 18,101 bp) regions. The cp genome encodes 133 unique genes, including 87 different protein-coding genes, 38 tRNA genes, and 8 rRNA genes. Based on a dataset of 69 chloroplast coding regions, the maximum-likelihood (ML) phylogenetic tree analysis indicated that Iris lactea var. chinensis clusters closely with Iris sanguinea. Thus, the complete chloroplast genome presented in this report may provide valuable genetic information not only for the future exploitation and utilization of this plant resource but also for further research investigating its relationship with other Iris species.
Keywords: Iris lactea var. chinensis, chloroplast genome, phylogenetic analysis
Iris lactea var. chinensis is a perennial herb of the genus Iris (family Iridaceae) native to China that is widely distributed in Northeast China, North China, and Northwest China. This species is one of the most valuable iris plants (Xu et al. 2011). Because the flowers of this species exhibit good appearance and gorgeous colors, it has high ornamental value and can be planted on the edges of garden roads, flower beds and flower borders, embellished in lawns, or directly used as ground cover plants (Tang et al. 2018). I. lactea var. chinensis has a well-developed root system and strong drought resistance; therefore, it is also a useful sand-fixation and greening plant. In addition, as a halophyte, I. lactea var. chinensis has strong salt resistance and can be used in the improvement of saline land (Gu et al. 2018). I. lactea var. chinensis also showed strong Cd tolerance and accumulation ability, indicating significant potential for application in the phytoremediation of Cd-contaminated soil (Gu et al. 2017; Tian et al. 2019; Liu et al. 2020). In this study, for the first time, we report the chloroplast (cp) genome of this species based on Illumina HiSeq paired-end sequencing data, which may provide valuable genetic information not only for the future exploitation and utilization of this plant resource but also for further research investigating its relationship with other Iris species.
The leaf sample of Iris lactea var. chinensis was collected from Zhengzhou (34°48′3″N, 113°48′44″E), Henan Province, China. The voucher specimen (IRII20190026) is kept in the herbarium of Henan Agricultural University. Total genomic DNA was extracted using a modified CTAB method (Stefanova et al. 2013). The high-quality DNA was cleaved, and paired-end library preparation and sequencing were performed on an Illumina HiSeq platform. The raw data were quality filtered at a Phred score < 30. All remaining sequences were assembled into contigs using NOVOPlasy-v3.3 (Dierckxsens et al. 2017) to reconstruct the cp genome, with Iris sanguinea (GenBank accession number: NC_029227) serving as a reference. The annotation and correction of the cp genome were performed through the program Geneious Prime (Kearse et al. 2012). This genome sequence was deposited into GenBank (accession number MT740331).
The complete cp genome of I. lactea var. chinensis measures 1,52,409 bp in length, containing a large single-copy (LSC) region of 82,256 bp, a small single-copy (SSC) region of 18,101 bp, and two inverted repeat (IR) regions of 26,026 bp. The overall GC content of the whole plastid genome is 38.0%, whereas the corresponding values of the LSC, SSC, and IR regions are 36.2%, 31.7%, and 42.9%, respectively. The whole sequence of I. lactea var. chinensis contains 133 genes, including 87 protein-coding genes, 38 tRNA genes, and 8 rRNA genes. Among these genes, 20 are duplicated, including eight protein-coding genes (ndhB, rpl2, rpl23, rps12, rps7, rps12, rps19 and ycf2), four ribosomal RNA genes, and eight transfer RNA genes (trnA-UGC, trnH-GUG, trnI-CAU, trnI-GAU, trnL-CAA, trnN-GUU, trnR-ACG, trnV-GAC). In addition, twenty-three of the genes contain one or two introns.
To perform phylogenetic analysis, a dataset of 69 chloroplast coding regions of 11 species, including six species from Iridaceae and five species from other families in Asparagales, was aligned using the MAFFT method in Geneious Prime. The maximum likelihood (ML) phylogenetic tree was constructed using RAxML-HPC2 on XSEDE v8.2.12 on the CIPRES cluster (Miller et al. 2010). The ML phylogenetic results indicated that six species from Iridaceae clustered into one branch, which was sister to another branch containing three species from Amaryllidaceae, Asparagaceae and Asphodelaceae (Figure 1). Four Iris species formed a monophyletic group, in which I. gatesii was the earliest species to diverge, and I. lactea var. chinensis was determined to be closely related to I. sanguinea.
Funding Statement
This study was supported by the Natural Science Foundation of Henan [Grant 162300410146].
Disclosure statement
The authors have no conflicts of interest to declare.
Data availability statement
The data that support the findings of this study are publicly available in the National Center for Biotechnology Information (NCBI) at https://www.ncbi.nlm.nih.gov, accession number MT740331.
References
- Dierckxsens N, Mardulyn P, Smits G.. 2017. NOVOPlasty: de novo assembly of organelle genomes from whole genome data. Nucleic Acids Res. 45(4):e18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gu CS, Liu LQ, Deng YM, Zhang YX, Wang ZQ, Yuan HY, Huang SZ.. 2017. De novo characterization of the Iris lactea var. chinensis transcriptome and an analysis of genes under cadmium or lead exposure. Ecotoxicol Environ Saf. 144:507–513. [DOI] [PubMed] [Google Scholar]
- Gu CS, Xu S, Wang ZQ, Liu LQ, Zhang YX, Deng YM, Huang SZ.. 2018. De novo sequencing, assembly, and analysis of Iris lactea var. chinensis roots’ transcriptome in response to salt stress. Plant Physiol Biochem. 125:1–12. [DOI] [PubMed] [Google Scholar]
- Kearse M, Moir R, Wilson A, Stones-Havas S, Cheung M, Sturrock S, Buxton S, Cooper A, Markowitz S, Duran C, et al. 2012. Geneious Basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics. 28(12):1647–1649. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu QQ, Zhang YX, Wang YJ, Wang WL, Gu CS, Huang SZ, Yuan HY, Dhankher OP.. 2020. Quantitative proteomic analysis reveals complex regulatory and metabolic response of Iris lactea Pall. var. chinensis to cadmium toxicity. J Hazard Mater. 400:123165. [DOI] [PubMed] [Google Scholar]
- Miller MA, Pfeiffer W, Schwartz T.. 2010. Creating the CIPRES science gateway for inference of large phylogenetic trees. In Gateway Computing Environments Workshop (GCE). New Orleans(LA): IEEE. p 1–8. [Google Scholar]
- Stefanova P, Taseva M, Georgieva T, Gotcheva V, Angelov A.. 2013. A modified CTAB method for DNA extraction from soybean and meat products. Biotechnol Biotechnol Equip. 27(3):3803–3810. [Google Scholar]
- Tang J, Liu Q, Yuan H, Zhang Y, Wang W, Huang S.. 2018. Molecular cloning and characterization of a novel salt-specific responsive WRKY transcription factor gene IlWRKY2 from the halophyte Iris lactea var. chinensis. Genes Genomics. 40(8):893–903. [DOI] [PubMed] [Google Scholar]
- Tian XX, Mao PC, Guo Q, Meng L.. 2019. Effect of cadmium stress on root activity and mineral nutrient elements absorption of Iris lacteal. Southwest China J Agric Sci. 32(9):2090–2096. [Google Scholar]
- Xu YF, Shi GX, Jin G, Wang WY, Wang J.. 2011. Recent progresses in the research of Iris lacteal Pall. var. chinesis. Seed. 30(4):67–70. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The data that support the findings of this study are publicly available in the National Center for Biotechnology Information (NCBI) at https://www.ncbi.nlm.nih.gov, accession number MT740331.