Abstract
Bolboschoenus planiculmis (F.Schmidt) T.V.Egorova is a typical wetland plant in the species-rich Cyperaceae family. This species contributes prominently to carbon dynamics and trophic integration in wetland ecosystems. Previous studies have reported that the chromosomes of B. planiculmis are holocentric; i.e. they have kinetic activity along their entire length and carry multiple centromeres. This feature was suggested to lead to a rapid genome evolution through chromosomal fissions and fusions and participate to the diversification and ecological success of the Bolboschoenus genus. However, the specific mechanism remains uncertain, partly due to the scarcity of genetic information on Bolboschoenus. We present here the first chromosome-level genome assembly for B. planiculmis. Through the integration of high-quality long-read and short-read data, together with chromatin conformation using Hi-C technology, the ultimate genome assembly was 238.01 Mb with a contig N50 value of 3.61 Mb. Repetitive elements constituted 37.04% of the genome, and 18,760 protein-coding genes were predicted. The low proportion of long terminal repeat retrotransposons (∼9.62%) was similar to that reported for other Cyperaceae species. The Ks (synonymous substitutions per synonymous site) distribution suggested no recent large-scale genome duplication in this genome. The haploid assembly contained a large number of 54 pseudochromosomes with a small mean size of 4.10 Mb, covering most of the karyotype. The results of centromere detection support that not all the chromosomes in B. planiculmis have multiple centromeres, indicating more efforts are needed to fully reveal the specific style of holocentricity in cyperids and its evolutionary significance.
Keywords: Bolboschoenus, Scirpus, genome assembly, karyotype, holocentric chromosome
Significance.
Bolboschoenus planiculmis (clubrush), a typical wetland plant, has potential for practical applications in phytoremediation and wetland conservation. The deficiency in available genetic information limits insights on its ecological adaptability and chromosomal evolution. We present the first chromosome-level genome assembly for the species, with high continuity, completeness, and detailed annotation of protein-coding genes and repeated sequences. The assembly suggests that the majority of the karyotype is constituted by similarly small chromosomes and reveals a heterogeneous distribution of centromeres along chromosomes and among them.
Introduction
Bolboschoenus planiculmis (F.Schmidt) T.V.Egorova (Fig. 1A) is a critical element in wetland ecosystems. The species is a member of the species-rich sedge family (Cyperaceae) and exhibits several unique traits of evolutionary and ecological importance (Ning et al. 2014; An et al. 2018; Ljevnaić-Mašić et al. 2020). By vigorous clonal growth as a result of rhizome elongation and corm formation, B. planiculmis and its relatives show strong adaptability to highly disturbed environments, for example, and contribute to the carbon dynamics in global coastal ecosystems (Peng et al. 2022). Although this species is recognized as a notorious weed in paddy fields owing to its strong competitiveness, its starch-rich tubers can provide a food source for wetland birds (An et al. 2018). Moreover, its potential for application in phytoremediation is attracting increasing attention. A recent study has highlighted its propensity for bioaccumulation through adsorption from the rhizosphere, providing an efficient means to remove deleterious ions to prevent their transfer to other trophic levels (Almaamary et al. 2022). However, the available genetic information on B. planiculmis is limited, except for the plastid genome (Ning et al. 2022a) and a population genetic analysis based on multilocus genotype data (Píšová et al. 2017). A high-quality reference genome is urgently needed to assist with in-depth scientific research and industrial application of this species.
Fig. 1.
Overview of the B. planiculmis genome assembly. A) The sampled individual with corms, a floret, and a seed (inset images below). B) Circos plot showing the distribution of genomic features. The four circular tiers (a–d) represent chromosome ideograms, transposable-element density, gene density, and GC content, respectively. Central lines indicate putative homology among linked sections. The colors of these links are arbitrary and for visual purposes only. C) Gaussian mixture modeling of the Ks distribution of B. planiculmis and Kobresia myosuroides. No convincing peak was detected in the B. planiculmis genome subsequent to its divergence from K. myosuroides.
The evolutionary uniqueness of Bolboschoenus plants warrants specific research attention. The Cyperaceae is well known for its exceptional variation in chromosome number (2n = 4 to 224), which may have facilitated its extreme diversification and cosmopolitan distribution (Roalson 2008; Hipp et al. 2010; Martín-Bravo et al. 2019). The flexibility in chromosome number may be attributable to the prevalence of holocentric chromosomes in the Cyperaceae. In contrast to monocentric chromosomes with fixed centromeres, holocentricity enables kinetochoric activity along the entire chromosome (Márquez-Corro et al. 2019). This unique trait reduces the chance of abnormalities accompanying chromosome fusion and fission, thus promoting speciation without ploidy change and variation in genome size (i.e. dysploidy) (Escudero et al. 2014). However, karyotypes may evolve differently among the clades of the Cyperaceae. Hipp et al. (2008) suggested that polyploidy is rare in Carex, but this may not apply to other Cyperaceae members. A phylogeny-based simulation revealed that the modes of chromosomal evolution were clade specific with different rates and frequencies of fission and fusion events (Márquez-Corro et al. 2019). Nevertheless, no genome assembly has been published for the genus Bolboschoenus. A previous study by Jarolímová and Hroudová (1998) has recorded a chromosome count of 54 for B. planiculmis at the gametophytic stage, and the Bolboschoeneae lineage tends to host karyotypes with high chromosome number and low average chromosome size (Elliott et al. 2022). Nevertheless, the exact individual chromosome information is still lacking. Thus, the genome assembly presented herein for B. planiculmis, together with the associated annotation data, will provide a valuable data source for future evolutionary studies of the Cyperaceae and for practitioners involved in the breeding and engineering of wetland plants.
Results and Discussion
Evaluation of the Genome Assembly
We generated 22.79 Gb (∼96×) of PacBio long-read data through circular consensus sequencing (CCS) sequencing, 29.05 Gb (∼122×) of short-read data for Hi-C mapping, 31.09 Gb (∼132×) of Illumina data for the genome profiling, and 13.95 Gb (∼58×) of transcriptomic data to assist in gene modeling (supplementary table S1, Supplementary Material online). The genome profiling revealed that the sampled individual had a small genome of moderate complexity. The genome was approximately 236.00 Mb in size. Repetitive elements accounted for ∼38.18% of the genome. The Guanine-Cytosine (GC) content was estimated to be ∼33.47%, and the inferred heterozygosity was ∼1.76% (see the detailed results of GenomeScope profiling in supplementary figs. S1 and S2, Supplementary Material online).
For genome assembly, we first established a preliminary de novo assembly based on high-quality PacBio CCS long reads (supplementary fig. S3, Supplementary Material online). The final chromosome-level assembly was generated by combining high-quality Hi-C data (supplementary table S3, Supplementary Material online) and the preliminary assembly. Using Hi-C, we were able to detect the association between most contigs and cluster them into pseudomolecules that image the real chromosome identity. In our case, 54 pseudochromosomes were constructed with an anchored rate of 93.34%, meaning 93.34% of the total base were mapped into these pseudochromosomes; thus, the preliminary assembly was polished to chromosome level (supplementary fig. S4, Supplementary Material online; Table 1). The final genome assembly of B. planiculmis comprised a total of 238.01 Mb (contig N50 value of 3.61 Mb, GC content 35.30%). This final assembly included both the well-anchored and nonanchored bases. We constructed a chromosome ideogram that represented the collinearity relations, GC content, transposable elements (TEs), and gene density (Fig. 1B). The genome size was consistent with the results of the genome profiling (supplementary fig. S1, Supplementary Material online) and flow cytometry (supplementary fig. S2 and table S2, Supplementary Material online).
Table 1.
Statistics for the B. planiculmis genome assembly and BUSCO scores
| Type | Statistics |
|---|---|
| Sequence | |
| Assembly size (bp) | 238,008,122 |
| GC content (%) | 35.30 |
| Number of scaffolds | 148 |
| Longest scaffold (bp) | 9,066,051 |
| Scaffold N50 size (bp) | 4,037,645 |
| Number of contigs | 168 |
| Longest contig (bp) | 5,698,134 |
| Contig N50 size (bp) | 3,612,616 |
| Pseudochromosome | |
| Number | 54 |
| Anchored rate (%) | 93.34 |
| Size range (M) | 3.16 ∼ 9.41 |
| BUSCO score | |
| Complete BUSCOs (%) | 95.48 |
| Complete and single-copy BUSCOs (%) | 93.99 |
| Complete and duplicated BUSCOs (%) | 1.49 |
| Fragmented BUSCOs (%) | 0.31 |
| Missing BUSCOs (%) | 4.21 |
| Total groups searched | 1,614 |
“Anchored rate” refers to the proportion of bases that are well mapped into pseudochromosomes. Those unmapped bases are also included in the final assembly. “Size range” delimits the minimum and maximum size of pseudochromosomes.
The high quality of the assembly was attested by the following evidences: (i) the mapping-back rate and average depth were high for both long reads (98.68%, 86×) and short reads (96.04%, 107×) (details provided in supplementary table S4, Supplementary Material online); and (ii) the complete BUSCO score of 95.48% (Table 1) was comparable with those recently reported for four cyperid genomes (Planta et al. 2022). (iii) Of all the 54 chromosomes, 48 (∼88.89% of the total number) showed signals of telomeres, among which 32 chromosomes exhibited telomeres at both ends (supplementary fig. S5, Supplementary Material online). Notably, the present assembly showed that B. planiculmis has a haploid chromosome number of n = 54. This value was consistent with the results of a previous cytological study (Jarolímová and Hroudová 1998) and was considerably higher than those reported for all published cyperid genomes (Can et al. 2020; Hofstatter et al. 2022; Ning et al. 2022b; Planta et al. 2022). The average pseudochromosome size was small (∼4.1 Mb), with 43 (∼80%) pseudochromosomes corresponding to only one contig (supplementary table S5, Supplementary Material online). Moreover, the proportion of duplicated BUSCO scores was extremely low (1.49%; Table 1). Compared with the genome of Rhynchospora pubera, which has a duplication percentage of 95.29% attributed to whole-genome duplication (WGD) events (Hofstatter et al. 2022), the genome of B. planiculmis is unlikely to have undergone lineage-specific duplication bursts. Instead, dysploidy evolution, which leads to variation in chromosome number but limited change in genome size, may better explain the karyotype evolution of this species.
Synteny and Karyotype
To evaluate support for the inferred absence of recent WGD events in B. planiculmis, we investigated the intragenomic synteny and synonymous substitution rate (Ks) distribution of the assembled genome. Neither the synteny block (segments with a high confidence of synteny) plot (supplementary fig. S6A, Supplementary Material online) nor the original unfiltered synteny dot plot (supplementary fig. S6B, Supplementary Material online) showed convincing evidence for lineage-specific WGD events. Regarding Ks distribution modeling, we used the genome assembly of Kobresia myosuroides as a reference. The results provided no indication of an intensive burst of synonymous substitutions in the B. planiculmis genome after its divergence from K. myosuroides (Fig. 1C). Gaussian mixture modeling failed to fit a unimodal distribution (supplementary fig. S7, Supplementary Material online). Combined with the extremely low percentage of duplicated BUSCO scores, it was concluded that no lineage-specific WGD event was likely to have contributed to the evolution of the B. planiculmis genome. Previous models have shown that the Fuireneae–Abildgaardieae–Eleocharideae–Cypereae clade (including Bolboschoenus) in the Cyperaceae experiences a 3-fold increase in diversification rate and the evolution of a majority of this clade is dominated by high rates of fusion and fission events (Escudero and Hipp 2013; Márquez-Corro et al. 2019). The present genome assembly supports the duplication-limited and dysploidy-prone evolutionary mode.
Genome Annotation and Gene Features
Approximately 37.04% (∼88.17 Mb) of the B. planiculmis genome comprised repetitive sequences. TEs constituted 21.81% of the genome and tandem repeats accounted for 15.23% of the genome. Detailed information for these two categories is shown in supplementary tables S6 and S7, Supplementary Material online. Among the various components of TEs, long terminal repeat retrotransposons (LTR-RTs) comprised ∼9.62% of the genome. It is well documented that variation in activities of LTR-RTs is largely responsible for genome size evolution (Zhao and Ma 2013). Thus, the relation between the low proportion of LTR-RTs and the small genome of B. planiculmis merits further investigation. In addition, a consensus of low proportions of LTR-RTs (6.1% to 15.4% of the genome) is evident in other genome assemblies in the Cyperaceae (Planta et al. 2022). Previous studies have shown that LTR-RTs and satellite repeats tend to be enriched in centromeric regions (Zhao and Ma 2013), and an edging study has highlighted the evolutionary significance of repeat-based holocentromeres in the genomes of three beak-sedges (Rhynchospora spp.) (Hofstatter et al. 2022). Thus, we established a thorough search for centromeres using quarTeT (Lin et al 2023). The results indicated that not all the chromosomes in B. planiculmis have multiple centromeres. Specifically, ten chromosomes showed no traces of centromere. Fifteen chromosomes were monocentric. The rest 29 chromosomes tended to have multiple centromeres (details in supplementary fig. S5, Supplementary Material online). We also noticed that the longest chromosome (Chr LG07) hosted the highest number of centromeres (seven candidates). Our results suggest that the distribution of centromeres may be heterogeneous among chromosomes. Thus, the importance of repeat elements in clubrush needs further investigation.
After masking the repeats, we established a gene model that integrated three methods (based on ab initio, homology, and transcriptome sequencing data, respectively). In total, 18,760 protein-coding genes were predicted in the B. planiculmis genome (supplementary table S8, Supplementary Material online). Approximately 82.79% of these predicted genes were supported by all three methods. The overall BUSCO score for the gene prediction was 94.18% (complete and single-copy 92.26% and complete and duplicated 1.92%). Further analysis revealed that 98.94% of the predicted genes were annotated in the public databases (EggNOG, NR, Swiss-Prot, KEGG, Pfam, KOG, TrEMBL, and GO; details are provided in supplementary table S8, Supplementary Material online). In addition, a noncoding RNA library was constructed, comprising 3,689 rRNAs, 487 tRNAs, 100 miRNAs, 50 snRNAs, and 21 snoRNAs. Furthermore, 22 pseudogenes were detected with a total length of 39,709 bp.
Materials and Methods
Collection and Preparation of Plant Materials
Healthy individuals of B. planiculmis were gathered from the Changgou Wetland (39°35′13″N, 115°53′32″E). We carefully chose healthy, clean leaves for DNA extraction and genomic sequencing. All samples were handled with care to prevent contamination by external pollutants and stored at −80 °C. A voucher specimen is housed in the herbarium of the National Natural History Museum of China (ID: BJM0272524, available upon reasonable request from the corresponding author).
Genome Sequencing
Genomic DNA was extracted using the cetyltrimethylammonium bromide method. Following a standard protocol, we generated a 15-kb DNA SMRTbell library for CCS sequencing. For Hi-C library construction, we followed a previously published protocol involving HindIII enzymatic digestion (Xie et al. 2015). An Illumina NovaSeq 6000 platform was used for Hi-C library sequencing.
Transcriptome Sequencing
We collected fresh samples from five tissues (corm, spikelet, stem, root, and leaf) for RNA sequencing. Paired-end libraries were generated using the conventional mRNA-seq prep kit following the manufacturer's instructions. The insertion size was approximately 350 bp on average. An Illumina NovaSeq 6000 platform was used to sequence the RNA libraries.
Genome Profiling and Draft Assembly
Sequences generated from the pair-end libraries were filtered to survey the genome. Inspection was performed using Genome Scope (v. 2.0) (Ranallo-Benavidez et al. 2020) and Jellyfish (v. 2.1.4) (Marçais and Kingsford 2011). A 19-nt k-mer distribution model was ultimately selected. In addition, flow cytometry experiments were conducted to validate the estimate of the genome size.
Clean data for the PacBio long reads were assembled using HIFIASM (v. 0.14) (Cheng et al. 2021). The preliminary contigs were then adjusted using Pilon (v. 1.24) (Walker et al. 2014). We utilized the Burrows–Wheeler Aligner (BWA; v. 0.7.17) and SAMtools (v. 1.13) to obtain alignments and for file format conversion (Li and Durbin 2009). Assembly integrity and completeness were assessed using BUSCO (v. 3.0) (Waterhouse et al. 2017). The embryophyta_odb10 database was chosen as the reference.
Finalizing the Genome Assembly Using Hi-C Technology
The clean Hi-C data were truncated according to the ascertained junction sites and then aligned with the draft assembly using BWA. For subsequent analysis, only read pairs that were uniquely aligned with mapping quality > 20 were retained. HiC-Pro (v. 2.8.1) (Servant et al. 2015) was used to filter out invalid read pairs.
Initially, we divided the preliminary scaffolds into partitions of ∼50 kb with error correction. The validated Hi-C data were then aligned onto these partitions using BWA, and only singularly aligned data were retained and further processed with LACHESIS (Burton et al. 2013). We carefully checked and corrected any erroneous placements or orientations that exhibited obvious discrete chromatin interaction patterns.
Gene Prediction and Annotation
To identify high-quality protein-coding genes, we used the MAKER pipeline (Cantarel et al. 2007). The de novo gene models were generated by implementing two ab initio gene prediction software tools (Augustus and SNAP). Homology detection was executed using four monocotyledonous species as references (Oryza sativa, Triticum aestivum, Panicum virgatum, and Carex littledalei). For transcript-based prediction, the RNA-sequencing data were used to generate unigenes for gene predictions. Finally, the outcomes of the three methods were reconciled using EVM (v. 1.1.1) (Haas et al. 2008).
To annotate the functions of genes in the B. planiculmis genome, we conducted a BLASTP search (details are provided in supplementary table S8, Supplementary Material online) with an E-value threshold of 1.0 × 10−5. We used tRNAscan-SE (v. 1.3.1) (Lowe and Eddy 1997) to detect tRNA with eukaryote parameters. We used Barrnap (v. 0.9) (Loman 2017) to identify rRNA genes. The miRNA, snoRNA, and snRNA genes were determined using INFERNAL1.1 (Nawrocki and Eddy 2013) with reference to the Rfam (release 12.0) database; detailed procedures could be found in the manual of IINFERNAL (http://eddylab.org/infernal/).
Detection of Repetitive Elements
We first built a de novo repeat library using RepeatModeler2 (v. 2.0.1) (Flynn et al. 2020) and then identified and retrieved high-quality intact LTRs using LTR_retriever (v. 2.8) (Ou and Jiang 2017). After removing redundant repeats, the final species-specific TE library was obtained through a combination of database mapping and correction with RepeatMasker (v. 4.10) (Tarailo-Graovac and Chen 2009). In addition, we used TRF (Benson 1999) and MISA (v. 2.1) (Beier et al. 2017) to identify tandem repeats and enhance the dimensions of the repetitive element library. We screened the genome for potential centromeres and telomeres using quarTeT (http://www.atcgn.com:8080/quarTeT/home.html) (Lin et al 2023) and visualized the results using RIdeogram (v. 0.2.2) (Hao et al 2020).
Detection of Intragenome Synteny and Potential Duplication Bursts
The WGDI toolkit (Sun et al. 2022) was used to ascertain intragenomic synteny and potential duplication bursts. WGDI constructs a hierarchical algorithm to improve the sensitivity and accuracy of collinearity detection. The functions “-d”, “-icl”, “-ks”, “-bi”, “-bk”, “-kp”, and “-kf” were invoked following the toolkit instructions (https://wgdi.readthedocs.io/en/latest/Introduction.html). Finally, an ideogram of pseudochromosomes was generated to intuitively represent the multidimensional genomic information for B. planiculmis.
Supplementary Material
Acknowledgments
We thank Mr Jiachen SUN and Miss Xinyan WU for their help in our fieldwork. We thank Robert McKenzie, PhD, from Liwen Bianji (Edanz) for editing the draft of this manuscript.
Contributor Information
Yu Ning, Wetland Research Center, Institute of Ecological Conservation and Restoration, Chinese Academy of Forestry, Beijing, China; Sichuan Zoige Wetland Ecosystem Research Station, Tibetan Autonomous Prefecture of Aba, China.
Yang Li, Huzhou University, Huzhou, China.
Hai Yan Lin, Institute of Information Technology, Chongqing Academy of Forestry Sciences, Chongqing, China.
En Ze Kang, Wetland Research Center, Institute of Ecological Conservation and Restoration, Chinese Academy of Forestry, Beijing, China; Sichuan Zoige Wetland Ecosystem Research Station, Tibetan Autonomous Prefecture of Aba, China.
Yu Xin Zhao, College of Biological Sciences and Biotechnology, Beijing Forestry University, Beijing, China.
Shu Bin Dong, College of Biological Sciences and Biotechnology, Beijing Forestry University, Beijing, China.
Yong Li, Wetland Research Center, Institute of Ecological Conservation and Restoration, Chinese Academy of Forestry, Beijing, China; Sichuan Zoige Wetland Ecosystem Research Station, Tibetan Autonomous Prefecture of Aba, China.
Xiao Fei Xia, National Natural History Museum of China, Beijing, China.
Yi Fei Wang, Wetland Research Center, Institute of Ecological Conservation and Restoration, Chinese Academy of Forestry, Beijing, China; Sichuan Zoige Wetland Ecosystem Research Station, Tibetan Autonomous Prefecture of Aba, China.
Chun Yi Li, Wetland Research Center, Institute of Ecological Conservation and Restoration, Chinese Academy of Forestry, Beijing, China.
Supplementary Material
Supplementary material is available at Genome Biology and Evolution online.
Author Contributions
Y.N. and Y.F.W. conceived and designed the research; Y.F.W. and C.Y.L. issued the funding and organized the related resources; H.Y.L., E.Z.K., and Y.X.Z. participated in the fieldwork and curated the raw data; Y.N. and Y.L. drafted the manuscript; C.Y.L., S.B.D., and Y.L. contributed in logic improvement and data visualization; and X.F.X. maintained the voucher specimen and carried out the cytological experiments. The revision and approval of this final manuscript were established by all the authors.
Funding
The funding sources for this research are Fundamental Research Funds of the Chinese Academy of Forestry (CAFYBB2020SY042, CAFYBB2018SZ018, and CAFYBB2020ZA004) and National Natural Science Foundation of China (NSFC31800348 and NSFC31972948).
Data Availability
The reported genome assembly is archived in NCBI under accession number PRJNA937020. Biosample information could be accessed at SAMN33386850. The read data involved in this research are deposited at Sequence Read Archive (SRA) with accession numbers SRR23592699 (Illumina short reads) and SRR23592700 (PacBio long reads). The related annotation files could be found at https://figshare.com/projects/genome_annotations_files_for_Bolboschoenus_planiculmis/162637, including coding sequences, protein sequences, and functional annotation in gff3 format.
Literature Cited
- Almaamary EAS, Abdullah SRS, Ismail N, Idris M, Kurniawan SB, Imron MF. Comparative performance of Scirpus grossus for phytotreating mixed dye wastewater in batch and continuous pilot subsurface constructed wetland systems. J Environ Manag. 2022:307:114534. 10.1016/j.jenvman.2022.114534. [DOI] [PubMed] [Google Scholar]
- An Y, Gao Y, Tong S. Emergence and growth performance of Bolboschoenus planiculmis varied in response to water level and soil planting depth: implications for wetland restoration using tuber transplantation. Aquat Bot. 2018:148:10–14. 10.1016/j.aquabot.2018.04.005. [DOI] [Google Scholar]
- Beier S, Thiel T, Münch T, Scholz U, Mascher M. MISA-web: a web server for microsatellite prediction. Bioinformatics. 2017:33(16):2583–2585. 10.1093/bioinformatics/btx198. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Benson G. Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res. 1999:27(2):573–580. 10.1093/nar/27.2.573. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Burton JN, Adey A, Patwardhan RP, Qiu R, Kitzman JO, Shendure J. Chromosome-scale scaffolding of de novo genome assemblies based on chromatin interactions. Nat Biotechnol. 2013:31(12):1119–1125. 10.1038/nbt.2727. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Can M, Wei W, Zi H, Bai M, Liu Y, Gao D, Tu D, Bao Y, Wang L, Chen S, et al. Genome sequence of Kobresia littledalei, the first chromosome-level genome in the family Cyperaceae. Sci Data. 2020:7(1):175. 10.1038/s41597-020-0518-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cantarel BL, Korf I, Robb SMC, Parra G, Ross E, Moore B, Holt C, Sánchez Alvarado A, Yandell M. MAKER: an easy-to-use annotation pipeline designed for emerging model organism genomes. Genome Res. 2007:18(1):188–196. 10.1101/gr.6743907. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cheng H, Concepcion GT, Feng X, Zhang H, Li H. Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm. Nat Methods. 2021:18(2):170–175. 10.1038/s41592-020-01056-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Elliott TL, Zedek F, Barrett RL, Bruhl JJ, Escudero M, Hroudová Z, Joly S, Larridon I, Luceño M, Márquez-Corro JI, et al. Chromosome size matters: genome evolution in the cyperid clade. Ann Bot. 2022:130(7):999–1013. 10.1093/aob/mcac136. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Escudero M, Hipp A. Shifts in diversification rates and clade ages explain species richness in higher-level sedge taxa (Cyperaceae). Am J Bot. 2013:100(12):2403–2411. 10.3732/ajb.1300162. [DOI] [PubMed] [Google Scholar]
- Escudero M, Martín-Bravo S, Mayrose I, Fernández-Mazuecos M, Fiz-Palacios O, Hipp AL, Pimentel M, Jiménez-Mejías P, Valcárcel V, Vargas P, et al. Karyotypic changes through dysploidy persist longer over evolutionary time than polyploid changes. PLoS One. 2014:9(1):e85266. 10.1371/journal.pone.0085266. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Flynn JM, Hubley R, Goubert C, Rosen J, Clark AG, Feschotte C, Smit AF. RepeatModeler2 for automated genomic discovery of transposable element families. Proc Natl Acad Sci U S A. 2020:117(17):9451–9457. 10.1073/pnas.1921046117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Haas BJ, Salzberg SL, Zhu W, Pertea M, Allen JE, Orvis J, White O, Buell CR, Wortman JR. Automated eukaryotic gene structure annotation using EVidenceModeler and the program to assemble spliced alignments. Genome Biol. 2008:9(1):1–22. 10.1186/gb-2008-9-1-r7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hao Z, Lv D, Ge Y, Shi J, Weijers D, Yu G, Chen J. RIdeogram: drawing SVG graphics to visualize and map genome-wide data on the idiograms. PeerJ Comput Sci. 2020:6:e251. 10.7717/peerj-cs.251. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hipp AL, Rothrock PE, Roalson EH. The evolution of chromosome arrangements in Carex (Cyperaceae). Bot Rev. 2008:75(1):96–109. 10.1007/s12229-008-9022-8. [DOI] [Google Scholar]
- Hipp AL, Rothrock PE, Whitkus R, Weber JA. Chromosomes tell half of the story: the correlation between karyotype rearrangements and genetic diversity in sedges, a group with holocentric chromosomes. Mol Ecol. 2010:19(15):3124–3138. 10.1111/j.1365-294x.2010.04741.x. [DOI] [PubMed] [Google Scholar]
- Hofstatter PG, Thangavel G, Lux T, Neumann P, Vondrak T, Novak P, Zhang M, Costa L, Castellani M, Scott A, et al. Repeat-based holocentromeres influence genome architecture and karyotype evolution. Cell. 2022:185(17):3153–3168.e18. 10.1016/j.cell.2022.06.045. [DOI] [PubMed] [Google Scholar]
- Jarolímová V, Hroudová Z. Chromosome numbers within the genus Bolboschoenus in Central Europe. Folia Geobot. 1998:33(4):415–428. 10.1007/bf02803643. [DOI] [Google Scholar]
- Li H, Durbin R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics. 2009:25(14):1754–1760. 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lin Y, Ye C, Li X, Chen Q, Wu Y, Zhang F, Pan R, Zhang S, Chen S, Wang X, et al. Quartet: a telomere-to-telomere toolkit for gap-free genome assembly and centromeric repeat identification. Hortic Res. 2023:10(8):uhad127. 10.1093/hr/uhad127. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ljevnaić-Mašić B, Džigurski D, Nikolić L, Brdar-Jokanović M, Čabilovski R, Ćirić V, Petrović A. Assessment of the habitat conditions of a rare and endangered inland saline wetland community with Bolboschoenus maritimus (L.) Palla dominance in Southeastern Europe: the effects of physical–chemical water and soil properties. Wetlands Ecol Manage. 2020:28(3):421–438. 10.1007/s11273-020-09721-4. [DOI] [Google Scholar]
- Loman T. A novel method for predicting ribosomal RNA genes in prokaryotic genomes. Lund (Sweden): LUND University; 2017. [Google Scholar]
- Lowe T, Eddy S. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 1997:25(5):955–964. 10.1093/NAR/25.5.0955. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Marçais G, Kingsford C. A fast, lock-free approach for efficient parallel counting of occurrences of k-mers. Bioinformatics. 2011:27(6):764–770. 10.1093/bioinformatics/btr011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Márquez-Corro JI, Martín-Bravo S, Spalink D, Luceño M, Escudero M. Inferring hypothesis-based transitions in clade-specific models of chromosome number evolution in sedges (Cyperaceae). Mol Phylogenet Evol. 2019:135:203–209. 10.1016/j.ympev.2019.03.006. [DOI] [PubMed] [Google Scholar]
- Martín-Bravo S, Jiménez-Mejías P, Villaverde T, Escudero M, Hahn M, Spalink D, Roalson EH, Hipp AL, Benítez-Benítez C, Bruederle LP, et al. A tale of worldwide success: behind the scenes of Carex (Cyperaceae) biogeography and diversification. J Sys Evol. 2019:57(6):695–718. 10.1111/jse.12549. [DOI] [Google Scholar]
- Nawrocki EP, Eddy SR. Infernal 1.1: 100-fold faster RNA homology searches. Bioinformatics. 2013:29(22):2933–2935. 10.1093/bioinformatics/btt509. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ning Y, Dong SB, Zhao YX, Pan Z, Ma H. The complete chloroplast genome sequence of Bolboschoenus planiculmis (Cyperaceae). Mitochondrial DNA B Resour. 2022a:7(1):241–242. 10.1080/23802359.2021.2024773. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ning Y, Li Y, Dong SB, Yang HG, Li CY, Xiong B, Yang J, Hu YK, Mu XY, Xia XF. The chromosome-scale genome of Kobresia myosuroides sheds light on karyotype evolution and recent diversification of a dominant herb group on the Qinghai–Tibet Plateau. DNA Res. 2022b:30(1):dsac049. 10.1093/dnares/dsac049. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ning Y, Zhang ZX, Cui LJ, Zou CL. Adaptive significance of and factors affecting plasticity of biomass allocation and rhizome morphology: a case study of the clonal plant Scirpus planiculmis (Cyperaceae). Pol J Eco. 2014:62:77–88. 10.3161/104.062.0108. [DOI] [Google Scholar]
- Ou S, Jiang N. LTR_retriever: a highly accurate and sensitive program for identification of long terminal repeat retrotransposons. Plant Physiol. 2017:176(2):1410–1422. 10.1104/pp.17.01310 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Peng Y, Zhou C, Jin Q, Ji M, Wang F, Lai Q, Shi R, Xu X, Chen L, Wang G. Tidal variation and litter decomposition co-affect carbon emissions in estuarine wetlands. Sci Total Environ. 2022:839:156357. 10.1016/j.scitotenv.2022.156357. [DOI] [PubMed] [Google Scholar]
- Píšová S, Hroudová Z, Chumová Z, Fér T. Ecological hybrid speciation in central-European species of Bolboschoenus. Preslia. 2017:89(1):17–39. 10.23855/preslia.2017.017. [DOI] [Google Scholar]
- Planta J, Liang YY, Xin H, Chansler MT, Prather LA, Jiang N, Jiang J, Childs KL. Chromosome-scale genome assemblies and annotations for Poales species Carex cristatella, Carex scoparia, Juncus effusus, and Juncus inflexus. G3 (Bethesda). 2022:12(10):jkac211. 10.1093/g3journal/jkac211. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ranallo-Benavidez TR, Jaron KS, Schatz MC. GenomeScope 2.0 and Smudgeplot for reference-free profiling of polyploid genomes. Nat Commun. 2020:11(1):1–10. 10.1038/s41467-020-14998-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Roalson EH. A synopsis of chromosome number variation in the Cyperaceae. Bot Rev. 2008:74(2):209–393. 10.1007/s12229-008-9011-y. [DOI] [Google Scholar]
- Servant N, Varoquaux N, Lajoie BR, Viara E, Chen CJ, Vert JP, Heard E, Dekker J, Barillot E. HiC-Pro: an optimized and flexible pipeline for Hi-C data processing. Genome Biol. 2015:16(1):1–11. 10.1186/s13059-015-0831-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sun P, Jiao B, Yang Y, Shan L, Li T, Li X, Xi Z, Wang X, Liu J. WGDI: a user-friendly toolkit for evolutionary analyses of whole-genome duplications and ancestral karyotypes. Mol Plant. 2022:15(12):1841–1851. 10.1016/j.molp.2022.10.018. [DOI] [PubMed] [Google Scholar]
- Tarailo-Graovac M, Chen N. Using RepeatMasker to identify repetitive elements in genomic sequences. Curr Protoc Bioinformatics. 2009:25(1):4–10. 10.1002/0471250953.bi0410s25. [DOI] [PubMed] [Google Scholar]
- Walker BJ, Abeel T, Shea T, Priest M, Abouelliel A, Sakthikumar S, Cuomo CA, Zeng Q, Wortman J, Young SK, et al. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS One. 2014:9(11):e112963. 10.1371/journal.pone.0112963. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Waterhouse RM, Seppey M, Simão FA, Manni M, Ioannidis P, Klioutchnikov G, Kriventseva EV, Zdobnov EM. BUSCO applications from quality assessments to gene prediction and phylogenomics. Mol Biol Evol. 2017:35(3):543–548. 10.1093/molbev/msx319. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Xie T, Zheng JF, Liu S, Peng C, Zhou YM, Yang QY, Zhang HY. De novo plant genome assembly based on chromatin interactions: a case study of Arabidopsis thaliana. Mol Plant. 2015:8(3):489–492. 10.1016/j.molp.2014.12.015. [DOI] [PubMed] [Google Scholar]
- Zhao M, Ma J. Co-evolution of plant LTR-retrotransposons and their host genomes. Protein Cell. 2013:4(7):493–501. 10.1007/s13238-013-3037-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The reported genome assembly is archived in NCBI under accession number PRJNA937020. Biosample information could be accessed at SAMN33386850. The read data involved in this research are deposited at Sequence Read Archive (SRA) with accession numbers SRR23592699 (Illumina short reads) and SRR23592700 (PacBio long reads). The related annotation files could be found at https://figshare.com/projects/genome_annotations_files_for_Bolboschoenus_planiculmis/162637, including coding sequences, protein sequences, and functional annotation in gff3 format.

