Abstract
The enzyme myo-inositol oxygenase (MIOX) catalyzes the myo-inositol into glucuronic acid. In this study, 6 MIOX genes were identified from all of the three diploid cotton species (Gossypium arboretum, Gossypium herbaceum and Gossypium raimondii) and Gossypioides kirkii, 12 MIOX genes were identified from two domesticated tetraploid cottons Gossypium hirsutum, Gossypium barbadense, and 11 MIOX genes were identified from three wild tetraploid cottons Gossypium tomentosum, Gossypium mustelinum and Gossypium darwinii. The number of MIOX genes in tetraploid cotton genome is roughly twice that of diploid cotton genome. Members of MIOX family were classified into six groups based on the phylogenetic analysis. Integrated analysis of collinearity events and chromosome locations suggested that both whole genome duplication and segmental duplication events contributed to the expansion of MIOX genes during cotton evolution. The ratios of non-synonymous (Ka) and synonymous (Ks) substitution rates revealed that purifying selection was the main force driving the evolution of MIOX genes. Numerous cis-acting elements related to light responsive element, defense and stress responsive element were identified in the promoter of the MIOX genes. Expression analyses of MIOX genes based on RNA-seq data and quantitative real time PCR showed that MIOX genes within the same group shared similar expression patterns with each other. All of these results provide the foundation for further study of the biological functions of MIOX genes in cotton environmental adaptability.
Background
Myo-inositol (MI) is a small molecule that is important in many different developmental and physiological processes in eukaryotic cells [1]. MI participates in the production of stress-related molecules, cell wall biosynthesis, phytic acid biosynthesis, auxin transport and storage [2]. The Myo-inositol oxidation pathway (MIOP) effectively control MI homeostasis. In this pathway, Myo-inositol oxygenase (MIOX, E.C. 1.13.99.1) is a key enzyme that catalyzes the conversion of MI into D-glucuronic acid (D-GlcUA), which is subsequently activated to UDP-D-glucuronic acid (UDP-GlcA) and serves as an important precursor for plant cell wall polysaccharides [3, 4]. Furthermore, previous reports suggested that MIOX might also be involved in the production of ascorbate, consequently, protection from ROS-mediated injury [5]. Studies have shown that MIOX plays a role in the response to environmental stresses. In Arabidopsis, MIOX was encoded by a gene family of four members, it has been reported that AtMIOX4 over expression can increase the tolerance to cold, heat and salt [3, 6–8]. In addition, it has also been reported that rice MIOX has a specific function in drought stress tolerance by decreasing oxidative damage [9].
Cotton (Gossypium) is one of the most economically important fiber crop plants throughout the world. It is a remarkably diverse genus, with over 50 species divided into eight diploid genomic groups, designated A-G and K, and one tetraploid genomic group, namely AD [10]. Divergence analysis suggests that the major diploid branches of the cotton genus diverged about 7~11 million years ago, and the polyploid clade originated circa 1~2 million years ago [11–14]. Due to the rapid development of next generation sequencing, the whole genomes have been reported for the diploid D-genome of G. raimondii [15], A-genome of G. arboretum [16], G. herbaceum [17], and allopolyploid AD genome species of G. hirsutum [18], G. barbadense [18], G. darwinii, G. mustelinum and G. tomentosum [19]. The sequence of these genomes provides useful resources for studying the gene families and a series of families have been genome-wide analyzed in cotton [20, 21].
Although cotton has better abiotic stress tolerance compared with other crops, its yield and fiber quality under saline-alkali, drought and heat stress are still under great threat. Considering the role of MIOX gene in various abiotic stress responses in plants and there is no genome-wide identification and characterization of MIOX gene family members in cotton. We conducted a systematic analysis of the cotton MIOX gene family by a genome-wide search, including the phylogeny evolutionary relationships, group classification, conserved motifs and the expression profile etc. Our results will provide good foundation for understanding the key roles of MIOX genes in cotton stress-responsive and other biological processes.
Results
Identification of the MIOX gene family
A total of 81 MIOX proteins were identified from nine species, including 75 MIOX proteins in eight Gossypium genomes and 6 MIOX proteins in the Gossypioides kirkii genome. Except for one MIOX protein from G. herbaceum has significantly more amino acids (896 aa) and two MIOX proteins from G. barbadense has significantly less amino acids (101 aa and 113 aa), the others are relatively conservative (S1 Table). Furthermore, we found that all of the three diploid Gossypium species and the Gossypioides kirkii contained 6 MIOX proteins, the number of MIOX proteins in tetraploid cotton was roughly twice that of diploid cotton, but the three wild tetraploid cottons (G. tomentosum, G. mustelinum and G. darwinii) missed a MIOX protein.
For convenience, we renamed the MIOX gene family. Gh, Gb, Ga, Gher, Gr, Gd, Gt, Gm, and Gkk were used as prefixes before the names of MIOX genes from G. hirsutum, G. barbadense, G. arboreum, G. herbaceum, G. raimondii, G. darwinii, G. tomentosum, G. mustelinum and Gossypioides kirkii respectively. We assigned the names to MIOX proteins as (GhMIOX01- GhMIOX12) for G. hirsutum, (GbMIOX01—GbMIOX12) for G. barbadense and so on. The new names of the sequence are only used in this study.
Phylogenetic analysis of the MIOX gene family
To reveal the evolutionary relationship of identified MIOX proteins, the amino acid sequences of 75 proteins from Gossypium species, 6 proteins from Gossypioides kirkii and 4 proteins from Arabidopsis thaliana were used to construct a phylogenetic tree (Fig 1, S1 Fig). According to the result, 81 MIOX proteins from 8 Gossypium species and Gossypioides kirkii were divided into six groups of A-F, and the number of MIOX proteins was very stable in each group, diploid species containing 6 MIOX proteins had one distribution in each group, while tetraploid cottons containing 11 or 12 MIOX proteins had one or two distribution in each group. All of these genes showed one-to-one homology relationship among different genomes or subgenomes. Compared to the A~F groups, the group G~I only contained A. thaliana MIOX proteins.
Gene structure and protein motifs of the MIOX gene family
To explore the structural diversity of MIOX genes, the intron-exon organization of each MIOX gene was analyzed (Fig 2). The number of introns ranged from 3 to 21, most (53/81) MIOX genes contained 9 or 10 introns. In the same group, the intron numbers exhibited comparable exon-intron structure and intron numbers, while many Gossypioides kirkii MIOX genes were different from cotton, implying important evidence for the phylogenetic relationship among members of the MIOX gene family.
Furthermore, we investigated the conserved motifs in MIOX proteins to understand the diversity of motif compositions (Fig 2). A total of 13 motifs, named Motif 1~Motif 13, were identified in MIOX proteins. The number of motifs varied from 3 to 12 in each MIOX protein and most MIOX proteins within the same group exhibited similar motif composition and arrangement, which indicates that the members of MIOX gene family that clustered in the same group may have similar biological functions. Motif 5 was found in 78 MIOX proteins, Motif 8 was completely missing only in group B, and Motif 13 was specific to group C. The gene structure and motif composition of the MIOX members from each group that obtained from phylogenetic analysis were similar, which indicates the classification was reliable.
Chromosomal location and gene duplications of the MIOX gene family
To more intuitively understand the distribution of MIOX family genes on the chromosomes, we performed a chromosomal localization analysis. The result shows that MIOX genes were mapped onto 5/10 different chromosomes of diploid/tetraploid cotton. Each chromosome contained only one MIOX gene usually, while some chromosome contained 2 MIOX genes (Fig 3). In addition, the distribution of Gossypioides kirkii MIOX genes showed similar to Gossypium.
Whole genome duplication, segmental duplication and tandem duplication provides major forces that drive the expansion of gene families. The number of MIOX genes in tetraploid Gossypium species was twice as much as that in diploid Gossypium species which indicates the expansion of MIOX gene family during polyploidization. We searched the segmental duplication using MCScanX within each genome, and identified 143 collinear genes pairs among the 72 MIOX genes. Of the 75 MIOX genes in eight Gossypium genomes, only 3 were located outside of the duplicated blocks, while 96% (72 of 75) were located in duplicated regions. In addition, 60, 38 duplication gene-pairs were found between diploid G. arboretum A-genome and tetraploid G. hirsutum, G. barbadense A-subgenome respectively. 50, 17 duplication gene-pairs were found between diploid G. raimondii D-genome and tetraploid G. hirsutum, G. barbadense D-subgenome respectively (Fig 4).
According to Holub’s description, a chromosomal region within 200 kb containing two or more genes was defined as a tandem duplication event [22]. Our results indicated that the MIOX genes of the nine species have no tandem duplication. Furthermore, the role of transposable elements also play an important role in the expansion of many genes in many plants [23–25]. However, our results show that none of MIOX gene was amplified by transposable elements.
Selection pressure of the MIOX gene family
To further elucidate the selection pressure of functional diversification of each MIOX group, we have comprehensively evaluated the non-synonymous (Ka) and synonymous (Ks) substitution rates (Fig 5). Generally, the value of Ka/Ks represents the ratio between Ka and Ks of two homologous protein-coding genes. Ka/Ks > 1 indicates that a gene has been positively selected, while a Ka/Ks = 1 indicates that a gene has been neutrally selected, and a Ka/Ks < 1 indicates that a gene has been selectively purified. Our results showed that the median Ka/Ks ratios of the 6 groups ranged from 0.164324 (Group E) to 0.500071 (Group A), and except for very few Ka/Ks ratios >1, majority of Ka/Ks ratios were <1. These results indicated that MIOX gene family was under purifying selection and might play specialized roles in the adaptive evolution of cotton.
Cis-elements in the promoter of MIOX genes
Cis-elements in promoters play vital roles in regulating the expression of genes. To gain more insight into the functions of MIOX genes, the cis-regulatory elements were scanned in the 2000 bp upstream of the transcription start sites of cotton MIOX genes (Fig 6, S2 Table, S2 Fig). The results showed that there were many kinds of response elements, such as light responsive element, defense and stress responsive element involved in drought and low temperature, and hormone responsive element associated with salicylic acid, abscisic acid, gibberellin and MeJA. All of the cotton MIOX genes contained more than one light responsive element, however only 29.33% (22/75) of the cotton MIOX genes contained auxins responsive element.
The cis-elements of MIOX genes in the same phylogenetic group were similar. In addition, within the same group, half of the cis-elements of MIOX genes in tetraploid Gossypium species are similar to the diploid A genome species (G. herbaceum and G. arboretum), and half are similar to the D genome species (G. raimondii). These results further indicated the expansion of MIOX gene family during polyploidization.
Tissue-specific expression profiles of MIOX genes
To study the tissue-specific expression patterns of the MIOX genes, we analyzed the expression profiles of the MIOX genes in different tissues. As shown in Fig 7, MIOX genes showed different expression levels in different tissues. In G. hirsutum, the expression of GhMIOX02 and GhMIOX08 were higher in leaf, torus, pistil, bract and sepal, while their expression was lower in root. The expression of GhMIOX04 and GhMIOX10 was higher in root, leaf, torus and sepal, while their expression was lower in pistil and bract. In G. barbadense, GbMIOX02, GbMIOX08, GbMIOX04 and GbMIOX10 had similar expression profiles with G. hirsutum. In addition, GhMIOX03 of G. hirsutum showed higher transcription level only in root and stem, however, GbMIOX03 of G. barbadense showed higher transcription levels in root, stem and pistil.
The expression levels of MIOX genes at ovule and fiber developmental stages were also investigated (Fig 7). At most stages of development, the expression of GhMIOX08 from G. hirsutum, GbMIOX08 from G. barbadense were higher than that of the other genes; Several genes were expressed at high levels during specific developmental stages; for instance, GhMIOX03, GhMIOX09, GhMIOX10 from G. hirsutum and GbMIOX03, GbMIOX09, GbMIOX10 from G. barbadense. In contrast, the RNA transcript levels of GhMIOX05, GhMIOX06, GhMIOX11 from G. hirsutum and GbMIOX01, GbMIOX05, GbMIOX06, GbMIOX11 from G. barbadense were low at all stages and all tissues. These findings indicated the MIOX genes play differential roles in tissue development.
Stress-induced expression patterns of MIOX genes
The expression patterns of MIOX genes were further analyzed in G. hirsutum and G. barbadense exposed to different durations of cold, heat, salt, and drought stresses for different times by RNA-seq data downloaded from the public database (Fig 8). Based on the clustering analysis, MIOX genes in the same group had similar expression patterns. GhMIOX02, GhMIOX08, GbMIOX02 and GbMIOX08 in group D showed higher expression levels in all of these stresses, while GhMIOX04, GhMIOX10, GbMIOX04 and GbMIOX10 in Group E and GhMIOX03, GhMIOX09, GbMIOX03 and GbMIOX09 in Group C had different expression patterns under different conditions. In addition, the genes in other groups were not significantly responded to these stresses. These findings suggested the functional divergence of the MIOX genes in abiotic stress.
Expression analysis of MIOX genes under cold, PEG, and NaCl treatments
We analyzed the expression patterns of the G. hirsutum MIOX genes after0, 1, 3, 6, 12, and 24h of cold (4°C), PEG, and NaCl treatment, respectively. The results showed that most of the MIOX genes responded to the different stress treatments (Fig 9). The upregulation (increased 3.9 ~ 5.3 times) of GhMIOX03, GhMIOX09 and GhMIOX04 reached their peaks after 24h treatment of Cold, NaCl and PEG respectively. The expression levels of GhMIOX03, GhMIOX08 and GhMIOX12 decreased after 1h of the PEG treatment. GhMIOX02, GhMIOX12 under NaCl treatment and GhMIOX09 under Cold, PEG conditions showed a down-up-down-regulation trend, and the fluctuating trend between the different genes was significantly different. In addition, some MIOX genes only showed slightly change, for example, GhMIOX02 response to cold and GhMIOX10 response to PEG. Therefore, the G. hirsutum MIOX genes are widely involved in abiotic stress responses.
Discussion
MI is a crucial substance in various aspects of plant physiology. Plants maintain an MI pool at a basal level throughout their life cycle, and MIOX is used to control the metabolite level of MI in plants [1, 2, 5, 8]. MIOX proteins are conserved and present in nearly all eukaryotes. MIOX is crucial in abiotic stress tolerance and previous studies on the MIOX gene family have been performed in many plants [2, 5], including Arabidopsis thaliana [3, 4], Oryza sativa [9] and Solanum lycopersicum [26]. In this study, we performed a comprehensive identification of MIOX genes in 5 tetraploid cotton species, 3 diploid cotton species and a cotton closely related species Gossypioides kirkii, with an aim of understanding the important and diverse roles of this gene family in abiotic stress tolerance in plants.
The number of MIOX genes in many species was small, for example, 4 MIOX genes were identified from Arabidopsis thaliana, 1 MIOX gene from Oryza sativa and 5 MIOX genes from Solanum lycopersicum. In addition, the number of MIOX genes in different species of Gossypium is also small. Our results indicated that no MIOX gene was amplified by tandem duplication and transposable elements, which suggested that the number of MIOX genes is relatively stable in the process of evolution. The number of MIOX genes roughly doubled during Gossypium polyploidization, our results further indicated that the amplified pair belong to the same group and maintained similar gene structure, motif composition and expression profile.
An analysis of gene expression patterns can be used to some extent to predict the molecular functions of genes involved in different processes. Our heatmap data showed that most MIOX genes within each group shared similar expression patterns. The MIOX genes in Group D in G. hirsutum (GhMIOX02 and GhMIOX08) and G. barbadense (GbMIOX02, GbMIOX08) were expressed in leaf, torus, pistil, bract, speal and responded to cold, heat, salt, and drought stresses. In contrast, the expression of MIOX genes in Group F in G. hirsutum (GhMIOX05, GhMIOX11) and G. barbadense (GbMIOX05, GbMIOX11) were barely observable.
Conclusions
We performed a genome-wide analysis of the MIOX gene family in 8 cotton species. The MIOX family was divided into 6 groups based on the phylogenetic tree. To understand the expansion mechanism of MIOX family, the gene duplication events was investigated, and the result indicated that segmental duplication and whole genome duplication were the major driving forces of MIOX family expansion, no tandem duplication and transposable element amplification was found. The expression profiles of G. hirsutum and G. barbadense MIOX genes in different tissues and under abiotic stresses were also analyzed. The results indicated that MIOX genes within the same group shared similar expression pattern and they exhibited different expression levels in different groups. Furthermore, the Ka/Ks ratio suggesting that all groups experienced purifying selection pressure. Our results will provide clues for researchers regarding the evolution and biological functions of MIOXs.
Methods
Identification of MIOX genes in cotton
The genome sequences of G. darwinii (AD5 HGS_v1.1), G. mustelinum (AD4 JGI_v1.1) and G. tomentosum (AD3 HGS_v1.1) [19] were downloaded from NCBI (https://www.ncbi.nlm.nih.gov/genome/), the genome sequences of G. hirsutum (TM-1 HAU_v1.1) [18], G. barbadense (HAU_v2_a1) [18], G. arboretum(A2 CRI-updated_v1) [16], G. herbaceum(A1 WHU_v1) [17], G. raimondii(D5 NSF_v1) [15] and a cotton closely related species Gossypioides kirkii (ISU_v3_a3) [27] were downloaded from COTTONGEN (http://www.cottongen.org). Arabidopsis MIOX proteins were used to search possible cotton MIOX sequences by BLASTP with an E-value≤1.0E−3 [28, 29]. The HMMER [30] program was used to identify MIOX proteins with the Hidden Markov Model of the MIOX domain (PF05153), which was downloaded from the Pfam database (http://pfam.xfam.org). Furthermore, NCBI CD-Search (https://www.ncbi.nlm.nih.gov/cdd/) and Search Pfam tools (http://pfam.xfam.org/search) were used to confirm the candidate sequences. The number of amino acids, molecular weights and theoretical isoelectric points of MIOX proteins were calculated using the ExPASy online server tool (https://www.expasy.org/).
Phylogenetic, gene structure, conserved motif and cis-elements analysis of MIOXs
The ClustalX was used to align MIOX protein sequences. A neighbor-joining tree of MIOX proteins was constructed using MEGA-X with 1000 bootstrap replications [31]. The phylogenetic tree was drawn using EvolView [32]. The structure of MIOX genes were analyzed with TBtools [33]. The conserved motifs of MIOX protein sequences were obtained using MEME [34]. To investigate putative cis–acting regulatory elements of MIOX genes, 2000 base pair genomic DNA sequences upstream of the initiation codon were retrieved and screened against the PlantCARE database [35].
Chromosomal distribution and gene duplication of MIOX genes
The chromosome distributions of MIOX genes were extracted from the GFF files and the Mapchart [36] software was used to visually map the chromosomal location. We made use of Multiple collinear scanning toolkits (MCScanX) [37] to detect the gene duplication events and tandem duplications were identified as previously described. The synonymous (Ks) and nonsynonymous (Ka) substitution rates of MIOX genes were calculated by KaKs_Calculator [38].
Transposable elements expansion analysis of MIOX genes
To investigate whether transposable elements played roles in expansion of the MIOX genes, LTR_retriever [39], LTRharvest [40], LTR_FINDER [41], RepeatModeler and RepeatMasker [42] were used to identify transposable elements. We then compared those results with the genome annotation to predict genes inside transposable elements in the 10 genomes.
Expression patterns of MIOX genes
To study the expression patterns of MIOX genes, G. hirsutum and G. barbadense high-throughput transcriptome sequencing data including various tissues, developmental stages and stress treatments were downloaded from the NCBI SRA (PRJNA490626). Trimmomatic [43] was used to remove the adapters and to perform quality control. The program hisat2 [44] was used to map the reads to the genomes, then Fragments Per Kilobase of transcript per Million fragments (FPKM) values of the MIOX genes were calculated by Cufflinks [45, 46]. Heat maps of gene expression profiles were drawn by TBtools [33].
RNA isolation and real-time PCR analysis of MIOX genes
Total RNA was extracted by using an EASYspin Plus Plant RNA Rapid Extraction Kit (Aidlab Biotech) and RNA integrity and concentration are measured using NanoDrop2000. The first cDNA strand was synthesized with 1 ng total RNA by using a TransScript II All-in-One First-Strand cDNA Synthesis SuperMix for qPCR (TransGen Biotech) based on the manufacturer’s instructions. The qRT-PCR reactions were performed on the ABI 7500 Fast (Applied Biosystems, Foster City, CA, USA) using TransStart Top Green qPCR SuperMix (TransGen Biotech) with specific primers designed by NCBI-Primer-BLAST (Table 1). Three technical replicates were used for each sample. The relative gene expression was calculated with the 2−ΔΔCt method [47].
Table 1. The specific primers for qRT-PCR of G. hirsutum MIOX genes.
Gene name | Forward Primer sequences (5’-3’) | Reverse Primer sequences (5’-3’) |
---|---|---|
GhMIOX02 | CTTGCTGCAAACAGCTGAGG | GCTCCCCAAAACCAGGATGA |
GhMIOX03 | CGAGAAGCCTGAGCTAGTGT | CCACGCTCTTTTGCCTTTCA |
GhMIOX04 | TCCTCATTGATCAACCTGATTTTGG | CACCCTGTTGCCTCTCACTT |
GhMIOX08 | CCTCAGTGGGCTGTTGTAGG | TCTCTGTTCTCCAACAGAAGAAA |
GhMIOX09 | CCTTCAGCAGGGCTGTTCAT | GCCACCTTGCTCTTGCTGTA |
GhMIOX10 | CTTGCTGCAAACAGCTGAGG | CCTACAACAGCCCACTGAGG |
GhMIOX12 | CCCATTGCACAAGCATGGAG | CGTGAGCTTTGCTCTTGCTG |
Supporting information
Acknowledgments
We thank all colleagues in our laboratory for helpful discussions and technical assistance.
Data Availability
All relevant data are within the paper and its Supporting information files.
Funding Statement
This work is partially support by College Student Innovation and Entrepreneurship Training Program in Henan Province(S202011330003), China Postdoctoral Science Foundation(2018M632761), the National Natural Science Foundation of China (No. 31471548), the National Key Research and Development Program of China (No. 2018YFD0100302) and the Science and Technology Development Project of Anyang City (2018-66-117). The funders had no role in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript.
References
- 1.Ye W, Ren W, Kong L, et al. Transcriptomic Profiling Analysis of Arabidopsis thaliana Treated with Exogenous Myo-Inositol. Plos One. 2016; 11(9): e161949. doi: 10.1371/journal.pone.0161949 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Loewus FA, Murthy PPN. myo-Inositol metabolism in plants. Plant science (Limerick). 2000; 150(1): 1–19. [Google Scholar]
- 3.Alford S. myo-Inositol oxygenase is required for responses to low energy conditions in Arabidopsis thaliana. Front Plant Sci. 2012; 3. doi: 10.3389/fpls.2012.00069 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Endres S, Tenhaken R. Down-regulation of the myo-inositol oxygenase gene family has no effect on cell wall composition in Arabidopsis. Planta. 2011; 234(1): 157–69. doi: 10.1007/s00425-011-1394-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Valluru R, Van den Ende W. Myo-inositol and beyond-Emerging networks under stress. tPlant Science. 2011; 181(4): 387–400. doi: 10.1016/j.plantsci.2011.07.009 [DOI] [PubMed] [Google Scholar]
- 6.Nepal N, Yactayo-Chang JP, Medina-Jiménez K, et al. Mechanisms underlying the enhanced biomass and abiotic stress tolerance phenotype of an Arabidopsis MIOX over-expresser. Plant Direct. 2019; 3(9): e165. doi: 10.1002/pld3.165 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Chen C, Sun X, Duanmu H, et al. Ectopic Expression of a Glycine soja myo-Inositol Oxygenase Gene (GsMIOX1a) in Arabidopsis Enhances Tolerance to Alkaline Stress. PLOS ONE. 2015; 10(6): e129998. doi: 10.1371/journal.pone.0129998 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Endres S, Tenhaken R. Myoinositol Oxygenase Controls the Level of Myoinositol in Arabidopsis, but does not increase ascorbic acid. Plant Physiology. 2009; 149(2): 1042–1049. doi: 10.1104/pp.108.130948 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Duan J, Zhang M, Zhang H, et al. OsMIOX, a myo-inositol oxygenase gene, improves drought tolerance through scavenging of reactive oxygen species in rice (Oryza sativa L.). Plant Science. 2012; 196: 143–151. doi: 10.1016/j.plantsci.2012.08.003 [DOI] [PubMed] [Google Scholar]
- 10.Wendel JF, Cronn RC. Polyploidy and the evolutionary history of cotton. Advances in Agronomy. 2003. [Google Scholar]
- 11.Chen Z, Feng K, Grover CE, et al. Chloroplast DNA Structural Variation, Phylogeny, and Age of Divergence among Diploid Cotton Species. PLoS One. 2016; 11(6): e157183. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Chen Z, Grover CE, Li P, et al. Molecular evolution of the plastid genome during diversification of the cotton genus. Mol Phylogenet Evol. 2017; 112: 268–276. doi: 10.1016/j.ympev.2017.04.014 [DOI] [PubMed] [Google Scholar]
- 13.Wendel JF. New World tetraploid cottons contain Old World cytoplasm. Proc Natl Acad Sci U S A. 1989; 86(11): 4132–4136. doi: 10.1073/pnas.86.11.4132 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Senchina DS, Alvarez I, Cronn RC, et al. Rate variation among nuclear genes and the age of polyploidy in Gossypium. Mol Biol Evol. 2003; 20(4): 633–643. doi: 10.1093/molbev/msg065 [DOI] [PubMed] [Google Scholar]
- 15.Udall JA, Long E, Hanson C, et al. De Novo Genome Sequence Assemblies of Gossypium raimondii and Gossypium turneri. G3 (Bethesda). 2019; 9(10): 3079–3085. doi: 10.1534/g3.119.400392 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Du X, Huang G, He S, et al. Resequencing of 243 diploid cotton accessions based on an updated A genome identifies the genetic basis of key agronomic traits. Nat Genet. 2018; 50(6): 796–802. doi: 10.1038/s41588-018-0116-x [DOI] [PubMed] [Google Scholar]
- 17.Huang G, Wu Z, Percy RG, et al. Genome sequence of Gossypium herbaceum and genome updates of Gossypium arboreum and Gossypium hirsutum provide insights into cotton A-genome evolution. Nat Genet. 2020. doi: 10.1038/s41588-020-0607-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Wang M, Tu L, Yuan D, et al. Reference genome sequences of two cultivated allotetraploid cottons, Gossypium hirsutum and Gossypium barbadense. Nat Genet. 2019; 51(2): 224–229. doi: 10.1038/s41588-018-0282-x [DOI] [PubMed] [Google Scholar]
- 19.Chen ZJ, Sreedasyam A, Ando A, et al. Genomic diversifications of five Gossypium allopolyploid species and their impact on cotton improvement. Nat Genet. 2020; 52(5): 525–533. doi: 10.1038/s41588-020-0614-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Li Y, Liu Z, Zhang K, et al. Genome-wide analysis and comparison of the DNA-binding one zinc finger gene family in diploid and tetraploid cotton (Gossypium). PLoS One. 2020; 15(6): e235317. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Ge Q, Cui Y, Li J, et al. Disequilibrium evolution of the Fructose-1,6-bisphosphatase gene family leads to their functional biodiversity in Gossypium species. BMC Genomics. 2020; 21(1): 379. doi: 10.1186/s12864-020-6773-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Holub EB. The arms race is ancient history in Arabidopsis, the wildflower. Nat Rev Genet. 2001; 2(7): 516–527. doi: 10.1038/35080508 [DOI] [PubMed] [Google Scholar]
- 23.Li M, Wang R, Liu Z, et al. Genome-wide identification and analysis of the WUSCHEL-related homeobox (WOX) gene family in allotetraploid Brassica napus reveals changes in WOX genes during polyploidization. BMC Genomics. 2019; 20(1). doi: 10.1186/s12864-019-5684-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Kim S, Park J, Yeom SI, et al. New reference genome sequences of hot pepper reveal the massive evolution of plant disease-resistance genes by retroduplication. Genome Biol. 2017; 18(1): 210. doi: 10.1186/s13059-017-1341-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Jiang SY, Ramachandran S. Genome-wide survey and comparative analysis of LTR retrotransposons and their captured genes in rice and sorghum. PLoS One. 2013; 8(7): e71118. doi: 10.1371/journal.pone.0071118 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Munir S, Mumtaz MA, Ahiakpa JK, et al. Genome-wide analysis of Myo-inositol oxygenase gene family in tomato reveals their involvement in ascorbic acid accumulation. BMC Genomics. 2020; 21(1): 284. doi: 10.1186/s12864-020-6708-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Udall JA, Long E, Ramaraj T, et al. The Genome Sequence of Gossypioides kirkii Illustrates a Descending Dysploidy in Plants. Front Plant Sci. 2019; 10: 1541. doi: 10.3389/fpls.2019.01541 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Nowicki M, Bzhalava D, BaLa P. Massively Parallel Implementation of Sequence Alignment with Basic Local Alignment Search Tool Using Parallel Computing in Java Library. J Comput Biol. 2018; 25(8): 871–881. doi: 10.1089/cmb.2018.0079 [DOI] [PubMed] [Google Scholar]
- 29.Matsuda F, Tsugawa H, Fukusaki E. Method for assessing the statistical significance of mass spectral similarities using basic local alignment search tool statistics. Anal Chem. 2013; 85(17): 8291–8297. doi: 10.1021/ac401564v [DOI] [PubMed] [Google Scholar]
- 30.Finn RD, Clements J, Eddy SR. HMMER web server: interactive sequence similarity searching. Nucleic Acids Res. 2011; 39(Web Server issue): W29–W37. doi: 10.1093/nar/gkr367 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Hall BG. Building phylogenetic trees from molecular data with MEGA. Mol Biol Evol. 2013; 30(5): 1229–1235. doi: 10.1093/molbev/mst012 [DOI] [PubMed] [Google Scholar]
- 32.He Z, Zhang H, Gao S, et al. Evolview v2: an online visualization and management tool for customized and annotated phylogenetic trees. Nucleic Acids Res. 2016; 44(W1): W236–W241. doi: 10.1093/nar/gkw370 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Chen C, Chen H, Zhang Y, et al. TBtools: An Integrative Toolkit Developed for Interactive Analyses of Big Biological Data. Mol Plant. 2020; 13(8): 1194–1202. doi: 10.1016/j.molp.2020.06.009 [DOI] [PubMed] [Google Scholar]
- 34.Bailey TL, Boden M, Buske FA, et al. MEME SUITE: tools for motif discovery and searching. Nucleic Acids Res. 2009; 37(Web Server issue): W202–W208. doi: 10.1093/nar/gkp335 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Lescot M, Dehais P, Thijs G, et al. PlantCARE, a database of plant cis-acting regulatory elements and a portal to tools for in silico analysis of promoter sequences. Nucleic Acids Res. 2002; 30(1): 325–327. doi: 10.1093/nar/30.1.325 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Voorrips RE. MapChart: software for the graphical presentation of linkage maps and QTLs. J Hered. 2002; 93(1): 77–78. doi: 10.1093/jhered/93.1.77 [DOI] [PubMed] [Google Scholar]
- 37.Wang Y, Tang H, Debarry JD, et al. MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity. Nucleic Acids Res. 2012; 40(7): e49. doi: 10.1093/nar/gkr1293 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Wang D, Zhang Y, Zhang Z, et al. KaKs_Calculator 2.0: a toolkit incorporating gamma-series methods and sliding window strategies. Genomics Proteomics Bioinformatics. 2010; 8(1): 77–80. doi: 10.1016/S1672-0229(10)60008-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Ou S, Jiang N. LTR_retriever: A Highly Accurate and Sensitive Program for Identification of Long Terminal Repeat Retrotransposons. Plant Physiol. 2018; 176(2): 1410–1422. doi: 10.1104/pp.17.01310 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Ellinghaus D, Kurtz S, Willhoeft U. LTRharvest, an efficient and flexible software for de novo detection of LTR retrotransposons. BMC Bioinformatics. 2008; 9(1): 1–14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Xu Z, Wang H. LTR_FINDER: an efficient tool for the prediction of full-length LTR retrotransposons. Nucleic Acids Res. 2007; 35(Web Server issue): W265–W268. doi: 10.1093/nar/gkm286 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Tempel S. Using and understanding RepeatMasker. Methods Mol Biol. 2012; 859: 29–51. doi: 10.1007/978-1-61779-603-6_2 [DOI] [PubMed] [Google Scholar]
- 43.Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014; 30(15): 2114–2120. doi: 10.1093/bioinformatics/btu170 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Kim D, Langmead B, Salzberg SL. HISAT: a fast spliced aligner with low memory requirements. Nat Methods. 2015; 12(4): 357–360. doi: 10.1038/nmeth.3317 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Ghosh S, Chan CK. Analysis of RNA-Seq Data Using TopHat and Cufflinks. Methods Mol Biol. 2016; 1374: 339–361. doi: 10.1007/978-1-4939-3167-5_18 [DOI] [PubMed] [Google Scholar]
- 46.Pollier J, Rombauts S, Goossens A. Analysis of RNA-Seq data with TopHat and Cufflinks for genome-wide expression analysis of jasmonate-treated plants and plant cultures. Methods Mol Biol. 2013; 1011: 305–315. doi: 10.1007/978-1-62703-414-2_24 [DOI] [PubMed] [Google Scholar]
- 47.Analysis of Relative Gene Expression Data Using Real-Time Quantitative PCR and the 22DDCTMethod[Z]. [DOI] [PubMed]