Abstract
Aquaporins (AQPs) are known to have a vital role in water transport in all living organisms including agriculturally important crops, but a comprehensive genomic study of AQPs in pepper has not been implemented. Here, we updated previous gene annotations and generated a total of 259 AQP genes from five plants, including pepper. Phylogenetic and motif analyses revealed that a large proportion of pepper AQP genes belong to the specific subgroup of tonoplast intrinsic protein (TIP) subfamily, TIP4. Chromosomal localization and estimated duplication times illustrated that genes in TIP4 formed a tandem array on the short arm of chromosome 1, resulting from pepper-specific expansion after its divergence with Solanaceae species. Transcriptome analyses under various abiotic stress conditions revealed that transport-, photosystem-, and thylakoid-related genes were generally enriched in expression clusters containing AQP genes in pepper. These results provide valuable genomic resources and insight into the evolutionary mechanism that generate genomic diversity of the AQP gene family in pepper.
Keywords: aquaporin, water transport, re-annotation, pepper, abiotic stress
1. Introduction
Aquaporins (AQPs) are small transmembrane proteins (21–34 KDa) from a large family of major intrinsic proteins (MIPs) that facilitate the movement of water, glycerol, and small molecules such as carbon dioxide (CO2) and hydrogen peroxide (H2O2) through biological membranes [1,2]. In plants, the AQP gene family is responsible for the selective transport of water and regulates the response to abiotic stresses such as drought and saline conditions [3]. For example, AtPIP2;2 transports water in Arabidopsis [4], and SlTIP2;2 increases the transpiration of tomato under drought and salt stresses [5]. Aquaporins typically have six transmembrane α-helices (H1 to H6) that are connected by five spiral loops (LA-LE). Loops B and E have two conserved asparagine–proline–alanine (NPA) motifs. Groups of amino acid residues, two groups in helices 2 and 5 (H2 and H5) and two groups in loop E (LE1 and LE2), form the aromatic/Arg (ar/R) selectivity filter. The AQP gene family in plants is classified into five major subfamilies: plasma membrane intrinsic proteins (PIPs), tonoplast intrinsic proteins (TIPs), NOD26-like intrinsic proteins (NIPs), small basic intrinsic proteins (SIPs), and uncategorized X intrinsic proteins (XIPs) [6].
Hot pepper (Capsicum annuum) is an economically important crop belonging to the Solanaceae family with tomato and potato. It is not only a source of the popular spice capsaicin but is of nutritional value as a source of vitamins, minerals, and phenolic compounds [7,8,9]. According to the United Nations Food and Agricultural Organization (FAO, Rome, Italy), world production of hot pepper increased by about 28% over the last decade, and in 2019, the top 10 pepper-producing countries cultivated 32 million tons (http://www.fao.org/home/en/ (accessed on 2 December 2020)). Recently, pepper genome sequencing projects with transcriptome studies have enabled identification and characterization of important gene families related to agricultural traits [8,10,11,12]; however, comprehensive understanding of AQP genes in pepper through integrative genomic and transcriptome studies has not been achieved.
In this study, we updated the annotation of AQP gene family including AQP-like as well as AQP genes in pepper, tomato, potato, rice, and Arabidopsis and performed a comparative analysis. Of the five species we studied, pepper had the most AQP genes, and 42% of these were in the TIP subfamily. Specifically, from phylogenetic analysis, we determined that most of these pepper genes (74%) were in the TIP4 subgroup. We also investigated the AQP domain motif composition of each subfamily of these five species. Motifs 7, 19, and 5, located in downstream positions 11, 12, and 13, respectively, were specific to the TIP4 subgroup of pepper. We demonstrated, with chromosomal localization, that 52% of the pepper AQP genes in the TIP4 subgroup clustered in the short arm of chromosome 1. Most of these genes had been duplicated after the divergence time between pepper and Solanaceae species. Expression clustering and gene ontology (GO) enrichment analyses demonstrated that genes with putative functions related to chloroplast, thylakoid, and transport were enriched in clusters containing the pepper AQP genes, which may be evidence that these AQP genes work with other pepper genes in the plant response to abiotic stresses, especially in the transport system of thylakoid membranes.
We anticipate that these updated AQP gene models will provide novel genomic resources to horticulturalists and enrich understanding of recent evolutionary divergence in pepper AQP genes and the complex transcriptional network that operates in abiotic stresses.
2. Results
2.1. Re-Annotation and Classification of AQP Genes in Pepper and Other Species
To reduce annotation bias from different annotation methods with resources [13], we performed re-annotation and generated a total of 259 AQP genes containing 29 (11.2%) newly annotated genes in addition to previously annotated genes (Table 1). In the updated AQP genes, we observed 73 pepper AQP genes that were 1.8-fold of AQP genes in rice and Arabidopsis genomes. In addition, numbers of AQP genes in other Solanaceae genomes were relatively higher than those of rice and Arabidopsis (Table 1).
Table 1.
Common Name | Total (Newly Annotated Genes) |
PIPs | TIPs | NIPs | SIPs | XIPs |
---|---|---|---|---|---|---|
Rice | 41 (3) | 13 (-) | 11 (-) | 13 (1) | 2 (-) | 2 (2) |
Arabidopsis | 41 (3) | 14 (1) | 12 (1) | 11 (1) | 4 (-) | - (-) |
Pepper | 73 (10) | 19 (2) | 31 (3) | 16 (3) | 5 (2) | 2 (-) |
Potato | 56 (9) | 19 (3) | 11 (-) | 13 (1) | 3 (-) | 10 (5) |
Tomato | 48 (4) | 15 (2) | 11 (-) | 12 (-) | 3 (-) | 7 (2) |
PIP, plasma membrane intrinsic protein subfamily; TIP, tonoplast intrinsic protein subfamily; NIP, NOD26-like intrinsic protein subfamily; SIP, small basic intrinsic protein subfamily; XIP, uncategorized X intrinsic protein subfamily. The numeric values in parentheses indicate the number of newly annotated genes.
We classified AQP genes into five subfamilies by comparing specific amino acid sequences in ar/R selectivity filter positions among five genomes (Table 2 and Table S2). A large number of the pepper AQP genes belonged to the TIP subfamily, approximately threefold more than in the other species. Specifically, 31 pepper AQP genes (42.5%) were in the TIP subfamily, suggesting that specific AQP genes have been expanded in the pepper genome (Table 1). We also observed many genes belong to the XIP subfamily in potato and tomato genomes, but none in Arabidopsis; this is consistent with a previous report [14,15]. These results demonstrated that the copy numbers of AQP genes in the five plant genomes have changed via proliferation of genes belonging to specific subfamilies in the Solanaceae, especially in pepper.
Table 2.
No. | Gene Name | Locus ID | Subfamily | Subgroup | Position (Amino Acid Residues) | Chromosome/ Scaffold Location |
|||
---|---|---|---|---|---|---|---|---|---|
H2 (37) | H5 (143) | LE1 (152) | LE2 (158) | ||||||
1 | CaPIP1;1 | CA.PGAv.1.6.scaffold890.5 | PIP | PIP1 | F | H | T | R | chr02 |
2 | CaPIP1;2 | CA.PGAv.1.6.scaffold216.2 | PIP | PIP1 | F | H | T | R | chr10 |
3 | CaPIP1;3 | CA.PGAv.1.6.scaffold588.86 | PIP | PIP1 | F | H | T | R | PGAv.1.6.scaffold588 |
4 | CaPIP2;1 | CANN_MIP.chr06.1 | PIP | PIP2 | - | H | T | R | chr06 |
5 | CaPIP2;2 | CA.PGAv.1.6.scaffold920.3 | PIP | PIP2 | F | - | - | - | chr06 |
6 | CaPIP2;3 | CA.PGAv.1.6.scaffold888.8 | PIP | PIP2 | F | H | T | R | chr09 |
7 | CaPIP2;4 | CA.PGAv.1.6.scaffold888.7 | PIP | PIP2 | F | H | T | R | chr09 |
8 | CaPIP2;5 | CA.PGAv.1.6.scaffold888.6 | PIP | PIP2 | F | H | T | R | chr09 |
9 | CaPIP2;6 | CA.PGAv.1.6.scaffold1169.2 | PIP | PIP2 | F | H | T | R | chr10 |
10 | CaPIP2;7 | CA.PGAv.1.6.scaffold631.27 | PIP | PIP2 | F | H | T | R | chr11 |
11 | CaPIP3;1 | CA.PGAv.1.6.scaffold774.55 | PIP | PIP3 | F | H | T | R | chr01 |
12 | CaPIP3;2 | CA.PGAv.1.6.scaffold792.78 | PIP | PIP3 | F | H | T | R | chr01 |
13 | CaPIP3;3 | CA.PGAv.1.6.scaffold613.1 | PIP | PIP3 | L | Y | T | R | chr03 |
14 | CaPIP3;4 | CA.PGAv.1.6.scaffold1184.2 | PIP | PIP3 | F | H | T | R | chr05 |
15 | CaPIP3;5 | CA.PGAv.1.6.scaffold78.85 | PIP | PIP3 | F | H | T | R | chr08 |
16 | CaPIP3;6 | CA.PGAv.1.6.scaffold815.3 | PIP | PIP3 | F | H | T | R | chr12 |
17 | CaPIP3;7 | CA.PGAv.1.6.scaffold855.26 | PIP | PIP3 | F | H | T | R | PGAv.1.6.scaffold855 |
18 | CaPIP3;8 | CA.PGAv.1.6.scaffold855.28 | PIP | PIP3 | - | P | I | G | PGAv.1.6.scaffold855 |
19 | CaPIP3;9 | CANN_MIP.PGAv.1.6.scaffold855.1 | PIP | PIP3 | - | P | I | R | PGAv.1.6.scaffold855 |
20 | CaTIP1;1 | CA.PGAv.1.6.scaffold610.69 | TIP | TIP1 | H | I | A | R | chr11 |
21 | CaTIP2;1 | CA.PGAv.1.6.scaffold1058.46 | TIP | TIP2 | - | I | G | R | chr01 |
22 | CaTIP2;2 | CA.PGAv.1.6.scaffold48.31 | TIP | TIP2 | N | V | G | Y | chr03 |
23 | CaTIP2;3 | CA.PGAv.1.6.scaffold438.40 | TIP | TIP2 | H | I | G | R | chr03 |
24 | CaTIP2;4 | CA.PGAv.1.6.scaffold62.35 | TIP | TIP2 | H | I | G | R | chr06 |
25 | CaTIP2;5 | CA.PGAv.1.6.scaffold1152.13 | TIP | TIP2 | H | I | G | R | chr06 |
26 | CaTIP3;1 | CA.PGAv.1.6.scaffold861.16 | TIP | TIP3 | H | V | A | R | chr03 |
27 | CaTIP3;2 | CA.PGAv.1.6.scaffold65.60 | TIP | TIP3 | H | V | A | R | chr06 |
28 | CaTIP4;1 | CA.PGAv.1.6.scaffold298.3 | TIP | TIP4 | H | T | A | M | chr01 |
29 | CaTIP4;2 | CA.PGAv.1.6.scaffold298.4 | TIP | TIP4 | H | T | A | M | chr01 |
30 | CaTIP4;3 | CA.PGAv.1.6.scaffold298.5 | TIP | TIP4 | - | T | A | M | chr01 |
31 | CaTIP4;4 | CANN_MIP.chr01.3 | TIP | TIP4 | H | - | Q | F | chr01 |
32 | CaTIP4;5 | CA.PGAv.1.6.scaffold298.6 | TIP | TIP4 | M | V | Y | G | chr01 |
33 | CaTIP4;6 | CA.PGAv.1.6.scaffold777.2 | TIP | TIP4 | H | - | - | - | chr01 |
34 | CaTIP4;7 | CA.PGAv.1.6.scaffold777.1 | TIP | TIP4 | H | L | A | M | chr01 |
35 | CaTIP4;8 | CANN_MIP.chr01.1 | TIP | TIP4 | H | T | A | M | chr01 |
36 | CaTIP4;9 | CA.PGAv.1.6.scaffold1381.7 | TIP | TIP4 | H | N | A | M | chr01 |
37 | CaTIP4;10 | CA.PGAv.1.6.scaffold1432.12 | TIP | TIP4 | H | I | A | V | chr06 |
38 | CaTIP4;11 | CA.PGAv.1.6.scaffold394.19 | TIP | TIP4 | H | I | A | V | chr06 |
39 | CaTIP4;12 | CA.PGAv.1.6.scaffold467.1 | TIP | TIP4 | H | I | A | G | chr07 |
40 | CaTIP4;13 | CANN_MIP.chr01.2 | TIP | TIP4 | H | T | A | M | chr01 |
41 | CaTIP4;14 | CA.PGAv.1.6.scaffold488.29 | TIP | TIP4 | H | I | A | V | chr10 |
42 | CaTIP4;15 | CA.PGAv.1.6.scaffold820.9 | TIP | TIP4 | - | I | A | M | chr11 |
43 | CaTIP4;16 | CA.PGAv.1.6.scaffold420.10 | TIP | TIP4 | H | V | A | L | chr11 |
44 | CaTIP4;17 | CA.PGAv.1.6.scaffold1381.8 | TIP | TIP4 | H | T | A | M | chr01 |
45 | CaTIP4;18 | CA.PGAv.1.6.scaffold280.18 | TIP | TIP4 | H | I | A | M | chr11 |
46 | CaTIP4;19 | CA.PGAv.1.6.scaffold1381.9 | TIP | TIP4 | H | T | A | M | chr01 |
47 | CaTIP4;20 | CA.PGAv.1.6.scaffold963.13 | TIP | TIP4 | - | I | A | M | PGAv.1.6.scaffold963 |
48 | CaTIP4;21 | CA.PGAv.1.6.scaffold1331.1 | TIP | TIP4 | H | - | - | G | PGAv.1.6.scaffold1331 |
49 | CaTIP4;22 | CA.PGAv.1.6.scaffold298.1 | TIP | TIP4 | H | T | A | M | chr01 |
50 | CaTIP4;23 | CA.PGAv.1.6.scaffold298.2 | TIP | TIP4 | H | T | A | M | chr01 |
51 | CaNIP1;1 | CA.PGAv.1.6.scaffold38.2 | NIP | NIP1 | A | V | - | - | chr01 |
52 | CaNIP1;2 | CA.PGAv.1.6.scaffold792.10 | NIP | NIP1 | A | I | G | R | chr01 |
53 | CaNIP1;3 | CA.PGAv.1.6.scaffold407.53 | NIP | NIP1 | S | I | A | R | chr03 |
54 | CaNIP3;1 | CA.PGAv.1.6.scaffold1545.1 | NIP | NIP3 | W | V | A | R | chr02 |
55 | CaNIP3;2 | CA.PGAv.1.6.scaffold1067.9 | NIP | NIP3 | W | - | - | - | chr02 |
56 | CaNIP3;3 | CA.PGAv.1.6.scaffold411.30 | NIP | NIP3 | W | V | A | R | chr02 |
57 | CaNIP3;4 | CA.PGAv.1.6.scaffold411.29 | NIP | NIP3 | W | S | A | R | chr02 |
58 | CaNIP3;5 | CA.PGAv.1.6.scaffold625.87 | NIP | NIP3 | W | V | A | R | chr02 |
59 | CaNIP3;6 | CANN_MIP.chr03.1 | NIP | NIP3 | W | - | - | - | chr03 |
60 | CaNIP3;7 | CANN_MIP.chr05.1 | NIP | NIP3 | W | - | - | R | chr05 |
61 | CaNIP3;8 | CA.PGAv.1.6.scaffold65.202 | NIP | NIP3 | W | - | - | K | chr06 |
62 | CaNIP3;9 | CA.PGAv.1.6.scaffold65.203 | NIP | NIP3 | W | I | A | R | chr06 |
63 | CaNIP3;10 | CA.PGAv.1.6.scaffold1298.6 | NIP | NIP3 | - | V | A | R | chr11 |
64 | CaNIP3;11 | CANN_MIP.chr11.1 | NIP | NIP3 | W | D | S | S | chr11 |
65 | CaNIP3;12 | CA.PGAv.1.6.scaffold1122.2 | NIP | NIP3 | W | A | - | K | chr12 |
66 | CaNIP3;13 | CA.PGAv.1.6.scaffold1569.5 | NIP | NIP3 | W | V | A | R | PGAv.1.6.scaffold1569 |
67 | CaSIP1;1 | CANN_MIP.chr01.4 | SIP | SIP1 | - | - | S | S | chr01 |
68 | CaSIP1;2 | CA.PGAv.1.6.scaffold245.6 | SIP | SIP1 | I | - | G | S | chr01 |
69 | CaSIP1;3 | CA.PGAv.1.6.scaffold415.6 | SIP | SIP1 | I | - | P | N | chr03 |
70 | CaSIP1;4 | CANN_MIP.chr07.1 | SIP | SIP1 | - | - | - | - | chr07 |
71 | CaSIP1;5 | CA.PGAv.1.6.scaffold960.54 | SIP | SIP1 | L | - | P | N | PGAv.1.6.scaffold960 |
72 | CaXIP1;1 | CA.PGAv.1.6.scaffold564.27 | XIP | XIP1 | I | T | A | R | chr04 |
73 | CaXIP1;2 | CA.PGAv.1.6.scaffold588.13 | XIP | XIP1 | I | T | A | R | PGAv.1.6.scaffold588 |
We investigated the motif composition of AQP domains to characterize the structural diversity of these domains, which are mostly conserved and essential for transport function of those genes (Figure 1 and Figure S1, Table S3). Overall, motif compositions were similar in each subfamily among the five plant species. Specifically, the motifs at the N- and C-terminal regions were mostly conserved among subfamilies, whereas the central regions, which includes the subfamily-specific motifs with ar/R selectivity filter position, were relatively variable. Thus, it may be that the motifs in the central account for the structural diversity of AQP domains observed among subfamilies (Figure 1 and Figure S1). When we examined the motifs of AQP domains in pepper, we observed the specific motifs (7, 19, 5) were enriched in the TIP subfamily compared to the other species (Figure 1 and Figure S1). This suggests that those motifs contributed to the structural diversity of pepper AQP domains in the TIP subfamily.
2.2. Phylogenetic Analysis of Plant AQP Genes
We next performed a phylogenetic analysis and determined the subgroups (of the subfamilies) to which the AQP genes belonged. The subfamilies PIP, TIP, and NIP were further separated into three, four, and three subgroups, respectively, whereas SIP and XIP remained as single groups (Figure 2A). Notably, a large number of AQP pepper genes in the TIP subfamily were mostly in the TIP4 subgroup—23 genes, representing 74% of the pepper genes in the TIP subfamily (Figure 2B). Examination of the motif composition of AQP domains in TIP subgroups revealed a set of specific motifs, 7, 9, and 15 on downstream regions, were only observed in pepper AQP domains of TIP4 (Figure 2C). These results are consistent with proliferation of specific TIP genes in pepper genome via lineage-specific duplication, which led to expansion of pepper TIP genes (Figure 2A).
2.3. Chromosomal Location and Gene Duplication Analyses of Plant AQP Genes
To identify the location of AQP genes, we mapped their physical positions on the chromosomes of each genome (Figure 3 and Figure S2). Overall, AQP genes were distributed across all chromosomes, with a few exceptions. We mapped 64 AQP genes in pepper, which has 12 chromosomes, and 33% of them mapped to chromosome 1 (Figure 3). More specifically, 12 of the pepper AQP genes in TIP4 consisted of a tandem array on the short arm of chromosome 1 (Figure 3). Considering that there were no TIP4 genes in the syntenic regions of the tomato or potato chromosomes, this suggests that these 12 pepper genes emerged recently as a result of pepper-specific gene duplication. We also noted that seven potato and five tomato AQP genes belonging to the XIP1 subgroup formed clusters on the long arm of chromosome 10, which was specific to the Solanum spp. (Figure S2).
We next explored the duplication history of AQP genes. Gene duplication time was estimated by calculating the synonymous substitution rates (Ks) between recently duplicated gene pairs (Figure 4). Our analysis showed that the pepper AQP genes, especially those in the TIP4 subgroup, were actively duplicated, with a maximum Ks value of approximately 0.15. This also suggests that most of the duplication of the pepper AQP genes occurred after the divergence between pepper and other Solanum species, which have a Ks value of 0.3 [8] (Figure 4B,C). Taken together, these results indicate that recent gene duplications mainly occurred in TIP4 subgroup was a major contributor for construction of diversified AQP gene repertories in the pepper genome through expansion of specific AQP genes.
2.4. Expression Profiles and Clustering Analyses of Pepper AQP Genes from Plants Grown under Abiotic Stress
Plants adapt to stressful environments through various systemic physiological changes. We examined responses to abiotic stresses in pepper at the level of transcription and expression. We first established an expression profile for the AQP genes under cold, heat, salt, and mannitol stresses (Figure S3). Overall, a few genes (14) belonging to the PIP subfamily were highly expressed in all stress conditions, whereas several genes (14) belonging to the TIP subfamily were not, with one exception: CaTIP4;10 was highly expressed in all stress treatments, suggesting that it may have a particular function in various abiotic stresses. To group pepper genes with parallel expression patterns and thus predict genes with similar functions, we then performed expression clustering analyses and identified 18,052 (50%), 17,438 (48.6%), 15,926 (44.3%), and 16,105 (45%) DEGs in plants subjected to cold, heat, salt, and mannitol stress, respectively. Among these DEGs, 36, 42, 35, and 35 were AQP genes in cold, heat, salt, and mannitol, respectively (Figure 5A). The clusters containing the most AQP genes were the cold cluster 5 (36%, 13 genes), the heat-cluster 4 (26%, 11 genes), the salt cluster 6 (29%, 10 genes), and the mannitol cluster 1 (26%, 9 genes) (Figure 5B). Most of the differentially expressed AQPs were in the PIP subfamily, and all gene clusters from plants exposed to heat and mannitol treatments contained PIP genes. The plants subjected to heat stress had 15 genes in the TIP subfamily, more than plants exposed to the other stresses.
Furthermore, functional enrichment analysis, performed to predict the putative gene functions, revealed that genes with chloroplast-, photosystem-, organelle-, and thylakoid-related GO terms were generally enriched in the clusters including AQP genes, suggesting that those genes could be involved in response to various abiotic stresses along with AQP genes (Figure 5C). Specifically, many plants use osmotic adjustment and ion transport through AQP [16] to adapt to stress, and the finding that anion and organic substance transport-related GO terms were enriched in the salt cluster 5 (Figure 5C) supports this suggestion. Moreover, enrichment of GO terms related to the thylakoid membrane in cold cluster 5 were consistent with previous studies, which showed that the TIP genes in Arabidopsis control the volume of the thylakoid lumen during water or light stress, thereby creating an optimal photosynthetic environment [17]. Our results suggest that the pepper AQP genes also have an important role in molecular responses to changing conditions and act in concert with other pepper genes.
3. Discussion
AQP is an important gene family involved in water transport and other small molecules such as CO2 and H2O2 plants, but the complete understanding of the AQP gene family in pepper has not been accomplished. For in-depth studies, an accurate annotation process involving structural annotation to fully predict gene structures containing target domains within the genome sequence is essential to provide a stronger foundation for biological investigation of living organisms [18]. However, reports of missing functional genes [19] or incomplete gene models [13] continue to be registered, generating bias in downstream studies.
In this study, we re-annotated AQP genes in five plant genomes using the target gene family annotation approach [20] to update AQP gene information via detection of previously omitted AQP genes. In general, whole-gene annotation methods, which have been applied to various species including model plants, have pre-processing steps such as repeat-masking to efficiently reduce computing power and manual work [21,22]. However, these steps often generate omission of genes in annotations [13], which are genes that exist in the assembled genome but are obscured by repeat-masking or unidentified due to other problems. Target gene family annotation methods were developed to complement these issues, and we conducted re-annotation using TGFam-Finder, developed to identify all target genes in assembled genome via intensive annotation of genomic regions containing target domain(s) [20].
Prior studies have also reported improvement of annotations of model plants such as Arabidopsis and rice, as well as human genome annotation by identification of missing genes in previous annotations [13,20]. Of the updated genes, we newly identified AQP genes omitted in previous annotations including rice and Arabidopsis and especially in the pepper genome. The updated AQP genes could serve as genomic resources for further comparative and evolutionary analyses on the basis of their accurate copy numbers in the assembled genomes.
Comparison of the five updated AQP gene models revealed that pepper had the highest number of AQP genes. Many previous studies reported that different copy numbers of gene families between species are mainly derived from species-specific evolution and resulted in genetic diversification [23,24]. We also observed that most pepper AQP genes were contained in the TIP4 subgroup with pepper-specific motif sequences, suggesting that these genes were lineage-specifically expanded in pepper, thereby increasing the genetic diversity in the AQP repertories.
Chromosomal localization and gene duplication history analyses showed that these pepper-specific genes in the TIP4 subgroup formed a tandem array in the short arm of chromosome 1, and the genes mainly emerged after the speciation time between pepper and other Solanum spp. A previous study also reported a similar case in which certain genes in the pepper nucleotide-binding, leucine-rich repeat (NLRs) gene family formed a tandem array in a specific chromosome via massive duplication, indicating that it occurred specifically in pepper [25]. Although most of their roles are unknown, a previous study reported that one gene, Pvr4, emerged eight million years ago after speciation between pepper and tomato, gaining a specific function in disease resistance for various potyviruses and showing that a pepper-specific NLR gene had an important role [25]. Similarly, the AQP genes in TIP4, which have undergone species-specific duplication after the pepper speciation time, may also possess important characteristics of agricultural value.
We further inspected the forward genetic screen using the expression level of pepper genes to identify those that are potentially involved in responses to different abiotic stress conditions. The genes belonging to the PIP were highly expressed. This was consistent with many reports that PIP is known to function in a variety of crops; for example, in Acacia auriculiformis, the AaPIP1-2 gene eliminated ROS and played a positive role in the drop stress; in Populus, the overexpression of PIP1;1 induced more sensitivity of transgenic in osmotic conventions; and in rice, the downregulation of PIP2;1 inhibited root growth [26,27,28]. Expression clustering and GO enrichment analyses on the pepper DEGs under abiotic stresses revealed clusters including AQP were generally enriched in GO terms related to the chloroplast, photosystem, organelle, and thylakoid, suggesting that genes involved in these functions may work together with AQP in response to various stresses [29]. Similarly, we observed newly identified genes in various clusters with other pepper genes. For example, CANN_MIP.PGAv.1.6.scaffold855.1 and CANN_MIP.chr06.1 belonging to salt cluster1 and salt cluster3, respectively, could function together with other pepper genes in cellular processes such as cell communication and membrane-bounded organelles under a salt stress condition. Although the actual functions of TIP4 genes in pepper are difficult to elucidate, their potential functions were predicted on the basis of the expression profiling and clustering analyses. Most genes belonging to the TIP4 subgroup were not significantly expressed, which is consistent with other reports that gene expression of newly emerged genes is not actively detected [30]. However, we found one gene, CA.PGAv.1.6.scaffold1432.12, in mannitol cluster2 that had GO enrichment in ribosome-related tasks, suggesting that this gene may also play a role in plant physiological processes related to ribosomes.
The updated AQP genes will be valuable genomic resources for further functional and breeding research in agriculturally valuable crops. Moreover, our findings provide insights into the evolutionary mechanism and expressions of pepper AQP genes that contribute to increasing genetic diversity and unveiling potential functions, respectively.
4. Materials and Methods
4.1. Re-Annotation of AQP Genes in Five Plant Genomes
To re-annotate putative aquaporin (AQP) genes including AQP-like genes, we first collected genome resources of Arabidopsis thaliana [31], Oryza sativa [32], Solanum lycopersicum [33], Solanum tuberosum [34], and Capsicum annuum [8], including assembled genomes, annotated genes, and RNA-Seq data (Table S1). The re-annotation was performed with TGFam-Finder v1.20 considering parameters described previously [20]. We used tsv files generated from the functional annotation of proteins in the five genomes using InterproScan-5 [35] as “TSV_FOR_DOMAIN_IDENTIFICATION”. PF00230 (MIP) from Pfam database was used as the “TARGET_DOMAIN_ID”. Finally, we obtained re-annotated AQP genes as updated annotations for the five plant genomes, including previously annotated genes from downloaded annotations and genes that were newly annotated from the TGFam-Finder.
We assigned gene names for re-annotated AQP genes. For previously annotated genes, if names had been assigned in previous studies, as was the case for the AQP genes in potato and rice, we matched new gene name to the original names. However, genes in Arabidopsis and tomato that had been annotated in previous studies that used different annotation versions. To address this problem, we compared the previously annotated AQP genes in updated annotations to their original versions with BLASTP, and the gene names with the highest identity score were used. Pepper has not been previously studied; therefore, all 73 re-annotated genes were given new names. In addition, newly annotated genes of other species were also assigned new names (Table 1, Table 2, and Table S2).
4.2. Classification and Phylogenetic Analysis of AQP Genes
The re-annotated AQP genes of the five species were aligned using MAFFT v7.470 [36], and unnecessary gaps in the alignments were trimmed with trimAL v1.4 (-gappyout) [37]. We obtained information on the ar/R selectivity filter from each of the four positions in H2 (37th position in the aligned sequences), H5 (143rd), LE1 (152nd), and LE2 (158th), on the basis of the location of the conserved NPA motifs in the alignments (Table 2 and Table S2). For Arabidopsis, rice, tomato, and potato, subfamilies were classified according to the predicted amino acids in each of four positions of the protein, as described previously [38,39,40,41]. We classified the subfamilies of pepper AQPs, however, by considering the known ar/R selectivity filter positions of other four species. We compared unclassified genes to the classified AQP genes using BLASTP with the best match score. From this, we identified members of all five subfamilies of AQP genes: PIP, TIP, NIP, SIP, and XIP in the five species (Table 1, Table 2, and Table S2).
To understand evolutionary relationships, a phylogenetic tree of aligned AQP sequences was constructed by IQ-TREE v2.0.6 (-alrt 1000-B 1000) [42] and visualized using IToL v3.2.317 (http://itol.embl.de/ (accessed on 18 June 2020)) [43]. On the basis of the phylogenetic tree, we classified 5 subfamilies into 12 subgroups.
4.3. Motif Analysis of Subfamilies in Pepper Genome
To identify conserved sequences of AQP domains in five plant species, we used MEME suite v5.1.1 (-mod zoops-nmotifs 50-minw 10-maxw 50-objfun se-markov_order 0) (Table S3) [44]. Then, the motif position in AQP domains were predicted using MAST v5.1.1 [45]. We determined the general 14 position constructing AQP domains using the identified motif sequences by manual inspections, considering sequence homology. We predicted the number of transmembrane helices and position from AQP domain using TMHMM (Figure 1) [46]. Finally, we counted motif frequencies located in the general position and visualized them using ggplot2 [47] in R package.
4.4. Location of AQP Genes on Chromosomes in Five the Plant Genomes
Chromosomal distribution of the re-annotated AQP genes was visualized using MapChart v2.32 [48]. Gene names were marked on the map, and members of the same subgroup, determined by the phylogenetic analysis in this study, were identified by the same color.
4.5. Estimation of Duplication Time of AQP Genes
We estimated the divergence times of the AQP genes by first identifying duplicated gene pairs with DupGen_Finder pipeline [49]. Next, the gene pairs were aligned with PRANK (-f = fasta-codon) [50]. We calculated the synonymous substitution rates (Ks) of each gene pair with the KaKs_Calculator v2.0 (-m MYN) [51]. Values of Ks < 2 were visualized with ggplot 2 [47] in an R package.
4.6. Expressional Clustering and GO Enrichment Analyses in the Pepper Genome
To examine expression profiles of pepper AQP genes, we collected RNA-Seq data for leaf tissues under various abiotic stress (cold, heat, mannitol, and salt) at different times generated from a previous study [52]. We first performed quality trimming with CLC Assembly Cell (CLC Bio, Aarhus, Denmark) to clear sequences of poor quality. Next, we mapped the RNA-Seq to the genome assembly of pepper using HISAT 2 [53] (-dta-x). The mapped reads were quantified and the “Fragment Per Kilobase of transcript per Million mapped reads” (FPKM) values for the pepper AQP genes were calculated with StringTie [54] (-e-B-G). The overall expression levels of the pepper AQP genes in different conditions were visualized with log2 (FPKM + 1) values using pheatmap v1.0.12 [55] software in an R package.
We performed expression clustering of the differentially expressed genes (DEGs) in the whole annotated pepper genome, including the newly annotated AQP genes. Expression clustering was calculated from log2 transformation of fold changes between the FPKM values measured in plants under stress conditions and unstressed control plants. The DEGs that had a p value ≤ 0.05 were clustered using the Mfuzz program [56] in an R package. The number of expression clusters was determined by k-means analysis. The GO terms for genes in each cluster were identified using Omicsbox (https://www.biobam.com/omicsbox/ (accessed on 5 January 2021)), and enriched GO terms were determined by Fisher’s exact test (false discovery rates corrected p-value ≤ 0.01).
Supplementary Materials
The following are available online at https://www.mdpi.com/article/10.3390/plants10061039/s1, Figure S1: Numbers of motifs in AQP domains of five plant species. Figure S2: Location of AQP genes on chromosomes for four plant species. Figure S3: Expression profiling of pepper AQP genes using RNA-Seq data under various abiotic stresses. Table S1: Genomic resources of five plant species used in this study. Table S2: Detailed information of individual AQP genes in four plant genomes. Table S3: Statistics of motifs identified from five plant AQP genes by MEME.
Author Contributions
S.K. designed the research. Y.M.L., G.Y.C., M.K.K. and S.K. carried out the comparative and evolutionary analyses, as well as the re-annotation. Y.M.L. wrote the initial manuscript draft. All authors edited and reviewed the final version. The authors declare no competing interests. All authors have read and agreed to the published version of the manuscript.
Funding
This study was supported by the Basic Science Research Program through the National Research Foundation of Korea (NRF), funded by the Ministry of Education (NRF-2017R1A6A3A04004014), by a grant from the Korea Forest Service of Korean government through the R&D Program for Forestry Technology (2014071H10-2022-AA04), and was supported by the “Graduate school of Post Plastic specialization” of Korea Environmental Industry & Technology Institute grant funded by the Ministry of Environment, Republic of Korea, both to S.K.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
Not applicable.
Conflicts of Interest
The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.
Footnotes
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Uehlein N., Lovisolo C., Siefritz F., Kaldenhoff R. The tobacco aquaporin NtAQP1 is a membrane CO2 pore with physiological functions. Nature. 2003;425:734–737. doi: 10.1038/nature02027. [DOI] [PubMed] [Google Scholar]
- 2.Kjellbom P., Larsson C., Johansson I., Karlsson M., Johanson U. Aquaporins and water homeostasis in plants. Trends Plant Sci. 1999;4:308–314. doi: 10.1016/S1360-1385(99)01438-7. [DOI] [PubMed] [Google Scholar]
- 3.Fox A.R., Maistriaux L.C., Chaumont F. Toward understanding of the high number of plant aquaporin isoforms and multiple regulation mechanisms. Plant Sci. 2017;264:179–187. doi: 10.1016/j.plantsci.2017.07.021. [DOI] [PubMed] [Google Scholar]
- 4.Javot H., Lauvergeat V., Santoni V., Martin-Laurent F., Guclu J., Vinh J., Heyes J., Franck K.I., Schaffner A.R., Bouchez D., et al. Role of a single aquaporin isoform in root water uptake. Plant Cell. 2003;15:509–522. doi: 10.1105/tpc.008888. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Sade N., Vinocur B.J., Diber A., Shatil A., Ronen G., Nissan H., Wallach R., Karchi H., Moshelion M. Improving plant stress tolerance and yield production: Is the tonoplast aquaporin SlTIP2;2 a key to isohydric to anisohydric conversion? New Phytol. 2009;181:651–661. doi: 10.1111/j.1469-8137.2008.02689.x. [DOI] [PubMed] [Google Scholar]
- 6.Kapilan R., Vaziri M., Zwiazek J.J. Regulation of aquaporins in plants under stress. Biol Res. 2018;51:1–11. doi: 10.1186/s40659-018-0152-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Bae H., Jayaprakasha G.K., Jifon J., Patil B.S. Variation of antioxidant activity and the levels of bioactive compounds in lipophilic and hydrophilic extracts from hot pepper (Capsicum spp.) cultivars. Food Chem. 2012;134:1912–1918. doi: 10.1016/j.foodchem.2012.03.108. [DOI] [PubMed] [Google Scholar]
- 8.Kim S., Park M., Yeom S.I., Kim Y.M., Lee J.M., Lee H.A., Seo E., Choi J., Cheong K., Kim K.T., et al. Genome sequence of the hot pepper provides insights into the evolution of pungency in Capsicum species. Nat. Genet. 2014;46:270–278. doi: 10.1038/ng.2877. [DOI] [PubMed] [Google Scholar]
- 9.Liu Z.B., Zhang Y.P., Ou L.J., Kang L.Y., Liu Y.H., Lv J.H., Wei G., Yang B.Z., Yang S., Chen W.C., et al. Identification and characterization of novel microRNAs for fruit development and quality in hot pepper (Capsicum annuum L.) Gene. 2017;608:66–72. doi: 10.1016/j.gene.2017.01.020. [DOI] [PubMed] [Google Scholar]
- 10.Diao W.P., Snyder J.C., Wang S.B., Liu J.B., Pan B.G., Guo G.J., Ge W., Dawood M.H.S.A. Genome-wide analyses of the NAC transcription factor gene family in pepper (Capsicum annuum L.): Chromosome location, phylogeny, structure, expression patterns, cis-elements in the promoter, and interaction network. Int. J. Mol. Sci. 2018;19:1028. doi: 10.3390/ijms19041028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Ali M., Luo D.X., Khan A., ul Haq S., Gai W.X., Zhang H.X., Cheng G.X., Muhammad I., Gong Z.H. Classification and genome-wide analysis of chitin-binding proteins gene family in pepper (Capsicum annuum L.) and transcriptional regulation to Phytophthora capsici, abiotic stresses and hormonal applications. Int. J. Mol. Sci. 2018;19:2216. doi: 10.3390/ijms19082216. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Islam S., Das Sajib S., Jui Z.S., Arabia S., Islam T., Ghosh A. Genome-wide identification of glutathione S-transferase gene family in pepper, its classification, and expression profiling under different anatomical and environmental conditions. Sci. Rep. UK. 2019;9:1–15. doi: 10.1038/s41598-019-45320-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Pertea M., Shumate A., Pertea G., Varabyou A., Breitwieser F.P., Chang Y.C., Madugundu A.K., Pandey A., Salzberg S.L. CHESS: A new human gene catalog curated from thousands of large-scale RNA sequencing experiments reveals extensive transcriptional noise. Genom. Biol. 2018;19:208. doi: 10.1186/s13059-018-1590-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Abascal F., Irisarri I., Zardoya R. Diversity and evolution of membrane intrinsic proteins. BBA-Gen. Subj. 2014;1840:1468–1481. doi: 10.1016/j.bbagen.2013.12.001. [DOI] [PubMed] [Google Scholar]
- 15.Quigley F., Rosenberg J.M., Shachar-Hill Y., Bohnert H.J. From genome to function: The Arabidopsis aquaporins. Genom. Biol. 2001;3:1–17. doi: 10.1186/gb-2001-3-1-research0001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Younis A.A., Khattab H., Emam M.M. Impacts of silicon and silicon nanoparticles on leaf ultrastructure and TaPIP1 and TaNIP2 gene expressions in heat stressed wheat seedlings. Biol. Plant. 2020;64:343–352. doi: 10.32615/bp.2020.030. [DOI] [Google Scholar]
- 17.Beebo A., Zia A., Kinzel C.R., Herdean A., Bouhidel K., Kirchhoff H., Schoefs B., Spetea C. TIP family aquaporins play role in chloroplast osmoregulation and photosynthesis. bioRxiv. 2020 doi: 10.1101/2020.09.18.297978. [DOI] [Google Scholar]
- 18.Abril J.F., Castellano Hereza S. Genome Annotation. Elsevier; Amsterdam, The Netherlands: 2019. pp. 195–209. [Google Scholar]
- 19.Bayer P.E., Edwards D., Batley J. Bias in resistance gene prediction due to repeat masking. Nat. Plants. 2018;4:762–765. doi: 10.1038/s41477-018-0264-0. [DOI] [PubMed] [Google Scholar]
- 20.Kim S., Cheong K., Park J., Kim M.S., Kim J., Seo M.K., Chae G.Y., Jang M.J., Mang H., Kwon S.H., et al. TGFam-Finder: A novel solution for target-gene family annotation in plants. New Phytol. 2020;227:1568–1581. doi: 10.1111/nph.16645. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Cantarel B.L., Korf I., Robb S.M.C., Parra G., Ross E., Moore B., Holt C., Alvarado A.S., Yandell M. MAKER: An easy-to-use annotation pipeline designed for emerging model organism genomes. Genom. Res. 2008;18:188–196. doi: 10.1101/gr.6743907. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Hoff K.J., Lange S., Lomsadze A., Borodovsky M., Stanke M. BRAKER1: Unsupervised RNA-seq-based genome annotation with GeneMark-ET and AUGUSTUS. Bioinformatics. 2016;32:767–769. doi: 10.1093/bioinformatics/btv661. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Li X.Y., Si W.N., Qin Q.Q., Wu H., Jiang H.Y. Deciphering evolutionary dynamics of SWEET genes in diverse plant lineages. Sci. Rep. UK. 2018;8:13440. doi: 10.1038/s41598-018-31589-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Zhang R.Z., Murat F., Pont C., Langin T., Salse J. Paleo-evolutionary plasticity of plant disease resistance genes. BMC Genom. 2014;15:187. doi: 10.1186/1471-2164-15-187. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Kim S.B., Kang W.H., Huy H.N., Yeom S.I., An J.T., Kim S., Kang M.Y., Kim H.J., Jo Y.D., Ha Y., et al. Divergent evolution of multiple virus-resistance genes from a progenitor in Capsicum spp. New Phytol. 2017;213:886–899. doi: 10.1111/nph.14177. [DOI] [PubMed] [Google Scholar]
- 26.Zhang G., Yu Z., Teixeira da Silva J.A.T., Wen D. Identification of aquaporin members in Acacia auriculiformis and functional characterization of AaPIP1-2 involved in drought stress. Environ. Exp. Bot. 2021;185:104425. doi: 10.1016/j.envexpbot.2021.104425. [DOI] [Google Scholar]
- 27.Jiang C., Song X.Q., He H., Chu L.W., Zhou H.J., Zhao Y.Q., Xu Y.J., Zeng W., Lin X.C., Lu M.Z. Genome-wide identification of plasma membrane aquaporin gene family in Populus and functional identification of PIP1;1 involved in osmotic stress. Environ. Exp. Bot. 2020;179:104200. doi: 10.1016/j.envexpbot.2020.104200. [DOI] [Google Scholar]
- 28.Ding L., Uehlein N., Kaldenhoff R., Guo S.W., Zhu Y.Y., Kai L. Aquaporin PIP2;1 affects water transport and root growth in rice (Oryza sativa L.) Plant. Physiol. Bioch. 2019;139:152–160. doi: 10.1016/j.plaphy.2019.03.017. [DOI] [PubMed] [Google Scholar]
- 29.Ramoni M.F., Sebatiani P., Kohane I.S. Cluster analysis of gene expression dynamics. Proc. Natl. Acad. Sci. USA. 2002;99:9121–9126. doi: 10.1073/pnas.132656399. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Qian W.F., Liao B.Y., Chang A.Y.F., Zhang J.Z. Maintenance of duplicate genes and their functional redundancy by reduced expression. Trends Genet. 2010;26:425–430. doi: 10.1016/j.tig.2010.07.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Lamesch P., Berardini T.Z., Li D.H., Swarbreck D., Wilks C., Sasidharan R., Muller R., Dreher K., Alexander D.L., Garcia-Hernandez M., et al. The Arabidopsis Information Resource (TAIR): Improved gene annotation and new tools. Nucleic Acids Res. 2012;40:D1202–D1210. doi: 10.1093/nar/gkr1090. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Ouyang S., Zhu W., Hamilton J., Lin H., Campbell M., Childs K., Thibaud-Nissen F., Malek R.L., Lee Y., Zheng L., et al. The TIGR rice genome annotation resource: Improvements and new features. Nucleic Acids Res. 2007;35:D883–D887. doi: 10.1093/nar/gkl976. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Fernandez-Pozo N., Menda N., Edwards J.D., Saha S., Tecle I.Y., Strickler S.R., Bombarely A., Fisher-York T., Pujar A., Foerster H., et al. The Sol Genomics Network (SGN)-from genotype to phenotype to breeding. Nucleic Acids Res. 2015;43:D1036–D1041. doi: 10.1093/nar/gku1195. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Sharma S.K., Bolser D., de Boer J., Sonderkaer M., Amoros W., Carboni M.F., D’Ambrosio J.M., de la Cruz G., Di Genova A., Douches D.S., et al. Construction of reference chromosome-scale pseudomolecules for potato: Integrating the potato genome with genetic and physical maps. G3-Genes Genom. Genet. 2013;3:2031–2047. doi: 10.1534/g3.113.007153. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Jones P., Binns D., Chang H.Y., Fraser M., Li W., McAnulla C., McWilliam H., Maslen J., Mitchell A., Nuka G., et al. InterProScan 5: Genome-scale protein function classification. Bioinformatics. 2014;30:1236–1240. doi: 10.1093/bioinformatics/btu031. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Katoh K., Standley D.M. MAFFT multiple sequence alignment software version 7: Improvements in performance and usability. Mol. Biol. Evol. 2013;30:772–780. doi: 10.1093/molbev/mst010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Capella-Gutierrez S., Silla-Martinez J.M., Gabaldon T. trimAl: A tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics. 2009;25:1972–1973. doi: 10.1093/bioinformatics/btp348. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Wallace I.S., Roberts D.M. Homology modeling of representative subfamilies of Arabidopsis major intrinsic proteins. Classification based on the aromatic/arginine selectivity filter. Plant. Physiol. 2004;135:1059–1068. doi: 10.1104/pp.103.033415. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Nguyen M.X., Moon S., Jung K.H. Genome-wide expression analysis of rice aquaporin genes and development of a functional gene network mediated by aquaporin expression in roots. Planta. 2013;238:669–681. doi: 10.1007/s00425-013-1918-9. [DOI] [PubMed] [Google Scholar]
- 40.Reuscher S., Akiyama M., Mori C., Aoki K., Shibata D., Shiratake K. Genome-wide identification and expression analysis of aquaporins in tomato. PLoS ONE. 2013;8:e79052. doi: 10.1371/journal.pone.0079052. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Venkatesh J., Yu J.W., Park S.W. Genome-wide analysis and expression profiling of the Solanum tuberosum aquaporins. Plant. Physiol. Bioch. 2013;73:392–404. doi: 10.1016/j.plaphy.2013.10.025. [DOI] [PubMed] [Google Scholar]
- 42.Minh B.Q., Schmidt H.A., Chernomor O., Schrempf D., Woodhams M.D., von Haeseler A., Lanfear R. IQ-TREE 2: New models and efficient methods for phylogenetic inference in the genomic Era. Mol. Biol. Evol. 2020;37:1530–1534. doi: 10.1093/molbev/msaa015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Letunic I., Bork P. Interactive tree of life (iTOL) v3: An online tool for the display and annotation of phylogenetic and other trees. Nucleic Acids Res. 2016;44:W242–W245. doi: 10.1093/nar/gkw290. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Bailey T.L., Williams N., Misleh C., Li W.W. MEME: Discovering and analyzing DNA and protein sequence motifs. Nucleic Acids Res. 2006;34:W369–W373. doi: 10.1093/nar/gkl198. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Bailey T.L., Gribskov M. Combining evidence using p-values: Application to sequence homology searches. Bioinformatics. 1998;14:48–54. doi: 10.1093/bioinformatics/14.1.48. [DOI] [PubMed] [Google Scholar]
- 46.Krogh A., Larsson B., Heijne G.V., Sonnhammer E.L.L. Predicting transmembrane protein topology with a hidden Markov model: Application to complete genomes. J. Mol. Biol. 2001;305:567–580. doi: 10.1006/jmbi.2000.4315. [DOI] [PubMed] [Google Scholar]
- 47.Wickham H. ggplot2: Elegant Graphics for Data Analysis. Springer; New York, NY, USA: 2016. [Google Scholar]
- 48.Voorrips R.E. MapChart: Software for the graphical presentation of linkage maps and QTLs. J. Hered. 2002;93:77–78. doi: 10.1093/jhered/93.1.77. [DOI] [PubMed] [Google Scholar]
- 49.Qiao X., Li Q., Yin H., Qi K., Li L., Wang R., Zhang S., Paterson A.H. Gene duplication and evolution in recurring polyploidization-diploidization cycles in plants. Genom. Biol. 2019;20:38. doi: 10.1186/s13059-019-1650-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Loytynoja A. Phylogeny-aware alignment with PRANK. Methods Mol. Biol. 2014;1079:155–170. doi: 10.1007/978-1-62703-646-7_10. [DOI] [PubMed] [Google Scholar]
- 51.Wang D., Zhang Y., Zhang Z., Zhu J., Yu J. KaKs_Calculator 2.0: A toolkit incorporating gamma-series methods and sliding window strategies. Genom. Proteom. Bioinform. 2010;8:77–80. doi: 10.1016/S1672-0229(10)60008-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Kang W.H., Sim Y.M., Koo N., Nam J.Y., Lee J., Kim N., Jang H., Kim Y.M., Yeom S.I. Transcriptome profiling of abiotic responses to heat, cold, salt, and osmotic stress of Capsicum annuum L. Sci. Data. 2020;7:17. doi: 10.1038/s41597-020-0352-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Kim D., Landmead B., Salzberg S.L. HISAT: A fast spliced aligner with low memory requirements. Nat. Methods. 2015;12:357–360. doi: 10.1038/nmeth.3317. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Pertea M., Pertea G.M., Antonescu C.M., Chang T.C., Mendell J.T., Salzberg S.L. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nat. Biotechnol. 2015;33:290–295. doi: 10.1038/nbt.3122. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Kolde R. Pheatmap: Pretty Heatmaps in R Package. R Package, Version 1.0.12. (2015) [(accessed on 9 September 2020)]; Available online: https://cran.r-project.org/package=pheatmap.
- 56.Kumar L., Futschik M.E. Mfuzz: A software package for soft clustering of microarray data. Bioinformation. 2007;2:5–7. doi: 10.6026/97320630002005. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Not applicable.