Abstract
Actinidia latifolia is one of the very few kiwifruit genotypes with extremely high ascorbic acid (AsA) content. However, a transcriptome atlas of this species is lacking. The accumulation of AsA during fruit development and ripening and the associated molecular mechanisms are still poorly understood. Herein, dynamic changes in AsA content at six different stages of A. latifolia fruit development and ripening were determined. AsA content of A. latifolia fruit reached 1108.76 ± 35.26 mg 100 g−1 FW at full maturity. A high-quality, full-length (FL) transcriptome of A. latifolia was successfully constructed for the first time using third-generation sequencing technology. The transcriptome comprises 326,926 FL non-chimeric reads, 15,505 coding sequences, 2882 transcription factors, 18,797 simple sequence repeats, 3328 long noncoding RNAs, and 231 alternative splicing events. The genes involved in AsA biosynthesis and recycling pathways were identified and compared with those in different kiwifruit genotypes. The correlation between the AsA content and expression levels of key genes in AsA biosynthesis and recycling pathways was revealed. LncRNAs that participate in AsA-related gene expression regulation were also identified. Gene expression patterns in AsA biosynthesis and metabolism exhibited a trend similar to that of AsA accumulation. Overall, this study paves the way for genetic engineering to develop kiwifruits with super-high AsA content.
Keywords: Actinidia latifolia, HPLC, ascorbic acid accumulation, PacBio SMRT sequencing, full-length transcriptome, ascorbic acid biosynthesis, recycling
1. Introduction
L-ascorbic acid (AsA, L-threo-hex-2-enono-1,4-lactone), also known as vitamin C and ascorbate, is an essential water-soluble, low molecular weight antioxidant [1] which supports plant growth and development [2] and the response of plants to environmental stresses [3]. In addition, AsA prevents and alleviates various human ailments and diseases, including some chronic and complex diseases (stroke, cancer, and scurvy) [4], and even has possible physiological effects on severe coronavirus disease 2019 (COVID-19) [5]. Throughout evolution, humans and other primates have lost the ability to synthesize AsA by themselves due to a lack of the L-gulonolactone oxidase (GLO) gene, and are thus entirely dependent upon food supplies, such as plant products [6]. Horticulture plants, particularly fresh fruits and vegetables, are abundant sources of AsA for human beings, sufficiently satisfying daily AsA requirements [7]. The benefits of the increase in AsA production in plants are profound.
Kiwifruit is a member of the genus Actinidia Lindl. belonging to the family Actinidiaceae. It is an important fruit crop widely distributed throughout most of East Asia, with the center of evolution being the Yangzi and Pearl Rivers and mountain ranges in China [8]. Kiwifruit has long been considered “the king of fruits” because of its remarkably high AsA content [8], and the presence of other nutritional phytochemicals that can provide several health benefits [9,10]. When compared with other commercially important fruit crops, the AsA content in equal fresh weight (FW) kiwifruit is approximately one-to-two times higher than that of orange, while it is 10 times higher than that of banana and up to 15 times higher than that of apple [11,12]. Within the genus Actinidia Lindl., there is great natural variation in AsA content. However, A. latifolia is one of the very few kiwifruit genotypes that are regarded as an exceptional source of extremely high AsA [13,14,15,16]. Accordingly, A. latifolia could serve as a valuable source for improving AsA content in other commercially important kiwifruit cultivars in future breeding strategies.
Several transgenic approaches have been employed to determine the molecular mechanisms underlying the AsA biosynthesis pathways in higher plants. The current consensus is that four AsA biosynthesis pathways, including the L-galactose, L-gulose, D-galacturonic acid, and inositol pathways, operate in higher plants, with the L-galactose pathway being the dominant route for AsA accumulation [1], as illustrated in Figure 1.
In recent decades, de novo assembly of the transcriptome by second-generation sequencing (NGS), Illumina RNA-sequencing (RNA-Seq), has become a cost-effective, accurate, and routine method. Nevertheless, NGS generates relatively short reads of only 100–500 bp, which poses a great challenge when it comes to assembly and annotation, especially without a well-annotated reference genome [17]. Recently, the advent of a third-generation sequence platform, PacBio long-read single-molecule real-time (SMRT) sequencing technology, has overcome the limitations of short read sequences and provides opportunities to generate reliable long reads (10–15 kb) which can meet the requirements for capturing genome-wide, full-length (FL) transcripts [17]. Although A. latifolia has extremely high AsA content [13,14,15,16], neither its reference genome nor transcriptome is available, which is a major bottleneck in understanding the functional genomics and molecular genetics of A. latifolia. To the best of our knowledge, there are no reports regarding the AsA accumulation pattern in this special AsA-rich kiwifruit cultivar. To elucidate the mechanisms regulating AsA concentration, it is of utmost importance to monitor how AsA accumulates during fruit development and ripening of cultivars with different AsA contents, especially in cultivars with extremely high AsA content. In addition, much remains to be learned regarding the molecular mechanisms underlying the dynamic accumulation of AsA in A. latifolia. A thorough elucidation of AsA accumulation, its associated gene expression, and metabolic pathways is required for genetic or cultural improvement in the future.
In the present study, the dynamic changes in AsA content at six different stages of fruit development and ripening of A. latifolia were evaluated using high-performance liquid chromatography (HPLC). The Isoform Sequencing (Iso-SeqTM) protocol with the PacBio SMRT sequencing-based de novo transcriptome was employed to construct a high-quality reference FL transcriptome for A. latifolia. In addition, the associated transcriptional changes, along with AsA accumulation, were quantified using second-generation Illumina paired-end sequencing technology to elucidate the underlying molecular mechanisms. Furthermore, the regulatory genes associated with the AsA biosynthesis and recycling pathways, and the lncRNAs involved in AsA-related gene expression regulation, were identified. The paucity of A. latifolia genomic information implies that the reference FL transcriptome atlas obtained in this study would be vital for future genome annotation and studying gene function, especially those related to economically important traits. The results of this study would be beneficial for breeding programs aimed at increasing the AsA level of kiwifruit in the future.
2. Results
2.1. Morphological Changes and AsA Accumulation at Six Different Stages of the Fruit Development and Ripening of A. latifolia
Visual inspection of six different stages corresponding to 30, 60, 90, 120, 150, and 170 DAF of A. latifolia fruit development and ripening is shown in Figure 2a. During development, A. latifolia fruit underwent three distinguishable phases, including a cell division and cell expansion phase from fruit set to approximately 60 DAF; a veraison phase with fruit color changing (60–90 DAF); and a ripening phase (90–170 DAF), as displayed in Figure 2a. AsA content was quantified by HPLC (Figure 2b). A significant (p < 0.05) decrease in AsA content occurred during the expansion and veraison stages, reaching the lowest level at 90 DAF (992.08 ± 46.59 mg 100 g−1 FW). As ripening progresses and the fruits transition to maturation, the AsA content showed an increasing trend from 90 to 170 DAF, and reached 1108.76 ± 35.26 mg 100 g−1 FW at full maturity. However, the statistical evaluation results of ANOVA showed no significant differences. The details of the changes in AsA content over time are expressed as the mean ± standard error (SE) on a FW basis, and are shown in Figure 2b.
2.2. PacBio SMRT Sequencing-Based FL Transcriptome Atlas of A. latifolia Fruit
To capture as many transcripts of A. latifolia fruit as possible, a total of eighteen RNA samples from six representative stages of fruit development and ripening of A. latifolia were equally pooled for library preparation (1–6 kb libraries). A total of 19.50 Gb of subreads were obtained from the offline data. The strict screening criteria (i.e., full passes greater than 1.0 and accuracy greater than 0.90) resulted in 358,138 CCS reads, comprising 743,399,287 read bases, with an average read length of 2087 bp. After error correction, a total of 34,148 polished high-quality isoforms were obtained, among which 91.80% (326,926 reads) were filtered as the FLNC reads of A. latifolia fruit transcriptome. The FLNC read-length distribution ranged from 700 to 3500 bp, and the overall distribution of each bin size agreed with the size of its cDNA library (Figure 3a).
2.3. Functional Annotation of A. latifolia Transcripts with Multiple Databases
Transcripts were scanned against and successfully annotated using the Nr, GO, COG, KOG, eggNOG, KEGG, Pfam, and Swiss-Prot databases, and the integrated alignment results are summarized in Table S2. Based on Nr functional annotation, the best BLAST hit of homologous species with A. latifolia fruit was Vitis vinifera (5689 isoforms, 31.33%), followed by Sesamum indicum (1260, 6.73%), and Theobroma cacao (1082, 5.78%) (Figure 3b). In total, 15,113 transcripts were successfully annotated using the GO database and classified into three categories: biological process (BP), cellular component (CC), and molecular function (MF). Genes involved in cellular process (8291 matched genes, 54.86%), metabolic processes (8072, 53.41%), and single-organism processes (5540, 36.66%) were highly represented in the BP category. Cells (8924 matched genes, 59.05%) were the most abundant subcategory within CC, followed by cell parts (8911, 58.96%) and organelles (6340, 41.95%). In the MF category, catalytic activity (7823 matched genes, 51.76%) was the most prominent category, followed by binding (7672, 50.76%), and transporter activity (1023, 6.77%) (Figure 3c).
2.4. Structural Analysis of the FL Transcriptome of A. latifolia
A total of 15,505 CDS were identified. The frequencies for each CDS length were evaluated, with the most frequent length ranging from 100 to 1200 bp (Figure S1a). Furthermore, by predicting non-redundant transcripts using the iTAK software, 2436 genes were predicted to be TFs. These TFs were classified into different families, among which the most abundant type identified was C3H (84 matched genes), followed by GRAS (76), AP2/ERF-ERF (68), MYB-related (56), B3-ARF (56), CAMK_CDPK (55), C2H2 (54), and RLK-Pelle_LRR_Xl-1 (51) (Figure S1b). A total of 3328 lncRNA candidates were identified, and 447, 445, 2459, and 1332 lncRNAs were identified using CNCI, CPC, Pfam, and CPAT prediction results, respectively. The comparison revealed that 169 transcripts were simultaneously identified using the four computational approaches (Figure 3d). In addition, transcripts were subjected to SSR analysis via MISA, and a total of 18,797 SSR, including seven SSR types (i.e., mononucleotide, dinucleotide, trinucleotide, tetranucleotide, pentanucleotide, hexanucleotide, and compound nucleotides) were detected. The dinucleotide SSR loci exhibited the highest frequency, followed by the mononucleotide and the compound SSR types (Figure S1c). A detailed breakdown of the SSR types from PacBio is presented in Table S3.
2.5. Differentially Expressed Genes (DEGs) Identified in Comparative Transcriptomic Analysis
The non-redundant transcripts obtained were used as a reference for Illumina sequence alignment and subsequent analysis. Eighteen cDNA libraries for Illumina sequencing were generated from six different stages, with three biological replicates per stage. DEG identification was independently performed using pair-wise comparisons between the developmental stage and baseline control. Based on the adopted cutoff (FDR < 0.01 and absolute fold change ≥ 2), the numbers of all genes displaying both significantly up- and down-regulated profiles are summarized in Table 1.
Table 1.
Baseline Control | DAF60 | DAF90 | DAF120 | DAF150 | DAF170 |
---|---|---|---|---|---|
DAF30 | 1871/1917 | 1628/1857 | 1931/2384 | 2373/2892 | 2632/3447 |
DAF60 | 1768/2147 | 1687/2288 | 2425/2911 | 2676/3441 | |
DAF90 | 410/813 | 866/1513 | 1491/2412 | ||
DAF120 | 434/689 | 1167/1866 | |||
DAF150 | 643/1152 |
Note: The first column represents the baseline and the first line represents the treatment group. Red (first) and green (second) numbers represent significantly up- and down-regulated gene numbers, respectively. For example, ‘1871/1917’ in the second column and the second line means that a total of 1871 significantly up-regulated and 1917 down-regulated genes were identified in the comparison between DAF30_vs_DAF60 (the former was used as a baseline control, and the latter was the treatment group).
2.6. Functional Annotation and Categorization of DEGs
The annotation, pathway, and functional categorization of the DEGs in the comparisons of DAF30 versus DAF60, 90, 120, 150, and 170 were thoroughly analyzed using the COG, GO, KEGG, KOG, Pfam, Swiss-Prot, eggNOG, and Nr databases. The assignments indicated that at least 97.79% of the DEGs were functionally annotated in multiple databases (Table 2). Analysis of GO categories showed that the functional distribution of DEGs in comparable groups was similar. The BP categories of GO terms were primarily grouped into cellular, metabolic, and single-organism processes. The CC category was mainly assigned to cells, cell parts, and membranes. Catalytic, binding, and transporter activities were prominent in the MF category (Figure S2). KEGG pathway enrichment analysis, a pathway-based categorization of orthologous genes, was conducted to predict the functional profiles and biological significances of DEGs identified during the six different stages of A. latifolia fruit development and ripening. Plant hormone signal transduction, carbon metabolism, amino acid biosynthesis, starch and sucrose metabolism, and protein processing in the endoplasmic reticulum were the most significantly enriched metabolic pathways (Figure S3).
Table 2.
DEG_set a | Annotated b | COG c | GO d | KEGG e | KOG f | Pfam g | Swiss-Prot h | eggNOG i | Nr j |
---|---|---|---|---|---|---|---|---|---|
DAF 30_vs_60 | 3718 | 1736 | 2940 | 1455 | 2053 | 3306 | 2983 | 3664 | 3709 |
DAF 30_vs_90 | 3405 | 1542 | 2704 | 1234 | 1760 | 3045 | 2756 | 3326 | 3381 |
DAF 30_vs_120 | 4232 | 1993 | 3435 | 1687 | 2207 | 3823 | 3474 | 4142 | 4209 |
DAF 30_vs_150 | 5176 | 2477 | 4187 | 2093 | 2771 | 4663 | 4232 | 5093 | 5156 |
DAF 30_vs_170 | 5991 | 2832 | 4841 | 2386 | 3305 | 5383 | 4807 | 5893 | 5964 |
a DEG: differentially expressed gene. b Annotated: number of differentially expressed transcripts annotated. c COG: clusters of orthologous groups. d GO: gene ontology. e KEGG: Kyoto encyclopedia of genes and genomes. f KOG: eukaryotic orthologous groups. g Pfam: protein family. h Swiss-Prot: a manually annotated and reviewed protein sequences database. i eggNOG: evolutionary genealogy of genes, non-supervised orthologous groups. j Nr: non-redundant protein sequence database.
2.7. Identification of Genes in AsA Biosynthesis and Recycling Pathway during the Fruit Development and Ripening of A. latifolia
By combining second- and third-generation sequencing technologies, crucial regulatory genes associated with AsA biosynthesis and recycling pathways were identified. All genes in the L-galactose pathway, some key genes in the other three pathways, and genes in the AsA recycling pathway were successfully identified during fruit development and ripening of A. latifolia. The number of these gene families in different kiwifruit genotypes, including Hongyang (A. chinensis) v2, Hongyang (A. chinensis) v3, Red 5 (A. chinensis), White (A. eriantha), and A. latifolia, were compared, and the results are listed in Table 3. At least two gene family members in A. latifolia were identified, except for GalDH and GalLDH. Gene expansions were detected not only in PMM, GMP, and GGP within the L-galactose pathway, which forms the major route for ascorbic acid biosynthesis, but also in genes involved in the recycling of ascorbic acid, including AO, APX, and MDHAR (Table 3).
Table 3.
Gene Name | Number of Genes | ||||
---|---|---|---|---|---|
Hong Yang (A. chinensis) v2 |
Hong Yang (A. chinensis) v3 |
Red 5 (A. chinensis) |
White (A. eriantha) |
A. latifolia | |
PGI | 5 | 4 | 4 | 6 | 6 |
PMI | 2 | 5 | 4 | 5 | 2 |
PMM | 10 | 8 | 7 | 8 | 14 |
GMP | 3 | 1 | 3 | 3 | 8 |
GME | 2 | 3 | 2 | 4 | 3 |
GGP | 3 | 4 | 3 | 5 | 9 |
GPP | 2 | 2 | 2 | 2 | 3 |
GalDH | 1 | 1 | 1 | 1 | 1 |
GalLDH | 1 | 1 | 1 | 1 | 1 |
MIOX | 7 | 8 | 6 | 10 | 4 |
GalUR | 5 | 5 | 3 | 6 | 2 |
AO | 3 | 2 | 7 | 4 | 11 |
APX | 12 | 4 | 11 | 17 | 17 |
DHAR | 5 | 4 | 4 | 5 | 5 |
MDHAR | 7 | 6 | 7 | 8 | 10 |
Note: The number of different genes in Hong Yang (v2/v3), Red 5, and White were determined by the search function of Kiwifruit Genome Database (http://kiwifruitgenome.org/ accessed on 6 March 2022).
Changes in the relative expression of gene family members involved in AsA biosynthesis and recycling over time were investigated in this study and their expression profiles are summarized in the heat maps (Figure 4 and Figure 5), except for PMM (Al31169), GME (Al24721), GGP (Al33117), APX (Al27791), APX (Al28761), APX (Al30425), MDHAR (Al29807), and MDHAR (Al22533). In the AsA biosynthesis pathway, GMP (Al20899, Al23713, and Al22538), GME (Al21163), and GGP (Al15644, Al15571, and Al15715) were highly expressed during fruit development and ripening of A. latifolia (Figure 4). In the AsA recycling pathway, APX (Al24536, Al27026, Al26316, Al27343, and Al24110), MDHAR (Al21766 and Al21314), and DHAR (Al28325 and Al27846) showed high transcriptional levels (Figure 5).
The statistical correlation or association between AsA accumulation and relative gene expression was measured and is summarized in Table S4. Positive highly significant (p < 0.05) correlations between the AsA content and gene expression were observed in two PMI genes, i.e., Al20224 [correlation coefficient (r) = 0.861] and Al21940 (r) = 0.523, two PMM genes, [Al18038 (r) = 0.601 and Al25631 (r) = 0.496], one GMP gene, [Al23435 (r) = 0.559], two GME genes, [Al23259 (r) = 0.507 and Al21163 (r) = 0.722], five GGP genes, [Al15644 (r) = 0.505, Al16531 (r) = 0.567, Al15715 (r) = 0.632, Al15576 (r) = 0.528, and Al19411 (r) = 0.625], two MIOX genes, [Al19299 (r) = 0.669 and Al25483 (r) = 0.580], two GalUR genes, [Al26209 (r) = 0.574 and Al25465 = 0.595], two APX genes, [Al27026 (r) = 0.470 and Al29236 (r) = 0.56), one MDHAR gene, [Al21379 (r) = 0.635], and six AO genes, [Al18226 (r) = 0.713, Al15008 (r) = 0.691, Al15150 (r) = 0.676, Al18350 (r) = 0.694, Al29613 (r) = 0.630 and Al17627 (r) = 0.580]. However, the expression levels of one PGI gene, [Al16988 (r) = −0.517], one PMI gene, [Al15376 (r) = −0.651], two GMP genes, [Al20899 (r) = −0.508 and Al21540 (r) = −470], one APX gene, [Al24185 (r) = −0.765], and one DHAR gene, [Al27909 (r) = −0.618], showed significant (p < 0.05) negative correlations. The correlation analysis results revealed that three optional biosynthesis and recycling pathways (L-galactose, galacturonate, and myo-inositol pathways) were significantly associated with AsA accumulation during fruit development and ripening of A. latifolia.
2.8. LncRNAs Participated in AsA-Related Gene Expression Regulation
There were 169 lncRNA candidates identified by the intersection of the CNCI, CPC, Pfam, and CPAT prediction results (Figure 3d), 91 of which may act on downstream target genes. In all downstream target genes, we found many genes involved in the AsA L-galactose pathway and recycling, such as PGI, PMM, GMP, GME, GGP, GPP, APX, MDHAR, DHAR, and AO (Table S5). Several lncRNAs can act on multiple target genes. For example, Al32641 and Al32633 have six target genes. In addition, many functional genes can be regulated by multiple lncRNAs, such as Al16988 (PGI), Al15576 (GGP), and Al15150 (AO). These results indicate that lncRNAs play important roles in AsA accumulation during fruit development and ripening of A. latifolia via complex regulatory networks.
2.9. Expression Patterns of Genes Involved in AsA Biosynthesis and Recycling during Fruit Development and Ripening of A. latifolia Verified by qRT-PCR Analysis
To investigate the molecular mechanisms regulating AsA biosynthesis during the fruit development and ripening of A. latifolia, the expression levels of AsA biosynthesis genes at different developmental stages were further examined by qRT-PCR, as illustrated in Figure 6. The L-galactose pathway, L-gulose, myo-inositol, and D-galacturonic acid pathways were identified. The overall gene expression levels of eight enzymes, including GalDH, GGP, GME, GPP, GMP, PGI, MIOX, and APX (Figure 6), exhibited a similar trend to the cumulative AsA content described above (Figure 2b). The L-galactose and myo-inositol pathways were the predominant pathways regulating the high AsA content in A. latifolia fruit development and ripening (Figure 6).
3. Discussion
3.1. The First High-Quality, Functionally Annotated Reference Transcriptome for A. latifolia
A. latifolia has been identified as a promising kiwifruit species with remarkably high AsA concentration [13,14,15,16]. The lack of a high-confidence transcriptome atlas of A. latifolia greatly hindered scope of investigation into the molecular genetic basis of this important cultivar. In this study, we successfully built a high-quality transcriptome for A. latifolia for the first time using PacBio SMRT sequencing technology corrected by RNA-seq (Figure 3), which will be a crucial resource in exploring genome mining and understanding gene functions for this species.
The reference transcriptome and draft genome data of A. latifolia have not yet been completely sequenced. It is crucial to annotate the transcripts for biological functions and metabolic pathways. Therefore, sequence-based alignments were conducted against eight functional databases (Figure 3) to understand the high-level functions and utilities of biological systems. Another important aspect of our study was the prediction of CDS, TF, and lncRNA from the non-redundant transcripts of A. latifolia (Figure S1). LncRNAs represent a novel class of non-coding RNA that regulate a range of biological processes, such as plant growth, development, and stress responses [18]. In the present study, lncRNAs were identified using a combination of the CPC, CNCI, Pfam, and CPAT databases (Figure 3d), which will be useful for subsequent studies on the biological functions of lncRNA in A. latifolia.
The dinucleotide SSR type was the most frequently observed type in A. latifolia, which was in agreement with the expressed sequence tag (EST)-derived SSR distribution previously reported in Actinidia species [19]. SSR markers have proven to be the most favored molecular markers for estimating genetic diversity, phylogenetic relationships, genotype identification and discrimination, marker-phenotype association, and genetic map construction [20]. The FL transcriptome atlas contains a large amount of genetic information and is a rich source of SSR discovery [21]. Transcriptome-based SSR development has increased the potential for association with functional genes or even agronomic phenotypes because of the close linkage to expressed genes in the transcriptome [22]. This study reported the use of PacBio SMRT sequencing technology for discovery of a set of SSR loci in A. latifolia for the first time (Table S3). The SSR markers identified in this study will be a valuable resource for marker-assisted breeding of A. latifolia.
3.2. Dynamics of AsA Content Accompanying Fruit Development and Ripening of A. latifolia
AsA plays a plethora of roles in biological functions of both plants and humans [2,3,4,5]. Besides cultivar-dependent differences [13,14,15,16], kiwifruits also show tissue- and developmental-specific variability in AsA content [23]. The highest accumulation of AsA was recorded at four weeks after anthesis in A. chinenesis (50–200 mg 100 g−1 FW) and at six weeks after anthesis in A. eriantha (800 mg 100 g−1 FW) and A. deliciosa (80 mg 100 g−1 FW) [23]. A. chinensis var. deliciosa ‘Qinmei’ synthesizes AsA primarily during the early fruit development stage [24]. Zhang et al. [25] reported that A. chinensis var. chinensis ‘Hongyang’ kiwifruit exhibited a maximal AsA level at its immature green stage due to the high biosynthesis rate, which decreased as it ripened and then remained fairly stable until complete ripening.
In the present study, we quantified the dynamic changes in AsA by HPLC during different stages of fruit development and ripening of A. latifolia (Figure 2). Although no significant changes in AsA content were detected by Tukey’s HSD test during the fruit ripening stage (90–170 DAF), there was a rapid decreasing profile during the initial expansion (30–60 DAF) and veraison stage (60–90 DAF), followed by a progressive, albeit not significant, increasing tendency (Figure 2b). Overall, the accumulation dynamic of AsA in A. latifolia positively correlated with fruit developmental stages, as in other kiwifruit varieties [23,24,25]. On the other hand, our results demonstrated the extremely high levels of AsA content of A. latifolia (1108.76 ± 35.26 mg 100 g−1 FW, Figure 2b), which were consistent with previous studies [13,14,15,16].
3.3. An Elucidation of the Molecular Mechanisms Regulating AsA Accumulation of A. latifolia
To date, much progress has been made toward understanding AsA biosynthesis and recycling in higher plants [26]. At least four distinct metabolic pathways, including the L-galactose, L-gulose, galacturonic, and myo-inositol pathways, form a complex network for AsA biosynthesis [1]. AsA biosynthesis and metabolism are complex reactions that depend on the co-expression and coordination of a cluster of genes [1,4,12]. All the genes encoding enzymes in the L-galactose pathway, and some of the genes encoding enzymes in the other three pathways, were identified in A. latifolia (Table 3), supporting that the predominant pathway of AsA biosynthesis in higher plants is the L-galactose pathway, which cooperates well with the other three alternative biosynthesis pathways [1,26].
In the present study, we quantified the global gene expression accompanying fruit development and ripening of A. latifolia and identified candidate genes associated with AsA biosynthesis and recycling (Figure 4 and Figure 5). Previous studies have demonstrated the correlation between GMP transcript level and AsA accumulation rate to some extent [23,27]. In this study, the GMP transcript level tended to be higher in the young fruits, and had a similar trend with AsA accumulation in early fruit developmental stages (Figure 6). GMP plays an important role in cell wall biosynthesis and protein glycosylation. Therefore, the two processes may also be involved in high GMP transcript levels in young fruits [27]. Transformation of plants with over-expression of the GGP gene in Arabidopsis resulted in a more than four-fold increase in AsA content [28]. As an early committed step of the L- galactose biosynthesis pathway, GGP contributes greatly to the rapid increase in AsA accumulation in several fruit species [23,29,30]. GGP has been considered the best regulatory control point for AsA biosynthesis [31]. In the present study, GGP showed relatively high expression levels in the early stages of fruit development (Figure 6), supporting the contribution of GGP to the AsA biosynthesis. In addition, the gene expression level of GME was also correlated with AsA content (Figure 6), emphasizing the central role and positive regulation of GME in AsA accumulation in immature fruit [23]. In addition to the three upstream enzymes, we observed a positive correlation between GalDH, GPP, PGI, MIOX, and APX and the cumulative AsA content (Figure 6), suggesting that the high expression of these enzymes resulted in high AsA accumulation in A. latifolia. Overall, the current study provided insights into the molecular mechanisms regulating AsA accumulation of A. latifolia fruit, which are expected to be useful for breeding cultivars with super-high AsA content in the future.
4. Materials and Methods
4.1. Plant Materials and Sampling
This study was conducted using 5-year-old (in 2019) fruit-bearing A. latifolia kiwifruit trees in an experimental block of Sichuan Provincial Academy of Natural Resource Sciences, Deyang, China (31°30′ N, 104°23′ E). Fruit samples were randomly collected from the fruiting branches (five per tree) of six different trees for AsA content determination and gene expression analysis. Fruit sampling included six representative stages of fruit development and ripening of A. latifolia corresponding to 30, 60, 90, 120, 150, and 170 days after flowering (DAF). The time when more than 75% of the flower’s petals had fallen was set as 0 DAF. Three biological replicates per stage were obtained from ten fruits per replicate. The samples were stored in a cold chamber and transported to the laboratory within 2 h. Upon arrival at the laboratory, samples were cut into slices, immediately frozen in liquid nitrogen, and kept at −80 °C until subsequent analysis.
4.2. Chemicals and Solvents
HPLC-grade authentic standards of AsA and oxalic acid were used (Beijing Solaribio Sciences & Technologies, Beijing, China). Ultrapure water with an electrical resistivity of 18.2 MΩ cm was prepared using a Milli-Q gradient water purification system (Millipore Corporation, Bedford, MA, USA) via a 0.22 μm filter. This purified water was used to prepare all solutions in this study.
4.3. Determination of AsA Using HPLC Coupled with UV Detection
AsA content was determined using HPLC coupled with UV detection, following the methods described by [32] with minor modifications. In brief, a portion of 2.0 g frozen samples was fully ground with 5.0 mL of 0.1% oxalic acid to a homogenous slurry until reaching a total volume of 25.0 mL. The extracts obtained were filtered through a 0.45 μm membrane before being considered ready for injection into the HPLC system. The AsA standard solution (1.0 mg/mL) was prepared by dissolving 25.0 mg AsA in 0.1% oxalic acid and diluted to 25.0 mL with the same solvent. HPLC analysis was performed using an Agilent 1260 HPLC instrument and a variable wavelength detector (Agilent, Santa Clara, CA, USA). The chromatographic separation was performed on a Zobax Stablebond Analytical SB-C18 column (250.0 × 4.6 mm, 5.0 μm). The mobile phase was a 0.1% oxalic acid solution. The flow rate of the mobile phase was kept constant at 1.0 mL/min and the injection volumes of the samples and standard were 10.0 μL for quantitative analysis. The UV absorbance of AsA was determined at 265.0 nm, and AsA content was quantified using a calibration curve.
4.4. RNA Extraction and Quality Evaluation
Total RNA was extracted using a PureLink RNA Mini Kit (Invitrogen Inc., Carlsbad, CA, USA) and purified using an on-column PureLink DNase treatment (Invitrogen Inc.) according to the manufacturer’s instructions. RNA purity was determined by A260 absorbance using a Nanodrop 2000 spectrophotometer (Thermo Fisher Scientific, Inc., CA, USA). RNA concentration was quantified using a Qubit 2.0 fluorometer (Invitrogen Inc.), and integrity was assessed using an RNA Nano 6000 Assay Kit on an Agilent Bioanalyzer 2100 system (Agilent Technologies, Santa Clara, CA, USA). The samples were sent to Biomarker Technologies Co. Ltd. (Beijing, China) for sequencing using PacBio and Illumina RNA-Seq technology.
4.5. Construction of Iso-seq cDNA Libraries and PacBio Sequencing
The mRNA was enriched using oligo-dT magnetic beads from 4.0 μg of total RNA and reverse transcribed into cDNA using the SMARTerTM PCR cDNA Synthesis Kit (Clontech, now Takara, http://www.takarabio.com accessed on 10 February 2022). The size-selected cDNA library was constructed according to the BluePippin Size Selection System protocol, as described by PacBio (PN 100-092-800-03), and sequenced on the PacBio Sequel platform.
4.6. Reads Processing and Error Collection of PacBio Iso-seq Reads
Row data acquired from SMRT sequencing were processed using SMRTlink v5.0. The circular consensus sequence (CCS) reads were obtained from subread BAM files, and the full-length non-chimeric (FLNC) reads and non-full-length reads were determined by the simultaneous presence of the poly A tail signal and the 5′ and 3′ cDNA primers from reads of inserts (ROIs). The short reads were discarded. Subsequently, FLNC sequences were isoform-level clustered with iterative clustering for error correction (Quiver algorithm), and we generated one consensus isoform [33]. The non-full-length CCSs were polished using the Quiver algorithm. Finally, the isoform with a minimum Quiver accuracy of 0.99 was considered a high-quality isoform and used for subsequent analyses.
4.7. Gene Functional Annotation
All isoforms were subjected to functional annotation using multiple protein and nucleotide databases, including the National Center for Biotechnology Information (NCBI) non-redundant protein (Nr, cutoff E-value ≤ 1e−5) [34], gene ontology (GO, E-value ≤ 1e−10) [35], Kyoto Encyclopedia of Genes and Genomes (KEGG, E-value ≤ 1e−3) [36], clusters of orthologous groups (COG, E-value ≤ 1e−3) [37], eukaryotic orthologous groups (KOG, E-value ≤ 1e−3) [38], protein families (Pfam, E-value ≤ 0.01) [39], and a manually annotated and reviewed protein sequence database (Swiss-Prot, E-value ≤ 1e−5) [40].
4.8. Transcript Structure Analysis
Potential coding sequence (CDS) regions within the transcripts were predicted using TransDecoder (https://github.com/TransDecoder/TransDecoder/releases accessed on 13 May 2022). Simple sequence repeats (SSRs) within the transcriptome were identified using MISA (http://pgrc.ipk-gatersleben.de/misa/ accessed on 13 May 2022). LncRNAs were screened via the coding-non-coding-index (CNCI) with default parameters [41] and coding potential calculator (CPC) with the NCBI eukaryotes’ protein database (E-value < 1e−10) [42]. Each transcript was translated in three possible frames, and a Pfam Scan with default parameters of -E 0.001 --domE 0.001 was utilized to determine whether there exists a domain of a known protein family. Transcription factors (TFs) were predicted using iTAK software from putative protein sequences [43].
4.9. Illumina cDNA Library Construction and Second-Generation Sequencing for Transcriptome of Fruit Development and Ripening Stages
Eighteen cDNA libraries (six representative stages × three biological replicates) were constructed and used for second-generation high-throughput sequencing. RNA extraction and quality detection, cDNA synthesis, library preparation, high-throughput sequencing, identification of DEGs, functional categorization, and pathway analysis of DEGs followed our previously published protocol [9].
4.10. Validation of DEGs by Quantitative Reverse Transcription PCR (qRT-PCR)
Specific primers for qRT-PCR are presented in Table S1, which were designed using Primer Premier software (version 5.0; Premier Biosoft, Palo Alto, CA, USA). qRT-PCR and data normalization were performed as previously described by [9].
5. Conclusions
The lack of comprehensive genome sequence information limits the scope of investigation of the molecular genetic basis of A. latifolia with extremely high ascorbic acid content. In this study, we used PacBio SMRT sequencing technology to generate a high-quality, functionally annotated reference transcriptome for this kiwifruit species. This is the first A. latifolia FL transcriptome release covering fruit tissues extracted from six stages of fruit development and ripening, which will be crucial for both basic and applied research on biotechnology assays and genetic improvement in the future. Based on the transcriptome, pair-wise comparisons between different developmental stages were performed and differentially expressed genes (DEGs) were detected. The regulatory genes associated with the AsA biosynthesis and recycling pathways, and the lncRNAs involved in AsA-related gene expression regulation, were identified. In addition, the expression patterns of the genes involved in AsA biosynthesis and metabolism were further validated by qRT-PCR, which explains the high AsA content detected in A. latifolia fruit. Our study provides insights into the molecular mechanisms regulating AsA accumulation of A. latifolia fruit, which may facilitate the breeding of cultivars with super-high AsA content in the future.
Acknowledgments
H.D. expresses her great and deep appreciation to her supervisor, Changyong Zhou, who gave her the greatest support in work and life when she was a Chengdu–Chongqing Twin Cities Economic Circle visiting scholar. Mingzhang Li was gratefully acknowledged for providing the plant materials.
Supplementary Materials
The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ijms23105808/s1.
Author Contributions
Conceptualization, D.L. and H.X.; methodology, H.D., H.X., Y.G., X.L. (Xinglin Liu), L.L. and J.W.; software, H.D., H.X., K.X., X.L. (Xiulan Lv) and R.H.; writing—original draft preparation, H.D.; writing—review and editing, H.D.; funding acquisition, D.L. and H.X. All authors have read and agreed to the published version of the manuscript.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
All data supporting the findings of this study are available within the paper and within its Supplementary Materials published online.
Conflicts of Interest
The authors declare no conflict of interest.
Funding Statement
This research was funded by the Sichuan Science and Technology Program (2016NZ0105, 2019YFSY0041, and 2022YFH0049).
Footnotes
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Mellidou I., Kanellis A.K. Genetic control of ascorbic acid biosynthesis and recycling in horticultural crops. Front. Chem. 2017;5:50. doi: 10.3389/fchem.2017.00050. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Gallie D.R. L-Ascorbic Acid: A Multifunctional molecule supporting plant growth and development. Science. 2013;2013:795964. doi: 10.1155/2013/795964. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Akram N.A., Shafiq F., Ashraf M. Ascorbic acid-a potential oxidant scavenger and its role in plant development and abiotic stress tolerance. Front. Plant Sci. 2017;8:613. doi: 10.3389/fpls.2017.00613. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Granger M., Eck P. Dietary vitamin C in human health. Adv. Food Nutr. Res. 2018;83:281–310. doi: 10.1016/bs.afnr.2017.11.006. [DOI] [PubMed] [Google Scholar]
- 5.Miranda-Massari J.R., González M.J., Marcial-Vega V.A., Soler J.D. A possible role for ascorbic acid in COVID-19. J. Restor. Med. 2020;10:1–7. doi: 10.14200/jrm.2020.0102. [DOI] [Google Scholar]
- 6.Bvenura C., Sivakumar D. The role of wild fruits and vegetables in delivering a balanced and healthy diet. Food Res. Int. 2017;99:15–30. doi: 10.1016/j.foodres.2017.06.046. [DOI] [PubMed] [Google Scholar]
- 7.Wallace T.C., Bailey R.L., Blumberg J.B., Burton-Freeman B., Chen C.O., Crowe-White K.M., Drewnowski A., Hooshmand S., Johnson E., Lewis R., et al. Fruits, vegetables, and health: A comprehensive narrative, umbrella review of the science and recommendations for enhanced public policy to improve intake. Crit. Rev. Food Sci. Nutr. 2020;60:2174–2211. doi: 10.1080/10408398.2019.1632258. [DOI] [PubMed] [Google Scholar]
- 8.Huang S., Ding J., Deng D., Tang W., Sun H., Liu D., Zhang L., Niu X., Zhang X., Meng M., et al. Draft genome of the kiwifruit Actinidia chinensis. Nat. Commun. 2013;4:2640. doi: 10.1038/ncomms3640. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Liang D., Deng H., Deng Q., Lin L., Lv X., Wang J., Wang Z., Xiong B., Zhao X., Xia H. Dynamic changes of phenolic compounds and their associated gene expression profiles occurring during fruit development and ripening of the donghong kiwifruit. J. Agric. Food Chem. 2020;68:11421–11433. doi: 10.1021/acs.jafc.0c04438. [DOI] [PubMed] [Google Scholar]
- 10.Xia H., Wang X., Su W., Jiang L., Lin L., Deng Q., Wang J., Deng H., Hu R., Liao M., et al. Changes in the carotenoids profile of two yellow-fleshed kiwifruit cultivars during storage. Postharvest Biol. Technol. 2020;164:111162. doi: 10.1016/j.postharvbio.2020.111162. [DOI] [Google Scholar]
- 11.Vissers M.C.M., Carr A.C., Pullar J.M., Bozonet S.M. The bioavailability of vitamin C from kiwifruit. Adv. Food Nutr. Res. 2013;68:125–147. doi: 10.1016/B978-0-12-394294-4.00007-9. [DOI] [PubMed] [Google Scholar]
- 12.Ferguson A.R. Botanical description. In: Testolin R., Huang H., Ferguson A.R., editors. The Kiwifruit Genome, Compendium of Plant Genomes. Springer; Cham, Switzerland: 2016. pp. 1–14. [Google Scholar]
- 13.Du G., Li M., Ma F., Liang D. Antioxidant capacity and the relationship with polyphenol and vitamin C in Actinidia fruits. Food Chem. 2009;113:557–562. doi: 10.1016/j.foodchem.2008.08.025. [DOI] [Google Scholar]
- 14.Rassam M., Laing W. Variation in ascorbic acid and oxalate levels in the fruit of Actinidia chinensis tissues and genotypes. J. Agric. Food Chem. 2005;53:2322–2326. doi: 10.1021/jf048197s. [DOI] [PubMed] [Google Scholar]
- 15.Nishiyama I., Yamashita Y., Yamanaka M., Shimohashi A., Fukuda T., Oota T. Varietal difference in vitamin C content in the fruit of kiwifruit and other Actinidia species. J. Agric. Food Chem. 2004;52:5472–5475. doi: 10.1021/jf049398z. [DOI] [PubMed] [Google Scholar]
- 16.Latocha P., Krupa T., Wołosiak R., Worobiej E., Wilczak J. Antioxidant activity and chemical difference in fruit of different Actinidia sp. Int. J. Food Sci. Nutr. 2010;61:381–394. doi: 10.3109/09637480903517788. [DOI] [PubMed] [Google Scholar]
- 17.Amarasinghe S.L., Su S., Dong X., Zappia L., Ritchie M.E., Gouil Q. Opportunities and challenges in long-read sequencing data analysis. Genome Biol. 2020;21:30. doi: 10.1186/s13059-020-1935-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Wu L., Liu S., Qi H., Cai H., Xu M. Research progress on plant long non-coding RNA. Plants. 2020;9:408. doi: 10.3390/plants9040408. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Fraser L.G., Harvey C.F., Crowhurst R.N., De Silva H.N. EST-derived microsatellites from Actinidia species and their potential for mapping. Theor. Appl. Genet. 2004;108:1010–1016. doi: 10.1007/s00122-003-1517-4. [DOI] [PubMed] [Google Scholar]
- 20.Kalia R.K., Rai M.K., Kalia S., Singh R., Dhawan A.K. Microsatellite markers: An overview of the recent progress in plants. Euphytica. 2011;177:309–334. doi: 10.1007/s10681-010-0286-9. [DOI] [Google Scholar]
- 21.Deng K., Deng R., Fan J., Chen E. Transcriptome analysis and development of simple sequence repeat (SSR) markers in Zingiber striolatum Diels. Physiol. Mol. Biol. Plants. 2018;24:125–134. doi: 10.1007/s12298-017-0485-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Taheri S., Abdullah T.L., Yusop M.R., Hanafi M.M., Sahebi M., Azizi P., Shamshiri R.R. Mining and development of novel SSR markers using next generation sequencing (NGS) data in plants. Molecules. 2018;23:399. doi: 10.3390/molecules23020399. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Bulley S.M., Rassam M., Hoser D., Otto W., Schünemann N., Wright M., MacRae E., Gleave A., Laing W. Gene expression studies in kiwifruit and gene over-expression in Arabidopsis indicates that GDP-L-galactose guanyltransferase is a major control point of vitamin C biosynthesis. J. Exp. Bot. 2009;60:765–778. doi: 10.1093/jxb/ern327. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Li M., Ma F., Liang D., Li J., Wang Y. Ascorbate biosynthesis during early fruit development is the main reason for its accumulation in kiwi. PLoS ONE. 2010;5:e14281. doi: 10.1371/journal.pone.0014281. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Zhang J.Y., Pan D.L., Jia Z.H., Wang T., Wang G., Guo Z.R. Chlorophyll, carotenoid and vitamin C metabolism regulation in Actinidia chinensis “Hongyang” outer pericarp during fruit development. PLoS ONE. 2018;13:e0194835. doi: 10.1371/journal.pone.0194835. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Fenech M., Amaya I., Valpuesta V., Botella M.A. Vitamin C content in fruits: Biosynthesis and regulation. Front. Plant Sci. 2019;9:2006. doi: 10.3389/fpls.2018.02006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Li M., Chen X., Wang P., Ma F. Ascorbic acid accumulation and expression of genes involved in its biosynthesis and recycling in developing apple fruit. J. Am. Soc. Hortic. Sci. 2011;136:231–238. doi: 10.21273/JASHS.136.4.231. [DOI] [Google Scholar]
- 28.Dowdle J., Ishikawa T., Gatzek S., Rolinski S., Smirnoff N. Two genes in Arabidopsis thaliana encoding GDP-L-galactose phosphorylase are required for ascorbate biosynthesis and seedling viability. Plant J. 2007;52:673–689. doi: 10.1111/j.1365-313X.2007.03266.x. [DOI] [PubMed] [Google Scholar]
- 29.Mellidou I., Keulemans J., Kanellis A.K., Davey M.W. Regulation of fruit ascorbic acid concentrations during ripening in high and low vitamin C tomato cultivars. BMC Plant Biol. 2012;12:239. doi: 10.1186/1471-2229-12-239. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Mellidou I., Chagné D., Laing W.A., Keulemans J., Davey M.W. Allelic variation in paralogs of GDP-L-galactose phosphorylase is a major determinant of vitamin C concentrations in apple fruit. Plant Physiol. 2012;160:1613–1629. doi: 10.1104/pp.112.203786. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Laing W.A., Martínez-Sánchez M., Wright M.A., Bulley S.M., Brewster D., Dare A.P., Rassam M., Wang D., Storey R., Macknight R.C., et al. An upstream open reading frame is essential for feedback regulation of ascorbate biosynthesis in arabidopsis. Plant Cell. 2015;27:772–786. doi: 10.1105/tpc.114.133777. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Li H., Liu Z.W., Wu Z.J., Wang Y.X., Teng R.M., Zhuang J. Differentially expressed protein and gene analysis revealed the effects of temperature on changes in ascorbic acid metabolism in harvested tea leaves. Hortic. Res. 2018;5:65. doi: 10.1038/s41438-018-0070-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Gordon S.P., Tseng E., Salamov A., Zhang J., Meng X., Zhao Z., Kang D., Underwood J., Grigoriev I.V., Figueroa M., et al. Widespread polycistronic transcripts in fungi revealed by single-molecule mRNA sequencing. PLoS ONE. 2015;10:30132628. doi: 10.1371/journal.pone.0132628. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Yangyang D., Jiangqi L., Songfeng W., Yunping Z., Chen Y., Fuchu H. Integrated Nr database in protein annotation system and its localization. Comput. Eng. 2006;32:71–74. [Google Scholar]
- 35.Ashburner M., Ball C.A., Blake J.A., Botstein D., Butler H., Cherry J.M., Davis A.P., Dolinski K., Dwight S.S., Eppig J.T., et al. Gene ontology: Tool for the unification of biology. Nat. Genet. 2000;25:25–29. doi: 10.1038/75556. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Kanehisa M., Goto S., Kawashima S., Okuno Y., Hattori M. The KEGG resource for deciphering the genome. Nucleic Acids Res. 2004;32:D277–D280. doi: 10.1093/nar/gkh063. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Tatusov R.L., Galperin M.Y., Natale D.A., Koonin E.V. The COG database: A tool for genome-scale analysis of protein functions and evolution. Nucleic Acids Res. 2000;28:33–36. doi: 10.1093/nar/28.1.33. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Koonin E.V., Fedorova N.D., Jackson J.D., Jacobs A.R., Krylov D.M., Makarova K.S., Mazumder R., Mekhedov S.L., Nikolskaya A.N., Rao B.S., et al. A comprehensive evolutionary classification of proteins encoded in complete eukaryotic genomes. Genome Biol. 2004;5:R7. doi: 10.1186/gb-2004-5-2-r7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Finn R.D., Bateman A., Clements J., Coggill P., Eberhardt R.Y., Eddy S.R., Heger A., Hetherington K., Holm L., Mistry J., et al. Pfam: The protein families database. Nucleic Acids Res. 2014;42:D222–D230. doi: 10.1093/nar/gkt1223. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Apweiler R., Bairoch A., Wu C.H., Barker W.C., Boeckmann B., Ferro S., Gasteiger E., Huang H., Lopez R., Magrane M., et al. UniProt: The universal protein knowledgebase. Nucleic Acids Res. 2004;32:D115–D119. doi: 10.1093/nar/gkh131. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Kong L., Zhang Y., Ye Z.Q., Liu X.Q., Zhao S.Q., Wei L., Gao G. CPC: Assess the protein-coding potential of transcripts using sequence features and support vector machine. Nucleic Acids Res. 2007;35:345–349. doi: 10.1093/nar/gkm391. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Sun L., Luo H., Bu D., Zhao G., Yu K., Zhang C., Liu Y., Chen R., Zhao Y. Utilizing sequence intrinsic composition to classify protein-coding and long non-coding transcripts. Nucleic Acids Res. 2013;41:e166. doi: 10.1093/nar/gkt646. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Zheng Y., Jiao C., Sun H., Rosli H.G., Pombo M.A., Zhang P., Banf M., Dai X., Martin G.B., Giovannoni J.J., et al. iTAK: A program for genome-wide prediction and classification of plant transcription factors, transcriptional regulators, and protein kinases. Mol. Plant. 2016;9:1667–1670. doi: 10.1016/j.molp.2016.09.014. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All data supporting the findings of this study are available within the paper and within its Supplementary Materials published online.