Abstract
Coconut palm has two distinct types—“tall” and “dwarf”—which differ morphologically. Tall coconut varieties need 8–10 years to start flowering, while dwarf coconut varieties only require 3–5 years. We compared seedling and reproductive stage transcriptomes for both coconut types to determine potential molecular mechanisms underlying control of flowering time in coconut. Several key genes in the photoperiod pathway were differentially expressed between seedling and reproductive leaf samples in both tall and dwarf coconut. These genes included suppressor of overexpression of constans (SOC1), flowering locus T (FT), and Apetala 1 (AP1). Alternative splicing analysis of genes in the photoperiod pathway further revealed that the FT gene produces different transcripts in tall compared to dwarf coconut. The shorter alternative splice variant of FT [which included a 6 bp deletion, alternative 3′ splicing sites (A3SS)] was found to be exclusively present in dwarf coconut varieties but absent in most tall coconut varieties. Our results provide a valuable information resource as well as suggesting a probable mechanism for differentiation of flowering time onset in coconut, providing a target for future breeding work in accelerating time to flowering in this crop species.
Subject terms: Plant breeding, Plant signalling
Introduction
Coconut palm (Cocos nucifera L., 2n = 32) belongs to the monocotyledonous family Arecaceae (Palmaceae) and is an important tropical oil and fruit crop. Presently, coconut palm is cultivated in 93 countries, including Central and South America, East and West Africa, Southeast Asia and the Pacific island, where it accounts for over 12 million hectares1. Coconut palm is divided into “tall” and “dwarf” types based on distinct morphological and breeding features between the two groups. Tall coconuts take 8–10 years to complete the vegetative to reproductive transition, can grow to a height of about 20–30 m, and have medium-to-large nuts. Tall coconuts are hardy and can adapt to a wide range of environmental conditions. Dwarf coconuts are early-flowering (4–6 years after planting), can grow to a height of about 10–15 m, and can produce a number of small nuts2. The large difference in flowering time between these two types is of particular interest for breeding and production: flowering time limits not only generation speed but also time to first harvest after planting, such that shorter flowering times are almost always desirable. However, to date little is known about regulation of flowering time in coconut.
In the last several decades, a great deal of work has been done in elucidating the molecular mechanisms underlying the transformation from vegetative growth to reproductive growth in plants. Four flowering regulation pathways have been described in model plant Arabidopsis thaliana: the photoperiod pathway3, the vernalization pathway4, the gibberellin pathway5 and the autonomous pathway6. Time to flowering in Arabidopsis thaliana is predominantly determined by expression of the Flowering Locus T (FT) gene7. In Arabidopsis thaliana FT is negatively regulated by expression of Flowering Locus C (FLC), which is a transcriptional repressor, and positively regulated by expression of Constans (CO), which is a transcriptional activator7. Overexpression of CO up-regulates FT and causes an early flowering phenotype8, as does direct overexpression of FT9,10 CO is itself negatively regulated by Suppressor of Overexpression of Constans 1 (SOC1) and PhyB11, the latter of which is a photoreceptor12. PhyB loss of function mutants also therefore result in early flowering12. Flowering time phenotypes have also been observed as a result of mutation of other photoreceptor genes, such as CRY213. In total at least 29 major genes are involved in regulation of flowering time in Arabidopsis thaliana , including Flowering Locus D (FLD) and Leafy (LFY)14–18.
The function of FT has also been well studied in many species other than model plant Arabidopsis thaliana. In rice, a total of thirteen FT-like homologs were identified, of which three FT-like genes can promote flowering (OsFTL1-OsFTL3)18–21. Of these genes, OsFTL2 was found to be a mobile signal that moves from the leaf to the shoot apical meristem to induce flowering22,23. In barley, a total of five FT-like genes were found, of which HvFT1 was highly expressed under long-day conditions in transition from vegetative to reproductive growth, HvFT3 was identified as a candidate gene under a major flowering-time QTL, HvFT2 and HvFT4 were highly expressed at later developmental stages and HvFT5 showed low expression level under short-day conditions24; highlighting the possible complexity of regulatory mechanisms involving multiple FT copies. In rapeseed, which has a history of polyploidy, 164 gene copies were identified from 29 core Arabidopsis flowering time gene homologs, an average of 4.7 copies per Arabidopsis gene25; 101 genome regions in this species were significantly associated with flowering time26 and 12 haplotypes differentiated winter and spring-flowering rapeseed7,27. By contrast, although a total of four FT-like genes were isolated in poplar, only one (PtFT2) was shown to shorten juvenile phase and promote seasonal flowering28.
In coconut palm, we have as yet no information about the molecular basis of flowering time control, despite the importance of the flowering time trait in this long-lived agricultural tree species. In the present study, we used comparative transcriptomics to elucidate the molecular mechanisms underlying the differences in flowering time observed between tall and dwarf coconut types. Coconut orthologs of 174 Arabidopsis genes belonging to the flowering-time pathway were identified and characterized for expression in different tissues from seedling and reproductive stages. Among these genes, we detected and validated alternative splicing of CnFT transcripts as the putative causal mechanism for the observed flowering time difference between tall and dwarf coconut types.
Materials and methods
Plant materials and RNA isolation
The leaf tissues of seedling and mature stage of twococonut plants were used for RNA-seq: 10-year-old Yellow Dwarf (YD) individuals; 1-year-old YD seedlings; 10-year-old Hainan Tall (HT) individuals and 1-year-old HT seedlings. Each sample included a mixture of two biological replicates. The eight coconut individuals were growing in the coconut germplasm resource of the Coconut Research Institute of the Chinese Academy of Tropical Agricultural Sciences. The average temperature and humidity in this location are approximately 23.9 °C and 89%. Total mRNA was extracted from spear leaves of these coconut individuals according to the MRIP method29.
Illumina sequencing
Purified mRNA was separately fragmented with divalent cations under increased temperature. These short fragments were taken as templates to synthesize the first-strand cDNA using hexamer primers and superscript III (Invitrogen, Carlsbad, CA, USA). Second-strand cDNA was then synthesized in a solution containing buffer, dNTP, RNaseH and DNA polymerase I and subsequently purified using a QiaQuick PCR extraction kit (Qiagen). EB buffer was used to resolve these short fragments for end reparation and poly (A) addition. The sequence adaptors were linked to two ends of short cDNA sequences and suitably sized cDNA fragments were selected out for PCR amplification based on the agrose gel electrophoresis results. Finally, the library established was sequenced using an Illumina Hiseq 200. The paired-end library was developed according to the paired-end sample Preparation kit protocol (Illumina, USA).
Real-time qPCR assays
Real-time qPCR was performed following a standard SYBR Premix Ex Taq Kit (TaKaRa) protocol in 96-well optical plates (Axygen) using a volume of 10 μl. The primer pairs were designed by using Snapgene Viewer 1.0 software (Table 1). The reactions were incubated in 0.2 ml tubes of a Mastercycler Ep Realplex4 (Eppendorf) machine as follows: 95 °C for 5 s, 55 °C for 15 s and 68 °C for 20 s. The procedure ended by a melt-curve ramping from 60 to 95 °C for 20 min to check the PCR specificity. All qPCR reactions were carried out in biological and technical triplicates. The final Ct values were the means of nine values. The comparative expression levels of the selected expressed sequences were normalized to that of actin.
Table 1.
Gene ID | Forward primer | Reverse primer |
---|---|---|
AP1_CCG008583.1 | AGGGAAGACGGAGGTGAGAC | GGGAGAGAAGACGATGAGGG |
AP1_CCG008584.1 | GGGGAAGACGGAGGTGAGACG | CCCTTGGGGGAGAAGACGATG |
AP1_CCG022175.1 | CCAGGTCTCCGTCGTCATCT | CTCTCGTGCTTGGCATCCC |
AP1_CCG027514.1 | AGGGGGAAGACGGAGATGAA | GAGAAGACGATGACGGCGAC |
CO_CCG005523.2 | CGAAGGGAGGGTTATTGGGG | GTCGCAGGTGACGCAGAGC |
FT_CCG009539.1 | TTCATCAGGTCCGCATTTCTT | AGGGTCCACCATCACGAGC |
FT_CCG011565.1 | GTGGTAGGGAGGGTCATAGGC | GGACAGTTGGAGGGTTGATAGC |
Calculating gene expression level
RPKM (Reads Per kb per Million reads) was used to calculate gene expression level using the following formula30:
where C is the number of reads that exclusively aligned to one expressed sequence, N is the total number of reads that aligned to all expressed sequences, and L is the basic number in the CDS of the corresponding expressed sequence. The CDS and genome sequence (the first version of coconut genome) is available in the GigaDB database (GigaDB, RRID:SCR 004002, https://gigadb.org/dataset/100347).
The statistical significance of the differential expression was determined according to the method documented by Audic and Claveries31. The statistical significance p(x) can fit the Poisson distribution
Arabic alphabet x in the formula represents number of reads mapped to each unigene. λ refers to the number of real transcripts of the unigene.
When thousands of hypothesis tests are performed, the p-value suitable for a single test is not sufficient to guarantee a low rate of false discovery. Thus, an false discovery rate (FDR) control method was applied using multiple hypothesis testing to correct the p-value results32. Subsequently, the RPKM ratio was used to compute the fold change of gene expression for each pair of samples simultaneously. The differentially expressed genes were selected using a threshold of FDR ≤ 0.001 and an absolute value of log2ratio ≥ 133.
The functional annotation of differential expressed genes
The predicted genes were used for BLAST and annotation against NCBI non-redundant sequence database (Nr) at a cut-off value of 10–5, The differential expressed genes were also aligned by BLASTX to protein databases such as GO, KEGG, Swissprot, and InterPro database (transcription factor), and PRG database (PRG). The Blast2GO program was used to obtain GO annotations for differential expressed genes. The WEGO software was then used to perform GO functional classification of differential expressional genes. Meanwhile, Blastall software was used to align differential expressed genes against KEGG (Kyoto Encyclopedia of Genes and Genomes) database. KEGG enrichment was performed by using phyper of R package. Enrichment of KEGG pathway were performed at a cut-off FDR of 0.01.
Detection of alternative splicing
We used rMATS 4.0.1 software to detect alternative splicing events based on RNA-seq data and coconut genome sequence (GigaDB, RRID:SCR 004002, https://gigadb.org/dataset/100347)34. The p-value and False Discovery Rate (FDR) were used to determine the isoform ratio of one gene at a cut-off PDR of 0.05. Detailed information is shown in Fig. 1. To exclude the influence of insertion/deletion variation in AS detection, we used STAR to map RNA-seq clean reads and GATK4 to call indel variations. The detected AS events that overlapped with transcript indel variations were filtered out for further analysis.
PCR amplification and sequencing of 20 coconut individuals
Primers were designed according to the exon sequences flanking alternative splicing junctions using Snapgene Viewer 1.0 software. The primer sequences were listed in Table 2. PCR amplifications were performed in 50 μl reaction mixures containing a 500 ng cDNA sample from the oil palm mesocarp, 1 × PCR buffer, 2 mM MgCl2, 5 U of TaqDNA polymerase (TaKaRa, China), 0.5 μM of each primer, and 0.2 mM dNTP mix. The PCR program included denaturation at 98 °C for 30 s; followed by 30 cycles at 98 °C for 10 s, 55 °C for 30 s, and 72 °C for 60 s; and a final extension at 72 °C for 5 min. The PCR products were electrophoretically visualized on a 1% agarose gel.
Table 2.
Primer name | Primer sequence |
---|---|
FT_AS_475-F | GTGGTAGGGAGGGTCATAGGC |
FT_AS_475-R | GGGCATTGAAGTAGACCGCAG |
Results
Transcriptome sequencing for tall and dwarf coconut
Approximately 6.6 Gb of unmapped clean reads were generated for each of four samples: tall coconut seedlings, tall coconut mature trees, dwarf coconut seedlings and dwarf coconut mature trees (Table 3). All clean reads have been deposited in the European Nucleotide Archive (Submission Number: PRJEB34852). After mapping all clean reads to the coconut reference genome using the software HISAT, a total of 36,558 transcripts were predicted. Among them, a total of 7,037 transcripts were predicted to be non-coding RNA. The remaining 29,521 were identified as protein-coding transcripts, 25,114 (85%) of which were consistent with previously known proteins predicted based on the coconut genome and the remaining 4,407 of which were novel protein-coding transcripts.
Table 3.
Clean reads (Gb) | Number of reads (million) | Q20 (%) | |
---|---|---|---|
Tall coconut | |||
Seedling | 6.59 | 65.94 | 97.58 |
Mature | 6.57 | 65.74 | 97.52 |
Dwarf coconut | |||
Seedling | 6.63 | 66.34 | 97.62 |
Mature | 6.68 | 66.75 | 97.5 |
Analysis and functional annotation of differential expression genes between tall and dwarf coconut
Global expression changes were identified between seedlings and mature trees of tall and dwarf coconut using the software program PossionDis. Compared to seedling period of tall coconut, 1729 transcripts were up-regulated and 4,150 were down-regulated (abs (log2Foldchange) < 1 or FDR > 0.001) in mature period of tall coconut. Meanwhile, compare to seedling period of dwarf coconut, 1964 transcripts were up-regulated and 2009 were down-regulated (abs (log2Foldchange) < 1 or FDR > 0.001) in seedling period of dwarf coconut.
In order to annotate differentially expressed unigenes, unigene sequences were aligned against the NCBI non-redundant (Nr) protein database using a cut-off E-value of 10–5 (Fig. 2). A large proportion of differentially expressed unigenes were assigned into the metabolic process (tall coconut: 902 unigenes and dwarf coconut: 576 unigenes) and the cellular process (tall coconut: 655 unigenes and dwarf coconut: 438 unigenes) categories for the “biological process” categories, cell (tall coconut: 921 unigenes and dwarf coconut: 537 unigenes) and cell part (tall coconut: 918 unigenes and dwarf coconut: 537 unigenes) in the “cellular component” categories, and catalytic activity (tall coconut: 949 unigenes and dwarf coconut: 636 unigenes) and binding (tall coconut: 733 unigenes and dwarf coconut: 498 unigenes) in the molecular function categories. Only down-regulated unigenes were detected in some functional categories in mature trees of dwarf coconut compared to dwarf coconut seedlings, including immune system processes, positive regulation of biological processes and biological phases under the biological processes categories, membrane-enclosed lumen, extracellular regions, and virion in the cellular component categories, and transcription factor activity and protein binding in the molecular function categories. By contrast, in each of these functional categories, both up-regulated and down-regulated unigenes were detected in mature trees of tall coconut compared to tall coconut seedlings.
In order to further identify the active biochemical pathways of differentially expressed unigenes, we mapped these differential expressed genes (abs (log2Foldchange) < 1 or FDR > 0.001) to the reference canonical pathways in the Kyoto encyclopedia of genes and genomes (KEGG). A total of 2,580 differential expressed genes between seedling and mature stage of tall coconut were mapping the 115 KEGG pathways (Fig. 2). Moreover, in dwarf coconut, 1,704 differential expressed genes were mapped onto 113 KEGG pathways. Enrichment results for KEGG pathways showed that the highest enrichment factors were detected in the photosynthesis-antenna protein pathway in dwarf coconut, followed by the photosynthesis pathway (Fig. 3). Meanwhile, in tall coconut, the highest enrichment factors were detected in the photosynthesis-antenna protein pathway, followed by Mannose type O-glycan biosynthesis.
Meanwhile, Transcription factors were also predicted by aligning differential expressed genes with COG database. A total of 685 differential expressed genes in tall coconut were identified and mapped onto 46 gene families. While, in dwarf coconut, 509 differential expressed genes were identified to be transcription factors and mapped into 45 gene families. Meanwhile, differential expressed genes were also blasted against PRG database. A total of 340 differential expressed genes were identified to be resistant genes in tall coconut. Moreover, in dwarf coconut, 208 differential genes were identified to resistant genes.
Identification of candidate genes involved in flowering-time pathways and their expression profiles
In order to identify candidate genes involving in flowering-time pathways, protein sequences of 174 Arabidopsis genes involving in flowering time were downloaded from the National Center for Biotechnology information (NCBI). Detailed information for these genes is listed in Supplementary Table 1. Subsequently, these Arabidopsis protein sequences were aligned using BLAST with the “tblastn” function against the CDS (coding sequences) database of coconut35. A total of 198 genes in coconut were identified as homologues of flowering-time genes in Arabidopsis. Detailed information for these coconut genes is listed in Supplementary Table 2. The 198 genes were classified into the four flowering-time pathways [photoperiod pathway, autonomous pathway, vernalization pathway, and gibberellin (GA) pathway] based on their Arabidopsis annotations.
From this data we identified four light receptor genes, including one copy of CRY1 (cryptochrome 1), one copy of PHYA (phytochrome A), and two copies of PHYB (phytochrome B), all of which showed high expression levels both in seedlings and mature trees of tall and dwarf coconut (Fig. 4) and are known to regulate clock genes. We identified 12 putative clock genes in the coconut genome: two ZTL (zeitlupe), one ELF6 (early flowering 6), two ELF7 (early flowering 7), one FKF1 (flavin binding F-box 1), and six SOC1 genes. Among them, three SOC1 genes (CCG008583, CCG008584, and CCG008883) showed higher expression levels in mature trees compared to seedlings in both tall and dwarf coconut. CCG008883 in particular had FPKM values of 0.22 and 0.34 in tall and dwarf coconut seedlings respectively, but FPKM values of 55.16 and 50.12 in mature trees of tall and dwarf coconut respectively. SOC1 is known to induce CO gene expression in Arabidopsis: we also found three CO orthologs in coconut (CCG025784, CGG002149, and CCG005523), one of which (CCG005523) was highly expressed at both seedling and reproductive stages of tall and dwarf coconut, while the other CO genes had low expression levels at both developmental stages. CO genes are known up-regulators of the Flowering Locus T (FT) gene, which triggers the transition from vegetative to reproductive growth. We identified six FT homologs in coconut: CCG009539.1, CCG014033.1, CCG023905.1, CCG016739.1, CCG025329.1, and CCG011565.1. Among them, CCG011565.1 was more highly expressed at the reproductive stage than the seedling stage in both tall and dwarf coconut. Finally, FT can also enhance the expression of the AP1 gene. We found four AP1 homologs in coconut: CCG027514.1, CCG008584.1, CCG008583.1, and CCG022175.1. Among them, higher expression was detected for CCG027514.1 in mature trees compared to in seedlings of tall and dwarf coconut.
Meanwhile, quantitative real-time PCR (qPCR) was used to validate the gene expression profile of several genes involved in the photoperiod pathway: CO (CCG021491), FT (CCG009539 and CCG011565), and AP1 (CCG008583, CCG008584, CCG022175, and CCG027514) gene copies (Fig. 5). The qPCR results were similar to the transcriptome data results. Higher expression levels in CO (CCG021491), FT (CCG011565), and AP1 (CCG022175 and CCG027514) were detected in mature trees compared to in seedlings of tall and dwarf coconut.
Alternative splicing of Flowering Locus T (FT) and association with flowering time
A total of 11,376, 12,124, 11,676, and 12,186 novel transcripts were detected in mature tall coconut trees, tall coconut seedlings, mature trees of dwarf coconut, and dwarf coconut seedlings, respectively. These transcripts were analysed in conjunction with gene predictions to determine if alternative splicing events were occurring. We detected five types of alternative splicing: alternative 3′ splicing sites (A3SS), alternative 5′ splicing sites (A5SS), mutually exclusive exons (MXE), retained introns (RI), and skipped exons (SE). The largest proportion of novel alternatively spliced transcripts were generated by skipped exons (SE), followed by retained introns (RI) and alternative 3′ splicing sites (A3SS), with the fewest number of alternatively spliced transcripts generated by mutually exclusive exons (MXE) (Fig. 6).
Meanwhile, we examined in detail any alternative splicing events involving genes in the floweing time pathway. For FT, different transcripts were detected between tall and dwarf coconut. Due to alternatice 3′ splicing site (A3SS) in the fourth exon of FT, one six-nucleotide indel was detected between tall and dwarf coconut (Fig. 7A). In order to validate the relationship between the six-nucleotide indel and flowering time, we examined FT transcripts from 8 tall coconuts and 11 dwarf coconuts. Shorter transcripts (including the six-nucleotide deletion) were detected in all dwarf coconut samples, while 8/11 tall coconut samples showed longer transcripts (without the six-nucleotide deletion) (Fig. 7B and Supplementary File 2).
Discussion
Coconut (Cocos nucifera, 2n = 32), belonging to the genus Cocos in the monocotyledonous family Arecaceae (Palmaceae), is an important tropical oil and fruit crop. Coconut can be subdivided into two types according to morphological characteristics: tall coconuts, which flower in 8–10 years, and dwarf coconut, which only need 3–5 years to flower36. In order to elucidate the molecular mechanisms underlying differences in flowering time between tall and dwarf coconut, we compared transcriptome data from seedlings and mature trees of tall and dwarf coconuts. We found flowering in Cocos nucifera to be putatively regulated by the photoperiod pathway. In this pathway, FT is a crucial gene involved in signal transduction; overexpressing FT can result in an extremely early-flowering phenotype7. In our study, we detected alternative splicing of the FT gene between tall and dwarf coconut, as well as a number of key genes putatively involved in regulation of flowering time in coconut.
One of the most crucial phases in flowering plant development is the decision to flower, and flowering time has a major influence on plant fitness. The transition from the vegetative to the reproductive stage is controlled by several external factors, including photoperiod14, temperature, and abiotic stress, as well as by internal factors such as hormone production, C/N nutrient ratios and the age of the plant37. A complex set of interactions between the internal and external environment together decide when the plant transforms from the vegetative to the reproduction stage. In woody plants, it is difficult to investigate the molecular mechanisms controlling flowering time, due to their long life-cycles. Woody plants, due to their characteristic long life cycles and perenniality, generally have specific strategies to adapt to the ever-changing environment, such as bud dormancy. Despite this, several studies have shown a high level of conservation of genetic networks regulating flowering time between woody and herbaceous plants38–40. In our study, some coconut genes also showed high similarity with Arabidopsis genes known to be involved in the regulation of flowering time, including CRY1, PHYA, PHYB, ZTL, SOC1, ELF3, FKF1, ELF4, CO, and FT homologs. CRY1 has been validated to up-regulate expression of clock genes in the Arabidopsis photoperiod pathway, including ZTL (zeitlupe), SOC1 (suppressor of overexpression of constans), ELF3 (early flowering 3), FKF1 (flavin binding F-box 1), and ELF4 (early flowering 4)11. By contrast, PHYA and PHYB can suppress the expression of clock genes in the Arabidopsis photoperiod pathway20. We also observed SOC1 genes were up-regulated expressed in mature period compared to seedling period from tall and dwarf coconut.
Coconut is mainly cultivated in the tropical and subtropical regions, including Central and South America, East and West Africa, Southeast Asia and the Pacific Islands. Hence, it is not possible that flowering time could be regulated by the vernalization pathway in coconut. Based on our transcriptome data, we think that flowering time in coconut is most probably regulated by the photoperiod pathway. We observed high levels of expression of some crucial genes involved in the photoperiod pathway at the mature tree stage compared to the seedling stage.
Most genes in flowering plants contain introns which will be spliced from pre-mRNA in the nucleus before mature mRNA is formed. Splice site variation in the junction between introns and exons will result in multiple mRNA transcripts or isoforms from a single gene41. Alternative splicing can contribute to gene regulation and proteome diversity, and is also known to be involved in stress response42–45, development and reproductive growth46–48. In Arabidopsis, FT plays an important role in the transition from vegetative growth to reproduction growth49,50. Alternative splicing of FT in Chrysanthemum morifolium results in one FT gene encoding up to four different transcripts51. The expression level of these four FT isoforms depends on the developmental stage of flower, with alternative splicing associated with the degree of early flowering51. We also detected alternative splicing of a CnFT gene between tall and dwarf coconut. In dwarf coconut, which requires approximately 3–5 years to enter its reproductive stage, we detected only one transcript type, which lacked six nucleotides compared to the transcript type found to be most abundant in tall coconut (which requires 8–10 years to enter its reproductive stage). These results suggest that the FT transcript lacking the six nucleotides produce by A3SS may have a positive contribution to the early flowering phenotype in coconut. The other isoform of CnFT was detected in 8/11 tall coconut individuals: the remaining three may potentially result from open-pollination/heterozygosity resulting from cross-pollination with dwarf coconut varieties. Our results suggest that this alternative splice variant of FT could be a major player in flowering time differentiation between the dwarf and tall coconut types, but further investigation and validation is warranted to confirm this effect.
Supplementary information
Acknowledgements
This study has been supported by Hainan major science and technology project and the National Nonprofit Institute Research Grants of 1630152020005, 1630152019006 and 1630152017007, and the Species and variety Protection of the Ministry of Agriculture and Rural Affairs.
Author contributions
Y.X., W.X., H.F.—designed the study. R.L., J.Z.—performed transcriptome sequencing and RNA extraction. Z.L., S.G., Y.Z. and J.Z.—performed alternative splicing analysis. Y.X., A.S.M. and W.X.—drafted and revised the manuscript. Y.D. and X.S.—assisted in the data analysis.
Data availability
The raw data of transcriptome is from seedling and mature period of tall and dwarf coconut available in the European Nucleotide Archive (Submission Number: PRJEB34852). The CDS and genome sequence (the first version of coconut genome) is available in the GigaDB database (GigaDB, RRID:SCR 004002, https://gigadb.org/dataset/100347)35.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Wei Xia, Rui Liu and Jun Zhang.
Contributor Information
Haikuo Fan, Email: vanheco@163.com.
Yong Xiao, Email: xiaoyong1980@catas.cn.
Supplementary information
is available for this paper at 10.1038/s41598-020-68431-2.
References
- 1.Batugal, P., Rao, V., Oliver, J., eds. Coconut Genetic Resources. IPGRI-Regional Office for Asia, the Pacific and Oceania, Serdang, Malaysia (2005).
- 2.Patel J. Coconut breeding. Proc. Assoc. Econ. Biol. 1937;5:1–16. [Google Scholar]
- 3.Koornneef M, Hanhart CJ, van der Veen JH. A genetic and physiological analysis of late flowering mutants in Arabidopsis thaliana. Mol. Gen. Genet. 1991;229:57–66. doi: 10.1007/BF00264213. [DOI] [PubMed] [Google Scholar]
- 4.Clarke JH, Dean C. Mapping FRI, a locus controlling flowering time and vernalization response in Arabidopsis thaliana. Mol. Gen. Genet. 1994;242:81–89. doi: 10.1007/BF00277351. [DOI] [PubMed] [Google Scholar]
- 5.Langridge J. Effect of day-length and gibberellic acid on the flowering of Arabidopsis. Nature. 1957;180:36–37. [Google Scholar]
- 6.Sheldon CC, Burn JE, Perez PP, Metzger J, Edwards JA, Peacock WJ, Dennis ES. The FLE MADS box gene: a repressor of flowering in Arabidopsis regulated by vernalization and methylation. Plant Cell. 1999;11:445–458. doi: 10.1105/tpc.11.3.445. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Schiessl SV, Huettel B, Kuehn D, Reihardt R, Snowdon RJ. Flowering time gene variation in Brassica species shows evolutionary principles. Front. Plant Sci. 2017;8:1742. doi: 10.3389/fpls.2017.01742. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Samach A, Coupland G. Time measurement and the control of flowering in plants. BioEssays. 2000;22:38–47. doi: 10.1002/(SICI)1521-1878(200001)22:1<38::AID-BIES8>3.0.CO;2-L. [DOI] [PubMed] [Google Scholar]
- 9.Onouchi H, Coupland G. The regulation of flowering time of Arabidopsis in response to daylength. J. Plant Res. 1998;111:271–275. [Google Scholar]
- 10.Tanaka C, Itoh T, Iwasaki Y, Mizuno N, Nasuda S, Murai K. Direct interaction between VRN1 protein and the promoter region of the wheat FT gene. Genes Genet. Syst. 2017;93(1):25–29. doi: 10.1266/ggs.17-00041. [DOI] [PubMed] [Google Scholar]
- 11.Halliday KJ, Koornneef M, Whitelam GC. Phytochrome B and at least one other phytochrome mediate the accelerated flowering response of Arabidopsis thaliana L. to low red/far-red ratio. Plant Physiol. 1994;104:1311–1315. doi: 10.1104/pp.104.4.1311. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Guo Y, Wang D, Jia W, Song J. Effects of seed vernalisation and photoperiod on flowering induction in the haplophyte Thellungiella halophila. Aust. J. Bot. 2012;60:743. [Google Scholar]
- 13.Lin C. Photoreceptors and regulation of flowering time. Plant Physiol. 2000;123:39–50. doi: 10.1104/pp.123.1.39. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Baurle I, Dean C. The timing of developmental transitions in plants. Cell. 2006;125:655–664. doi: 10.1016/j.cell.2006.05.005. [DOI] [PubMed] [Google Scholar]
- 15.Boss PK, Bastow PM, Mylne JS, Dean C. Multiple pathways in the decision to flower: Enabling, promoting, and resetting. Plant Cell. 2004;16:18–31. doi: 10.1105/tpc.015958. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Jack T. Molecular and genetic mechanisms of floral control. Plant Cell. 2004;16:S1–17. doi: 10.1105/tpc.017038. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Moon J, Lee H, Kim M, Lee I. Analysis of flowering pathway intergrators in Arabidopsis. Plant Cell Physiol. 2005;46:292–299. doi: 10.1093/pcp/pci024. [DOI] [PubMed] [Google Scholar]
- 18.Putterill J, Laurie R, Macknight R. It’s time to flower: The genetic control of flowering time. BioEssays. 2004;26:363–373. doi: 10.1002/bies.20021. [DOI] [PubMed] [Google Scholar]
- 19.Chardon F, Damerval C. Phylogenomic analysis of the PEBP gene family in cereals. J. Mol. Evol. 2005;61:1030–1033. doi: 10.1007/s00239-004-0179-4. [DOI] [PubMed] [Google Scholar]
- 20.Izawa T, Oikawa T, Sugiyama N, Tanisaka T, Yano M, Shimamoto K. Phytochrome mediates the external light signal to repress FT orthologs in photoperiodic flowering of rice. Genes Dev. 2002;6:2006–2020. doi: 10.1101/gad.999202. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Paterson AH, Bowers JE, Peterson DG, Estill JC, Chapman BA. Structure and evolution of cereal genomes. Curr. Opin. Genet. Dev. 2003;13:644–650. doi: 10.1016/j.gde.2003.10.002. [DOI] [PubMed] [Google Scholar]
- 22.Salse J, Piegu B, Cooke R, Delseny M. New in silico insight into the synteny between rice (Oryza sativa L.) highlights reshuffling and identifies new duplications in the rice genome. Plant J. 2004;38:396–409. doi: 10.1111/j.1365-313X.2004.02058.x. [DOI] [PubMed] [Google Scholar]
- 23.Tamaki S, Matsuo S, Wong HL, Yokoi S, Shimamoto K. Hd3a protein is a mobile flowering signal in rice. Science. 2007;316:1033–1040. doi: 10.1126/science.1141753. [DOI] [PubMed] [Google Scholar]
- 24.Faure S, Higgins J, Turner A, Laurie DA. The Flowering Locus T-like gene family in barley (Hordeum vulgare) Genetics. 2007;176:599–609. doi: 10.1534/genetics.106.069500. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Coelho CP, Minow MAA, Chalfun-Junior A, Colasanti J. Putative sugarcane FT/TFL 1 genes delay flowering time and alterreproductive in Arabidopsis. Front. Plant Sci. 2014;5:221. doi: 10.3389/fpls.2014.00221. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Schiessl S, Iniguez-Luy F, Qian W, Snowdon R. Diverse regulatory factors associated with flowering time and yield responses in winter-type Brassica napus. BMC Genom. 2015;16(1):737. doi: 10.1186/s12864-015-1950-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Schiessl S, Samans B, Huttel B, Reinhard R, Snowdon RJ. Capturing sequence variation among flowering-time regulatory gene homologs in the allopolyploid crop species Brassica napus. Front. Plant Sci. 2014;5:404. doi: 10.3389/fpls.2014.00404. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Hsu CY, Liu Y, Luthe DS, Yuceer C. Poplar FT2 shortens the juvenile phase and promotes seasonal floering. Plant Cell. 2006;18:1846–1861. doi: 10.1105/tpc.106.041038. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Xiao Y, Yang Y, Cao H, Fan H, Ma Z, Lei X, Mason AS, Xia Z, Huang X. Efficient isolation of high quality RNA from tropical palms for RNA-seq analysis. Plant Omics. 2012;5:584–589. [Google Scholar]
- 30.Mortazavi A, Williams BA, McCue K, Schaeffer L, Wold B. Mapping and quantifying mammalian transcriptome by RNA-seq. Nat. Methods. 2018;5:621–628. doi: 10.1038/nmeth.1226. [DOI] [PubMed] [Google Scholar]
- 31.Audic S, Claverie JM. The significance of digital gene expression profiles. Genome Res. 1997;7:986–995. doi: 10.1101/gr.7.10.986. [DOI] [PubMed] [Google Scholar]
- 32.Peiner A, Yekutieli D, Benjamini Y. Identifying differentially expressed genes using false discovery rate controlling procedures. Bioinformatics. 2003;19:368–375. doi: 10.1093/bioinformatics/btf877. [DOI] [PubMed] [Google Scholar]
- 33.Wu J, Zhang Y, Zhang H, Huang H, Folta KM. Whole genome wide expression profiles of Vitis amurensis grape responding to downy mildew by using Solex sequencing technology. BMC Plant Biol. 2010;10:234. doi: 10.1186/1471-2229-10-234. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Shen S, Park JW, Lu Z, Lin L, Henry MD, Wu YN, Zhou Q, Xing Y. Robust and flexible detection of differential alternative splicing from replicate RNA-seq data. Proc. Acad. Sci. USA. 2014;11:E5593–E5601. doi: 10.1073/pnas.1419161111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Xiao Y, Xu P, Fan H, Baudouin L, Xia W, Bocs S, Xu J, Li Q, Guo A, Zhou L, Li J, Wu Y, Ma Z, Armero A, Issali AE, Liu N, Peng M, Yang Y. The genome draft of coconut (Cocos nucifera) Gigascience. 2017;6:1–11. doi: 10.1093/gigascience/gix095. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Menon KPV, Pandalai KM. The Coconut. A Monograph. Indian Central Coconut Committee. Madras: Government Press; 1958. pp. 86–102. [Google Scholar]
- 37.Otiz-Marchena MI. Photoperiodic control of Carbon distribution during the floral transition in Arabidopsis. Plant Cell. 2014;26:565–584. doi: 10.1105/tpc.114.122721. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Dong X, Jiang X, Kuang G, Wang Q, Zhong M, Jim D, Hu J. Genetic control of flowering time in woody plants: Roses as an emerging model. Plant Divers. 2017;39:104–110. doi: 10.1016/j.pld.2017.01.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Mohamed R, Wang CT, Ma C, Shevchenko O, Dye SJ, Puzey JR, Etherington E, Sheng X, Meilan R, Strauss SH, Brunner AM. Populus CEN/TFL 1 regulates first onset of flowering, axillary meristem identity and dormancy release in Populus. Plant J. 2010;62:674–688. doi: 10.1111/j.1365-313X.2010.04185.x. [DOI] [PubMed] [Google Scholar]
- 40.Chai G, Wang Z, Tang X, Yu L, Qi G, Wang D, Yan X, Kong Y, Zhou G. R2R3-MYB gene pairs in Populus: Evolution and contribution to secondary wall formation and flowering time. J. Exp. Bot. 2014;65:4255–4269. doi: 10.1093/jxb/eru196. [DOI] [PubMed] [Google Scholar]
- 41.Barbazuk WB, Fu Y, McGinnis KM. Genome-wide analyses of alternative splicing in plants: Opportunities and changes. Genome Res. 2008;18:1381–1392. doi: 10.1101/gr.053678.106. [DOI] [PubMed] [Google Scholar]
- 42.Kwon YJ, Park MJ, Kim SG, Baldwin IT, Park CM. Alternative splicing and nonsense-mediated decay of circadian clock genes under environmental stress conditions in Arabidopsis. BMC Plant Biol. 2014;14:136. doi: 10.1186/1471-2229-14-136. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Feng J, Li J, Gao Z, Lu Y, Yu J, Zheng Q. SKIP confers osmotic tolerance during salt stress by controlling alternative gene splicing in Arabidopsis. Mol. Plant. 2015;8:1038–1052. doi: 10.1016/j.molp.2015.01.011. [DOI] [PubMed] [Google Scholar]
- 44.Shikata H, Hanada K, Ushijima T, Nakashima M, Suzuki Y, Matsushita T. Phytochrome controls alternative splicing to mediate light responses in Arabidopsis. Proc. Natl. Acad. Sci. USA. 2014;111:18781–18786. doi: 10.1073/pnas.1407147112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Wang HF, You CJ, Chang F, Wang Y, Wang L, Qi J. Alternative splicing during Arabidopsis flower development results in constitutive and stage-regulated isoforms. Front. Genet. 2014;5:25. doi: 10.3389/fgene.2014.00025. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Zhang B, Liu ZX, Ma J, Song Y, Cheng FJ. Alternative splicing of the AGAMOUS orthologous gene in double flower of Magnolia stellata (Magnoliaceae) Plant Sci. 2015;241:277–285. doi: 10.1016/j.plantsci.2015.10.017. [DOI] [PubMed] [Google Scholar]
- 47.Yuan YX, Wu J, Sun RF, Zhang XW, Xu DH, Bonnema G. A naturally occurring splicing site mutation in the Brassica rapa FLC1 gene is associated with variation in flowering time. J. Exp. Bot. 2009;60:1299–1308. doi: 10.1093/jxb/erp010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Lee JH, Ryu HS, Chuang KS, Pose D, Kim S, Schmid M. Regulation of temperature-responsive flowering by MADS-box transcription factor repressors. Science. 2013;342:628–632. doi: 10.1126/science.1241097. [DOI] [PubMed] [Google Scholar]
- 49.Corbesier L, Vincent C, Jang S, Fornara F, Fan Q, Searle I. FT protein movement contributes to long-distance signaling in floral in floral induction of Arabidopsis. Science. 2007;316:1030–1033. doi: 10.1126/science.1141752. [DOI] [PubMed] [Google Scholar]
- 50.Notaguchi M, Abe M, Kirmura T, Daimon Y, Kobayashi T, Yamaguchi A. Long-distance, graft-transmissible action of Arabidopsis FLOWRING LOCUS T protein to promote flowering. Plant Cell Physiol. 2008;49:1645–1658. doi: 10.1093/pcp/pcn154. [DOI] [PubMed] [Google Scholar]
- 51.Mao Y, Sun J, Cao P, Zhang R, Fu Q, Chen S, Chen F, Jiang J. Functional analysis of alternative splicing of the FLOWERING LOCUS T orthologous gene in Chrysanthemum morifolium. Hortic. Res. 2016;3:16058. doi: 10.1038/hortres.2016.58. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The raw data of transcriptome is from seedling and mature period of tall and dwarf coconut available in the European Nucleotide Archive (Submission Number: PRJEB34852). The CDS and genome sequence (the first version of coconut genome) is available in the GigaDB database (GigaDB, RRID:SCR 004002, https://gigadb.org/dataset/100347)35.