Skip to main content
Scientific Reports logoLink to Scientific Reports
. 2021 May 18;11:10516. doi: 10.1038/s41598-021-90082-0

Revealing the developmental dynamics in male strobilus transcriptome of Gnetum luofuense using nanopore sequencing technology

Chen Hou 1,2, Yuxin Tian 3,4, Yingli Wang 1,2, Huiming Lian 1,2, Dongcheng Liang 1,2, Shengqing Shi 5, Nan Deng 3,4,, Boxiang He 1,2,
PMCID: PMC8131605  PMID: 34006996

Abstract

Gnetum is a pantropical distributed gymnosperm genus. As being dioecious, Gnetum species apply female and male strobili to attract and provide nutrition to insect pollinators. Due to its unique gross morphology, a Gnetum male strobilus receives much attention in previous taxonomic and evolutionary studies. However, underlying molecular mechanisms that control male strobilus development and pollination adaptation have not been well studied. In the present study, nine full-length transcriptomes were sequenced from three developmental stages of the G. luofuense male strobili using Oxford Nanopore Technologies. In addition, weighted gene co-expression network analysis (WGCNA), and RT-qPCR analysis were performed. Our results show that a total of 3138 transcription factors and 466 long non-coding RNAs (lncRNAs) were identified, and differentially expressed lncRNAs and TFs reveal a dynamic pattern during the male strobilus development. Our results show that MADS-box and Aux/IAA TFs were differentially expressed at the three developmental stages, suggesting their important roles in the regulation of male strobilus development of G. luofuense. Results of WGCNA analysis and annotation of differentially expressed transcripts corroborate that the male strobilus development of G. luofuense is closely linked to plant hormone changes, photosynthesis, pollination drop secretion and reproductive organ defense. Our results provide a valuable resource for understanding the molecular mechanisms that drive organ evolution and pollination biology in Gnetum.

Subject terms: Flowering, Plant development, Plant morphogenesis

Introduction

The genus Gnetum comprises approximately 40 species that are widely distributed in pantropical forests1,2. The leaves of Gnetum are edible, its stems and bark are made into string, nets, and paper, and its seeds can be used for oil and drinks3,4. Reproductive organs of Gnetum consist of female and male strobili that are composed of several layers of involucre collars1,5 (Fig. 1). In a male strobilus, each layer of involucre collars is composed of exposed (e.g. G. gnemon L.) or hidden (G. luofuense C.Y. Cheng) sterile ovules at the top, surrounded by several layers of male reproductive units with attached microsporangia6. With regard to the hidden and exposed sterile ovules, genus Gnetum was classified into two sections i.e. Gnemonomorphi and Cylindrostachys in the monograph7. Morphological observation of male strobili has been conducted among different lineages of Gnetum e.g. African species8,9, Asian arboresent species3,5, and Asian lianoid species1012. These results provide important diagnostic characters to further delimit species among different lineages of this genus. As being dioecious, the majority of Gnetum species are entemophilous or ambophilous6,13. At anthesis, sterile ovules of G. gnemon L. and G. cuspidatum Blume not only produce sugary pollination drops with rotten scents to attract insects, but also provide nutrition to pollinators as a reward14,15.

Figure 1.

Figure 1

Different developmental stages of male strobili in Gnetum luofuense C.Y. Cheng. A white bar represents 1 cm. The photograph was made by C.H.

Despite of these previous efforts, some issues related to sexual reproduction and evolution about Gnetum male strobili remain unclear. First, previous studies shows that MADS-box genes are essential in organ identity and subsequent development of female and male strobili in G. gnemon1618. However, roles of other genes, e.g. Type I and TM8 genes in the sexual determination of male strobili have not been investigated. Second, photosynthetic capacity of Gnetum leaves have been tested to be low19,20, a male strobilus might undergo the improvement of photosynthesis to provide additional resource for the entire course of male strobilus development, but the validation of this statement requires robust evidence. Third, at anthesis, a male strobilus produces sugary pollination drops, the underlying mechanisms that generate the main components in a pollination drop remain poorly understood. Last but not the least, a male strobilus might apply an efficient mechanism to protect against external fungi, bacteria and pathogen, since their ovules are exposed to the surrounding area like other gymnosperms. Molecular mechanisms, again, have not been carefully investigated.

LncRNAs (long non-coding RNAs) take part in transcriptional and post-transcriptional gene regulation in almost all eukaryotic organisms2123, it might control the expression of genes involved in male strobilus development of G. luofuense. The roles of lncRNAs might differ across different development stage of male strobili but their potential roles in the regulation of functional genes, so far, have rarely been investigated. It is probably because the RNA-seq based on Illumina platforms produce short reads, preventing us from the assembly of full-length transcrptome. With the advent of third generation sequencing technology, Oxford Nanopore Technologies (ONT) can now be used to sequence complete full-length cDNAs24. To date, Nanopore sequencing has been widely used in plant genome sequencing)25,26, but rarely in full-length transcript sequencing and investigation of lncRNAs. A recent study has shown that ONT technology generates better quality raw data and provides more accurate data at transcription level than PacBio technology27. In the present study, we generated nine full-length transcriptomes from three developmental stages of G. luofuense male strobili using Nanopore sequencing technology. Differentially expressed lncRNAs, transcripts and TFs were investigated across the three developmental stages. Our efforts will be made to understand the molecular mechanisms that drive organ evolution and pollination biology in Gnetum.

Results

Assembly and functional annotation of full-length reads

In the present study, we defined three developmental stages of G. luofuense male strobilus—FA represents 1–10 days’ growth, FB represents 10–15 days’ growth, and FC represents 15–25 days’ growth (Fig. 1). A total of 30,261,170 clean reads were generated by Nanopore sequencing, with mean lengths ranging from 993 (FC03) to 1236 bp (FB02) (Table S1). Among these clean reads, 22,997,187 full length (FL) reads with clear primer sequences at both ends were identified. FL reads accounted for 78.05% (1,971,235; FA02) to 79.64% (1,485,322; FA03) of the clean reads (Table S2). All FL reads were clustered and polished, yielding a total of 257,886 consensus reads, ranging from 22,119 (FB01) to 33,351 (FC03) in individual samples (Table S3). To delete redundant reads, all consensus reads were mapped against the reference genome of G. luofuense with the mapping rates ranging from 99.26 to 99.40% (Table S4). After mapping, a total of 132,653 non-redundant full-length reads (nFLs) were obtained, ranging from 12,066 (FB01) to 16,689 (FC03) in individual samples (Table S5). A total of 45,036 transcription were functionally annotated by searching against the NCBI non-redundant protein sequence (NR), Swiss-Prot, Clusters of Orthologous Groups of proteins (KOG/COG), Gene ontology (GO), Protein Family (Pfam) and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases (Table S6).

Identification of CDS and lncRNAs

We identified a total of 21,720 open reading frames (ORFs), among which 16,692 (76.85%) were complete ORFs with both start and stop codons. The length distribution of the complete ORFs is shown in Fig. 2A. Among the complete ORFs, the average length of the 5′ untranslated regions (UTRs) was 530 bp, and the average length of the 3′ UTR was 557 bp. A total of 15,439 coding sequences (CDS) were detected, with an average length of 394 bp. In addition, 466 lncRNAs with a mean length of 689 nt were identified using four separate methods (Fig. 2B). The lncRNAs comprised 349 lincRNAs (74.9%), 20 antisense lncRNAs (4.3%), 10 intronic lncRNAs (2.1%), and 87 sense lncRNAs (18.7%) (Fig. 2C). The differentially expressed lncRNAs are shown in Fig. 2D, of which we found the number between FA and FC is the largest (22 lncRNAs), while the smallest between FB and FC (one lncRNA). Furthermore, three hundred twenty-six genes were predicted to be regulated by 296 lncRNAs in cis, 218 genes regulated by 81 lncRNAs in trans. Networks of target genes with the regulation of lncRNAs in cis and in trans are shown in Fig. 2E. Moreover, KEGG pathway enrichment analysis was performed for those target genes (Fig. 2F). Our results show that target genes were primarily enriched in “photosynthesis (ko00195)”, “plant-pathogen interaction” (ko04626), “starch and sucrose metabolism” (ko00500), “plant hormone signal transduction” (ko04075), “flavonoid biosynthesis” (ko00941), “amino sugar and nucleotide sugar metabolism” (ko00520, 25).

Figure 2.

Figure 2

Identification of open reading frames (ORFs) and lncRNAs based on nine full-length transcriptome. (A) Length distribution of ORFs detected in all full-length transcripts. (B) Venn diagram showing the number of lncRNAs identified using four different approaches: CPC (Coding Potential Calculator), CNCI (Coding-Non-Coding Index), CPAT (Coding Potential Assessment Tool), and Pfam (Protein Family). (C) Functional classification and numbers of four lncRNA types. (D) Venn diagram showing the overlap in the differential expressed lncRNAs among the three different developmental stages of G. luofuense male strobili. (E) Representatives of predicted interaction networks among lncRNAs and their target genes. Solid lines and dotted lines represent the expression regulation by the lncRNAs in cis and in trans, respectively. (F) KEGG (Kyoto Encyclopedia of Genes and Genomes) pathway annotations of cis- and trans-regulated genes regulated by the detected lncRNAs.

WGCNA analysis

Expression of all transcripts were quantified with CPM values and listed in Supplementary Dataset File 1. Based on the data, we performed weighted correlation network analysis (WGCNA), and our results show that seven modules of highly correlated TFs across the three developmental stages of G. luofuense male strobili (Fig. 3A, B). The three largest modules of enriched transcripts are shown in turquoise genes), brown (349), blue (340). At the FA stage, values of Pearson’s correlation coefficient in the brown and turquoise modules were both positive, but with the q value > 0.05. At the FC stage, however, positive values of correlation coefficient were found among blue, green, and red modules with all q values < 0.05. KEGG pathway enrichment analysis was performed for transcripts in the five modules (Fig. 3C). TFs in the turquoise module were primarily enriched in “photosynthesis (ko00195)”, “phenylpropanoid biosynthesis” (ko00940), and transcripts in the brown module were primarily enriched in “phenylpropanoid biosynthesis”, “DNA replication” (ko03030), and “plant hormone signal transduction” (ko04075). Transcripts in the blue module were primarily enriched in “starch and sucrose metabolism”, “phenylpropanoid biosynthesis”, and “plant hormone signal transduction”.

Figure 3.

Figure 3

Weighted correlation network analysis (WGCNA) of co-expressed transcripts. (A) Hierarchical clustering tree showing co-expression modules based on WGCNA analysis. Each branch in the phylogenetic tree corresponds to an individual gene, and highly interconnected genes are grouped into seven modules. The different colored rows below the phylogeny indicate differentially expressed transcripts in G. luofuense. (B) Module–trait relationship with physiological indexes. The number represents the correlation coefficient about modules with physiological indexes. The number in the bracket means p-value (C) KEGG (Kyoto Encyclopedia of Genes and Genomes) pathway annotations of co-expressed genes in the largest five modules.

Detection of DETs

A total of 3119 differentially expressed transcripts (DETs) were detected (Supplementary Dataset File 2), with the largest number of (2786) between the FA and FC stages; 1263 transcripts were up-regulated and 1523 transcripts were down-regulated in FA versus FC (Fig. 4A). The smallest number of DETs (127) was detected between FB and FC: 45 up-regulated and 86 down-regulated. The number of shared DETs was the largest between FA versus FB and FA versus FC (667), and the number of shared DETs was the smallest between FB versus FC and FA versus FB (12) (Fig. 4B). Among the up-regulated genes, the set of all DETs was significantly enriched in the top three KEGG pathways (Fig. 4C): “glutathione metabolism” (ko00480), “glycolysis /gluconeogenesis” (ko00010), and “amino sugar and nucleotide sugar metabolism” (ko00520). Among the down-regulated genes, the set of all DETs was significantly enriched in the top three KEGG pathways (Fig. 4D): “phenylpropanoid biosynthesis”, “photosynthesis”, “starch and sucrose metabolism” (ko00500).

Figure 4.

Figure 4

Detected differentially expressed transcripts (DETs) and their annotation. (A) Numbers of DETs between different developmental stages of G. luofuense male strobili. (B) Venn diagram showing the overlap in DETs between different developmental stages of male strobili. (C) KEGG (Kyoto Encyclopedia of Genes and Genomes) pathway annotations of up-regulated transcripts between FA and FC. (D) KEGG pathway annotation of down-regulated transcripts between FA and FC. (E) Two clusters identified based on expression levels of DETs from FA to FC. (F) GO enrichment amongst the two clusters with orange boxes denoting significant enrichment of GO annotation.

TF identification and RT-qPCR validation

A total of 3138 transcription factors (TFs) was detected, of which MYB-related, MADS-box, and bHLH TFs constituted the most abundant TFs (Fig. 5A, Supplementary Dataset File 3). We further investigated the differentially expressed TFs among the three developmental stages of G. luofuense male strobili. We found that bHLH, MYB and MADS-box MIKC were highly expressed at FA and FC, while TFs e.g. Y-subunit B, ABI3 and CBF/NF-Y were highly expressed at FB (Fig. 5B). In addition, we performed RT-qPCR experiments to validate the eight genes of interest, which encode the differentially expressed TFs (Fig. 5C). Results of RT-qPCR are largely congruent with the results of ONT sequencing, but there are exceptions such as genes TnS001008199g01 and TnS000980857g03 at the stage of FC and TnS000498063g52 at the stage of FB. It might because the sampling of RT-qPCR and ONT-sequencing slightly differed at the developmental stages of male strobili.

Figure 5.

Figure 5

(A) A partial list of transcription factors (top 20 gene families) identified in nine full-length transcriptome of G. luofuense male strobili. (B) Transcription factors highly expressed in FA, FB, and FC with numbers in brackets. (C) RT–qPCR of eight genes of interests that encode differentially expressed transcription. Expression of the ACTIN gene used as an internal control was also shown. Counts per million (CPM) values from Nanopore sequencing are indicated on the left y-axis, and relative RT-qPCR expression levels are indicated on the right y-axis.

Discussion

Reproductive organ development

MADS-box genes play an essential role in reproductive organ development of seed plants2830. Transcription factors encoded by MADS-box genes consists of two types, i.e. type I (SRF-like) and type II (MEF2-like). A previous study reports that DEF/GLO-like gene GGM15 are differentially expressed during a sterile ovule development of G. gnemon16. Another study shows that transcription factors (TFs) encoded by DEF/GLO-like genes GGM2 and GGM15, AGL6-like genes GGM9 and GGM11 were examined to form a heterodimer that determine the initiation of male strobilus development in G. gnemon31. In the present study, four type II genes (including three TM8-like genes and one SQUA-like gene identified in32) as well as one type I genes were differentially expressed across the three developmental stages of G. luofuense male strobili (Fig. 5C). These results corroborate that MADS-box genes are important in male strobilus development of G. luofuense.

Besides of MADS-box TFs, Aux/IAA TFs are probably also involved in a male strobilus development of G. luofuense. Transcriptional factors AUX/IAA are involved in the auxin-response regulation during plant growth3335. In angiosperms, it has been known that Aux/IAA TFs play an important role in gynoecium morphogenesis, ovule development, and formation of primary branch in Arabidopsis36. Another study shows that Aux/IAA genes take effects in reproductive organ development and responses to abiotic stress in rice37. In Gnetum, a previous study has shown that concentration of endogenous hormones gibberellin A3 (GA3) and zeatin riboside (ZR) dramatically increases while indole-3-acetic acid (IAA) declines over the course of male strobilus development in G. parvifolium12. Another study shows that six Aux/IAA genes are involved in the female strobilus development of G. luofuense38. In the present study, the results of present study that 13 DETs annotated with hormone and signal transduction are down-regulated between FA and FC (Fig. 4C). Besides, three Aux/IAA TFs were identified and validated to be differentially expressed across the male strobilus development (Fig. 5C). Thus, our results corroborate that plant hormone, e.g. indole-3-acetic acid might play an important role in the male strobilus development of G. luofuense.

Photosynthesis

Photosynthesis in reproductive organs is commonly seen in seed plants. A previous study shows that sepals and petals of tobacco flowers have the photosynthetic capability of fixing CO2 and pigment biosynthesis39. Another example shows that photosynthetic capacity of grapevine flowers gradually decrease across the procedure of flowering and finally cease at fruit periods40. In gymnosperms, female strobili of Pseudotsuga menziesi (Douglas-fir) were examined to possess considerable rates of photosynthesis41. Another study shows that female strobili of Pinus sylvestris (Scots pine) enclosed by aluminum foil yield lower seed weight than the control group42, corroborating the presence of photosynthesis in reproductive organs. Our results corroborate the statement that the DETs are annotated in the KEGG pathways e.g. photosynthesis and photosynthesis−antenna proteins (Fig. 4D). For an example, e.g. expression of genes in photosystem I, e.g. PsaD, PsaE, and PsaG are down-regulated from FA to FC, these genes are regulated by gene differentially expressed lncRNAs (Fig. 2). Accordingly, a scientific question was raised whether reproductive organs of Gnetum actually undergo photosynthesis? To address this question, an explicit study of measuring photosynthetic capacity of G. luofuense male strobili at different developmental stages is highly desirable in future studies.

Pollination drop secretion

Pollination drops in general function as a media to capture pollen grains and further transport them to interior nucellus for fertilization4345. Gnetum is an entemophilous tropical genus, and a few Asian species (e.g. G. gnemon, G. parvifolium and G. luofuense) are reported to be insect pollinated13,15,46. Sugary pollination drops in Gnetum are able to attract and provide rewards to pollinators like nectars in angiosperms4345. It has known pollination drops of Gnetum are rich in carbohydrates (sucrose, fructose, and glucose), proteins (degradome and secrotome) and amino acids, phosphate, and minerals6,47. Our results of WGCNA analysis shows that co-expressed transcripts in the blue module were enriched in KEGG pathways related to carbohydrate metabolism, e.g. starch and sucrose metabolism, glycolysis/gluconeogenesis, and fructose and mannose metabolism (Fig. 3C). These genes were shown to be highly expressed at the late developmental stages of male strobili. For examples, expression of gene TnS000345359g03 that encodes beta-fructofuranosidase is up-regulated from FA to FC. These results corroborate that G. luofuense male strobilus development is closely associated with sugar reproduction; the process might be related to pollination drop production as a response to insect pollinators.

Defense mechanism

As being reproductive organs, male strobili in G. luofuense demands protection against fungi and pathogens throughout their development. An effective manner in G. luofuense is to produce a physical barrier—an involucre collar to protect sterile ovules that secrete pollination drops at anthesis. A previous study shows that pollination drops are composed of defense-related proteins, e.g. thaumatin-like proteins, xylosidase, beta-glucodiase, and chitinases secreted from sterile ovules of G. gnemon and G. luofuense6,47. In the present study, we found three genes, i.e. TnS000958803g05, TnS000762467g04, and TnS000052095g04 were all up-regulated from FA to FC. Of these genes, genes TnS000958803g05 and TnS000762467g04 encode thaumatin-like proteins, which were validated to participate in ovule defense in hybrid yew (Taxus × media)48. The other gene TnS000052095g04 encodes chitinases, which was considered to be important in protection of female ovules among various species of Ephedra49 and Welwitschia50. Thus, this evidence corroborates active defense reaction against external organisms during the male strobilus development of G. luofuense.

Methods

Plant material sampling

Male strobili of G. luofuense were collected from one male individual at the Bamboo Garden at Sun Yat-sen University, Guangzhou, China on April 27, 2019 (voucher No. CH004, the herbarium was identified by Chen Hou and deposited in SYS, Guangzhou, China). Collection of plant samples was permitted by Sun Yat-sen University. Three developmental stages—Stage 1 (1–10 days), Stage 2 (10–15 days), and Stage 3 (15–25 days)—were identified (Fig. 1). Three replicate samples of each developmental stage were prepared from separate male strobili of G. luofuense. They were further named as FA (01–03), FB (01–03), and FC (01–03), for a total of nine sequencing samples. Identical samples of G. luofuense male strobili were also prepared for RT–qPCR analyses.

RNA extraction and nanopore sequencing

The collected male strobilus material was snap-frozen in liquid nitrogen and stored at − 20 ℃. A RNeasy Plant Mini Kit (Qiagen, Valencia, CA, USA, product No. 74903) was used to extract total RNA from the nine samples, and relic DNA was removed using RNase-free DNase (Qiagen). The concentration, purity, and integrity of extracted RNA was assessed using 1% agarose gel electrophoresis with NorthernMax gel buffer (Qiagen), a NanoDrop spectrophotometer (ThermoFisher Scientific, Wilmington, DE, USA), and an Agilent 2100 Bioanalyzer (Agilent Technologies, Palo Alto, CA, USA), respectively.

The synthesis of cDNA for Nanopore sequencing was performed according to the protocol from Oxford Nanopore Technologies, UK: 1 μg total RNA was prepared for cDNA libraries using cDNA-PCR Sequencing Kit (SQK-PCS109) protocol provided by Oxford Nanopore Technologies (ONT)51. Libraries were then created using a sequencing library preparation kit. We added defined PCR adapters directly to both ends of the first-strand cDNA. The establishment of cDNA libraries was subject to 14 cycles of PCR amplification with LongAmp Taq (NEB). The cycling parameters were set as 94 ℃ for 3 min, followed by 35 cycles of 94 ℃ for 30 s, 56 ℃ for 45 s, 72 ℃ for 1.5 min, and a final extension step of 72 ℃ for 10 min. The PCR products were then subjected to ONT adaptor ligation using T4 DNA ligase (NEB). Agencourt XP beads was used for DNA purification according to ONT protocol. The final cDNA libraries were added to FLO-MIN109 flowcells and libraries were then sequenced using a MinION Mk1B sequencer. Nanopore sequencing data from the nine male strobilus samples were all deposited in the NCBI Sequence Read Archive (SRA) under BioProject accession number PRJNA645614.

Data processing and genome mapping

Raw sequencing reads were analyzed using MinKNOW version 2.2 (Oxford, UK). Raw reads were filtered with the following settings: read quality score ≥ 7 and read length ≥ 500 bp. Ribosomal RNA was removed by searching against the Silva rRNA database (https://www.arb-silva.de). Full-length reads were identified when primers were detected at both ends of the cleaned reads. All full-length reads were clustered after mapping to the G. luofuense reference genome, https://datadryad.org/stash/dataset/10.5061/dryad.0vm3752) using minimap2 version 2.1.752. After each cluster was polished with pinfish (https://github.com/nanoporetech/pinfish), consensus reads were again mapped to the G. luofuense reference genome using minimap2. The mapped reads were further collapsed using the cDNA_Cupcake package with a minimum coverage of 85% and a minimum identity of 90%. Consensus reads with sequence differences at the 5′ ends were not considered to be redundant transcripts.

Functional annotation and classification

All detected genes, including novel genes, were annotated by BLASTX v.2.2.26 searches (E-value < 1 × 10−5) of Pfam (http://pfam.xfam.org/), NR (http://www.ncbi.nlm.nih.gov/), and Swiss-Prot (http://www.expasy.org/sprot/) databases, as well as by HMMER v.3.1b2 searches (E-value < 1 × 10−10) of the Pfam database53,54. In addition, GO enrichment analysis Clusters of Orthologous Groups of Proteins (KOG/COG, http://www.ncbi.nlm.nih.gov/COG/) was performed using the GOseq package implemented in R55, and KEGG enrichment analysis was performed using KEGG Orthology56,57 based Annotation System using KOBAS58.

Identification of CDS and lncRNAs

We identified CDS of polished non-redundant isoforms using the software TransDecoder version 5.0.259 (https://github.com/TransDecoder/TransDecoder/releases) with a default setting. Prediction of lncRNAs was performed using four methods, the Coding Potential Calculator (CPC)60, the Coding-Non-Coding Index (CNCI)61, the Coding Potential Assessment Tool (CPAT)62, and Pfam. Subsequent to filtering protein-coding reads, lncRNAs were identified as reads that possessed at least 200 nt and two exons. Identified lncRNAs were then classified as either lincRNA, antisense-lncRNA, sense-lncRNA, or intronic-lncRNA. Target genes that are regulated by identified lncRNAs were predicted using the software LncTar (www.cuilab.cn/lnctar)63; thus, two types of cis or trans target genes were defined as regulated by lncRNAs in this analysis64,65.

WGCNA analysis

Pearson correlation matrix and network topology analysis were applied to calculate the gene correlation across the nine samples using the R package WGCNA version 1.4266 with the following settings: CPM values > 1, fold change > 1, minimum module size of 30, and minimum height for merging modules of 0.055. Then, the adjacency was converted to a topological overlap matrix. A hierarchal clustering tree was constructed using the Dynamic Tree Cut package for R67.

DET detection analysis

Full-length reads were mapped to the G. luofuense reference genome mentioned above, and mapped reads with coverage above five were saved. Transcript expression was quantified as counts per million (CPM), where CPM = (reads mapped to transcript/total reads mapped in one sample) × 1,000,000. Differential expression analysis between each pair of developmental stages was performed using DESeq version 1.10.1 (https://bioconductor.org/packages/release/bioc/html/DESeq.html)68. The t-test was used to judge the statistical significance of expression difference, and the threshold of P-value was determined with the FDR in multiple testing. Three replicates at each developmental stages were set as an independent group for pairwise comparisons. The resulting P-values were adjusted using Benjamini and Hochberg’s69 approach in order to control the false discovery rate. Genes with FDR-adjusted p-values < 0.01 and fold changes ≥ 2 were defined as DETs. KEGG enrichment analyses were performed as described above. Out of these DETs, transcript expression of three replicates at each developmental stage was averaged, and denoted FA, FB, and FC.

TF identification and RT–qPCR validation

Transcription factors were identified and their genes were assigned to different families using iTAK version 1.7 (http://bioinfo.bti.cornell.edu/tool/itak)70. We selected eight differentially expressed TFs for gene expression validation by RTqPCR referred to the MIQE guidelines. Primers were designed using Primer Premier version 5.071, and information on the RT–qPCR protocol is presented in Supplementary Dataset File 1. The cDNA was synthesized from 2 μl of total RNA were extracted from the nine samples using a PrimeScript RT Master Mix (Perfect Real Time) (TaKaRa, China). For each sample, three independent analyses were performed, and the mean and standard deviation of the RT–qPCR gene expression values were calculated. The entire experiment was performed in Applied Biosystems Real-Time PCR Instruments-QuantStudio 6 (ThermoFisher Scientific, China). The amplification program consisted of 2 min of initial denaturation at 95 ℃, followed by 40 cycles of 20 s at 94 ℃, 20 s at 58 ℃, 20 s at 72 ℃. When producing melting curves, the amplification program was set at 30 s at 94 ℃, followed by 30 s at 65 ℃ and 30 s at 94 ℃. The G. luofuense actin gene was used as an endogenous control to estimate the relative expression of TFs using the ΔΔCt method72. Based on the slope of the standard curve, amplification efficiency was calculated in the website sever (https://www.thermofisher.com/uk/en/home/brands/thermoscientific/molecularbiology/molecular-biology-learning-center/molecular-biology-resource-library/thermoscientific-web-tools/qpcr-efficiency-calculator.html).When amplification efficiency was all close to 100%, relative expression values were calculated as −2∆∆Ct, of which ∆∆Ct = ∆Ct—the average values of three replicates of ∆Ct, and ∆Ct = Ct (genes of our interest) – Ct (actin gene).

Supplementary Information

Acknowledgements

We thank Guan-Shen Liu (Biomarker Technologies, Inc., Beijing, China) for providing valuable technical and analytical assistance. We acknowledge TopEdit LLC for the linguistic editing and proofreading during the preparation of this manuscript. We also thank the three anonymous viewers for their constructive suggestions and comments. This work was supported by the Natural Science Foundation of Guangdong Province, China (No. 2018A0303130163 to C.H.).

Author contributions

N.D. and B.H. conceived and designed the experiments. C.H., H.L., S.S., and Y.W. performed the experiments. Y.T., D.L., and N.D. analyzed the data. CH wrote the manuscript. All authors have read and approved the manuscript.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Nan Deng, Email: idengnan@me.com.

Boxiang He, Email: heboxiang@163.com.

Supplementary Information

The online version contains supplementary material available at 10.1038/s41598-021-90082-0.

References

  • 1.Hou C, Humphreys AM, Thureborn O, Rydin C. New insights into the evolutionary history of Gnetum (Gnetales) Taxon. 2015;64:239–253. doi: 10.12705/642.12. [DOI] [Google Scholar]
  • 2.Price RA. Systematics of the Gnetales: a review of morphological and molecular evidence. Int. J. Plant Sci. 1996;157:40–49. doi: 10.1086/297402. [DOI] [Google Scholar]
  • 3.Markgraf, F. in Flora Malesiana Ser. 1, Vol. 4 (ed Steenis, C. G. G. J.) 336–347 (Djakarta: Noordhoff-Kolff, 1951).
  • 4.Kubitzki, K. in The families and genera of vascular plants (eds Kramer, K. U. & Green, P. S.) 383–386 (Springer, 1990).
  • 5.Endress PK. Structure and function of female and bisexual organ complexes in Gnetales. Int. J. Plant Sci. 1996;157:113–125. doi: 10.1086/297407. [DOI] [Google Scholar]
  • 6.Hou C, Saunders RM, Deng N, Wan T, Su Y. Pollination drop proteome and reproductive organ transcriptome comparison in Gnetum reveals entomophilous adaptation. Genes. 2019;10:800. doi: 10.3390/genes10100800. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Markgraf, F. Monographie der Gattung Gnetum Ser. 3. Bull. Jard. Bot. Buitenzorg10, 407–511 (1930).
  • 8.Biye, E. H., Balkwill, K. & Cron, G. V. A clarification of Gnetum L. (Gnetaceae) in Africa and the description of two new species. Plant Syst. Evol.300, 263–272 (2014).
  • 9.Biye EH, Cron GV, Balkwill K. Morphometric delimitation of Gnetum species in Africa. Plant Syst. Evol. 2016;302:1067–1082. doi: 10.1007/s00606-016-1317-3. [DOI] [Google Scholar]
  • 10.Jörgensen A, Rydin C. Reproductive morphology in the Gnetum cuspidatum group (Gnetales) and its implications for pollination biology in the Gnetales. Plant Ecol. Evol. 2015;148:387–396. doi: 10.5091/plecevo.2015.1142. [DOI] [Google Scholar]
  • 11.Hou C, Wikström N, Strijk J, Rydin C. Resolving phylogenetic relationships and species delimitations in closely related gymnosperms using high-throughput NGS, Sanger sequencing and morphology. Plant Syst. Evol. 2016;302:1345–1365. doi: 10.1007/s00606-016-1335-1. [DOI] [Google Scholar]
  • 12.Lan Q, et al. Anatomy, microstructure and endogenous hormone changes in Gnetum parvifolium during anthesis. J. Syst. Evol. 2018;56:14–24. doi: 10.1111/jse.12263. [DOI] [Google Scholar]
  • 13.Gong YB, et al. Wind or insect pollination? Ambophily in a subtropical gymnosperm Gnetum parvifolium (Gnetales) Plant Spec. Biol. 2016;31:272–279. doi: 10.1111/1442-1984.12112. [DOI] [Google Scholar]
  • 14.Kato M, Inoue T. Origin of insect pollination. Nature. 1994;368:195. doi: 10.1038/368195a0. [DOI] [Google Scholar]
  • 15.Kato M, Inoue T, Nagamitsu T. Pollination biology of Gnetum (Gnetaceae) in a lowland mixed dipterocarp forest in Sarawak. Am. J. Bot. 1995;82:862–868. doi: 10.1002/j.1537-2197.1995.tb15702.x. [DOI] [Google Scholar]
  • 16.Becker A, Saedler H, Theissen G. Distinct MADS-box gene expression patterns in the reproductive cones of the gymnosperm Gnetum gnemon. Dev. Genes Evol. 2003;213:567–572. doi: 10.1007/s00427-003-0358-0. [DOI] [PubMed] [Google Scholar]
  • 17.Becker A, Winter KU, Meyer B, Saedler H, Theissen G. MADS-box gene diversity in seed plants 300 million years ago. Mol. Biol. Evol. 2000;17:1425–1434. doi: 10.1093/oxfordjournals.molbev.a026243. [DOI] [PubMed] [Google Scholar]
  • 18.Winter KU, et al. MADS-box genes reveal that gnetophytes are more closely related to conifers than to flowering plants. Proc. Natl. Acad. Sci. USA. 1999;96:7342–7347. doi: 10.1073/pnas.96.13.7342. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Deng N, et al. Significance of photosynthetic characters in the evolution of Asian Gnetum (Gnetales) Front. Plant. Sci. 2019;10:39. doi: 10.3389/fpls.2019.00039. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Feild TS, Balun L. Xylem hydraulic and photosynthetic function of Gnetum (Gnetales) species from Papua New Guinea. New Phytol. 2008;177:665–675. doi: 10.1111/j.1469-8137.2007.02306.x. [DOI] [PubMed] [Google Scholar]
  • 21.Liu J, Wang H, Chua NH. Long noncoding RNA transcriptome of plants. J. Plant Biotechnol. 2015;13:319–328. doi: 10.1111/pbi.12336. [DOI] [PubMed] [Google Scholar]
  • 22.Wang L, et al. Genome-wide identification and characterization of novel lncRNAs in Ginkgo biloba. Trees. 2018;32:1429–1442. doi: 10.1007/s00468-018-1724-x. [DOI] [Google Scholar]
  • 23.Ye J, et al. A global survey of full-length transcriptome of Ginkgo biloba reveals transcript variants involved in flavonoid biosynthesis. Ind Crop Prod. 2019;139:111547. doi: 10.1016/j.indcrop.2019.111547. [DOI] [Google Scholar]
  • 24.van Dijk EL, Jaszczyszyn Y, Naquin D, Thermes C. The third revolution in sequencing technology. Trends Genet. 2018;34:666–681. doi: 10.1016/j.tig.2018.05.008. [DOI] [PubMed] [Google Scholar]
  • 25.Deschamps S, Campbell MA. Utilization of next-generation sequencing platforms in plant genomics and genetic variant discovery. Mol. Breed. 2010;25:553–570. doi: 10.1007/s11032-009-9357-9. [DOI] [Google Scholar]
  • 26.Ansorge WJ. Next-generation DNA sequencing techniques. New Biotechnol. 2009;25:195–203. doi: 10.1016/j.nbt.2008.12.009. [DOI] [PubMed] [Google Scholar]
  • 27.Cui J, Lu Z, Xu G, Wang Y, Jin B. Analysis and comprehensive comparison of PacBio and nanopore-based RNA sequencing of the Arabidopsis transcriptome. Plant Methods. 2020;16:85. doi: 10.1186/s13007-020-00629-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Gramzow L, Weilandt L, Theissen G. MADS goes genomic in conifers: towards determining the ancestral set of MADS-box genes in seed plants. Ann. Bot. 2014;114:1407–1429. doi: 10.1093/aob/mcu066. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Gramzow L, Theissen G. A hitchhiker's guide to the MADS world of plants. Genome Biol. 2010;11:214. doi: 10.1186/gb-2010-11-6-214. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Theissen, G. et al. in Developmental Genetics and Plant Evolution, Vol. 65 (eds Cronk, Q. C. B. et al.) 173–205 (2002).
  • 31.Wang YQ, Melzer R, Theissen G. Molecular interactions of orthologues of floral homeotic proteins from the gymnosperm Gnetum gnemon provide a clue to the evolutionary origin of 'floral quartets'. Plant J. 2010;64:177–190. doi: 10.1111/j.1365-313X.2010.04325.x. [DOI] [PubMed] [Google Scholar]
  • 32.Hou C, Li L, Liu Z, Su Y, Wan T. Diversity and expression patterns of MADS-box genes in Gnetum luofuense—implication in functional diversity and evolution. Trop Plant Biol. 2020;13:36–49. doi: 10.1007/s12042-019-09247-x. [DOI] [Google Scholar]
  • 33.Wu WT, et al. Evolution analysis of the Aux/IAA gene family in plants shows dual origins and variable nuclear localization signals. Int J Mol Sci. 2017;18:107. doi: 10.3390/ijms18010107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Luo J, Zhou J-J, Zhang J-Z. Aux/IAA gene family in plants: molecular structure, regulation, and function. Int J Mol Sci. 2018;19:259. doi: 10.3390/ijms19010259. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Liscum E, Reed J. Genetics of Aux/IAA and ARF action in plant growth and development. Plant Mol Biol. 2002;49:387–400. doi: 10.1023/A:1015255030047. [DOI] [PubMed] [Google Scholar]
  • 36.Simonini S, et al. A noncanonical auxin-sensing mechanism is required for organ morphogenesis in Arabidopsis. Genes Dev. 2016;30:2286–2296. doi: 10.1101/gad.285361.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Jain M, Khurana JP. Transcript profiling reveals diverse roles of auxin-responsive genes during reproductive development and abiotic stress in rice. FEBS J. 2009;276:3148–3162. doi: 10.1111/j.1742-4658.2009.07033.x. [DOI] [PubMed] [Google Scholar]
  • 38.Hou C, Deng N, Su YJF. PacBio long-read sequencing reveals the transcriptomic complexity and Aux/IAA gene evolution in Gnetum (Gnetales) Forests. 2019;10:1043. doi: 10.3390/f10111043. [DOI] [Google Scholar]
  • 39.Müller GL, Drincovich MF, Andreo CS, Lara MV. Role of photosynthesis and analysis of key enzymes involved in primary metabolism throughout the lifespan of the tobacco flower. J Exp. Bot. 2010;61:3675–3688. doi: 10.1093/jxb/erq187. [DOI] [PubMed] [Google Scholar]
  • 40.Lebon G, Brun O, Magné C, Clément C. Photosynthesis of the grapevine (Vitis vinifera) inflorescence. Tree Physiol. 2005;25:633–639. doi: 10.1093/treephys/25.5.633. [DOI] [PubMed] [Google Scholar]
  • 41.Rook D, Sweet G. Photosynthesis and photosynthate distribution in Douglas-fir strobili grafted to young seedlings. Can. J. Bot. 1971;49:13–17. doi: 10.1139/b71-002. [DOI] [Google Scholar]
  • 42.Linder S, Troeng E. The seasonal course of respiration and photosynthesis in strobili of Scots pine. Forest Sci. 1981;27:267–276. [Google Scholar]
  • 43.Nepi M, et al. Nectar and pollination drops: how different are they? Ann. Bot. 2009;104:205–219. doi: 10.1093/aob/mcp124. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Nepi, M., von Aderkas, P. & Pacini, E. in Secretions and exudates in biological systems, Vol. 12 (eds Vivanco, J. M. & Baluška, F.) 155–185 (Springer, 2012).
  • 45.Nepi M, et al. Phylogenetic and functional signals in gymnosperm ovular secretions. Ann. Bot. 2017;120:923–936. doi: 10.1093/aob/mcx103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Corlett RT. Pollination in a degraded tropical landscape: a Hong Kong case study. J. Trop. Ecol. 2001;17:155–161. doi: 10.1017/S0266467401001109. [DOI] [Google Scholar]
  • 47.Prior N, et al. Complex reproductive secretions occur in all extant gymnosperm lineages: a proteomic survey of gymnosperm pollination drops. Plant Reprod. 2019;32:153–166. doi: 10.1007/s00497-018-0348-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.O'Leary SJ, Poulis BA, von Aderkas P. Identification of two thaumatin-like proteins (TLPs) in the pollination drop of hybrid yew that may play a role in pathogen defense during pollen collection. Tree Physiol. 2007;27:1649–1659. doi: 10.1093/treephys/27.12.1649. [DOI] [PubMed] [Google Scholar]
  • 49.von Aderkas P, et al. Degradome and secretome of pollination drops of Ephedra. Bot. Rev. 2015;81:1–27. doi: 10.1007/s12229-014-9147-x. [DOI] [Google Scholar]
  • 50.Wagner RE, et al. Proteomic evaluation of gymnosperm pollination drop proteins indicates highly conserved and complex biological functions. Plant Reprod. 2007;20:181–189. doi: 10.1007/s00497-007-0054-8. [DOI] [Google Scholar]
  • 51.Jain M, Olsen HE, Paten B, Akeson M. The Oxford Nanopore MinION: delivery of nanopore sequencing to the genomics community. Genome Biol. 2016;17:239. doi: 10.1186/s13059-016-1103-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Li H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics. 2018;34:3094–3100. doi: 10.1093/bioinformatics/bty191. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Finn RD, Clements J, Eddy SR. HMMER web server: interactive sequence similarity searching. Nucl. Acids Res. 2011;39:29–37. doi: 10.1093/nar/gkr367. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Albert VA, et al. The Amborella genome and the evolution of flowering plants. Science. 2013;342:1241089. doi: 10.1126/science.1241089. [DOI] [PubMed] [Google Scholar]
  • 55.R Core Team. R: A language and environment for statistical computing version 3.2.0. R Foundation for Statistical Computing, Vienna, Austria. Retrieved from https://www.R-project.org (2018).
  • 56.Kanehisa M, Goto S. KEGG: kyoto encyclopedia of genes and genomes. Nucl. Acids Res. 2000;28:27–30. doi: 10.1093/nar/28.1.27. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Kanehisa M. Toward understanding the origin and evolution of cellular organisms. Protein Sci. 2019;28:1947–1951. doi: 10.1002/pro.3715. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Xie C, et al. KOBAS 2.0: a web server for annotation and identification of enriched pathways and diseases. Nucl. Acids Res. 2011;39:316–322. doi: 10.1093/nar/gkr483. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Haas BJ, et al. De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis. Nat. Protoc. 2013;8:1494. doi: 10.1038/nprot.2013.084. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Kong L, et al. CPC: assess the protein-coding potential of transcripts using sequence features and support vector machine. Nucl. Acids Res. 2007;35:345–349. doi: 10.1093/nar/gkm391. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Sun L, et al. Utilizing sequence intrinsic composition to classify protein-coding and long non-coding transcripts. Nucl. Acids Res. 2013;41:e166. doi: 10.1093/nar/gkt646. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Wang L, et al. CPAT: coding-potential assessment tool using an alignment-free logistic regression model. Nucl. Acids Res. 2013;41:e74. doi: 10.1093/nar/gkt006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Li JW, et al. LncTar: a tool for predicting the RNA targets of long noncoding RNAs. Brief Bioinform. 2015;16:806–812. doi: 10.1093/bib/bbu048. [DOI] [PubMed] [Google Scholar]
  • 64.Kung JT, Colognori D, Lee JT. Long noncoding RNAs: past, present, and future. Genetics. 2013;193:651–669. doi: 10.1534/genetics.112.146704. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Yang L, Froberg JE, Lee JT. Long noncoding RNAs: fresh perspectives into the RNA world. Trends Biochem. Sci. 2014;39:35–43. doi: 10.1016/j.tibs.2013.10.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Langfelder P, Horvath S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinform. 2008;9:559. doi: 10.1186/1471-2105-9-559. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Langfelder P, Zhang B, Horvath S. Defining clusters from a hierarchical cluster tree: the Dynamic Tree Cut package for R. Bioinformatics. 2007;24:719–720. doi: 10.1093/bioinformatics/btm563. [DOI] [PubMed] [Google Scholar]
  • 68.Anders S. Analysing RNA-Seq data with the “DESeq” package. Mol Biol. 2010;43:1–17. [Google Scholar]
  • 69.Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R. Stat. Soc. B. 1995;57:289–300. [Google Scholar]
  • 70.Zheng Y, et al. iTAK: A program for genome-wide prediction and classification of plant transcription factors, transcriptional regulators, and protein kinases. Mol Plant. 2016;9:1667–1670. doi: 10.1016/j.molp.2016.09.014. [DOI] [PubMed] [Google Scholar]
  • 71.Lalitha S. Primer premier 5. Biotech Softw Internet Rep The Comput Softw J Sci. 2000;1:270–272. doi: 10.1089/152791600459894. [DOI] [Google Scholar]
  • 72.Livak KJ, Schmittgen TD. Analysis of relative gene expression data using real-time quantitative PCR and the 2−ΔΔCT method. Methods. 2001;25:402–408. doi: 10.1006/meth.2001.1262. [DOI] [PubMed] [Google Scholar]
  • 73.Wan T, et al. A genome for gnetophytes and early evolution of seed plants. Nat. Plants. 2018;4:82–89. doi: 10.1038/s41477-017-0097-2. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials


Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

RESOURCES