Abstract
Somatic embryogenesis (SE), which occurs naturally in many plant species, serves as a model to elucidate cellular and molecular mechanisms of embryo patterning in plants. Decoding the regulatory landscape of SE is essential for its further application. Hence, the present study was aimed at employing Weighted Gene Correlation Network Analysis (WGCNA) to construct a gene coexpression network (GCN) for Arabidopsis SE and then identifying highly correlated gene modules to uncover the hub genes associated with SE that may serve as potential molecular targets. A total of 17,059 genes were filtered from a microarray dataset comprising four stages of SE, i.e., stage I (zygotic embryos), stage II (proliferating tissues at 7 days of induction), stage III (proliferating tissues at 14 days of induction), and stage IV (mature somatic embryos). This included 1,711 transcription factors and 445 EMBRYO DEFECTIVE genes. GCN analysis identified a total of 26 gene modules with the module size ranging from 35 to 3,418 genes using a dynamic cut tree algorithm. The module-trait analysis revealed that four, four, seven, and four modules were associated with stages I, II, III, and IV, respectively. Further, we identified a total of 260 hub genes based on the degree of intramodular connectivity. Validation of the hub genes using publicly available expression datasets demonstrated that at least 78 hub genes are potentially associated with embryogenesis; of these, many genes remain functionally uncharacterized thus far. In silico promoter analysis of these genes revealed the presence of cis-acting regulatory elements, “soybean embryo factor 4 (SEF4) binding site,” and “E-box” of the napA storage-protein gene of Brassica napus; this suggests that these genes may play important roles in plant embryo development. The present study successfully applied WGCNA to construct a GCN for SE in Arabidopsis and identified hub genes involved in the development of somatic embryos. These hub genes could be used as molecular targets to further elucidate the molecular mechanisms underlying SE in plants.
1. Introduction
The ability to produce embryos from undifferentiated somatic cells in vitro is a unique developmental pathway found within the plant kingdom. Since the first report of somatic embryo induction from callus cells of carrot [1, 2], this developmental pathway based on cellular totipotency has been studied extensively due to its biological and scientific significance; it has been recognized as a model system for studying early plant embryogenesis. Until now, most studies have focused on the mechanism of somatic embryo development at the morphological level [2–4] or the development of optimized protocols for the generation of somatic embryos from a range of explants [5–8].
Somatic embryogenesis (SE) involves a complex signaling network [9]; transcriptional regulation of a set of genes in response to stress caused by plant growth regulators, nutrients, certain stress conditions, and other signaling elements triggers cellular reprogramming and transformation of somatic cells into embryos [10, 11]. In 2007, Zeng et al. [12] developed the first draft gene regulatory network for early SE employing a set of transcriptionally regulated SE-related genes in cotton. Although a set of genes have been identified as markers for the initiation phase of SE [13, 14], for example, SOMATIC EMBRYOGENESIS RECEPTOR-LIKE KINASE1 (SERK1) [15, 16], LEAFY COTYLEDON (LEC) [17–21], BABY BOOM (BBM) [22], and WUSCHEL (WUS) [18, 23], the current scientific knowledge on the underlying regulatory landscape of SE is limited. The use of transcriptomics has uncovered a large number of differentially expressed genes (DEGs) during SE in many crops, including Arabidopsis [24], rice [25], bread wheat [26], cotton [27], maize [28], and coconut [29]. However, the functions of many of these genes in SE are still not understood.
Gene coexpression networks (GCNs) are increasingly used to understand the interactions among a set of transcriptionally regulated genes. There are many types of coexpression networks: signed/unsigned coexpression networks and weighted/unweighted coexpression networks [30]. In the present study, we have focused on weighted network construction as it is likely to produce more robust findings than unweighted networks [31]. Weighted Correlation Network Analysis (WGCNA) is one of the most popular clustering packages for GCN analysis [31, 32] and the first tool to be employed to construct GCNs from RNA-sequencing (RNA-seq) data. This coexpression tool is easy to use and can be used to find clusters (modules) of highly correlated genes and to identify biologically relevant associations between phenotypes/sample traits and modules from expression data [30]. Recently, WGCNA has been effectively used to identify stage-specific gene expression clusters associated with key stages of Arabidopsis zygotic embryo development [33]. In addition, this approach has been successfully used to discover the regulatory landscape of SE in rice [25] and several other biological pathways in plants [34–36]. Here, we have analyzed a transcriptome dataset covering four somatic embryo developmental stages in Arabidopsis using WGCNA to understand better the system-level functionality of the transcriptionally regulated genes in dicot SE.
2. Materials and Methods
2.1. Data Collection and Gene Filtering
The transcriptome data covering somatic embryo developmental stages of wild-type Arabidopsis were retrieved from the National Centre for Biotechnology Information (NCBI) GEO database (GEO accession: GSE48915) [37]. The dataset consisted of four developmental stages (zygotic embryos, proliferating tissues at 7 days of induction, proliferating tissues at 14 days of induction, and mature somatic embryos) with two replicates for each stage. Subsequently, the genes with variance greater than the second quartile of variance were filtered to eliminate low-expressed or nonvarying genes, and the remaining genes were used in GCN analysis (https://horvath.genetics.ucla.edu/html/coexpressionnetwork/rpackages/wgcna/faq.html, accessed on 11 May 2022). In addition, DEGs between consecutive embryonic stages were identified by calculating the fold change (FC) in gene expression through a simple t-test. Arbitrary FC cut-off of |log2 FC| ≥ 2.0 and p value of <0.05 were used to reduce false discoveries.
2.2. GCN Construction
“WGCNA” package in R software [32] was employed to identify significant gene modules and hub genes in Arabidopsis somatic embryo transcriptomes. A gene coexpression similarity matrix was constructed between the expression profiles of the filtered genes using the Pearson correlation. The similarity matrix was then transformed into an adjacency matrix where each entry encodes the connection strength between each pair of genes (“nodes”). The adjacency matrix defines a measure of node dissimilarity from which the nodes (genes) are clustered into network modules. Consequently, the GCN was developed using the automatic one-step network construction and module detection method with the following parameters:
The soft threshold value (power parameter) was decided by the scale-free topology fit index curve.
2.3. GCN Visualization
The constructed modular networks were exported to Cytoscape (version 3.7.2) for visualization; gene correlations with p value <0.05 were filtered as significant gene correlations and visualized. The modular networks were analyzed by the “network analyzer” tool in Cytoscape for a concise and informative representation of nodes and edges.
2.4. Validation of Network Modules
The robustness of the coexpression modules was assessed through module preservation and quality statistics, which were computed using the modulePreservation function in the WGCNA package [38]. The adjacency matrix of the network was taken as the reference, and the dataset was selected as test data with 200 permutations (nPermutations = 200). The stability of the modules was tested through the statistics median rank and Zsummary.
2.5. Inferring Module-Stage Relationships
Module-stage relationships of the GCN were evaluated through module eigengenes (MEs). The correlation relationships between the MEs and different somatic embryo developmental stages were analyzed and visualized through a heatmap. Gene significance was calculated based on the p value of the linear regression between the gene expression profile and the associated developmental stage.
2.6. Functional Enrichment Analysis
Functional enrichment analysis was performed to detect enriched biological processes in gene modules. Gene Ontology (GO) terms enriched in each module were elucidated using the “singular enrichment analysis” tool provided by agriGO v2.0 [39]. “Arabidopsis genome locus (TAIR10)” was used as the reference, and all other parameters were set as the default for the analysis. Overrepresented GO terms in each network module were identified using the hypergeometric test. To further explore the DEGs mapped to each gene module, the distribution of the following genes across modules was studied: SE-related marker genes [40], plant transcription factors (TFs) (http://planttfdb.cbi.pku.edu.cn/index.php), EMBRYO DEFECTIVE (EMB) genes [41], and gene encoding epigenetic regulators [42, 43].
2.7. Identification and Validation of Hub Genes
Genes in each module were arranged based on gene connectivity. The top 10 genes of each module were considered as hub genes. The transcriptome dataset published by Wickramasuriya and Dunwell in 2015 was retrieved from the ArrayExpress database (E-MTAB-2403) [24] to study the expression of hub genes during SE.
2.8. In Silico Analysis of Hub Genes
The promoter sequences of hub genes (1000 bp upstream from the transcription start site) were retrieved from “The Arabidopsis Information Resource” (TAIR) database and analyzed using the Multiple Em for Motif Elicitation (MEME) tool in the MEME Suite 5.3.3 [44]. The following parameters were used in the analysis: number of motifs: 10; motif site distribution: zero or once per occurrence (ZOOPS); minimum width: 6; maximum width: 50; and background model: zero-order model of sequences. Further, the biological significance of the predicted MEME motifs was investigated using the Gene Ontology for MOtifs (GOMo) version 5.3.3 [45] provided in the MEME Suite. Additionally, the retrieved promoter sequences were searched against the Plant cis-acting regulatory DNA elements (PLACE) database to identify overrepresented cis-acting regulatory elements (CREs; [46]).
3. Results
3.1. Hierarchical Clustering of Somatic Embryo Transcriptomes
In the present study, transcriptome datasets generated through microarray experiments were retrieved from the NCBI covering four somatic embryo developmental stages (with two replicates for each stage), referred to herein as stages I (zygotic embryos), II (proliferating tissues at 7 days of induction), III (proliferating tissues at 14 days of induction), and IV (mature somatic embryos). The hierarchical clustering of samples (Figure 1(a)) confirmed that the sample replicates of each stage have a higher degree of correlation with each other than with other developmental stages; sample outliers were not detected in the dataset. The clustering heatmap clearly distinguished four discrete clusters of related expression patterns corresponding to the stages of somatic embryo development (Figure 1(b)). Further, stage I showed a poor correlation with the other three stages. This suggests that stage I may have a distinct expression profile as compared to other somatic embryo developmental stages.
3.2. Filtering of Genes for the GCN Construction and Downstream Analysis
As recommended by Langfelder and Horvath [32], genes were filtered by the variance for the GCN construction; filtering genes for variance greater than 0.25 quantile identified a total of 17,059 genes (see Table S1). This included 445 EMB genes [41], 10 SE marker genes [40], and 1,711 Arabidopsis TFs (65.3%).
In addition, DEGs were identified by a pairwise ratio of expression between consecutive stages of development. A total of 2,244 genes were identified by threshold filtering based on |log2 FC| ≥ 2.0 and p value <0.05. 64 EMB genes [41], four SE marker genes [40], and 458 TFs were present within the DEGs identified (see Table S2). A total of 12 genes including the genes STRESS INDUCED FACTOR 2 (AT1G51850), LIGHT-HARVESTING-LIKE 3 : 1 (AT4G17600), BETA GLUCOSIDASE 28 (AT2G44460), FERREDOXIN C 1 (AT4G14890), and PHOTOSYSTEM II SUBUNIT Q (AT4G05180) were differentially expressed throughout SE (Figure 2(a)). In addition, a considerable number of genes were up- and downregulated during early embryo developmental stages (Figure 2(b)).
3.3. Construction of GCN
The expression profiles of the filtered 17,059 genes were used to construct a scale-free gene expression network with a soft threshold of 15 (Figure 3(a)). The dynamic hierarchical clustering approach integrated with the WGCNA pipeline distinguishes groups of genes with coexpression patterns and clusters them into network modules. In total, 26 distinct coexpression gene modules were detected with the module size ranging from 35 to 3,418 genes (Figures 3(b) and 3(c)); each module was assigned with a unique colour. The module comprising most genes was the turquoise (3,418 genes) followed by the blue (2,973 genes) and brown (2,437 genes) (Figure 3(b)). The expression profiles of coexpressed genes clustered in each module were summarized as MEs. Among the filtered genes, 13 genes that failed to fit within a distinct group were assigned to the grey module and removed from the downstream analysis. Module preservation analysis indicated high module preservation, confirming that the modules generated here can also be found in diverse independent datasets (Figure 3(b)). Each module was exported and visualized using Cytoscape.
3.4. Identification of Stage-Related Modules
The relationships between the gene modules and different somatic embryo developmental stages were determined by assessing the Pearson correlation coefficient (r) between the MEs and developmental stages. Module-trait correlation analyses revealed that multiple modules are related to SE (Figure 4(a)). A total of 18 modules were significantly associated with the somatic embryo developmental stages (|r| > 0.8 and p value ≤0.01; Figure 4), and these modules were “stage-specific,” i.e., the module was significantly associated with only one particular developmental stage of SE: tan, turquoise, dark-orange, and green to stage I; grey60, magenta, brown, and light-yellow to stage II; green-yellow, dark-gray, dark-green, orange, blue, light-green, and light-cyan to stage III; and pink, dark-turquoise, salmon, and yellow to stage IV. Gene significance, the correlation between modular gene expression and each stage, is shown in Figure 4(b).
3.5. Functional Enrichment Analysis of “Stage-Specific” Gene Modules
GO enrichment analysis performed on “stage-specific” modules showed that the genes in green and turquoise modules which exhibited a significant association with stage I were mainly enriched in the biological processes being involved in postembryonic development, hormone-mediated signaling pathway, biosynthesis pathways (sterol and fatty acids), DNA methylation, and transcription regulation (Figure 5(a)). Genes in brown, light-yellow, and magenta modules, which showed significant association with stage II, were mainly enriched in the biological processes involved in root and shoot development, ATP synthesis, response to the metal ions, and DNA replication (Figure 5(b)), whereas genes in blue and light-cyan modules, which showed significant association with stage III, were enriched for the biological processes involved in transition postembryonic and seed development, hormone- and sugar-mediated signaling pathways, cell differentiation, protein modification, and RNA processing (Figure 5(c)). Moreover, the yellow module, which showed a significant relationship to stage IV, was mainly enriched in biological processes involved in ion transport, postembryonic development, signal transduction, lipid localization, response to oxidative and water stress, as well as response to phytohormones (abscisic acid, gibberellin, cytokinin, and jasmonic acid) (Figure 5(d)).
3.6. Analysis of Hub Genes
Hub genes are nodes in a network often hypothesized to be functionally significant due to their high degree of intramodular connectivity. A total of 260 genes (top 10 genes of each module with high connectivity) were identified as potential hub genes; the hub gene with the highest degree of connectivity in each module is given in Table 1 (the complete list of hub genes is given in Table S3). GO enrichment analysis of the hub genes revealed that they are mainly enriched for biological processes such as metabolic processes (mRNA and cellular amino acid), oxidation-reduction, protein folding, and postembryonic development.
Table 1.
Gene identifier | Degree of connectivity | Gene module | Gene name | Description |
---|---|---|---|---|
AT1G27120 | 3327 | Turquoise | AT1G27120 (GALT4) | Galactosyltransferase family protein |
AT5G52820 | 2853 | Blue | NOTCHLESS (NLE) | WD-40 repeat family protein/notchless protein |
AT5G56090 | 2348 | Brown | CYTOCHROME C OXIDASE 15 (COX15) | Encodes a homolog of COX15 |
AT2G43100 | 2134 | Yellow | ISOPROPYLMALATE ISOMERASE 2 (IPMI2) | Isopropylmalate isomerase 2 |
AT1G71010 | 1958 | Green | FORMS APLOID AND BINUCLEATE CELLS 1C (FAB1C) | Encodes a protein that is predicted to act as a phosphatidylinositol-3P 5-kinase but lacks a FYVE domain |
AT2G29890 | 806 | Red | VILLIN 1 (VLN1) | Encodes a ubiquitously expressed villin-like protein |
AT2G45600 | 569 | Black | AT2G45600 | Alpha/beta-hydrolases superfamily protein |
AT1G72400 | 321 | Magenta | AT1G72400 | Hypothetical protein |
AT3G53980 | 311 | Pink | AT3G53980 | Bifunctional inhibitor/lipid-transfer protein/seed storage 2S albumin superfamily protein |
AT5G65350 | 243 | Purple | HISTONE 3 11 (HTR11) | Histone 3 11 |
AT5G27560 | 182 | Green-yellow | AT5G27560 | DUF1995 domain protein, putative (DUF1995) |
AT5G54855 | 170 | Tan | AT5G54855 | Pollen Ole e 1 allergen and extensin family protein |
AT1G75630 | 156 | Salmon | VACUOLAR H+-PUMPING ATPASE 16 KDA PROTEOLIPID SUBUNIT 4 (AVA-P4) | Vacuolar H+-pumping ATPase 16 kD proteolipid (ava-p) mRNA |
AT1G74450 | 107 | Cyan | AT1G74450 | BPS1-like protein (DUF793) |
AT2G23940 | 107 | Midnight-blue | AT2G23940 | Transmembrane protein (DUF788) |
AT1G30460 | 82 | Light-cyan | CLEAVAGE AND POLYADENYLATION SPECIFICITY FACTOR 30 (CPSF30) | Encodes AtCPSF30, the 30-KDa subunit of cleavage and polyadenylation specificity factor |
AT1G06040 | 74 | Grey60 | SALT TOLERANCE (STO) | B-box zinc finger family protein that encodes a salt tolerance protein |
AT2G11560 | 58 | Light-green | AT2G11560 | Mutator-like transposase/similar to MURA transposase of maize |
AT3G55050 | 50 | Dark-green | D-CLADE TYPE 2C PROTEIN PHOSPHATASE 4 (PP2C.D4) | Protein phosphatase 2C family protein |
ATCG01070 | 48 | Dark-red | NAD(P)H-QUINONE OXIDOREDUCTASE SUBUNIT 4L (NDHE) | NADH dehydrogenase ND4L |
AT3G25950 | 48 | Royal-blue | AT3G25950 | TRAM, LAG1, and CLN8 (TLC) lipid-sensing domain containing protein |
AT1G65410 | 39 | Dark-turquoise | ATP-BINDING CASSETTE I13 (ABCI13) | Encodes a member of NAP subfamily of transporters |
AT5G02310 | 38 | Light-yellow | PROTEOLYSIS 6 (PRT6) | Encodes a component of the N-end rule pathway that targets protein degradation |
AT3G61130 | 37 | Dark-grey | GALACTURONOSYLTRANSFERASE 1 (GAUT1) | Encodes a protein with putative galacturonosyltransferase activity |
AT3G53350 | 34 | Dark-orange | ROP INTERACTIVE PARTNER 3 (RIP3) | Encodes a microtubule-binding protein |
AT5G43490 | 34 | Orange | AT5G43490 | Myb-like protein X |
Among the hub genes, only 234 genes were functionally annotated; of these, 13 were TFs: AUXIN RESPONSE FACTOR 9 (ARF9), FLOWERING BHLH 4 (FBH4), BASIC HELIX-LOOP-HELIX 39 (BHLH39), BASIC LEUCINE-ZIPPER 44 (bZIP44), bZIP19, ZIM-LIKE 2 (ZML2), AT5G60820, AT4G01270, KANADI 3 (KAN3), HOMEODOMAIN GLABROUS 4 (HDG4), CELL DIVISION CYCLE 5 (CDC5), NAC DOMAIN CONTAINING PROTEIN 80 (NAC080), and SALT TOLERANCE (STO)). In addition, five genes encoding transposable elements (i.e., AT2G11560, AT3G33066, AT5G32430, AT3G42820, and AT4G28900) were identified.
In silico analysis of the promoter sequences (1000 bp upstream from the transcription start site) of the hub genes using the MEME tool identified four significant motifs ranging in length from 15 to 29 bp (Table 2). Motifs 1, 2, and 3 were detected across 229, 245, and 121 hub genes, respectively. Further analysis of the predicted motifs using the GOMo tool provided in the MEME suite indicated that motifs 1 and 3 may be involved in the DNA endoreduplication, polarity specification of axial/abaxial axis, and hormone-mediated signaling pathways; motifs 1 and 3 seem to function in association to cytokinin and gibberellic acid, respectively.
Table 2.
Motif | E-value | Motif width | Sites | Significant GO enriched terms (q value<0.05) |
|
---|---|---|---|---|---|
1 | NAVAAAAAAARAAARARAAARAAAAHMAA Consensus sequence: [AGT]A[AG]AA[AC]AAAA[AG]A[AG][AG][AGC]A[AG]AA[AG][AG]A[AG]AA[ATC][AC]A[AGT] |
2.8e-077 | 29 | 229 | (i) GO:0042023: DNA endoreduplication (ii) GO:0009944: Polarity specification of adaxial/abaxial axis (iii) GO:0009735: Response to cytokinin stimulus (iv) GO:0009744: Response to sucrose stimulus (v) GO:0006468: Protein amino acid phosphorylation (vi) GO:0009965: Leaf morphogenesis |
Consensus logo: | |||||
| |||||
2 | DTTTTTKTTTTKTTY Consensus sequence: [AGT][TG]TTTT[TG]TTTT[TG][TG]T[TCG] |
6.0e-020 | 15 | 245 | |
Consensus logo: | |||||
| |||||
3 | NDRRAGDDDRRWARRRARAGAADRRDAG Consensus sequence: [AGC][GAT][AG][AG][AT][GA][AGT][ATG][AGT][GA][AG][AT]A[GA][AG][GAC]A[AG]A[GA]A[AG][GAT][AG][AG][GTA][AT][GT] |
3.6e-025 | 28 | 121 | (i) GO:0009944: Polarity specification of adaxial/abaxial axis (ii) GO:0048481: Ovule development (iii) GO:0010050: Vegetative phase change (iv) GO:0010051: Xylem and phloem pattern formation (v) GO:0042023: DNA endoreduplication (vi) GO:0035196: Production of miRNAs involved in gene silencing by miRNA (vii) GO:0009740: Gibberellic acid mediated signaling pathway |
Consensus logo: | |||||
| |||||
4 | CMAYCTYCTCCDTCHBCATC Consensus sequence: [CT][CA]A[TC]CT[CT]CT[CT]C[TAG][TGC][CT][ATC][GTC]C[AT][TC]C |
5.3e-007 | 20 | 44 | |
Consensus logo: |
3.7. Validation of Hub Genes
A comparison of hub genes and DEGs showed that 31 hub genes are differentially expressed in SE (the expression values of differentially expressed hub genes are given in Table S4). Further, expression analysis of these genes using the Arabidopsis eFP browser demonstrated that two hub genes, AT1G19540 (Figure 6(a)) and AT5G44380 (Figure 6(b)), exhibit a seed-specific pattern of expression.
Moreover, analysis of the expression profiles of hub genes in the Arabidopsis somatic embryo transcriptome dataset (E-MTAB-2465) published by Wickramasuriya and Dunwell (2015) revealed that 62 hub genes are differentially expressed in somatic embryonic tissues compared to leaf tissues (|log2 FC| ≥ 2.0 and p value <0.05; Figure 7). Of these, 15 genes were identified as DEGs in the present analysis. For instance, CYSTEINE-RICH TRANSMEMBRANE MODULE 7 (ATHCYSTM7/AT2G33520), HEPTAHELICAL TRANSMEMBRANE PROTEIN2 (AT4G30850), INDOLE-3-ACETIC ACID INDUCIBLE 30 (IAA30/AT3G62100), RPS9C, VASCULATURE COMPLEXITY AND CONNECTIVITY (AT2G32280), AT2G21820, AT2G38900, and AT5G43770 showed a marked expression in somatic embryonic tissues as compared to leaf tissues. Expression analysis using the Arabidopsis eFP browser further showed that AT2G29300, AT2G21820, AT2G38900, AT5G43770, ATHCYSTM7, and AT1G19540 exhibit a seed-specific pattern of gene expression.
As expected, few hub genes highly expressed in leaf tissues were repressed in somatic embryos indicating the importance of gene regulation in SE (Figure 7); for instance, CELLULOSE SYNTHASE-LIKE B4 (AT2G32540), CHOLINE/ETHANOLAMINE KINASE 3 (AT4G09760), GLUTAMATE DECARBOXYLASE 2 (AT1G65960), ISOPROPYLMALATE ISOMERASE 2 (AT2G43100), PEROXIREDOXIN Q (PRXQ/AT3G26060), PHOTOSYNTHETIC NDH SUBCOMPLEX L 4 (PnsL4/AT4G39710), PLASTID RIBOSOMAL PROTEIN S20 (AT3G15190), STO (AT1G06040), SINAPOYLGLUCOSE 1 (SNG1/AT2G22990), THYLAKOID RHODANESE-LIKE (TROL/AT4G01050), TONOPLAST INTRINSIC PROTEIN 2 (TIP2/AT3G26520), AT3G50685, AT4G33666, AT5G16010, and AT5G54540 genes showed a marked repression in somatic embryos compared to leaf tissues.
In summary, the present study identified a total of 78 hub genes as potential regulators of SE (Figure 8), including genes showing marked overexpression as well as repression in SE. Of these, 41 genes have not been functionally annotated thus far. The analysis of the promoter sequences of these uncharacterized hub genes using the PLACE database identified a total of 215 different plant CREs; ARR1AT, CAATBOX1, CACTFTPPCA1, DOFCOREZM, GATABOX, GT1CONSENSUS, POLLEN1LELAT52, and WRKY71OS were observed in all 41 functionally uncharacterized potential hub genes. Moreover, several CREs related to embryogenesis were identified (Figure 9). The functions of the predicted CREs are included in Table 3.
Table 3.
Cis-acting regulatory element | Function∗ |
---|---|
ABADESI1 | “ACGT” motif; transacting factor: TAF-1; responsive to ABA and desiccation. Expressed in seeds late during embryogenesis. Induced by ABA and osmotic stress in vegetative tissues. |
CACGTGMOTIF | “CACGTG motif”; essential for expression of beta-phaseolin gene during embryogenesis |
CANBNNAPA | Core of “(CA)n element” in storage protein genes; embryo- and endosperm-specific transcription of napin (storage protein) gene |
CARGNCAT | Noncanonical CArG motif (CC-Wx8-GG); A relevant cis-element for the response to AGL15 (AGAMOUS-like 15) in vivo |
DPBFCOREDCDC3 | DPBF-1 and 2 (Dc3 promoter-binding factor-1 and 2) binding core sequence; Dc3 expression is embryo-specific and induced by ABA |
DRE1COREZMRAB17 | “DRE1” core found in maize (Z.M.) rab17 gene promoter; “DRE1” was protected, in in vivo footprinting, by a protein in embryos specifically, but in leaves, was protected when was treated with ABA and drought; rab17 is expressed during late embryogenesis and is induced by ABA |
DRE2COREZMRAB17 | “DRE2”; core sequence in rab17 gene promoter. rab17 is expressed during late embryogenesis and is induced by ABA |
EBOXBNNAPA | “E-box” of napA storage-protein gene |
PYRIMIDINEBOXOSRAMY1A | Found in the promoter of alpha-amylase (Amy2/32b) gene which is induced in the aleurone layers in response to GA in embryo |
RYREPEATVFLEB4 | RY repeat motif; quantitative seed expression; binding site of Arabidopsis B3-domain-containing transcription factor FUS3, mediates abscisic acid-induced transcription |
SEF1MOTIF | “SEF1 (soybean embryo factor 1)” binding motif; regulates the expression of genes encoding for the beta-conglycinin seed storage proteins |
SEF3MOTIFGM | “SEF3 binding site”; regulates the expression of genes encoding for the beta-conglycinin seed storage proteins |
SEF4MOTIFGM7S | “SEF4 (soybean embryo factor 4)” binding motif; regulates the expression of genes encoding for the beta-conglycinin seed storage proteins |
TATABOX2 | “TATA box”; TATA box found in beta-phaseolin promoter which is accurate transcription initiation in the embryo stage |
∗Details of PLACE entries were retrieved from the https://www.dna.affrc.go.jp/place/place_seq.shtml (accessed on 19th May 2022).
3.8. Distribution of Embryogenesis-Related Genes across Network Modules
Further exploration of genes mapped to each network module found that 10 key regulators of SE including LEC1, FUSCA3 (FUS3), and ABSCISIC ACID INSENSITIVE 3 (ABI3) are present among the highly connected genes in the network (Table 4); SE-related marker genes, LEC2, SERK1, WUS, BBM, and WUSCHEL RELATED HOMEOBOX 2 (WOX2) showed low variance in the present dataset and thus were not included in the GCN analysis. We also observed that the majority of previously published EMB genes [41] are localized to the blue and turquoise modules, which showed significant association with stage I and stage III, respectively (Figure 10; see Table S5).
Table 4.
Gene identifier | Module | Gene name | No. of interactors | |
---|---|---|---|---|
1 | AT3G26790 | Turquoise | FUSCA3 (FUS3) | 3346 |
2 | AT5G13790 | Turquoise | AGAMOUS-LIKE 15 (AGL15) | 3308 |
3 | AT1G21970 | Turquoise | LEAFY COTYLEDON 1 (LEC1) | 3297 |
4 | AT5G45980 | Turquoise | WUSCHEL RELATED HOMEOBOX 8 (WOX8) | 3158 |
5 | AT3G24650 | Turquoise | ABSCISIC ACID INSENSITIVE 3 (ABI3) | 3115 |
6 | AT1G78080 | Brown | WOUND INDUCED DEDIFFERENTIATION 1 (WIND1) | 2111 |
7 | AT5G57390 | Turquoise | AINTEGUMENTA-LIKE 5 (AIL5) | 752 |
8 | AT4G37750 | Red | AINTEGUMENTA (ANT) | 688 |
9 | AT1G63470 | Red | AT-HOOK MOTIF NUCLEAR LOCALIZED PROTEIN 5 (AHL5) | 522 |
10 | AT5G65510 | Purple | AINTEGUMENTA-LIKE 7 (AIL7) | 216 |
In addition, we observed that 1,711 Arabidopsis TFs are distributed across all the gene modules except in light-green and royal-blue modules, with the highest number of TFs present in the turquoise module (the complete list of TFs included in the GCN is given in Table S6). Notably, AP2/EREBP (APETALA2/ethylene-responsive element binding proteins), bHLH (basic helix–loop–helix), bZIP, C2H2 (Cys2-His2), HB (homeobox), NAC (NAM, ATAF, and CUC), MYB (MYB-domain), C3H, and WRKY TF families were highly represented (Figure 11(a)). Of these, members of AP2/EREBP, bHLH, C2H2, HB, NAC, MYB, and WRKY TF families were involved in early SE (Figure 11(b)). Interestingly, TFs that are targets of several microRNAs (miRNAs) were also recovered from the GCN (Table S7).
Notably, several gene encoding epigenetic regulators were localized in network modules (Figure 12). This included 14 genes involved in DNA modification, 51 genes involved in histone modification, 34 genes involved in chromatin remodeling, 15 genes encoding polycomb-group proteins, and 55 genes associated with RNA silencing (see Table S8). Each of these genes directly interacted with numerous modular genes forming a complex network.
4. Discussion
Plant embryogenesis is a meticulous developmental process that requires the regulation of multiple genes. A GCN will serve as a map of statistically significant gene interactions that helps in narrowing down the transcriptome to the potential gene interactions involved in biological processes. Recently, Clercq et al. report an integrated gene regulatory network for Arabidopsis covering TFs and target genes [47]. In the present study, WGCNA was employed to explore potential clusters of highly coregulated genes and hub genes associated with SE. Although WGCNA has been previously applied to construct a GCN for Arabidopsis zygotic embryogenesis (ZE) [33], to the best of our knowledge, this is the first report on the use of WGCNA to construct a GCN for Arabidopsis SE and to explore SE-related network modules and hub genes. The findings of this study provide new insights into the molecular mechanism of SE in plants.
The GCN constructed for SE comprised of 26 network modules: black (674 genes), blue (2,973 genes), brown (2,437 genes), cyan (125 genes), dark-green (52 genes), dark-grey (39 genes), dark-orange (35 genes), dark-red (54 genes), dark-turquoise (52 genes), green (2,132 genes), green-yellow (189 genes), grey60 (79 genes), light-cyan (86 genes), light-green (59 genes), light-yellow (58 genes), magenta (338 genes), midnight-blue (117 genes), orange (35 genes), pink (357 genes), purple (271 genes), red (853 genes), royal-blue (56 genes), salmon (162 genes), tan (172 genes), turquoise (3,418 genes), and yellow (2,223 genes) modules. Among them, 18 modules showed strong associations with different stages of SE; module-trait relationship analysis revealed that four, four, seven, and four modules were significantly correlated with stages I, II, III, and IV of SE, respectively. This suggests that SE involves complex genetic networks.
Functional enrichment analysis using GO is one of the most widely used bioinformatic methods to classify genes into functionally related groups [48–50]. GO analysis of the coexpressed gene clusters (or network modules) showed that the initial stages of SE were mainly enriched with biological processes such as hormone-mediated signaling, biosynthesis pathways, ATP synthesis, DNA methylation, and replication. Notably, genes involved in lipid transport, postembryonic development, signal transduction, and seed dormancy were enriched in later stages of SE; this indicates the developmental shift in the maturation phase with the accumulation of embryo-specific food reserves, a process that aids in withstanding dormancy and postembryonic development [2, 10, 51]. Furthermore, genes related to stress responses (e.g., oxidative and water stress), phytohormones (e.g., cytokinin, abscisic acid, gibberellin, and jasmonic acid), and metabolic processes were enriched in all stages of somatic embryo development studied, from the initiation to maturation stage. These findings further confirmed the importance of cell-cell interactions [52], signaling [9, 13, 53], and transcriptional activation of stress responses [54, 55] during plant SE.
High-degree nodes or the genes with high network connectivity in GCN modules (“hub genes”) may have important biological functions [36, 56–58]; often, they may serve as biological markers. Several studies have successfully employed WGCNA to mine hub genes controlling biological processes [34, 59–62]. The present study reports 260 potential hub genes related to SE based on the degree of connectivity. These genes may play pivotal roles in the regulation of SE. Importantly, 13 TFs encoded by hub genes were identified in the coexpression network. They were ARF9, NAC080, ZML2, bHLH39, KAN3, bZIP19, bZIP44, HDG4, FBH4, STO, CDC5, AT5G60820, and AT4G01270; functional roles of many of these genes in the regulation of SE are not reported. Previous studies have reported that ARF9 represses the expression of its target genes such as TOPLESS (TPL) and TPL-related proteins [63, 64]. Wójcikowska and Gaj observed stable expression of ARF9 during SE [65]. In addition, KAN3, a member of GARP TF family, has also exhibited an embryonic expression pattern.
In addition, ROOT UV-B SENSITIVE 6 (RUS6; AT5G49820), which encodes a DUF647 (DOMAIN OF UNKNOWN FUNCTION 647) containing protein, an ankyrin repeat-containing gene designated as AT5G65860 and a gene that encodes hydroxyproline-O-glycosyltransferases (Hyp-O-GALT), GALT4 (AT1G27120)), was also identified as hub genes in the coexpression network. The members of the RUS gene family play diverse roles in plant development [66]. Interestingly, knockout mutants of RUS6 have shown a strong embryo-lethal phenotype. In Arabidopsis, ankyrin repeat-containing proteins have been classified into 16 groups [67], and of these, proteins with only ankyrin repeats have been associated with disease resistance, antioxidation, embryogenesis, and development [68–70]. For instance, T-DNA mutants of the EMB 506 gene, which encodes a protein containing five ankyrin repeats, have shown defective embryo development at the globular-to-heart stage transition [70]. Moreover, Hyp-O-GALT enzymes are responsible for hydroxyproline glycosylation of arabinogalactan proteins, which are known to function in various aspects of plant growth and development including SE [71–73]. Although the hub genes identified in the present study are implicated to function in many plant developmental processes, the functions of many of the hub genes in SE remain to be elucidated. Hence, these genes could be potential targets for functional studies in the future.
Promoter analysis of the functionally uncharacterized hub genes using the PLACE database revealed the overrepresentation of two motifs in many of the promoter regions. These were EBOXBNNAPA (consensus sequence: CANNTG) and SEF4MOTIFGM7S (consensus sequence: [A/G]TTTTT[A/G]). Of these, EBOXBNNAPA (“E-box” motif) is a CRE found in the regulatory region of the napin gene, napA in Brassica napus [74]; this gene encodes a storage protein. Moreover, CANNTG provides the binding site for bHLH TFs [75]. bHLH is one of the most frequently represented gene families in DEGs in ZE [76] and SE and is known to have diverse functions in plants [24] including cell proliferation [75]. The recognition sequence of SEF4MOTIFGM7S motif is known to interact with SEF3, a protein expressed in immature soybean seeds that acts as a transcriptional activator of the β-conglycinin α subunit gene [77]. Hence, the uncharacterized hub genes that showed considerable expression in embryonic tissues are more likely to play a significant role in plant embryo development.
Differential gene expression analysis of hub genes revealed that 78 genes could be considered as potential regulators of SE; of these, 15 genes were differentially expressed in transcriptome datasets derived from two independent studies related to SE [24, 37]. One of the genes identified was IAA30, which is a member of one of the families of auxin signaling proteins (Aux/IAA; [78]). iaa30 mutants have displayed significantly impaired SE efficiency, producing fewer somatic embryos per explant [76] and suggesting its role in the initiation phase of SE. Moreover, IAA30 is a target of two important SE marker genes, LEC2 and AGL15 [79, 80]. In addition, two hub genes, AT1G19540 and AT5G44380, showed a marked expression in seed development, suggesting their roles in embryogenesis.
To enhance our understanding of the regulatory mechanism of SE, the distribution of embryogenesis-related genes across the gene modules was examined. Horstman et al. report LEC1–LEC2–FUS3–BBM–ABI3 network to induce SE in Arabidopsis [81]. Moreover, Zheng et al. suggest a MADS-domain TF encoding gene, and AGL15 may associate with LEC2, FUS3, and ABI3 during SE [82]. However, a recent study has found that AGL15 is not essential to promote SE [83]. In the present analysis, 10 key regulators of SE including LEC1, ABI3, FUS3, AGL15, and three members of the AINTEGUMENTA-LIKE/PLETHORA (AIL/PLT) subfamily (ANT, AIL5, and AIL7) were identified in the coexpression network. Consistent with previous literature, members of the AP2/EREBP, bHLH, bZIP, MYB, HB, WRKY, NAC, C3H, and C2H2 TF families were overrepresented in the GCN [76, 84]. In addition, members of the TF families (i.e., SPB (SQUAMOSA promoter binding protein-like), GRAS (GRAS-domain), trihelix, G2-like, and CAMTA (CALMODULIN BINDING TRANSCRIPTION ACTIVATOR 3)) that are not or to a lesser extent reported to be involved in SE were identified. The members of GRAS, trihelix, and CAMTA families are known to be involved in the regulation of stress responses [47, 85, 86].
Further, it is reported that miRNAs (e.g., miR156, miR159, miR162, miR164, miR166, miR167, miR169, miR168, miR171, miR319, miR393, and miR396) play an important role in SE [87–91]. Consistent with previous studies, several TFs targeted by miRNAs were recovered from the SE-related GCN. This included seven miR156/157 targeting genes of the SPB TF family, seven miR169 targeting genes of the CCAAT TF family, six miR396 targeting genes of the GRF TF family, five miR166/miR165 targeting genes of the HB TF family, five miR164 targeting genes of the NAC family, and five miR159/miR319 targeting genes of the TCP TF family. These miRNA-targeted TF encoding genes may play a significant role in the regulation of SE responses.
Recent studies have uncovered critical roles of epigenetic modifications in the regulation of SE, in particular, DNA methylation/demethylation [92–94] and histone modifications [91, 95, 96]. Recently, an expression study on Arabidopsis embryos at single-cell resolution has provided evidence for distinct expression patterns for many epigenetic regulators across embryonic tissues [97]. Our coexpression network also revealed that many genes encoding epigenetic regulators such as METHYLTRANSFERASE 1 (MET1), CHROMOMETHYLASE 3 (CMT3), DEMETER (DME), DEMETER-LIKE (DML1,-2), histone acetyltransferases (HISTONE ACETYLTRANSFERASE OF THE CBP FAMILY (HAC1,-4,-5,-12), histone deacetylases (i.e., HISTONE DEACETYLASE (HDA1,-2,-3,-5,-6,-8,-9,-14,-15,-17), and histone demethylases (JUMONJI DOMAIN-CONTAINING PROTEIN 16 (JMJ14,-16,-22,-27,-29) were coexpressed with key genes involved in the regulation of SE.
The present study showed that the WGCNA pipeline could be used to identify biologically relevant modules of SE. However, our analysis has some limitations. The main limitations were the small sample size used in the analysis and the lack of an independent dataset to replicate the findings. Langfelder and Horvth [32] recommend using at least 15 samples to construct robust networks. However, high-quality, clean data could also result in biologically meaningful networks even with <15 samples. Therefore, further experiments are recommended to validate the hub genes discovered in the present study. Furthermore, the GCN built in the present study was based on microarray gene expression data. Although hybridization-based gene expression profiling approaches are high-throughput and relatively inexpensive, they have a number of limitations; most importantly, they provide only an indirect measure of the level of gene expression and can only be used to study the expression levels of genes that the arrays are designed to detect and are subjected to cross-hybridization biases [98]. Given the limitations of this approach, it would be recommended to perform a GCN analysis employing an expression dataset generated through high-throughput transcriptome sequencing (RNA-seq) with an appropriate number of replicates. Unlike microarrays, RNA-seq is not dependent on prior knowledge about the genome sequence and has higher sensitivity to genes expressed either at a low or very high level and also has higher levels of reproducibility than microarrays [99]. Therefore, it could generate a more suitable dataset for GCN analysis.
5. Conclusion
In this study, a GCN was successfully constructed for SE employing WGCNA. Gene modules and hub genes related to Arabidopsis somatic embryo development were successfully mined based on their statistical significance. The findings reported here provide a unique resource to advance the regulation of SE at the molecular level.
Acknowledgments
The authors gratefully acknowledge the support from the University of Colombo, Sri Lanka.
Abbreviations
- ABI3:
ABSCISIC ACID INSENSITIVE 3
- AGL:
AGAMOUS-LIKE
- AIL:
AINTEGUMENTA-LIKE
- BBM:
BABY BOOM
- bHLH:
Basic helix-loop-helix
- bZIP:
BASIC LEUCINE-
- C2H2:
Cys2-His2
- CRE:
Cis-acting regulatory element
- DEG:
Differentially expressed gene
- EMB:
EMBRYO-DEFECTIVE
- FC:
Fold change
- FUS3:
FUSCA3
- GCN:
Gene coexpression network
- GEO:
Gene Expression Omnibus
- GO:
Gene Ontology
- HB:
HOMEOBOX
- IAA30:
INDOLE-3-ACETIC ACID INDUCIBLE 30
- JMJ:
JUMONJI DOMAIN-CONTAINING
- KAN3:
KANADI 3
- LEC:
LEAFY COTYLEDON
- ME:
Module eigengene
- MEME:
Multiple Em for Motif Elicitation
- miRNA:
MicroRNA
- PLACE:
Plant cis-acting regulatory DNA elements
- r:
Pearson correlation coefficient
- RNA-Seq:
RNA-sequencing
- RUS:
ROOT UV-B SENSITIVE
- SE:
Somatic embryogenesis
- STO:
SALT TOLERANCE
- TF:
Transcription factor
- WGCNA:
Weighted Gene Correlation Network Analysis.
Data Availability
The datasets used to support the findings of this study are included within the article and within the supplementary information files.
Disclosure
A preprint has previously been published [100].
Conflicts of Interest
The authors declare that there is no conflict of interest regarding the publication of this paper.
Authors' Contributions
KKD participated in the design of the study, performed the GCN construction and analysis, and drafted the manuscript. JMD helped to interpret data and draft the manuscript. AMW conceived the study, participated in the design of the study, and helped to draft the manuscript. All authors have read and approved the final manuscript.
Supplementary Materials
References
- 1.Steward F. C., Mapes M. O., Mears K. Growth and organized development of cultured cells. II. Organization in cultures grown from freely suspended cell. American Journal of Botany . 1958;45(10):705–708. doi: 10.1002/j.1537-2197.1958.tb10599.x. [DOI] [Google Scholar]
- 2.Zimmerman J. L. Somatic embryogenesis: a model for early development in higher plants. The Plant Cell . 1993;5(10):1411–1423. doi: 10.2307/3869792. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Capron A., Chatfield S., Provart N., Berleth T. Embryogenesis: pattern formation from a single cell. The Arabidopsis Book . 2009;7, article e0126 doi: 10.1199/tab.0126. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.de Vries S. C., Weijers D. Plant embryogenesis. Current Biology . 2017;27(17):R870–R873. doi: 10.1016/j.cub.2017.05.026. [DOI] [PubMed] [Google Scholar]
- 5.Etienne H. Protocol for Somatic Embryogenesis in Woody Plants . Berlin/Heidelberg: Springer-Verlag; 2005. Somatic embryogenesis protocol: coffee (Coffea arabica L) pp. 167–179. [DOI] [Google Scholar]
- 6.Steinmacher D. A., Clement C. R., Guerra M. P. Somatic embryogenesis from immature peach palm inflorescence explants: towards development of an efficient protocol. Plant Cell, Tissue and Organ Culture . 2007;89(1):15–22. doi: 10.1007/s11240-007-9207-6. [DOI] [Google Scholar]
- 7.Manrique-Trujillo S., Díaz D., Reaño R., Ghislain M., Kreuze J. Sweetpotato plant regeneration via an improved somatic embryogenesis protocol. Scientia Horticulturae . 2013;161:95–100. doi: 10.1016/j.scienta.2013.06.038. [DOI] [Google Scholar]
- 8.Vinoth S., Gurusaravanan P., Jayabalan N. Optimization of somatic embryogenesis protocol in Lycopersicon esculentum L. using plant growth regulators and seaweed extracts. Journal of Applied Phycology . 2014;26(3):1527–1537. doi: 10.1007/s10811-013-0151-z. [DOI] [Google Scholar]
- 9.Méndez-Hernández H. A., Ledezma-Rodríguez M., Avilez-Montalvo R. N., et al. Signaling overview of plant somatic embryogenesis. Frontiers in Plant Science . 2019;10:p. 77. doi: 10.3389/fpls.2019.00077. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Dantu P. K., Tomar U. K., Tripathi G. Cellular and Biochemical Science . New Delhi: IK International House Pvt Ltd; 2010. Somatic embryogenesis; pp. 892–908. [Google Scholar]
- 11.El-Esawi M. A. Nonzygotic embryogenesis for plant development. In: Anis M., Ahmad N., editors. Plant Tissue Culture: Propagation, Conservation and Crop Improvement . Singapore: Springer Singapore; 2016. pp. 583–598. [DOI] [Google Scholar]
- 12.Zeng F., Zhang X., Cheng L., et al. A draft gene regulatory network for cellular totipotency reprogramming during plant somatic embryogenesis. Genomics . 2007;90(5):620–628. doi: 10.1016/j.ygeno.2007.07.007. [DOI] [PubMed] [Google Scholar]
- 13.Smertenko A., Bozhkov P. Applied Plant Cell Biology . Berlin, Heidelberg: Springer; 2014. The life and death signalling underlying cell fate determination during somatic embryogenesis; pp. 131–178. [DOI] [Google Scholar]
- 14.Cetz-Chel J. E., Loyola-Vargas V. M. Transcriptome profile of somatic embryogenesis. In: Loyola-Vargas V. M., Ochoa-Alejo N., editors. Somatic Embryogenesis: Fundamental Aspects and Applications . Cham: Springer International Publishing; 2016. pp. 39–52. [DOI] [Google Scholar]
- 15.Hecht V., Vielle-Calzada J.-P., Hartog M. V., et al. The Arabidopsis somatic embryogenesis receptor kinase 1 gene is expressed in developing ovules and embryos and enhances embryogenic competence in culture. Plant Physiology . 2001;127(3):803–816. doi: 10.1104/pp.010324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Yang X., Zhang X. Regulation of somatic embryogenesis in higher plants. Critical Reviews in Plant Science . 2010;29(1):36–57. doi: 10.1080/07352680903436291. [DOI] [Google Scholar]
- 17.Gaj M. D., Zhang S., Harada J. J., Lemaux P. G. Leafy cotyledon genes are essential for induction of somatic embryogenesis of Arabidopsis. Planta . 2005;222(6):977–988. doi: 10.1007/s00425-005-0041-y. [DOI] [PubMed] [Google Scholar]
- 18.Ikeda M., Takahashi M., Fujiwara S., Mitsuda N., Ohme-Takagi M. Improving the efficiency of adventitious shoot induction and somatic embryogenesis via modification of WUSCHEL and LEAFY COTYLEDON 1. Plants . 2020;9(11):p. 1434. doi: 10.3390/plants9111434. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Lotan T., Ohto M., Yee K. M., et al. Arabidopsis LEAFY COTYLEDON1 is sufficient to induce embryo development in vegetative cells. Cell . 1998;93(7):1195–1205. doi: 10.1016/S0092-8674(00)81463-4. [DOI] [PubMed] [Google Scholar]
- 20.Stone S. L., Kwong L. W., Yee K. M., et al. LEAFY COTYLEDON2 encodes a B3 domain transcription factor that induces embryo development. Proceedings of the National Academy of Sciences . 2001;98(20):11806–11811. doi: 10.1073/pnas.201413498. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Stone S. L., Braybrook S. A., Paula S. L., et al. Arabidopsis LEAFY COTYLEDON2 induces maturation traits and auxin activity: implications for somatic embryogenesis. Proceedings of the National Academy of Sciences . 2008;105(8):3151–3156. doi: 10.1073/pnas.0712364105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Boutilier K., Offringa R., Sharma V. K., et al. Ectopic expression of BABY BOOM triggers a conversion from vegetative to embryonic growth. The Plant Cell . 2002;14(8):1737–1749. doi: 10.1105/tpc.001941. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Zuo J., Niu Q.-W., Frugis G., Chua N.-H. The WUSCHEL gene promotes vegetative-to-embryonic transition in Arabidopsis. The Plant Journal . 2002;30(3):349–359. doi: 10.1046/j.1365-313X.2002.01289.x. [DOI] [PubMed] [Google Scholar]
- 24.Wickramasuriya A. M., Dunwell J. M. Global scale transcriptome analysis of Arabidopsis embryogenesis in vitro. BMC Genomics . 2015;16(1):p. 301. doi: 10.1186/s12864-015-1504-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Indoliya Y., Tiwari P., Chauhan A. S., et al. Decoding regulatory landscape of somatic embryogenesis reveals differential regulatory networks between japonica and indica rice subspecies. Scientific Reports . 2016;6(1, article 23050) doi: 10.1038/srep23050. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Singla B., Tyagi A. K., Khurana J. P., Khurana P. Analysis of expression profile of selected genes expressed during auxin-induced somatic embryogenesis in leaf base system of wheat (Triticum aestivum) and their possible interactions. Plant Molecular Biology . 2007;65(5):677–692. doi: 10.1007/s11103-007-9234-z. [DOI] [PubMed] [Google Scholar]
- 27.Zeng F., Zhang X., Zhu L., Tu L., Guo X., Nie Y. Isolation and characterization of genes associated to cotton somatic embryogenesis by suppression subtractive hybridization and macroarray. Plant Molecular Biology . 2006;60(2):167–183. doi: 10.1007/s11103-005-3381-x. [DOI] [PubMed] [Google Scholar]
- 28.Salvo S. A. G. D., Hirsch C. N., Buell C. R., Kaeppler S. M., Kaeppler H. F. Whole transcriptome profiling of maize during early somatic embryogenesis reveals altered expression of stress factors and embryogenesis-related genes. PLoS One . 2014;9(10, article e111407) doi: 10.1371/journal.pone.0111407. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Rajesh M. K., Fayas T. P., Naganeeswaran S., et al. De novo assembly and characterization of global transcriptome of coconut palm (Cocos nucifera L.) embryogenic calli using Illumina paired-end sequencing. Protoplasma . 2016;253(3):913–928. doi: 10.1007/s00709-015-0856-8. [DOI] [PubMed] [Google Scholar]
- 30.van Dam S., Võsa U., van der Graaf A., Franke L., de Magalhães J. P. Gene co-expression analysis for functional classification and gene–disease predictions. Briefings in Bioinformatics . 2017;19:575–592. doi: 10.1093/bib/bbw139. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Zhang B., Horvath S. A general framework for weighted gene co-expression network analysis. Statistical Applications in Genetics and Molecular Biology . 2005;4(1):p. Article17. doi: 10.2202/1544-6115.1128. [DOI] [PubMed] [Google Scholar]
- 32.Langfelder P., Horvath S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics . 2008;9(1, article 559) doi: 10.1186/1471-2105-9-559. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Gao P., Xiang D., Quilichini T. D., et al. Gene expression atlas of embryo development in Arabidopsis. Plant Reproduction . 2019;32(1):93–104. doi: 10.1007/s00497-019-00364-x. [DOI] [PubMed] [Google Scholar]
- 34.Shaik R., Ramakrishna W. Genes and co-expression modules common to drought and bacterial stress responses in Arabidopsis and rice. PLoS One . 2013;8(10, article e77261) doi: 10.1371/journal.pone.0077261. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Tai Y., Liu C., Yu S., et al. Gene co-expression network analysis reveals coordinated regulation of three characteristic secondary biosynthetic pathways in tea plant (Camellia sinensis) BMC Genomics . 2018;19(1):p. 616. doi: 10.1186/s12864-018-4999-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Zhu M., Xie H., Wei X., et al. WGCNA analysis of salt-responsive core transcriptome identifies novel hub genes in rice. Genes . 2019;10(9):p. 719. doi: 10.3390/genes10090719. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Becker M. G., Chan A., Mao X., et al. Vitamin C deficiency improves somatic embryo development through distinct gene regulatory networks in Arabidopsis. Journal of Experimental Botany . 2014;65(20):5903–5918. doi: 10.1093/jxb/eru330. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Langfelder P., Luo R., Oldham M. C., Horvath S. Is my network module preserved and reproducible? PLoS Computational Biology . 2011;7(1, article e1001057) doi: 10.1371/journal.pcbi.1001057. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Tian T., Liu Y., Yan H., et al. agriGO v2.0: a GO analysis toolkit for the agricultural community, 2017 update. Nucleic Acids Research . 2017;45(W1):W122–W129. doi: 10.1093/nar/gkx382. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Magnani E., Jiménez-Gómez J. M., Soubigou-Taconnat L., Lepiniec L., Fiume E. Profiling the onset of somatic embryogenesis in Arabidopsis. BMC Genomics . 2017;18(1):p. 998. doi: 10.1186/s12864-017-4391-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Meinke D. W. Genome-wide identification of EMBRYO-DEFECTIVE (EMB) genes required for growth and development in Arabidopsis. The New Phytologist . 2020;226(2):306–325. doi: 10.1111/nph.16071. [DOI] [PubMed] [Google Scholar]
- 42.Xiang D., Venglat P., Tibiche C., et al. Genome-wide analysis reveals gene expression and metabolic network dynamics during embryo development in Arabidopsis. Plant Physiology . 2011;156(1):346–356. doi: 10.1104/pp.110.171702. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Pikaard C. S., Mittelsten S. O. Epigenetic regulation in plants. Cold Spring Harbor Perspectives in Biology . 2014;6(12, article a019315) doi: 10.1101/cshperspect.a019315. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Bailey T. L., Boden M., Buske F. A., et al. MEME SUITE: tools for motif discovery and searching. Nucleic Acids Research . 2009;37(suppl_2):W202–W208. doi: 10.1093/nar/gkp335. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Buske F. A., Bodén M., Bauer D. C., Bailey T. L. Assigning roles to DNA regulatory motifs using comparative genomics. Bioinformatics . 2010;26(7):860–866. doi: 10.1093/bioinformatics/btq049. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Higo K., Ugawa Y., Iwamoto M., Korenaga T. Plant cis-acting regulatory DNA elements (PLACE) database: 1999. Nucleic Acids Research . 1999;27(1):297–300. doi: 10.1093/nar/27.1.297. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.De Clercq I., Van de Velde J., Luo X., et al. Integrative inference of transcriptional networks in Arabidopsis yields novel ROS signalling regulators. Nature Plants . 2021;7(4):500–513. doi: 10.1038/s41477-021-00894-1. [DOI] [PubMed] [Google Scholar]
- 48.Consortium TGO. Creating the Gene Ontology resource: design and implementation. Genome Research . 2001;11(8):1425–1433. doi: 10.1101/gr.180801. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Ashburner M., Ball C. A., Blake J. A., et al. Gene Ontology: tool for the unification of biology. Nature Genetics . 2000;25(1):25–29. doi: 10.1038/75556. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Rue-Albrecht K., McGettigan P. A., Hernández B., et al. GOexpress: an R/Bioconductor package for the identification and visualisation of robust gene ontology signatures through supervised learning of gene expression data. BMC Bioinformatics . 2016;17(1):p. 126. doi: 10.1186/s12859-016-0971-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.West M. A. L., Harada J. J. Embryogenesis in higher plants: an overview. The Plant Cell . 1993;5(10):1361–1369. doi: 10.2307/3869788. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Williams E. G., Maheswaran G. Somatic embryogenesis: factors influencing coordinated behaviour of cells as an embryogenic group. Annals of Botany . 1986;57(4):443–462. doi: 10.1093/oxfordjournals.aob.a087127. [DOI] [Google Scholar]
- 53.Smertenko A., Bozhkov P. V. Somatic embryogenesis: life and death processes during apical–basal patterning. Journal of Experimental Botany . 2014;65(5):1343–1360. doi: 10.1093/jxb/eru005. [DOI] [PubMed] [Google Scholar]
- 54.Zavattieri M. A., Frederico A. M., Lima M., Sabino R., Arnholdt-Schmitt B. Induction of somatic embryogenesis as an example of stress-related plant reactions. Electronic Journal of Biotechnology . 2010;13(1):1–9. doi: 10.2225/vol13-issue1-fulltext-4. [DOI] [Google Scholar]
- 55.Jin F., Hu L., Yuan D., et al. Comparative transcriptome analysis between somatic embryos (SEs) and zygotic embryos in cotton: evidence for stress response functions in SE development. Plant Biotechnology Journal . 2014;12(2):161–173. doi: 10.1111/pbi.12123. [DOI] [PubMed] [Google Scholar]
- 56.Qiu J., Du Z., Wang Y., et al. Weighted gene co-expression network analysis reveals modules and hub genes associated with the development of breast cancer. Medicine . 2019;98(6, article e14345) doi: 10.1097/MD.0000000000014345. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Liu Y., Gu H.-Y., Zhu J., Niu Y.-M., Zhang C., Guo G.-L. Identification of hub genes and key pathways associated with bipolar disorder based on weighted gene co-expression network analysis. Frontiers in Physiology . 2019;10:p. 1081. doi: 10.3389/fphys.2019.01081. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Zhu Z., Jin Z., Deng Y., et al. Co-expression network analysis identifies four hub genes associated with prognosis in soft tissue sarcoma. Frontiers in Genetics . 2019;10:p. 37. doi: 10.3389/fgene.2019.00037. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Du J., Wang S., He C., Zhou B., Ruan Y.-L., Shou H. Identification of regulatory networks and hub genes controlling soybean seed set and size using RNA sequencing analysis. Journal of Experimental Botany . 2017;68(8):1955–1972. doi: 10.1093/jxb/erw460. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Zhang X., Feng H., Li Z., et al. Application of weighted gene co-expression network analysis to identify key modules and hub genes in oral squamous cell carcinoma tumorigenesis. Oncotargets and Therapy . 2018;Volume 11:6001–6021. doi: 10.2147/OTT.S171791. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Wang Q., Zeng X., Song Q., Sun Y., Feng Y., Lai Y. Identification of key genes and modules in response to cadmium stress in different rice varieties and stem nodes by weighted gene co-expression network analysis. Scientific Reports . 2020;10(1):p. 9525. doi: 10.1038/s41598-020-66132-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Zhang F., Wang L., Bai P., et al. Identification of regulatory networks and hub genes controlling nitrogen uptake in tea plants [Camellia sinensis (L.) O. Kuntze] Journal of Agricultural and Food Chemistry . 2020;68(8):2445–2456. doi: 10.1021/acs.jafc.9b06427. [DOI] [PubMed] [Google Scholar]
- 63.Causier B., Ashworth M., Guo W., Davies B. The TOPLESS interactome: a framework for gene repression in Arabidopsis. Plant Physiology . 2012;158(1):423–438. doi: 10.1104/pp.111.186999. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Gulzar B., Mujib A., Malik M. Q., Sayeed R., Mamgain J., Ejaz B. Genes, proteins and other networks regulating somatic embryogenesis in plants. Journal, Genetic Engineering & Biotechnology . 2020;18(1):p. 31. doi: 10.1186/s43141-020-00047-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Wójcikowska B., Gaj M. D. Expression profiling of AUXIN RESPONSE FACTOR genes during somatic embryogenesis induction in Arabidopsis. Plant Cell Reports . 2017;36(6):843–858. doi: 10.1007/s00299-017-2114-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Perry N., Leasure C. D., Tong H., Duarte E. M., He Z.-H. RUS6, a DUF647-containing protein, is essential for early embryonic development in Arabidopsis thaliana. BMC Plant Biology . 2021;21(1):p. 232. doi: 10.1186/s12870-021-03011-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Becerra C., Jahrmann T., Puigdomènech P., Vicient C. M. Ankyrin repeat-containing proteins in Arabidopsis: characterization of a novel and abundant group of genes coding ankyrin-transmembrane proteins. Gene . 2004;340(1):111–121. doi: 10.1016/j.gene.2004.06.006. [DOI] [PubMed] [Google Scholar]
- 68.Yan J., Wang J., Zhang H. An ankyrin repeat-containing protein plays a role in both disease resistance and antioxidation metabolism. The Plant Journal . 2002;29(2):193–202. doi: 10.1046/j.0960-7412.2001.01205.x. [DOI] [PubMed] [Google Scholar]
- 69.Zhang H., Scheirer D. C., Fowle W. H., Goodman H. M. Expression of antisense or sense RNA of an ankyrin repeat-containing gene blocks chloroplast differentiation in arabidopsis. The Plant Cell . 1992;4(12):1575–1588. doi: 10.1105/tpc.4.12.1575. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Albert S., Despres B., Guilleminot J., et al. The EMB506 gene encodes a novel ankyrin repeat containing protein that is essential for the normal development of Arabidopsis embryos. The Plant Journal . 1999;17(2):169–179. doi: 10.1046/j.1365-313X.1999.00361.x. [DOI] [PubMed] [Google Scholar]
- 71.Poon S., Heath R. L., Clarke A. E. A chimeric arabinogalactan protein promotes somatic embryogenesis in cotton cell culture. Plant Physiology . 2012;160(2):684–695. doi: 10.1104/pp.112.203075. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Basu D., Tian L., Wang W., et al. A small multigene hydroxyproline-O-galactosyltransferase family functions in arabinogalactan-protein glycosylation, growth and development in Arabidopsis. BMC Plant Biology . 2015;15(1):p. 295. doi: 10.1186/s12870-015-0670-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Duchow S., Dahlke R. I., Geske T., Blaschek W., Classen B. Arabinogalactan-proteins stimulate somatic embryogenesis and plant propagation of Pelargonium sidoides. Carbohydrate Polymers . 2016;152:149–155. doi: 10.1016/j.carbpol.2016.07.015. [DOI] [PubMed] [Google Scholar]
- 74.Stålberg K., Ellerstöm M., Ezcurra I., Ablov S., Rask L. Disruption of an overlapping E-box/ABRE motif abolished high transcription of the napA storage-protein promoter in transgenic Brassica napus seeds. Planta . 1996;199(4):515–519. doi: 10.1007/BF00195181. [DOI] [PubMed] [Google Scholar]
- 75.Heim M. A., Jakoby M., Werber M., Martin C., Weisshaar B., Bailey P. C. The basic helix-loop-helix transcription factor family in plants: a genome-wide study of protein structure and functional diversity. Molecular Biology and Evolution . 2003;20(5):735–747. doi: 10.1093/molbev/msg088. [DOI] [PubMed] [Google Scholar]
- 76.Gliwicka M., Nowak K., Balazadeh S., Mueller-Roeber B., Gaj M. D. Extensive modulation of the transcription factor transcriptome during somatic embryogenesis in Arabidopsis thaliana. PLoS One . 2013;8(7, article e69261) doi: 10.1371/journal.pone.0069261. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Allen R. D., Bernier F., Lessard P. A., Beachy R. N. Nuclear factors interact with a soybean beta-conglycinin enhancer. The Plant Cell . 1989;1(6):623–631. doi: 10.1105/tpc.1.6.623. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Liscum E., Reed J. W. Genetics of Aux/IAA and ARF action in plant growth and development. Plant Molecular Biology . 2002;49(3/4):387–400. doi: 10.1023/A:1015255030047. [DOI] [PubMed] [Google Scholar]
- 79.Braybrook S. A., Stone S. L., Park S., et al. Genes directly regulated by LEAFY COTYLEDON2 provide insight into the control of embryo maturation and somatic embryogenesis. Proceedings of the National Academy of Sciences . 2006;103(9):3468–3473. doi: 10.1073/pnas.0511331103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Wójcik A. M., Wójcikowska B., Gaj M. D. Current perspectives on the auxin-mediated genetic network that controls the induction of somatic embryogenesis in plants. International Journal of Molecular Sciences . 2020;21(4):p. 1333. doi: 10.3390/ijms21041333. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Horstman A., Li M., Heidmann I., et al. The BABY BOOM transcription factor activates the LEC1-ABI3-FUS3-LEC2 network to induce somatic embryogenesis. Plant Physiology . 2017;175(2):848–857. doi: 10.1104/pp.17.00232. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Zheng Y., Ren N., Wang H., Stromberg A. J., Perry S. E. Global identification of targets of the Arabidopsis MADS domain protein AGAMOUS-Like15. The Plant Cell . 2009;21(9):2563–2577. doi: 10.1105/tpc.109.068890. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Joshi S., Keller C., Perry S. E. The EAR motif in the Arabidopsis MADS transcription factor AGAMOUS-like 15 is not necessary to promote somatic embryogenesis. Plants . 2021;10(4):p. 758. doi: 10.3390/plants10040758. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Nowak K., Gaj M. D. Transcription factors in the regulation of somatic embryogenesis. In: Loyola-Vargas V. M., Ochoa-Alejo N., editors. Somatic Embryogenesis: Fundamental Aspects and Applications . Cham: Springer International Publishing; 2016. pp. 53–79. [DOI] [Google Scholar]
- 85.Pant P., Iqbal Z., Pandey B. K., Sawant S. V. Genome-wide comparative and evolutionary analysis of calmodulin-binding transcription activator (CAMTA) family in Gossypium species. Scientific Reports . 2018;8(1):p. 5573. doi: 10.1038/s41598-018-23846-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Kaplan-Levy R. N., Brewer P. B., Quon T., Smyth D. R. The trihelix family of transcription factors - light, stress and development. Trends in Plant Science . 2012;17(3):163–171. doi: 10.1016/j.tplants.2011.12.002. [DOI] [PubMed] [Google Scholar]
- 87.Siddiqui Z. H., Abbas Z. K., Ansari M. W., Khan M. N. The role of miRNA in somatic embryogenesis. Genomics . 2019;111(5):1026–1033. doi: 10.1016/j.ygeno.2018.11.022. [DOI] [PubMed] [Google Scholar]
- 88.Alves A., Cordeiro D., Correia S., Miguel C. Small non-coding RNAs at the crossroads of regulatory pathways controlling somatic embryogenesis in seed plants. Plants . 2021;10(3):p. 504. doi: 10.3390/plants10030504. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Wójcik A. M., Gaj M. D. miR393 contributes to the embryogenic transition induced in vitro in Arabidopsis via the modification of the tissue sensitivity to auxin treatment. Planta . 2016;244(1):231–243. doi: 10.1007/s00425-016-2505-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Szyrajew K., Bielewicz D., Dolata J., et al. MicroRNAs are intensively regulated during induction of somatic embryogenesis in Arabidopsis. Plant Science . 2017;8 doi: 10.3389/fpls.2017.00018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91.Nowak K., Morończyk J., Wójcik A., Gaj M. D. AGL15 controls the embryogenic reprogramming of somatic cells in Arabidopsis through the histone acetylation-mediated repression of the miRNA biogenesis genes. International Journal of Molecular Sciences . 2020;21(18):p. 6733. doi: 10.3390/ijms21186733. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 92.Chen X., Xu X., Shen X., et al. Genome-wide investigation of DNA methylation dynamics reveals a critical role of DNA demethylation during the early somatic embryogenesis of Dimocarpus longan Lour. Tree Physiology . 2020;40(12):1807–1826. doi: 10.1093/treephys/tpaa097. [DOI] [PubMed] [Google Scholar]
- 93.Grzybkowska D., Nowak K., Gaj M. D. Hypermethylation of auxin-responsive motifs in the promoters of the transcription factor genes accompanies the somatic embryogenesis induction in Arabidopsis. International Journal of Molecular Sciences . 2020;21(18):p. 6849. doi: 10.3390/ijms21186849. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 94.Ji L., Mathioni S. M., Johnson S., et al. Genome-wide reinforcement of DNA methylation occurs during somatic embryogenesis in soybean. The Plant Cell . 2019;31(10):2315–2331. doi: 10.1105/tpc.19.00255. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 95.Rodríguez-Sanz H., Moreno-Romero J., Solís M.-T., Köhler C., Risueño M. C., Testillano P. S. Changes in histone methylation and acetylation during microspore reprogramming to embryogenesis occur concomitantly with BnHKMT and BnHAT expression and are associated with cell totipotency, proliferation, and differentiation in Brassica napus. Cytogenetic and Genome Research . 2014;143(1-3):209–218. doi: 10.1159/000365261. [DOI] [PubMed] [Google Scholar]
- 96.Wójcikowska B., Botor M., Morończyk J., et al. Trichostatin A triggers an embryogenic transition in Arabidopsis explants via an auxin-related pathway. Frontiers in Plant Science . 2018;9:p. 1353. doi: 10.3389/fpls.2018.01353. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 97.Kao P., Schon M. A., Mosiolek M., Nodine M. D. Gene expression variation in Arabidopsis embryos at single-nucleus resolution. Development . 2021;148(13):p. dev199589. doi: 10.1242/dev.199589. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 98.Bumgarner R. Overview of DNA microarrays: types, applications, and their future. Current Protocols in Molecular Biology . 2013;101(1):p. Unit 22.1.. doi: 10.1002/0471142727.mb2201s101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 99.Zhao S., Fung-Leung W. P., Bittner A., Ngo K., Liu X. Comparison of RNA-Seq and microarray in transcriptome profiling of activated T cells. PLoS One . 2014;9(1, article e78644) doi: 10.1371/journal.pone.0078644. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 100.de Silva K. K., Dunwell J. M., Wickramasuriya A. M. Weighted Gene Correlation Network Analysis (WGCNA) of Arabidopsis somatic embryogenesis (SE) and identification of key gene modules to uncover SE-associated hub genes. 2022. [DOI] [PMC free article] [PubMed]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The datasets used to support the findings of this study are included within the article and within the supplementary information files.