Skip to main content
International Journal of Molecular Sciences logoLink to International Journal of Molecular Sciences
. 2020 Jan 29;21(3):861. doi: 10.3390/ijms21030861

Comparative Transcriptome Analysis of Different Dendrobium Species Reveals Active Ingredients-Related Genes and Pathways

Yingdan Yuan 1,2, Bo Zhang 1,2,3, Xinggang Tang 1,2, Jinchi Zhang 1,2,*, Jie Lin 1,2,*
PMCID: PMC7037882  PMID: 32013237

Abstract

Dendrobium is widely used in traditional Chinese medicine, which contains many kinds of active ingredients. In recent years, many Dendrobium transcriptomes have been sequenced. Hence, weighted gene co-expression network analysis (WGCNA) was used with the gene expression profiles of active ingredients to identify the modules and genes that may associate with particular species and tissues. Three kinds of Dendrobium species and three tissues were sampled for RNA-seq to generate a high-quality, full-length transcriptome database. Based on significant changes in gene expression, we constructed co-expression networks and revealed 19 gene modules. Among them, four modules with properties correlating to active ingredients regulation and biosynthesis, and several hub genes were selected for further functional investigation. This is the first time the WGCNA method has been used to analyze Dendrobium transcriptome data. Further excavation of the gene module information will help us to further study the role and significance of key genes, key signaling pathways, and regulatory mechanisms between genes on the occurrence and development of medicinal components of Dendrobium.

Keywords: Dendrobium, comparative transcriptome, active ingredients, different tissues

1. Introduction

Dendrobium is a perennial epiphytic herb of the genus Orchidaceae, with more than 1500 species in the world which are mostly growing in tropical and subtropical Asia and eastern Australia [1,2]. In China, there are more than 80 species of Dendrobium that had been reported in studies, mainly distributed in south of the Qinling Mountains [3,4]. As a kind of ornamental and medicinal plants, more and more researchers have paid attention to it in recent years [5]. Also, Dendrobium is a traditional Chinese herbal medicine, that has been used for benefiting stomach and clearing heat, nourishing yin, and promoting fluid [6]. The medicinal ingredients of Dendrobium are very complex, not only including polysaccharides and alkaloids, but also flavonoids, various amino acids, and trace mineral elements. However, the most two important components are polysaccharides and alkaloids [7,8,9,10].

With the development of high-throughput technology, a variety of data sources have been derived, including gene expression microarray, RNA-seq, metabolomic and CHIP-seq, these have become powerful tools for studying plant growth, development, and physiology at the transcriptional and metabolic levels [11,12,13]. In order to explore the key genes and regulatory pathways for the synthesis of its medicinal ingredients, more researchers have done different kinds of Dendrobium transcriptome research [14,15,16]. However, it has been difficult to systematically explain the relationship between gene expression or metabolite changes and trait differences [17]. The analysis of correlation networks can bridge the gap between single gene interpretation and systematic biology research by mining the link between genes and gene products [18,19,20], such as integrating a single gene into a co-expression network based on pairwise gene expression correlation [21]. Gene co-expression analysis has been used to discover new candidate genes [22,23], identify key modulators of immune responses [24], and reconstruct regulatory pathways [25]. Genes belonging to the same co-expression sub-network (or module) are likely to be functionally related [26,27,28], participate in similar biological processes, or be part of the same pathway [25].

Weighted gene co-expression network analysis (WGCNA) is one of the most useful methods based on gene co-expression networks. It focuses on the set of genes other than on a single gene in the observed gene expression data, and it alleviates the multiple detection problems inherent in chip data analysis and can be used in unweighted correlation networks [29]. Compared with many other analysis methods, WGCNA has the advantages of summarizing and standardizing the methods and functions of integrated R packages, including methods of weighted and unweighted correlation networks [30]. WGCNA is used in combination with gene chip data, transcriptome data, and metabolome data for metabolic regulation network simulation, mining inter-genetic interactions, screening functional genes, etc. It has been extensively studied in plant growth, tissue and organ development, pigmentation and fragrance synthesis [31,32,33,34].

To reveal the underlying molecular mechanism of the active ingredients of Dendrobium, we downloaded three datasets from the NCBI Sequence Reading Archive (SRA) to identify highly connected hub genes and important modules. This study used three different Dendrobium species and different tissues as materials to perform transcriptome sequencing data, combined with polysaccharide and alkaloid content data, and used WGCNA analysis to construct a co-expression gene network. Correlation analysis was performed between the gene module and the polysaccharide and alkaloid data, and hub genes related to the main medicinal ingredients were discovered, in order to provide new clues for further research on the molecular mechanism of medicinal ingredients of Dendrobium. For the first time, a co-expression network analysis of transcriptome genes in Dendrobium was constructed, and modules with high correlation in secondary metabolism were analyzed, laying a foundation for the discovery of functional genes of medicinal ingredients.

2. Results

2.1. Determination of Total Alkaloid and Polysaccharide Contents in Different Species and Different Tissues

Polysaccharides and alkaloids are the main medicinal components of Dendrobium. Therefore, we determined the polysaccharide and total alkaloid in three different tissues of three Dendrobium species (Figure 1). The biennial Dendrobium officinale has the highest content in the comparison of different tissues and species. For polysaccharide content, the stem is the main enrichment tissue, while for total alkaloid, the leaf is the main enrichment tissue. In the stem, the polysaccharide content of Dendrobium varied from 23.34 to 37.41%. In the leaf, the total alkaloid content of Dendrobium varied from 0.0291 to 0.0421%.

Figure 1.

Figure 1

Polysaccharide and total alkaloid contents variation in different tissues of different species. (a) Polysaccharide content. (b) Total alkaloid content. There are three replicates of each sample. 2-Dh: two-year-old Dendrobium huoshanese, 2-Do: two-year-old Dendrobium officinale, 2-Dm: two-year-old Dendrobium moniliforme.

2.2. Differential Expression Analysis

In this study, we calculated the FPKM of each sample to regulate the expression and investigate the gene expression differences among different tissues of different Dendrobium species. We compared the three data sets from different comparison groups using a Venn diagram (Figure 2a–c). In addition, we compared the number of DEGs between different species in the same tissue, and many genes were differentially expressed only in one or two comparisons. Then the edgeR was used to test the differential expression of the repeated count data. We used “FDR < 0.05 & | log2FC | ≥1” as the criterion for significant differences in gene expression. When log2FC > 1, DEG is considered to be up-regulated. In contrast, for log2FC < −1, it is considered a downward adjustment. We did up and down-regulated analysis of DEGs for three different tissues and three different Dendrobium species, and the results are shown in Figure 3a–i.

Figure 2.

Figure 2

Venn diagram of differentially expressed genes (DEGs) in different comparisons. All DEGs are grouped into three comparison groups represented by three circles. The overlapping portions of the different circles represent the number of DEGs common to these comparison groups. (a) Venn diagram of three kinds of Dendrobium leaves. (b) Venn diagram of three kinds of Dendrobium roots. (c) Venn diagram of three kinds of Dendrobium stems.

Figure 3.

Figure 3

Volcano plots of the DEGs in different comparisons. Red dots indicate significant up-regulation of genes, and blue dots indicate significant down-regulation of genes. Black dots represent non-DEGs. (a) Dm_L vs. Dh_L volcano; (b) Dm_L vs. Do_L volcano; (c) Do_L vs. Dh_L volcano; (d) Dm_R vs. Dh_R volcano; (e) Dm_R vs. Do_R volcano; (f) Do_R vs. Dh_R volcano; (g) Dm_S vs. Dh_S volcano; (h) Dm_S vs. Do_S volcano; (i) Do_S vs. Dh_S volcano.

According to the transcriptome data, all DEGs were used for hierarchical cluster analysis of transcription abundance in three different tissues. The heatmap of DEGs between different tissues and different Dendrobium species show similar transcriptome profiles for Dh_R, Dh_L, Dh_S, Do_R, Do_L, Do_S, Dm_R, Dm_L, and Dm_S (Figure 4a). A total of 35,159 DEGs were identified and analyzed using criteria of log10 (FPKM+1) and p < 0.05. The trend of the specific expression level is shown in the number under the color bar at the top left. On the left is the gene cluster tree. The closer the degree of separation between the two genes, the closer their expression is. In order to reflect the main trends and tissue-specific expression of different Dendrobium species, all DEGs were clustered into ten expression profiles (Figure 4b) using the K-means method and hierarchical clustering with similar regulation model and log2 (foldchange). DEGs belonging to cluster 3 were more highly expressed in leaves than in other tissues. Except for subcluster 1 and subcluster 3, all the other subclusters have just one gene trend.

Figure 4.

Figure 4

Expression profiles of all DEGs. (a) The heatmap of DEGs and the FPKM distribution of all unigenes obtained by hierarchical cluster analysis. Each column in the figure represents a sample, and each row represents a gene. The colors in the graph indicate the magnitude of gene expression (log10 (FPKM + 1)) in the sample. Red indicates that the gene is highly expressed in the sample, and the blue indicates that the gene expression is low. (b) K-means clustering analysis of gene expression profiles. The blue line represents the expression model. The gray lines are the expression profiles of each DEGs. The x-axis represents different tissues of different Dendrobium plants. The y-axis represents log2 (ratio).

2.3. GO and KEGG of DEGs Annotation and Enrichment Analysis

GO is a database established by the Gene Ontology Consortium. Its purpose is to standardize biological terms about genes and gene products in different databases, to define and describe gene and protein functions. Using the GO database, genes can be classified according to biological process, cellular component, and molecular function. The GO annotation statistics were performed on the differentially expressed genes in pairs, and one of the samples was used as a control. The obtained results can be used to plot the GO annotation bar graph of up and down DEGs (Figure 5). Figure 5 only shows the comparison of the stems of different Dendrobium species, and the other two tissues are shown in the Supplementary Figure S1.

Figure 5.

Figure 5

Gene Ontology (GO) annotations of up and down regulated DEGs. The bottom x-axis indicates the number of DEGs annotated to a GO term, the upper x-axis indicates the proportion of DEGs annotated to a GO terms to the total number of all GO annotated DEGs; and the y-axis represents each detailed classification of GO. (a) Dm_S vs Dh_S GO annotation; (b) Dm_S vs. Do_S GO annotation; (c) Do_S vs. Dh_S GO annotation.

KEGG focuses on biochemical pathways, especially genes involved in protein, carbohydrate, and energy metabolism. In the KEGG enrichment analysis, we mapped all these genes to a reference pathway in the KEGG database to determine the biological pathways in which these genes may be involved (Figure 6). In terms of the KEGG pathways, “Dm_S vs. Dh_S” comparisons are involved in 235 pathways (Figure 6a). In “Dm_S vs. Do_S” comparisons, 223 pathways were involved (Figure 6b). Finally, in “Do_S vs. Dh_S” comparisons, 237 pathways were involved (Figure 6c). Among these pathways, “flavonoid biosynthesis” was enriched in the Dm_S vs. Dh_S comparison. “Lipopolysaccharide biosynthesis” related to polysaccharide was the most enriched pathways in the Do_S vs. Dh_S comparison. Notably, we observed terpenoid biosynthesis in the secondary metabolites also enriched in the Do_S vs. Dh_S comparison. The KEGG pathways of different Dendrobium species of the other two tissues are shown in the Supplementary Figure.

Figure 6.

Figure 6

Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment of DEGs. The x-axis represents the pathway name, and the y-axis represents the enrichment ratio (sample number/background number). (a) Dm_S vs. Dh_S; (b) Dm_S vs. Do_S; (c) Do_S vs. Dh_S. All pathways in the figure with asterisks indicate significant KEGG enrichment, with three asterisks indicating p-value < 0.001, two asterisks indicating p-value < 0.01, one asterisks indicating p-value < 0.05.

2.4. WGCNA Analysis

2.4.1. Construction of Gene Co-Expression Network of Dendrobium

Consequently, we used the WGCNA package tool to construct a co-expression module that expressed the expression of 8056 genes in 27 Dendrobium samples. The heat map of the cluster dendrogram and tissues and species traits are shown in Figure 7a. One of the most critical parameters is the power value, which mainly affects the independence and average connectivity of the co-expression modules. First, we selected the appropriate power value. When the power value was 18, the independence reached 0.6, and the average connectivity was higher. Therefore, the power values and results of constructing co-expression modules indicate that 19 different co-expression modules were identified in Dendrobium. These co-expression modules were constructed and are shown in different colors (Figure 7). The size of these modules depends on the number of genes they contain. The number of genes and module names are shown in Table 1.

Figure 7.

Figure 7

Clustering dendrogram. (a) Clustering dendrogram of 27 samples and heatmaps of species and tissues traits. The expression is from low to high, and the color transitions from white to red. (b) Clustering dendrogram of DEGs, with dissimilarity based on the topological overlap, together with assigned module colors. The clustered branches represent different modules, and each line represents one gene.

Table 1.

The number of genes in 19 constructed modules.

Module Colors Node Number
Blue 562
Brown 892
Cyan 1132
darkgreen 1016
darkolivegreen 362
darkorange 665
floralwhite 184
greenyellow 440
lightcyan 180
lightgreen 178
lightpink4 40
lightsteelblue1 193
paleturquoise 470
plum2 57
royalblue 275
saddlebrown 179
salmon4 1015
sienna3 109
yellowgreen 107

2.4.2. Interaction Analysis of Co-Expression Module

Subsequently, we analyzed the interactions between the 19 co-expression modules (Figure 8). The heatmap shows the topological overlap matrix (TOM) of all genes in the analysis. Light color indicates low overlap, and dark red indicates high overlap. Except for some high-brightness areas, the overall difference between the modules is not significant, which indicates that the gene expression between the modules is relatively independent and has a high scale independence. The correlations between module eigengene and traits: tissues and species, were analyzed and data are shown in Figure 8b. Two modules are significantly associated with species: Lightgreen (p-value = 1 × 10−5, cor = 0.73) and Lightsteelblue1 (p-value = 2 × 10−6, cor = −0.76). The modules that highly correlated to tissues were Yellowgreen (p-value = 1 × 10−6, cor = 0.79), Salmon4 (p-value = 9 × 10−7, cor = 0.79), Blue (p-value = 3 × 10−6, cor = 0.77), and Sienna3 (p-value =2 × 10−10, cor = 0.9).

Figure 8.

Figure 8

Co-expression network analysis across different tissues and different species. (a) Visualizing the gene network using a heatmap plot. The heatmap depicts the topological overlap matrix (TOM) among all genes in the analysis. (b) Module-trait associations. Each row corresponds to a module characteristic gene (eigengene), and each column corresponds to a trait. Each cell contains a corresponding correlation and p-value. According to the color legend, the table is color-coded by correlation.

We analyzed the connectivity of eigengenes to find the interactions between these constructed co-expression modules. First of all, cluster analysis of these eigengenes was performed (Figure 9a). These 19 clusters were divided into two clusters, including 5 modules (modules 3, 8, 9, 11, and 14) and the remaining 14 modules. According to that, the connectivity effect between different modules is obviously different. (Figure 9b).

Figure 9.

Figure 9

Analysis of connectivity of eigengenes in different module. (a) Cluster analysis of eigengenes. (b) The heatmap of connectivity of eigengenes.

2.4.3. Functional Enrichment Analysis of Genes in Interested Modules

We performed pathway enrichment analysis on the genes of each module separately, explored their related functions, and obtained many meaningful KEGG pathway and GO terms (Figure 10). It is important to identify the most significant modules related to medicinal components. Sienna3 module showed a significantly high correlation with polysaccharide. In the KEGG pathway, it is significantly enriched to fructose and mannose metabolism (ko00051), starch and sucrose metabolism (ko00500), and galactose metabolism (ko00052). The top three GO terms, shown in expanded size, are glucosyltransferase activity (GO:0046527) and starch synthase activity (GO:0009011). These KEGG pathways and GO terms are all related to polysaccharide biosynthesis, and polysaccharides are the main medicinal components of Dendrobium.

Figure 10.

Figure 10

GO and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment of module sienna3. (a) Top 30 of KEGG pathway enrichment. (b) Thumbnails view of directed acyclic graph (DAG) on molecular function of module sienna3.

In addition, the KEGG pathways regulating Dendrobium polysaccharides and secondary metabolism biosynthesis were also identified in three other modules, which are the blue module, the lightsteelblue1 module, and the salmon4 module (Figure 11). In blue module, genes were significantly enriched in these pathways: anthocyanin biosynthesis (ko00942), drug metabolism—cytochrome P450 (ko00982) and steroid biosynthesis (ko00100). In lightsteelblue1, the significantly enriched pathways are as follows: terpenoid backbone biosynthesis (ko00900), diterpenoid biosynthesis (ko00904), galactose metabolism (ko00052), and glycolysis/gluconeogenesis (ko00010). In salmon4, genes were significantly enriched in these pathways: the pentose phosphate pathway (ko00030), terpenoid backbone biosynthesis (ko00900), and diterpenoid biosynthesis (ko00904). The enriched pathways in these three modules are significantly related to the synthesis of polysaccharides and secondary metabolites of Dendrobium. Therefore, sienna3, blue, lightsteelblue1 and salmon4 modules were defined as important module and extracted for further analysis.

Figure 11.

Figure 11

Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment of interested module. (a) Module blue; (b) module lightsteelblue1; (c) module salmon4.

2.4.4. Module Visualize and Hub Genes

We performed genes trend expression analysis of four modules of interest (Figure 12). Interestingly, by analyzing the trend expression of genes in sienna3 module, we found that they are specifically expressed only in the stem, which suggests that they play an important role in the stem. Through further analysis of this module, several hub genes were discovered, such as beta-glucosidase and granule-bound starch synthase (GBSSI). This indicates that the specific expression of these polysaccharide-related genes in stems may be related to the content of polysaccharides in stems significantly higher than in other tissues.

Figure 12.

Figure 12

Expression profile and transcriptional regulatory network associated with the tissue-specific modules. (a), (c), (e), and (g) heatmaps showing genes in module that were significantly over-represented in Dm_L, Dm_R, Dm_S, Do_L, Do_R, Do_S, Dh_L, Dh_R, Dh_S and predicted transcriptional regulatory network associated with the gene sets showing expression patterns at Dm_L, Dm_R, Dm_S, Do_L, Do_R, Do_S, Dh_L, Dh_R, and Dh_S. Heatmaps show the expression profile of all the co-expressed genes (number given on the top) in the modules (labelled on top). Candidate hub genes are shown in rectangular shapes. Purple in figure (b, d and h) represents genes related to polysaccharides. Red in figure (d and f) represents genes related to secondary metabolites. The shapes, from big to small, indicate the weights from big to small. Green in figure d represents bHLH transcription factors. Green in figure (f) represents bZIP transcription factors and yellow represents MYB transcription factors.

In the blue module, Do_S shows specific expression in this module, several hub genes were detected in each important module, with each gene interacting with many other genes. These genes include bHLH, beta-glucosidase, alpha-L-fucosidase, and other genes related to Dendrobium polysaccharide and secondary metabolism. In salmon4 module, Do_S shows specific expression in this module, hub genes include CCR4 (cinnamoyl -CoA reductase), bZIP, and MYB transcription factors were identified. In lightsteelblue1, a large number of genes were annotated to ketohexokinase (KHK). KHK, also known as fructokinase, which is a key enzyme in fructose metabolism. Fructose is a kind of plant polysaccharide, which suggests that this module is related to polysaccharide biosynthesis.

3. Discussion

Traditional biological research focuses on elucidating the effects of individual functional elements (such as DNA, mRNA, and protein) on life activities at the molecular level. Although those methods are of great significance for revealing the genetic mechanisms of specific traits, it can only partially explain the cause of a certain life activities. With the rapid development of sequencing technology, traditional biological research cannot fully and effectively explore the biological significance contained in massive data. As a research method of systems biology, the network is widely used in the exploration of life sciences with the help of data of genome, transcriptome, and metabolome. Compared with other regulatory networks, WGCNA can screen for genes related to specific traits and perform modular classification from large samples to obtain highly biologically significant co-expression modules, which has proven to be an efficient data mining method [35].

WGCNA has been widely used in plants in recent years. In order to obtain key expression modules and key hub genes related to drought resistance in Brassica napus L., WGCNA was used to analyze Brassica napus transcriptome data in multiple samples (48 transcriptome data), the well-watered and droughted networks contained 17 and 20 modules, respectively, suggesting that there are additional expression patterns in the droughted network because of rearrangement of the transcriptome in response to the drought treatment. [36]. In the study of Fragaria L. flowers [37], researchers generated different tissue- and stage-transcriptomic profiling of woodland strawberry (Fragaria vesca) flower development, they discovered a developing receptacle-specific module exhibiting similar molecular features to those of young floral meristems and hub genes of the strawberry homologs of a number of meristem regulators, including LOST MERISTEM and WUSCHEL in the developing receptacle network. [37]. Analysis of the pollen transcriptome of three male sterile lines using weighted gene co-expression network analysis revealed that two modules were significantly associated with male sterility and many hub genes that were differentially expressed in the sterile lines [38]. Farcuh et al. used WGCNA to investigate sugar metabolism during leaf and fruit development of two Japanese plum varieties, and identified 11 key sugar metabolism-related genes, the results showed that sugar metabolism was reprogrammed in a non-climacteric bud mutant of a climacteric plum fruit and showed an increase in sorbitol synthesis [39]. In Ginkgo biloba, a total of 12 gene modules were revealed to be involved in flavonoid metabolism structural genes and transcription factors by constructing co-expression networks, they reveal that some hub genes operate during the biosynthesis by identifying transcription factors (TFs) and structure genes and seven key hub genes were also identified by analyzing the correlation between gene expression level and flavonoids content [40]. Through these studies, it was found that many of the hub genes obtained by WGCNA analysis were indeed very important genes. In order to obtain the hub gene related to the synthesis and regulation of Dendrobium polysaccharides and alkaloids, we also performed WGCNA analysis.

Therefore, we constructed a Dendrobium gene co-expression network using a WGCNA approach and identified co-expression modules using transcriptome data from three kinds of Dendrobium species and three different tissues. Correlation analysis between co-expression modules and two traits (species and tissues) was carried out, and four highly significant active ingredients-related modules (p-value < 0.05) were identified. These modules consist of highly connected functional genes, and different modules appear to be involved in individual functions [41]. Meanwhile, KEGG pathway enrichment analysis of modules associated with polysaccharide and secondary metabolism indicated that these pathways in different Dendrobium species and different tissues are related to each other at the transcriptomic level. In sienna3 module, we found that several hub genes related to polysaccharide biosynthesis, such as KHK (ketohexokinase) which is a key enzyme in fructose metabolism. In the expression trend of this module, the stem has a clear advantage, indicating that the polysaccharide content is indeed concentrated on the Dendrobium stem, which is consistent with our previous determination of the polysaccharide content. Meanwhile, in blue module, we found that not only the hub genes related to polysaccharide, but also related to secondary metabolism. In the expression trend, we found that the stem of Dendrobium officinale to be the highest. This result is consistent with the trend in the determination of polysaccharides and alkaloids. The results also indicated that one component can be regulated by multiple modules, and one module can simultaneously be associated with multiple components.

Module hub genes are generally considered representative of a given module in a biological network. Previous studies reported that MYB-bHLH-WDR (MBW) ternary complexes comprise the essential regulatory machinery for catechin and anthocyanin biosynthesis [42]. In the present study, transcription factors MYB and bHLH were identified as hub genes in modules related to secondary metabolism. In addition, several genes involved in polysaccharide and secondary metabolism biosynthesis (CCR4: cinnamoyl -CoA reductase and KHK: ketohexokinase) were identified in modules. MYB transcription factors play an important role in the regulation of phenylpropane biosynthesis. Phenylpropane synthesis is upstream of the regulation of flavonoid biosynthesis, indicating that MYB is also an important transcription factor for the synthesis of flavonoids. In gentian, both GtMYBP3 and GtMYBP4 can activate the gene expression of flavonol synthesis, and then significantly increase the flavonol content in seedlings [43]. Ginkgo GbMYBF2 inhibits the expression of CHS (chalcone synthase), F3H (flavanone 3-hydroxylase), FLS (flavonol synthase, Flavonol synthase and ANS genes) on the phenylpropane synthesis pathway, thereby reducing the content of flavonoids and anthocyanins [44]. Salvia miltiorrhiza SmMYB39 affects the synthesis of rosmarinic acid by regulating the expression of key enzyme genes of the phenylpropane metabolic pathway [45]. Current studies indicate that plant bHLH transcription factors are involved in regulating various signal transduction and anabolic pathways, such as light signal transduction, hormone synthesis, glandular and root hair development, and stress [46,47,48,49].

In this study, the WGCNA method was used for the first time in Dendrobium, and the modules related to specific tissues and genes related to specific traits were identified. Hub genes were further analyzed to find related genes and predict gene functions [30]. Combining the WGCNA method and RNA-Seq data can be used to better mine the genes and transcription factors related to traits. In strawberry, modules related to tissue specificity such as torus were found in strawberry, and 7 hub genes were identified in torus tissue [37]; in tomato, genes related to vitamin C biosynthesis were found [50]; co-expression modules related to acidity and genes related to anthocyanin synthesis were found in apple [33,51]. In addition, specific modules in other tissues such as roots, leaves, flowers, and fruits at other periods can be excavated to find relevant metabolic processes and important genes and potential transcription factors.

4. Materials and Methods

4.1. Plant Material

Dendrobium plants were cultivated artificially in a greenhouse located in Anhui Tongjisheng Biotechnology Company, Lu’an, China. Seed germination, the growth of protocorm-like bodies protocols and the condition of pots used for planting were described by our previous study [15]. Two-year-old D. huoshanense, D. officinale, and D. moniliforme were selected to provide three replicates of each sample. Stems, leaves, and roots were collected from three different kinds of Dendrobium plants for RNA extraction. Then, after drying, they were used to determine polysaccharide and total alkaloid contents in March, 2017. The same samples were used for transcriptome sequencing and polysaccharide and total alkaloid contents determination.

4.2. Polysaccharide and Total Alkaloid Contents Determination

The method for determining the polysaccharide content is consistent with the Chinese Pharmacopoeia (version 2010), and the details are described in our previous study [15]. The method of total alkaloids has been modified based on the Bush et al. method and has been described in detail in our previous studies [52,53]. A part of data of this part has been published [14,15].

4.3. Identification of Orthologous Genes

Datasets for three kinds of Dendrobium species were obtained from the NCBI Sequence Read Archive (SRA) (https://www.ncbi.nlm.nih.gov/sra) with accessing number SRP122499, SRP139000, and SRP150489. There was a total of 27 samples of Dendrobium that were taken from three different species (Dendrobium huoshanense, Dendrobium officinale, and Dendrobium moniliforme) and three different tissues (stems, leaves, and roots). The data for these samples were submitted by our own laboratory. The OrthoMCL pipeline [54] was used with standard settings to identify potential orthologous genes in Dendrobium, and subsequently aligned using the MUSCLE algorithm [55].

4.4. Differential Gene Expression Analysis

The gene expression levels of all samples were estimated by RSEM version 1.2.15 [56], and the bowtie2 parameter setting mismatch was 0. The R Bioconductor package edgeR [57] was used to identify differentially expressed genes (DEGs) in two samples. DEGs were used for KEGG and GO enrichment analyses, which were performed using the KOBAS version 2.0.12 with default settings and Goatools version 0.5.9 (https://github.com/tanghaibao/Goatools) [58,59].

4.5. WGCNA Analysis

4.5.1. Construction of Weighted Gene Co-Expression Networks and Identification of Modules

We use thresholds for different expression data to calculate the number of genes. The WGCNA algorithm was applied to the evaluation of gene expression. The flashClust toolkit (R language) was used to perform cluster analysis on samples and set appropriate thresholds. After background correction and quantile normalization, the top 50% of genes (8056 genes) with the most variant in the analysis of variance were selected for WGCNA analysis. The WGCNA package in R was used to construct the co-expression network of the 8056 genes after verification [30].

The gradient independence method was used to test the scale independence and average connectivity of different power modules (the power value ranging from 1 to 20). According to the scale-independent conditions, the appropriate power value was 0.6. After the power value determined, the module was constructed by the WGCNA algorithm, and the genetic information corresponding to each module was extracted. For high reliability of the results, the minimum number of genes per module was set to 50.

4.5.2. Interactions Analysis of Co-Expression Modules

The interaction relationship between different co-expression modules was calculated by WGCNA, and the topological overlap matrix (TOM) was constructed using the correlation expression values. The resulting topological overlap is a biologically meaningful measure of gene similarity based on the co-expression relationship between two genes. Using each TOM as input, the flashClust function was used for hierarchical cluster analysis. Then the dynamicTreeCut algorithm (minModuleSize = 30) was used to detect the module as a branch of the dendrogram [30]. Random colors were assigned to the modules, and the first principal component was used to calculate the module characteristic genes of each module. The module characteristic genes can be regarded as the representative of the gene expression pattern in this module, and the module combining highly relevant characteristic genes (merge-CutHeight = 0.07). By calculating the Pearson correlation between the module characteristic genes and the interest characteristics, and using the module characteristic genes to estimate the module-trait relationship, the samples are classified according to the corresponding traits, types, and tissues, and the module with a correlation coefficient ≥ | 0.75 | and p-value ≤ 0.01 was selected for further analysis. Through the heat map of the gene network topological overlap, the structure of co-expression module was visualized. Then through the hierarchical clustering tree diagram of the characteristic genes and the heat map of the corresponding characteristic gene network, the relationship between the modules was summarized.

4.5.3. Functional Enrichment Analysis of the Key Module

We performed GO and KEGG enrichment analysis for key modules with most genes by using the Database for Annotation, Visualization and Integrated Discovery (DAVID, https://david.ncifcrf.gov/summary.jsp) [60] and KEGG Orthology Based Annotation System (KOBAS, https://bio.tools/kobas). Analysis results were extracted out under the condition of p < 0.05 after correction [61,62].

4.5.4. Identification of Hub Genes

The hub gene is usually used as an abbreviation for a highly connected gene, which has a high degree of connectivity in co-expression modules. Module membership (MM) is defined as the correlation between the expression of each gene and its module eigengenes (MEs). Gene significance (GS) is defined as the correlation of each gene to the trait of interest. In the module of interest, the genes with the highest MM and GS were set as candidate genes for further analysis [63]. We chose the intramodular hub genes by external traits based GS > 0.2 and MM > 0.8 with a threshold of p-value < 0.05 [64]. In our study, based on the size of the module, we classified the top 10% of the most connected genes in the module as hub genes. We construct and visualize gene–gene interaction networks using Cytoscape tool [65].

5. Conclusions

Weighted gene co-expression network associated with traits was constructed, and 19 tissue and species-specific modules were obtained. The biological significance of the tissue and species-specific module was revealed, the regulatory genes related to the medicinal components of Dendrobium were identified, and a local biologically significant network was constructed. The results of this study can provide important experimental data and theoretical basis for the further analysis of the genetic mechanism of the medicinal ingredients of Dendrobium.

Supplementary Materials

Supplementary materials can be found at https://www.mdpi.com/1422-0067/21/3/861/s1.

Author Contributions

Conception and design of the research: Y.Y. and X.T.; acquisition of data: B.Z.; statistical analysis: X.T.; drafting the manuscript: Y.Y. and J.Z.; revision of manuscript for important intellectual content: J.L. All authors read and approved the final manuscript.

Funding

This project was funded by the Priority Academic Program Development of Jiangsu Higher Education Institutions (PAPD), National Foundation of Forestry Science and Technology Popularization (Grant No. [2015] 17), Major Fund for Natural Science of Jiangsu Higher Education Institutions (Grant No. 15KJA220004).

Conflicts of Interest

The authors declare no conflict of interest.

References

  • 1.Wood H.P. The Dendrobiums. ARG Gantner Verlag Timber Press; Portland, OR, USA: 2006. [Google Scholar]
  • 2.Zotz G. The systematic distribution of vascular epiphytes—A critical update. Bot. J. Linn. Soc. 2013;171:453–481. doi: 10.1111/boj.12010. [DOI] [Google Scholar]
  • 3.Lian W., Xu J. The chromosome number in Dendrobium I. Ten species. J. Wuhan Bot. Res. 1989;7:112–114. [Google Scholar]
  • 4.Zhitao N., Shuying Z., Jiajia P., Ludan L., Jing S., Xiaoyu D. Comparative analysis of Dendrobium plastomes and utility of plastomic mutational hotspots. Sci. Rep. 2017;7:2073. doi: 10.1038/s41598-017-02252-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Tang H., Zhao T., Sheng Y., Zheng T., Fu L., Zhang Y. Dendrobium officinale kimura et migo: A review on its ethnopharmacology, phytochemistry, pharmacology, and industrialization. Evid. Based Complement. Altern. Med. 2017;2017:19. doi: 10.1155/2017/7436259. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Chang C.-C., Ku A.F., Tseng Y.-Y., Yang W.-B., Fang J.-M., Wong C.-H. 6, 8-di-c-glycosyl flavonoids from Dendrobium huoshanense. J. Nat. Prod. 2010;73:229–232. doi: 10.1021/np900252f. [DOI] [PubMed] [Google Scholar]
  • 7.Ng T.B., Liu J., Wong J.H., Ye X., Sze S.C.W., Tong Y., Zhang K.Y. Review of research on Dendrobium, a prized folk medicine. Appl. Microbiol. Biotechnol. 2012;93:1795–1803. doi: 10.1007/s00253-011-3829-7. [DOI] [PubMed] [Google Scholar]
  • 8.Jin Q., Jiao C., Sun S., Song C., Cai Y., Lin Y., Fan H., Zhu Y. Metabolic analysis of medicinal Dendrobium officinale and Dendrobium huoshanense during different growth years. PLoS ONE. 2016;11:e0146607. doi: 10.1371/journal.pone.0146607. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Hsieh Y.S., Chien C., Liao S.K., Liao S.F., Hung W.T., Yang W.B., Lin C.C., Cheng T.J., Chang C.C., Fang J.M., et al. Structure and bioactivity of the polysaccharides in medicinal plant Dendrobium huoshanense. Bioorg. Med. Chem. 2008;16:6054–6068. doi: 10.1016/j.bmc.2008.04.042. [DOI] [PubMed] [Google Scholar]
  • 10.Luo J.P., Deng Y.Y., Zha X.Q. Mechanism of polysaccharides from Dendrobium huoshanense. On streptozotocin-induced diabetic cataract. Pharm. Biol. 2008;46:243–249. doi: 10.1080/13880200701739397. [DOI] [Google Scholar]
  • 11.Kaufmann K., Wellmer F., Muino J.M., Ferrier T., Wuest S.E., Kumar V., Serrano-Mislata A., Madueno F., Krajewski P., Meyerowitz E.M., et al. Orchestration of floral initiation by apetala1. Science. 2010;328:85–89. doi: 10.1126/science.1185244. [DOI] [PubMed] [Google Scholar]
  • 12.Lou Q., Liu Y., Qi Y., Jiao S., Tian F., Jiang L., Wang Y. Transcriptome sequencing and metabolite analysis reveals the role of delphinidin metabolism in flower colour in grape hyacinth. J. Exp. Bot. 2014;65:3157–3164. doi: 10.1093/jxb/eru168. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Verdonk J.C., Haring M.A., Tunen A.J.V., Schuurink R.C. Odorant1 regulates fragrance biosynthesis in petunia flowers. Plant. Cell. 2005;17:1612–1624. doi: 10.1105/tpc.104.028837. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Yuan Y., Yu M., Jia Z., Song X., Liang Y., Zhang J. Analysis of Dendrobium huoshanense transcriptome unveils putative genes associated with active ingredients synthesis. BMC Genom. 2018;19:978. doi: 10.1186/s12864-018-5305-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Yuan Y., Zhang J., Kallman J., Liu X., Meng M., Lin J. Polysaccharide biosynthetic pathway profiling and putative gene mining of Dendrobium moniliforme using RNA-seq in different tissues. BMC Plant. Biol. 2019;19:521. doi: 10.1186/s12870-019-2138-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Guo X., Li Y., Li C., Luo H., Wang L., Qian J., Luo X., Xiang L., Song J., Sun C., et al. Analysis of the Dendrobium officinale transcriptome reveals putative alkaloid biosynthetic genes and genetic markers. Gene. 2013;527:131–138. doi: 10.1016/j.gene.2013.05.073. [DOI] [PubMed] [Google Scholar]
  • 17.Khaitovich P., Weiss G., Lachmann M., Hellmann I., Enard W., Muetzel B., Wirkner U., Ansorge W., Pääbo S. A neutral model of transcriptome evolution. PloS Biol. 2004;2:e132. doi: 10.1371/journal.pbio.0020132. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Barabási A.L., Oltvai Z.N. Network biology: Understanding the cell’s functional organization. Nat. Rev. Genet. 2004;5:101–113. doi: 10.1038/nrg1272. [DOI] [PubMed] [Google Scholar]
  • 19.Carter S.L., Brechbuhler C.M., Griffin M., Bond A.T. Gene co-expression network topology provides a framework for molecular characterization of cellular state. Bioinformatics. 2004;20:2242–2250. doi: 10.1093/bioinformatics/bth234. [DOI] [PubMed] [Google Scholar]
  • 20.Yook S.H., Oltvai Z.N., Barabási A.L. Functional and topological characterization of protein interaction networks. Proteomics. 2004;4:928–942. doi: 10.1002/pmic.200300636. [DOI] [PubMed] [Google Scholar]
  • 21.Langfelder P., Horvath S. Eigengene networks for studying the relationships between co-expression modules. BMC Syst. Biol. 2007;1:54. doi: 10.1186/1752-0509-1-54. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Cai B., Li C.H., Huang J. Systematic identification of cell-wall related genes in populus based on analysis of functional modules in co-expression network. PLoS ONE. 2014;9:e95176. doi: 10.1371/journal.pone.0095176. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Van Dam S., Cordeiro R., Craig T., van Dam J., Wood S.H., de Magalhães J.P. Genefriends: An online co-expression analysis tool to identify novel gene targets for aging and complex diseases. BMC Genom. 2012;13:535. doi: 10.1186/1471-2164-13-535. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Tully J.P., Hill A.E., Ahmed H.M., Whitley R., Skjellum A., Mukhtar M.S. Expression-based network biology identifies immune-related functional modules involved in plant defense. BMC Genom. 2014;15:421. doi: 10.1186/1471-2164-15-421. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Ma S., Gong Q., Bohnert H.J. An Arabidopsis gene network based on the graphical gaussian model. Genome Res. 2007;17:1614–1625. doi: 10.1101/gr.6911207. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Oldham M.C., Horvath S.S., Geschwind D.H. Conservation and evolution of gene co-expression networks in human and chimpanzee brains. Proc. Natl. Acad. Sci. USA. 2006;103:17973–17978. doi: 10.1073/pnas.0605938103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Takeshi O., Shinpei H., Masayuki S., Motoshi S., Hiroyuki O., Kengo K. Coxpresdb: A database of coexpressed gene networks in mammals. Nucleic Acids Res. 2007;36:D77–D82. doi: 10.1093/nar/gkm840. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Lee J.M., Sonnhammer E.L. Genomic gene clustering analysis of pathways in eukaryotes. Genome Res. 2003;13:875–882. doi: 10.1101/gr.737703. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Presson A.P., Sobel E.M., Papp J.C., Suarez C.J., Whistler T., Rajeevan M.S., Vernon S.D., Horvath S. Integrated weighted gene co-expression network analysis with an application to chronic fatigue syndrome. BMC Syst. Biol. 2008;2:95. doi: 10.1186/1752-0509-2-95. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Langfelder P., Horvath S. Wgcna: An R package for weighted correlation network analysis. BMC Bioinform. 2008;9:559. doi: 10.1186/1471-2105-9-559. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Amrine K.C., Blanco-Ulate B., Cantu D. Discovery of core biotic stress responsive genes in Arabidopsis by weighted gene co-expression network analysis. PLoS ONE. 2015;10:e0118731. doi: 10.1371/journal.pone.0118731. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Borges R.M., Bessiere J.M., Ranganathan Y. Diel variation in fig volatiles across syconium development: Making sense of scents. J. Chem. Ecol. 2013;39:630–642. doi: 10.1007/s10886-013-0280-5. [DOI] [PubMed] [Google Scholar]
  • 33.El-Sharkawy I., Liang D., Xu K. Transcriptome analysis of an apple (Malus × Domestica) yellow fruit somatic mutation identifies a gene network module highly associated with anthocyanin and epigenetic regulation. J. Exp. Bot. 2015;66:7359–7376. doi: 10.1093/jxb/erv433. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Smita S., Katiyar A., Pandey D.M., Chinnusamy V., Bansal K.C. Identification of conserved drought stress responsive gene-network across tissues and developmental stages in rice. Bioinformation. 2013;9:72–78. doi: 10.6026/97320630009072. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Zhao W., Langfelder P., Fuller T., Dong J., Li A., Hovarth S. Weighted gene coexpression network analysis: State of the art. J. Biopharm. Stat. 2010;20:281–300. doi: 10.1080/10543400903572753. [DOI] [PubMed] [Google Scholar]
  • 36.Greenham K., Guadagno C.R., Gehan M.A., Mockler T.C., Weinig C., Ewers B.E., McClung C.R. Temporal network analysis identifies early physiological and transcriptomic indicators of mild drought in Brassica rapa. Elife. 2017;6:e29655. doi: 10.7554/eLife.29655. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Hollender C.A., Kang C., Darwish O., Geretz A., Matthews B.F., Slovin J., Alkharouf N., Liu Z. Floral transcriptomes in woodland strawberry uncover developing receptacle and anther gene networks. Plant. Physiol. 2014;165:1062–1075. doi: 10.1104/pp.114.237529. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Hu J., Chen G., Zhang H., Qian Q., Ding Y. Comparative transcript profiling of alloplasmic male-sterile lines revealed altered gene expression related to pollen development in rice (Oryza sativa l.) BMC Plant. Biol. 2016;16:175. doi: 10.1186/s12870-016-0864-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Farcuh M., Li B., Rivero R.M., Shlizerman L., Sadka A., Blumwald E. Sugar metabolism reprogramming in a non-climacteric bud mutant of a climacteric plum fruit during development on the tree. J. Exp. Bot. 2017;68:5813–5828. doi: 10.1093/jxb/erx391. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Ye J., Cheng S., Zhou X., Chen Z., Kim S.U., Tan J., Zheng J., Xu F., Zhang W., Liao Y., et al. A global survey of full-length transcriptome of Ginkgo biloba reveals transcript variants involved in flavonoid biosynthesis. Ind. Crop. Prod. 2019;139:11547. doi: 10.1016/j.indcrop.2019.111547. [DOI] [Google Scholar]
  • 41.Oldham M.C., Konopka G., Iwamoto K., Langfelder P., Kato T., Horvath S., Geschwind D.H. Functional organization of the transcriptome in human brain. Nat. Neurosci. 2008;11:1271. doi: 10.1038/nn.2207. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Xu W., Dubos C., Lepiniec L. Transcriptional control of flavonoid biosynthesis by MYB–bHLH–WDR complexes. Trends Plant. Sci. 2015;20:176–185. doi: 10.1016/j.tplants.2014.12.001. [DOI] [PubMed] [Google Scholar]
  • 43.Nakatsuka T., Saito M., Yamada E., Fujita K., Kakizaki Y., Nishihara M. Isolation and characterization of gtmybp3 and gtmybp4, orthologues of R2R3-MYB transcription factors that regulate early flavonoid biosynthesis, in gentian flowers. J. Exp. Bot. 2012;63:6505–6517. doi: 10.1093/jxb/ers306. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Xu F., Ning Y., Zhang W., Liao Y., Li L., Cheng H., Cheng S. An R2R3-MYB transcription factor as a negative regulator of the flavonoid biosynthesis pathway in ginkgo biloba. Funct. Integr. Genom. 2014;14:177–189. doi: 10.1007/s10142-013-0352-1. [DOI] [PubMed] [Google Scholar]
  • 45.Zhang S., Ma P., Yang D., Li W., Liang Z., Liu Y., Liu F. Cloning and characterization of a putative R2R3 MYB transcriptional repressor of the rosmarinic acid biosynthetic pathway from salvia miltiorrhiza. PLoS ONE. 2013;8:e73259. doi: 10.1371/journal.pone.0073259. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Qi T., Huang H., Wu D., Yan J., Qi Y., Song S., Xie D. Arabidopsis della and jaz proteins bind the WD-repeat/bHLH/MYB complex to modulate gibberellin and jasmonate signaling synergy. Plant. Cell. 2014;26:1118–1133. doi: 10.1105/tpc.113.121731. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Nakata M., Mitsuda N., Herde M., Koo A.J., Moreno J.E., Suzuki K., Howe G.A., Ohme-Takagi M. A bhlh-type transcription factor, ABA-inducible bhlh-type transcription factor/ja-associated myc2-like1, acts as a repressor to negatively regulate jasmonate signaling in Arabidopsis. Plant. Cell. 2013;25:1641–1656. doi: 10.1105/tpc.113.111112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Fan M., Bai M.-Y., Kim J.-G., Wang T., Oh E., Chen L., Park C.H., Son S.-H., Kim S.-K., Mudgett M.B. The bHLH transcription factor hbi1 mediates the trade-off between growth and pathogen-associated molecular pattern–triggered immunity in arabidopsis. Plant. Cell. 2014;26:828–841. doi: 10.1105/tpc.113.121111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Liu W., Tai H., Li S., Gao W., Zhao M., Xie C., Li W.X. B hlh 122 is important for drought and osmotic stress resistance in Arabidopsis and in the repression of aba catabolism. N. Phytol. 2014;201:1192–1204. doi: 10.1111/nph.12607. [DOI] [PubMed] [Google Scholar]
  • 50.Gao C., Ju Z., Li S., Zuo J., Fu D., Tian H., Luo Y., Zhu B. Deciphering ascorbic acid regulatory pathways in ripening tomato fruit using a weighted gene correlation network analysis approach. J. Integr. Plant. Biol. 2013;55:1080–1091. doi: 10.1111/jipb.12079. [DOI] [PubMed] [Google Scholar]
  • 51.Bai Y., Dougherty L., Cheng L., Zhong G.-Y., Xu K. Uncovering co-expression gene network modules regulating fruit acidity in diverse apples. BMC Genom. 2015;16:612. doi: 10.1186/s12864-015-1816-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Bush L.P., Wilkinson H.H., Schardl C.L. Bioprotective alkaloids of grass-fungal endophyte symbioses. Plant. Physiol. 1997;114:1–7. doi: 10.1104/pp.114.1.1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Yuan Y., Yu M., Zhang B., Liu X., Zhang J. Comparative nutritional characteristics of the three major chinese dendrobium species with different growth years. PLoS ONE. 2019;14:e0222666. doi: 10.1371/journal.pone.0222666. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Li L., Stoeckert C.J., Roos D.S. Orthomcl: Identification of ortholog groups for eukaryotic genomes. Genome Res. 2003;13:2178–2189. doi: 10.1101/gr.1224503. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Edgar R.C. Muscle: Multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004;32:1792–1797. doi: 10.1093/nar/gkh340. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Dewey C.N., Li B. Rsem: Accurate transcript quantification from RNA-seq data with or without a reference genome. BMC Bioinform. 2011;12:323. doi: 10.1186/1471-2105-12-323. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Robinson M.D., Mccarthy D.J., Smyth G.K. Edger: A bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26:139. doi: 10.1093/bioinformatics/btp616. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Tang H., Klopfenstein D., Pedersen B., Flick P., Sato K., Ramirez F., Yunes J., Mungall C. Goatools: Tools for gene ontology. Zenodo. 2015 doi: 10.5281/zenodo31628. [DOI] [Google Scholar]
  • 59.Mao X., Cai T., Olyarchuk J.G., Wei L. Automated genome annotation and pathway identification using the KEGG orthology (ko) as a controlled vocabulary. Bioinformatics. 2005;21:3787. doi: 10.1093/bioinformatics/bti430. [DOI] [PubMed] [Google Scholar]
  • 60.Dennis G., Sherman B.T., Hosack D.A., Yang J., Gao W., Lane H.C., Lempicki R.A. David: Database for annotation, visualization, and integrated discovery. Genome Biol. 2003;4:R60. doi: 10.1186/gb-2003-4-9-r60. [DOI] [PubMed] [Google Scholar]
  • 61.Ashburner M., Ball C.A., Blake J.A., Botstein D., Butler H., Cherry J.M., Davis A.P., Dolinski K., Dwight S.S., Eppig J.T. Gene ontology: Tool for the unification of biology. Nat. Genet. 2000;25:25. doi: 10.1038/75556. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Xie C., Mao X., Huang J., Ding Y., Wu J., Dong S., Kong L., Gao G., Li C.-Y., Wei L. Kobas 2.0: A web server for annotation and identification of enriched pathways and diseases. Nucleic Acids Res. 2011;39:W316–W322. doi: 10.1093/nar/gkr483. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Fuller T.F., Ghazalpour A., Aten J.E., Drake T.A., Lusis A.J., Horvath S. Weighted gene coexpression network analysis strategies applied to mouse weight. Mamm. Genome. 2007;18:463–472. doi: 10.1007/s00335-007-9043-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Horvath S., Dong J. Geometric interpretation of gene coexpression network analysis. PLoS Comput. Biol. 2008;4:e1000117. doi: 10.1371/journal.pcbi.1000117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Shannon P., Markiel A., Ozier O., Baliga N.S., Wang J.T., Ramage D., Amin N., Schwikowski B., Ideker T. Cytoscape: A software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13:2498–2504. doi: 10.1101/gr.1239303. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials


Articles from International Journal of Molecular Sciences are provided here courtesy of Multidisciplinary Digital Publishing Institute (MDPI)

RESOURCES