Abstract
Intricate signal networks and transcriptional regulators translate the recognition of pathogens into defense responses. In this study, we carried out a gene co-expression analysis of all currently publicly available microarray data, which were generated in experiments that studied the interaction of the model plant Arabidopsis thaliana with microbial pathogens. This work was conducted to identify (i) modules of functionally related co-expressed genes that are differentially expressed in response to multiple biotic stresses, and (ii) hub genes that may function as core regulators of disease responses. Using Weighted Gene Co-expression Network Analysis (WGCNA) we constructed an undirected network leveraging a rich curated expression dataset comprising 272 microarrays that involved microbial infections of Arabidopsis plants with a wide array of fungal and bacterial pathogens with biotrophic, hemibiotrophic, and necrotrophic lifestyles. WGCNA produced a network with scale-free and small-world properties composed of 205 distinct clusters of co-expressed genes. Modules of functionally related co-expressed genes that are differentially regulated in response to multiple pathogens were identified by integrating differential gene expression testing with functional enrichment analyses of gene ontology terms, known disease associated genes, transcriptional regulators, and cis-regulatory elements. The significance of functional enrichments was validated by comparisons with randomly generated networks. Network topology was then analyzed to identify intra- and inter-modular gene hubs. Based on high connectivity, and centrality in meta-modules that are clearly enriched in defense responses, we propose a list of 66 target genes for reverse genetic experiments to further dissect the Arabidopsis immune system. Our results show that statistical-based data trimming prior to network analysis allows the integration of expression datasets generated by different groups, under different experimental conditions and biological systems, into a functionally meaningful co-expression network.
Introduction
Plants have evolved complex sensing, signaling, and defense mechanisms to cope with a broad range of pathogens [1]. Although some plant defense strategies are specific to the type of invading organism (e.g., fungi, bacteria) and its pathogenic lifestyle (i.e., biotrophic, necrotrophic, or hemibiotrophic), other responses are thought to be common to diverse biotic stresses [1,2]. Core responses to pathogens may result from cross-talk between hormone-related pathways [3,4], such as ethylene (ET), jasmonic acid (JA), salicylic acid (SA) and abscisic acid (ABA), and/or from signaling mediated by calcium [5,6], reactive oxygen species (ROS; [7]), and phosphorylation cascades [8]. Transcription factors convey these signaling cues and activate or repress the expression of genes involved in immune responses and metabolic processes.
Because most plant immune responses are under transcriptional control, transcriptome profiling approaches, including hybridization based microarray and RNA-seq, are effective tools to monitor at the genome-scale the activation or suppression of specific regulatory and metabolic pathways during the interactions between plants and microorganisms [9–11]. The expression profiles of individual genes can be integrated into co-expression networks where genes are clustered as a function of pairwise gene expression correlations [12]. Gene co-expression analyses have been conducted to reconstruct regulatory pathways [13], discover novel candidate genes [14,15], and identify key modulators of immune responses [16]. Genes belonging to the same co-expression sub-network (or module) are likely to be functionally related [17–19], participate in similar biological processes, or be part of the same pathway [13].
Gene co-expression networks have been shown to have small-world and scale-free properties [13,20,21], which are common topological properties of many other biological networks, including protein-protein interaction and metabolic networks [22]. In a scale-free network most nodes have few interactions, while few nodes, the hubs, are highly interconnected. In comparison to scale-free networks, random networks have a uniform number of edges per node [22]. A critical property of scale-free networks is that random perturbations do not alter the overall stability of the system, whereas perturbations to the most highly connected hubs are severely destabilizing [23]. The centrality to network architecture and high degree of connectivity of molecular hubs, including those in co-expression networks, tend to be associated with essential roles in biological processes [24–26]. Co-expression network hubs were also shown to evolve more slowly than genes with fewer co-expression partners, suggesting that changes in sequence or expression level of hub genes can be deleterious, which further supports their functional centrality [20].
In this study, we constructed an undirected weighted gene co-expression network leveraging a rich curated expression dataset comprising 32 publicly available microarray experiments that involved infections of Arabidopsis plants with a wide array of single-cell eukaryotic and bacterial pathogens with different parasitic lifestyles. We annotated the network components to identify modules enriched in genes known to be associated with disease responses. We applied a permutation analysis to determine whether the observed functional enrichments were significant. Network topology was then analyzed to identify intra- and inter-modular hub genes, which are potentially key components of the core responses that Arabidopsis activates when challenged by multiple and diverse biotic stresses.
Results and Discussion
Data collection and processing
Fig. 1 provides a schematic workflow of the analysis, from data collection and processing, network construction, to module identification and characterization. All publicly available microarray experiments involving wild-type Arabidopsis plants challenged with biotic agents were collected generating a comprehensive dataset that included multiple infectious agents (Table 1; S1 Table). Additional variation across experiments included different Arabidopsis accessions, infected tissues, and developmental stages (S1 Table). Hierarchical clustering of the 167 infected microarrays based on gene expression fold-changes revealed strong similarities between biological replicates, with no other experimental condition driving sample clustering across experiments (S1 Fig.).
Table 1. Summary of experiments included in this study.
Number of microarrays 1 | Infection type 2 | Infectious agents 3 |
---|---|---|
70 | BB | Agrobacterium tumefaciens (21), Pseudomonas syringae (49), |
7 | BH | Xanthomonas campestris |
4 | BN | Ralstonia solanacearum |
32 | FB | Blumeria graminis (8), Glovinomyces cichoracearum (16), Fusarium oxysporum (4), Golovinomyces orontii (4), |
21 | FN | Alternaria brassicicola (3), Botrytis cinerea (6), Rhizoctonia solani (6), Sclerotinia sclerotiorum (6) |
27 | OB | Hyaloperonospora arabidopsidis (14), Hyaloperonospora parasitica (5), Phytophthora parasitica (8), |
6 | PR | Plasmodiophora brassicae |
1 Number of ATH1 arrays that were hybridized with cDNA from pathogen-infected samples
2 BB: bacterial biotrophs; BH: bacterial hemibiotrophs; BN: bacterial necrotrophs; FB: fungal biotrophs; FN: fungal necrotrophs; OB: oomycete biotrophs; PR: protist.
3 Values in parenthesis correspond to the number of arrays for each pathogen.
To identify genes responsive to pathogen infections, significant differential expression (DE) was tested for each probeset within each experiment as described in the Materials and Methods section. DE testing was applied to all experimental conditions that had at least three biological replicates of the healthy control and infected samples. The distribution of fold-changes corresponding to significant DE probesets was used to identify genes potentially differentially expressed in those experiments with less than three biological replicates, while the 4,745 probesets that did not display significant fold-changes in any of the experiments were excluded from the final dataset (Fig. 1). These excluded probesets were (i) housekeeping genes, (ii) genes that are not differentially expressed in response to any of the pathogens, and (iii) genes with large variation in expression between biological replicates, which may have masked differential regulation. To remove potential noise from the dataset, all comparisons that were not called significant or outside the fold-change thresholds were converted to a log2 fold-change value of zero. This approach ensured that weak fluctuations of expression were more likely to be true biological signal instead of measurement error or bias uncorrected by gcrma normalization.
Construction of the co-expression network of pathogen responsive genes
A gene expression correlation network was constructed with the Weighted Gene Co-expression Network Analysis (WGCNA) method implemented in the WGCNA R package [27]. WGCNA identifies modules composed of genes that are connected based on the topological overlap mapping metric (TOM), a neighborhood proximity measurement that quantifies the degree of shared network neighbors [12]. Unlike unweighted network approaches where gene connections are dichotomized based on a priori selected correlation cutoff, weighted networks do not depend on a hard-threshold. Given the complexity of the multi-infection dataset, using a hard threshold would likely result in loss of information and sensitivity [28]. Therefore, a soft-thresholding power of 5 with a scale-free model fitting index R 2 > 0.688 (S2 Fig.) was applied to (i) maximize scale-free topology, while (ii) maintaining a high mean number of connections and (iii) eliminating small correlations.
As a result, an undirected weighted network with scale-free topology composed of 205 modules of Arabidopsis genes with correlated expression during pathogen infections was obtained (Fig. 2a; S2 Table). WGCNA assigned to each module a unique color label that was used as specific module identifier in the analyses described below [27]. The modules were composed on average of 87.7 genes (median gene number per module = 44), with the largest module (‘turquoise’) containing 802 genes. Four probesets were not grouped in any of the 205 modules and were added to the ‘grey’ (or improper) module.
To allow additional graph-theoretical calculations, including the identification of gene hubs (see below), the weighted network was converted into an unweighted network preserving all connections with TOM > 0.1 [29]. The unweighted network displayed a hierarchical and modular structure as suggested by the empirical distribution of the probability (P(k)) of finding nodes containing k edges (S3 Fig.). The properties of the unweighted co-expression network included a mean path length (distance between any two nodes) of 3.8 and mean clustering coefficient of 0.80, indicating that the network also had small-world characteristics [22]. The most highly connected nodes (top 5%) from both weighted and unweighted (TOM > 0.1) networks corresponded to members of multiple modules (46 and 38, respectively).
Computing correlations between modules indicated the presence of a higher-order structure in the unweighted co-expression network. Inter-modular correlations were determined by correlation analyses of the module eigengenes (i.e., the first right-singular vector of TOM values of a given module), which have been shown to be robust indicators of entire module trends [25]. One hundred fifty-two modules (75%) were merged into 30 meta-modules based on pairwise eigengene adjacency (a eigen I,J> 0.75); see Materials and Methods; Fig. 2b). The meta-module sizes ranged from 46 genes (2 modules) to 2,460 genes (16 modules; median meta-module size: 170.5 genes, 3 modules). Some meta-modules of interest include the TOOL (‘thistle1’, ‘orangered’, ‘orangered4’, ‘lightpink2’), LTDL (‘lightgreen’, ‘tomato’, ‘darkseagreen’, ‘lightcoral’), LT (‘lavenderblush3’, ‘thistle3’), and PB (‘plum’, ‘bisque4’) meta-modules, all enriched for disease response functions (Fig. 2b, c).
Evaluation of module significance by gene ontology term enrichment
Enrichment analyses of gene ontology (GO) terms within modules and meta-modules were conducted [30] to provide a biological interpretation of the constructed gene network. One hundred and fifty-one modules were enriched (P-value < 0.01) in GO terms related to specific Biological Processes (BP), while 82 modules showed over-representation (P-value < 0.01) of Molecular Function GO terms (MF; Fig. 2a and S3 Table). To validate the significance of the observed GO term enrichments, the same analysis was carried out in 100 randomly generated networks (Fig. 3). The permutation was conducted with random assignment of probesets to modules of the same size as in the WGCNA-created network. For both BP (Fig. 3a) and MF (Fig. 3b) GO terms, the number of enriched modules in the “real” network was significantly higher (BP: P-value = 0; MF: P-value = 0) than in the random networks. In addition, a significantly higher number of enriched GO terms (BP: P-value = 0; MF: P-value = 0) was found in the “real” network when compared to the random networks.
The modules with the largest number of enriched terms included enriched GO categories associated with biotic stress responses, signaling, and transcriptional regulation (S3 Table). The ‘orangered4’ module, composed of 122 genes and a member of the TOOL meta-module, had an over-representation of BP terms related to immune and defense responses (e.g., GO:0002376, GO:0045087. GO:0006955, GO:0006952, GO:0031347, and GO:0030968). This module also displayed a high number of enriched MF categories (38 terms) and most of them were related to signaling and stress responses, such as “protein kinase activity” (GO:0004672; GO:0004674), “phosphotransferase activity” (GO:0016773) and “nucleotide binding” (GO:0032559; GO:0030554; GO:0017076). Significant differential expression was detected in 60% of the 20,374 comparisons between control and infected samples (122 probesets x 167 experiments) that compose the ‘orangered4’ module, with (77.6%) of the fold changes significantly up-regulated by both biotrophs and necrotrophs (S4 Fig.). These results suggest that members of the ‘orangered4’ module are part of a general biotic stress response that Arabidopsis activates when challenged by different types of pathogens.
The ‘thistle1’ module, another member of the TOOL meta-module, also included genes associated with plant responses to biotic stress, such as “responses to chitin” (GO:0010200), a pathogen-associated molecular pattern. This module was also enriched in MF terms related to transcription factor activities (GO:0003700 and GO:0001071). Nineteen percent of the members of ‘thistle1’ were significantly up-regulated in all comparisons between healthy and pathogen infected samples; these included genes expected to be involved in signaling and responses to pathogen infections, such as WRKY and WRKY-associated transcription factors (At4g23810, probeset 254231_at, and At1g80840, probeset 261892_at), calcium binding-proteins [5] (At5g54490, probeset 248164_at; At2g41100, probeset 267083_at), a FAD-binding Berberine family protein (At4g20830, probeset 254432_at), and JA biosynthesis and signaling (At1g72520, probeset 260399_at; At1g19180, probeset 256017_at). Members of ‘thistle1’ and ‘orangered4’ were also found co-expressed in another Arabidopsis disease response co-expression study using different methods, further supporting the TOOL meta-module creation, and its enrichment for disease response activity [16].
Not all modules were enriched in GO terms directly associated with responses to disease, suggesting that transcriptional reprogramming in response to pathogens involves other metabolic processes besides defense reactions. For example, the ‘darkgreen’ module displayed the largest number of enriched BP terms (215 terms at P < 0.001), with the most significant ones related to photosynthesis and growth-related processes (GO:0015979, GO:0019684 and GO:006091). The only MF term that was significantly enriched in the ‘darkgreen’ module was the tetrapyrrole binding activity (GO: 0046906), further suggesting that this module is involved in photosynthesis. About half of the comparisons (45.4% of 32,899) in the ‘darkgreen’ module show DE in response to biotic stresses and most of them (74%) down-regulated (S4 Fig.). Among the ‘darkgreen’ genes, At5g64040 (probeset 247320_at) that encodes a subunit of photosystem I reaction center displayed the greatest down-regulation across all infections analyzed (median log2 fold change: -0.86; S2 Table). Global down-regulation of photosynthetic genes is considered a common plant response to pathogen and insect attack [31–35]. Inhibition of the photosynthetic activity causes a switch from primary metabolism towards non-assimilatory (e.g., carbon-consuming) pathways, which can enhance the production of anti-microbial metabolites and the expression of defense-related genes [36].
Identification of modules enriched for known pathogen responsive genes
To further explore the functional enrichment in the 205 co-expression modules, genes involved in plant hormone networks were identified based on the annotations of the Arabidopsis Hormone Database (AHD). The AHD includes 1,318 genes encoding proteins involved in the biosynthesis, modification, and signaling of plant hormones; most of which are experimentally supported by forward and/or reverse genetic evidence [37]. An emphasis on ET, SA, JA and ABA pathways was given in this study, as they play crucial roles during plant-microbe interactions [38,39]. Among the 18,065 probesets that compose the network, 133, 104, 132 and 314 probesets were associated with ET, SA, JA, and ABA pathways, respectively [40]. A total of 179 DE probesets were assigned to multiple signaling pathways by AHD annotations [37]. Hormone enrichment was tested with the classic Fisher’s exact test. Modules significantly enriched (P-value < 0.01) in ET (4 modules), SA (5 modules), JA (5 modules), and ABA (4 modules) associated genes were identified (S4 Table). The ‘lightgreen’ and ‘palevioletred2’ modules were enriched in genes from three hormone pathways including ET, SA and JA (Fig. 2b). Importantly, all the modules enriched in ET, SA and/or JA hormone-related genes were also enriched in other genes with GO terms associated with responses to pathogens further supporting the validity of the constructed network. Five of these enriched modules were part of four meta-modules of co-expressed genes (i.e., TOOL, LTDL, LT, and PB; Fig. 2b, c). For example, ‘orangered4’ and ‘thistle1’ had an over-representation of hormone-associated genes (Fig. 2c; S4 Table) and in GO terms related to signaling and disease responses (see previous section). Not all the members of the modules enriched in hormone-related genes have AHD annotations; however some genes could potentially be part of the hormone networks. Functional coherence between co-expressed genes can be exploited to assign putative functions to the genes inside these modules that are still reported as coding proteins with “unknown function”, such as those provided in S5 Table.
Besides hormone-related functions, module enrichment in other well-characterized responses to pathogen infection were also evaluated, such as those involving genes coding for pathogenesis-related (PR) proteins [41]. PR proteins have diverse functions in plant defense, including signaling (e.g., PR-1), hydrolysis of fungal cell walls (e.g., PR-2 and PR-4) and contact toxicity (e.g., PR-5 [42]). Remarkably, PR-1 (At2g14610, probeset 266385_at), PR-2 (At3g57260, probeset 251625_at), and PR-5 (At1g75040, probeset 259925_at) were tightly co-expressed and belonged to the ‘coral4’ module, which was enriched in 67 BP terms, including “immune system process” (GO:0002376). ‘Coral4’ formed a clade in the eigengene dendrogram (Fig. 2b) with the PB meta-module and the ‘thistle2’ and ‘green1’ modules. PR-4 (At3g04720, probeset 258791_at) was part of the ‘darkviolet’ module, which was enriched in ET pathway-associated genes (Fig. 2b, S3 Table). A more extensive list of experimentally verified disease response-related genes, and the modules they are associated with, are available in S6 Table.
Modules were also analyzed for enrichment in specific transcription factor (TF) families. TF families that control the expression of defense-related genes, hormone-dependent or independent, include, among others, the ethylene-responsive-element-binding factor (ERF), basic-domain leucine-zipper (bZIP), WRKY, and MYB proteins [43]. The ‘black’ module was enriched (P-value < 0.01) for TF families, particularly APETALA2 (AP2)/Ethylene-Responsive Element Binding Proteins (EREBP), Homeobox, MADS, MYB, NAC, and bZIP (S4 Table). Most of these genes displayed small fold-changes in expression as a consequence of pathogen infection (S2 Table). Of the 25.7% differentially regulated points in the ‘black’ module, 53% were down-regulated in response to pathogens (S4 Fig.). Given that transcription factors generally have complex relationships including interactions with themselves, the genes they regulate, and other transcription factors (e.g., auto-regulation, feed-forward loops, multi-component loops, regulator chains, single input chains, and multiple input chains; [44,45]), it is likely that direct correlations in co-expression networks cannot be used to identify transcription factor relationships [46].
A similar approach was used to determine if the identified modules were significantly enriched in particular cis-regulatory elements using annotations from the Arabidopsis cis-regulatory element database (AtcisDB; [47,48]). The meta-module LTDL, characterized by a significant overrepresentation of genes associated with the ET and SA pathways (Fig. 2b, c), was also enriched in motifs known to be under hormone regulation. These included the ET/JA-responsive motifs, G-box (99 genes; P = 0.012) and GCC-box (55 genes; P = 0.02; [49,50]). In addition, W-box motifs (264 genes; P = 0.02), which are regulated by SA-induced WRKY TFs, were present in this meta-module [51–53]. The W-box motif is present in the promoter of a large variety of genes involved in immune responses [54–56], even several WRKY factors themselves contain this cis-regulatory element [57]. An enrichment in W-box motifs was also found in the LT meta-module (98 genes, P = 0.02), which contains ‘thistle3’, a module enriched for SA-associated genes (see previous section). The ABA-responsive elements, ATHB1 (9 genes, P = 0.03), and DPBF1&2 binding site motifs (94 genes, P = 0.01), were significantly enriched in the LT meta-module as well. Both motifs were shown to be present in genes with functions in stress tolerance [58].
Network topology of the four disease responsive and hormone-associated gene enriched meta-modules
To determine the local co-expression network topology, and to identify the main hub genes within the four hormone-enriched and disease-related meta-modules, the connectivity of each meta-module was evaluated (Table 2). The TOOL meta-module displayed a non-random network topology (Table 2) with most (92%) of the top 5% of connected genes up-regulated by both biotrophic and necrotrophic infections (Fig. 4, S7 Table). Fifty-six percent of the probesets in the TOOL meta-module were strongly connected (TOM > 0.1). These connected nodes consisted of 94 ‘orangered4’, 37 ‘thistle1’, 22 ‘orangered’, and 1 ‘lightpink2’ probesets. The three most connected genes in the TOOL meta-module corresponded to (i) At5g13190 (probeset 250289_at, ‘orangered4’ module), which encodes a LPS-Induced Tumor necrosis Alpha Factor (LITAF) domain protein shown to negatively regulate programmed cell death [59], (ii) At2g11520 (probeset 263274_at, ‘orangered4’ module) encoding a cytoplasmic calmodulin-binding receptor-like kinase [60], and (iii) At1g57630 (probeset 246405_at, ‘orangered4’ module) encoding a transmembrane Toll-Interleukin Receptor (TIR) domain protein [61]. The transmembrane pathogen receptor At1g57630 was not only one of the most highly connected nodes in the TOOL meta-module, but was also up-regulated by most of the experiments (75%) and displayed the highest median fold-change across experiments (log2 fold-change: 2.44).
Table 2. Summary of network properties of the TOOL, LTDL, LT, and PB meta-modules.
Module | TOOL | LTDL | LT | PB |
---|---|---|---|---|
Module size 1 | 277 (155) | 345 (145) | 167 (111) | 156 (59) |
Density | 0.07 | 0.04 | 0.07 | 0.06 |
Centralization | 0.08 | 0.08 | 0.09 | 0.08 |
Heterogeneity | 0.50 | 0.80 | 0.42 | 0.60 |
Mean clustering coefficient | 0.12 | 0.10 | 0.10 | 0.11 |
Mean connectivity | 19.50 | 13.82 | 10.95 | 8.73 |
Factorizability | 0.70 | 0.76 | 0.67 | 0.70 |
1 Numbers in parenthesis correspond to the number of nodes with at least one connection with TOM > 0.1.
The LTDL meta-module also displayed non-random network topology (Table 2) with the top 5% of connected (81.2%) genes up-regulated in response to biotrophic and necrotrophic pathogens (S5 Fig. a). Most interconnected nodes were members of the ‘lightgreen’ (92), ‘lightcoral’ (35), ‘darkseagreen’ (15), and ‘tomato’ (2) modules. The three most connected nodes were (i) At1g61370 (probeset 264756_at, ‘lightgreen’ module) encoding a S-locus lectin protein kinase that, (ii) At3g55470 (probeset 251790_at, ‘lightgreen’ module) coding for a calcium-dependent lipid-binding (CaLB domain) family protein, and (iii) At3g25250 (probeset 257840_at, ‘lightgreen’ module) encoding the oxidative signal-inducible kinase 1 (OXI1); all of these genes have been associated with defense signaling [62–66]. In particular, OXI1 is essential in the signal transduction pathway linking ROS signaling and downstream responses to biotic and abiotic cues [66].
In the LT meta-module (Table 2; S5 Fig. b), the three most connected nodes were At5g58730 (probeset 247788_at; ‘lightsteelblue1’ module) coding for a myo-inositol kinase and two genes encoding proteins involved in defense responses by callose deposition, At4g12120 (probeset 254857_at, ‘lightsteelblue1’ module) and At1g15430 (probeset 262571_at, ‘lightsteelblue1’ module)]. Most top hub nodes in the PB meta-module (Table 2; S5 Fig. c) did not have functions obviously linked to pathogen responses, but included a gene coding for the UDP-dependent glycosyltransferase UGT76B1 (At3g11340, probeset 256252_at, ‘plum’ module), which was shown to be involved in resistance against P. syringae, a biotrophic bacterium, and susceptibility to A. brassicicola, a necrotrophic fungus [67].
Identification of hub genes with putative roles in responses to biotic stress
Unweighted node connectivity information was used to identify hub genes within the four defense-related meta-modules, LTDL, PB, TOOL, and LT (S4 and S7 Tables). Nodes were ranked by their connectivity and 46 putative hub genes corresponding to the top 5% of the most connected nodes within each meta-module were selected for further analysis (S7 Table). Calculations of betweenness centrality (a measure of how important a node is for joining other connections in the network) confirmed that these hub genes had the highest overall intra-modular centrality (S7 Table) [68]. Of these 46 hub genes, 37%, 17.4%, 28.2%, 17.4% were members of the LTDL, PB, TOOL, and LT meta-modules, respectively. These hub genes were highly inter-connected (TOM > 0.1) to at least one other hub gene from the same or a different meta-module (Fig. 5a). High interconnectivity between the hub genes implies that the processes they are involved in are potentially co-regulated. The most interconnected hub gene with 41 strong connections (TOM > 0.1) was At1g71100 (probeset 259749_at; ‘lightgreen’ module), a gene encoding a putative ribose 5-phosphate isomerase (RSW10 [69]).
Further exploiting the property of high inter-connectivity between the hub genes, the co-expression network was mined for genes that were connected to more than 75% (TOM > 0.1) of the hub genes from the four meta-modules analyzed. Twenty additional hub genes from six other modules were identified; the majority belonged to the ‘midnightblue’ module (55%), followed by ‘blue’ (15%), ‘green’ (10%), ‘lightcyan’ (10%), ‘cyan’ (5%) and ‘pink’ (5%) modules (Fig. 5a).
In the final set of 66 hub genes, we found a significant enrichment in responsive functions to biotic stress (Fig. 5b, S8 Table). The top enriched BP terms were associated with immune system processes and defense responses (GO:0002376 and GO:0006952), and their regulation (GO:0002682). Remarkably, most hub genes (80.3%) were up-regulated by all types of pathogens (Fig. 5c, S7 Table), which suggests that they may represent core components of Arabidopsis responses to biotic stress. The proteins encoded by the top 15 highest expressed genes (among all types of pathogen infection; S7 Table) included: (i) the WRKY15 TF (AT5G13080, probeset 245976_at, ‘lightgreen’ module), which was shown to be involved in Arabidopsis resistance to necrotrophic and biotrophic pathogens [70,71], (ii) UGT76B1 (mentioned in the previous section), a known modulator of SA/JA-dependent pathways [67], (iii) the calmodulin-like protein CML37 (At5g42380, probeset 249197_at, ‘lightgreen’ module’), which is induced in response to abiotic and biotic stresses [72], and (iv) the MYB15 TF (At3g23250, probeset 257919_at, ‘lightgreen’ module) that is involved in ABA-mediated stress responses [73]. A smaller fraction of hub genes was down-regulated (19.7%) by both biotrophs and necrotrophs (Fig. 5c, S7 Table). Most of the down-regulated genes (75%) corresponded to genes involved in larger system-wide processes including photosynthesis and cell cycle control, such as the genes coding for the high chlorophyll fluorescence protein HCF243 (At3G15095, probeset 257236_at, ‘pink’ module) and the ribonucleotide reductase RNR1 (At5G02250, probeset 251040_at, ‘lightsteelblue1’ module) (S7 Table).
Conclusions
In this study we applied a novel approach to describe the complexity of plant responses to biotic stresses at the transcriptional level by taking advantage of the large amount of expression data available for the model plant Arabidopsis. We show that statistical-based data trimming prior to weighted network analysis can be used to integrate expression datasets generated using different experimental approaches. This allowed us to identify modules of functionally related co-expressed genes whose transcriptional regulations are responsive to multiple pathogen infections. We show that functional enrichments in the identified modules are significantly greater than in randomly generated networks. We identified hub genes that can serve as candidates for reverse genetic studies because of their high degree of connectivity within and across meta-modules enriched for immune response functions. Co-expression associations reported here can be further exploited to infer potential functions for those genes coding for proteins with unknown activities.
Materials and Methods
Metadata collection and curation
Microarray data were obtained from ArrayExpress and imported into R (www.r-project.org/) in September 2013 using the ArrayExpress data loading package v. 1.20.0 [74]. All datasets used in this study were generated using the Affymetrix Arabidopsis ATH1 Genome Array (ATH1) platform (GEO accession number GPL198), which contains more than 22,500 probesets representing approximately 24,000 genes (http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GPL198). The final dataset included 272 microarrays comprising 52 comparisons (31 with more than 3 biological replicates) between pathogen-infected (167 microarrays) and control samples (105 microarrays). Bacterial (4 species), fungal (8 species), oomycete (3 species), and protist (1 species) pathogens were grouped based on their parasitic life style as biotrophs (10 species), hemibiotrophs (1 species) or necrotrophs (5 species; Table 1).
Prior to statistical testing, hybridization intensities were normalized across all experiments using gcrma v. 2.32.0 [75]. Differential expression for experiments that included at least three healthy control samples was tested with the Limma R package v. 3.16.8 [76]. All analyses described in this work were conducted on the 18,065 probesets, out of the total 22,810 spotted on the ATH1 array, whose expression was significantly changed at least by one pathogen infection treatment. The empirical probability distribution of the fold-changes associated with significant DEs (P<0.05) was used to define a log2 fold-change threshold (α + = 0.2777194, α_ = -0.2784232) to identify probesets potentially differentially expressed in those experiments with less than three control biological replicates. To remove potential noise, all fold-change values associated with comparisons that were not considered significant by Limma or potentially significant by threshold-filtering were converted to ‘zero’, which in log2 scale corresponds to complete absence of differential regulation between treatments, with the function f(C) = I C (X) where:
In order to maintain one data point per biological replicate of the pathogen-treated samples, the retained fold-change values were calculated by subtracting the median normalized log2 expression values of healthy control samples from the respective pathogen-treated samples.
Weighted co-expression network construction
Co-expression analysis was conducted using the Weighted Correlation Network Analysis (WGCNA) v. 1.3.6 [27,77]. Instead of creating multiple networks and comparing their modules, which would have required a significant reduction of the dataset due to the high heterogeneity of the final dataset (varying numbers of replicates, diverse infecting pathogens, different time points of infection…), we generated with basic WGCNA network creation functions (flashClust() and cutreeDynamic(), [27,77]) a single co-expression network that incorporated all filtered expression values. Multiple branch cutting approaches were tested and dynamic cutting with the ‘hybrid’ parameter was selected because hard threshold cutting has been shown to be too rigid for dendrograms with potentially complicated transcriptomic data [78]. The ‘dynamic’ cutting can identify nested clusters and has been shown to better detect outliers [78]. A threshold of minimum module size of 20 genes was imposed. For network visualization and topological analyses, the unweighted network was extracted using a hard threshold of TOM > 0.1 and imported into Cytoscape v. 2.8 [29].
Module enrichment tests
Gene Ontology (GO) enrichment in modules was carried out with topGO v. 2.12.0 [79] using the annotation of the ATH1 array from the ath1121501.db R package v. 2.9.0 [41]. The classic Fisher’s exact test method was applied. Each module was tested for enrichment in terms of the Molecular Function (MF) and the Biological Process (BP) categories. GO terms with expected values of less than one gene (i.e., determined by chi-square test) were not included in the analyses, in order to eliminate false positives from ill-annotated terms.
Hormone-related gene and transcription factor enrichments were also determined with a classic Fisher’s exact test. The list of genes annotated as involved in hormone biosynthesis, signaling, or modification were obtained from the Arabidopsis Hormone Database (ATHD; [37]). Transcription factor annotations were obtained from the Plant Transcription Factor Database (PlnTFDB) [80]. All modules were tested individually for enrichment in each category (S4 Table).
The Fisher’s exact test was used to assess significant enrichment of each cis-regulatory element type independently with contingency tables consisting of total counts of element presence in the Arabidopsis genome (AtcisDB; [47,48]), total counts of element presence in meta-modules, total predicted cis-regulatory elements in the Arabidopsis genome, and total count of cis-regulatory elements in meta-modules.
Module analysis
Module eigengenes were calculated for each module (defined as the first right singular vector of the module, constructed with the TOM matrix), calculated using the moduleEigengenes() function in WGCNA. Pairwise eigengene correlation distances defined as , where I and J are modules and E I and E J are vectors of eigengene values from each module, were used to cluster the modules to create a tree of putative relatedness of function based on module trend [12]. Modules were clustered in meta-modules based on the 0.25 cutoff (corresponding to a correlation of 0.75) for the module eigengene dendrogram (Fig. 2b). Cytoscape v. 2.8 [29] was used for network visualizations and to calculate topological properties. Cytoscape statistics are reported in S7 Table. Topologies were visualized using the ‘shortest path’ method after removing edges associated with TOM values < 0.1. Equation (1)
Supporting Information
Data Availability
All relevant data are within the paper and its Supporting Information files.
Funding Statement
This work was partially supported by funding to DC from the California Department of Food and Agriculture (project 11-0148-SA) and from the College of Agricultural and Environmental Sciences (UC Davis). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1. Chisholm ST, Coaker G, Day B, Staskawicz BJ. Host-microbe interactions: shaping the evolution of the plant immune response. Cell. 2006;124: 803–814. [DOI] [PubMed] [Google Scholar]
- 2. Dodds PN, Rathjen JP. Plant immunity: towards an integrated view of plant-pathogen interactions. Nat Rev Genet. 2010;11: 539–548. 10.1038/nrg2812 [DOI] [PubMed] [Google Scholar]
- 3. Pieterse CM, Leon-Reyes A, Van der Ent S, Van Wees SC. Networking by small-molecule hormones in plant immunity. Nat Chem Biol. 2009;5: 308–316. 10.1038/nchembio.164 [DOI] [PubMed] [Google Scholar]
- 4. Spoel SH, Dong X. Making sense of hormone crosstalk during plant immune responses. Cell Host Microbe. 2008;3: 348–351. 10.1016/j.chom.2008.05.009 [DOI] [PubMed] [Google Scholar]
- 5. Dodd AN, Kudla J, Sanders D. The language of calcium signaling. Annu Rev Plant Biol. 2010;61: 593–620. 10.1146/annurev-arplant-070109-104628 [DOI] [PubMed] [Google Scholar]
- 6. Schulz P, Herde M, Romeis T. Calcium-dependent protein kinases: hubs in plant stress signaling and development. Plant Physiol 2013;163: 523–530. 10.1104/pp.113.222539 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Torres MA, Jones JD, Dangl JL. Reactive oxygen species signaling in response to pathogens. Plant Physiol 2006;141: 373–378. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Asai T, Tena G, Plotnikova J, Willmann MR, Chiu WL, Gomez-Gomez L, et al. MAP kinase signalling cascade in Arabidopsis innate immunity. Nature. 2002;415: 977–983. [DOI] [PubMed] [Google Scholar]
- 9. Tao Y, Xie Z, Chen W, Glazebrook J, Chang HS, Han B, et al. Quantitative nature of Arabidopsis responses during compatible and incompatible interactions with the bacterial pathogen Pseudomonas syringae . Plant Cell. 2003;15: 317–330. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Boddu J, Cho S, Kruger WM, Muehlbauer GJ. Transcriptome analysis of the barley-Fusarium graminearum interaction. Mol Plant Microbe Interact. 2006;19: 407–417. [DOI] [PubMed] [Google Scholar]
- 11. Wise RP, Moscou MJ, Bogdanove AJ, Whitham SA. Transcript profiling in host-pathogen interactions. Annu Rev Phytopathol. 2007;45: 329–369. [DOI] [PubMed] [Google Scholar]
- 12. Langfelder P, Horvath S. Eigengene networks for studying the relationships between co-expression modules. BMC Syst Biol. 2007;1: 54 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Ma S, Gong Q, Bohnert HJ. An Arabidopsis gene network based on the graphical Gaussian model. Genome Res. 2007;17: 1614–1625. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Cai B, Li CH, Huang J. Systematic identification of cell-wall related genes in Populus based on analysis of functional modules in co-expression network. PLoS One. 2014;9: e95176 10.1371/journal.pone.0095176 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. van Dam S, Cordeiro R, Craig T, van Dam J, Wood SH, de Magalhães JP. GeneFriends: an online co-expression analysis tool to identify novel gene targets for aging and complex diseases. BMC Genomics. 2012;13: 535 10.1186/1471-2164-13-535 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Tully JP, Hill AE, Ahmed HM, Whitley R, Skjellum A, Mukhtar MS. Expression-based network biology identifies immune-related functional modules involved in plant defense. BMC Genomics. 2014;15: 421 10.1186/1471-2164-15-421 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Oldham MC, Horvath S, Geschwind DH. Conservation and evolution of gene coexpression networks in human and chimpanzee brains. Proc Natl Acad Sci U S A. 2006;103: 17973–17978. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Obayashi T, Hayashi S, Shibaoka M, Saeki M, Ohta H, Kinoshita K. COXPRESdb: a database of coexpressed gene networks in mammals. Nucleic Acids Res. 2008;36: D77–82. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Lee JM, Sonnhammer EL. Genomic gene clustering analysis of pathways in eukaryotes. Genome Res. 2003;13: 875–882. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Jordan IK, Marino-Ramirez L, Wolf YI, Koonin EV. Conservation and coevolution in the scale-free human gene coexpression network. Mol Biol Evol. 2004;21: 2058–2070. [DOI] [PubMed] [Google Scholar]
- 21. Lee I, Ambaru B, Thakkar P, Marcotte EM, Rhee SY. Rational association of genes with traits using a genome-scale gene network for Arabidopsis thaliana . Nat Biotechnol. 2010;28: 149–156. 10.1038/nbt.1603 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Barabasi A-L, Oltvai ZN. Network biology: understanding the cell’s functional organization. Nat Rev Genet. 2004;5: 101–113. [DOI] [PubMed] [Google Scholar]
- 23. Albert R, Jeong H, Barabasi AL. Error and attack tolerance of complex networks. Nature. 2000;406: 378–382. [DOI] [PubMed] [Google Scholar]
- 24. Jeong H, Mason SP, Barabasi AL, Oltvai ZN. Lethality and centrality in protein networks. Nature. 2001;411: 41–42. [DOI] [PubMed] [Google Scholar]
- 25. Reverter A, Chan EK. Combining partial correlation and an information theory approach to the reversed engineering of gene co-expression networks. Bioinformatics. 2008;24: 2491–2497. 10.1093/bioinformatics/btn482 [DOI] [PubMed] [Google Scholar]
- 26. Carlson MR, Zhang B, Fang Z, Mischel PS, Horvath S, Nelson SF. Gene connectivity, function, and sequence conservation: predictions from modular yeast co-expression networks. BMC Genomics. 2006;7: 40 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Langfelder P, Horvath S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics. 2008;9: 559 10.1186/1471-2105-9-559 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Zhang B, Horvath S. A general framework for weighted gene co-expression network analysis. Stat Appl Genet Mol Biol. 2005;4: 1544–6115. [DOI] [PubMed] [Google Scholar]
- 29. Smoot ME, Ono K, Ruscheinski J, Wang PL, Ideker T. Cytoscape 2.8: new features for data integration and network visualization. Bioinformatics. 2011;27: 431–432. 10.1093/bioinformatics/btq675 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Alexa A, Rahnenfuhrer J, Lengauer T. Improved scoring of functional groups from gene expression data by decorrelating GO graph structure. Bioinformatics. 2006;22: 1600–1607. [DOI] [PubMed] [Google Scholar]
- 31. Bilgin DD, Zavala JA, Zhu J, Clough SJ, Ort DR, DeLucia EH. Biotic stress globally downregulates photosynthesis genes. Plant Cell Environ. 2010;33: 1597–1613. 10.1111/j.1365-3040.2010.02167.x [DOI] [PubMed] [Google Scholar]
- 32. Bolton MD. Primary metabolism and plant defense—fuel for the fire. Mol Plant Microbe Interact 2009;22: 487–497. 10.1094/MPMI-22-5-0487 [DOI] [PubMed] [Google Scholar]
- 33. Kerchev PI, Fenton B, Foyer CH, Hancock RD. Plant responses to insect herbivory: interactions between photosynthesis, reactive oxygen species and hormonal signalling pathways. Plant Cell Environ. 2012;35: 441–453. 10.1111/j.1365-3040.2011.02399.x [DOI] [PubMed] [Google Scholar]
- 34. Major IT, Nicole MC, Duplessis S, Seguin A. Photosynthetic and respiratory changes in leaves of poplar elicited by rust infection. Photosynth Res. 2010;104: 41–48. 10.1007/s11120-009-9507-2 [DOI] [PubMed] [Google Scholar]
- 35. Roberts MR, Paul ND. Seduced by the dark side: integrating molecular and ecological perspectives on the influence of light on plant defence against pests and pathogens. New Phytol. 2006;170: 677–699. [DOI] [PubMed] [Google Scholar]
- 36. Kangasjarvi S, Neukermans J, Li S, Aro EM, Noctor G. Photosynthesis, photorespiration, and light signalling in defence responses. J Exp Bot. 2012;63: 1619–1636. 10.1093/jxb/err402 [DOI] [PubMed] [Google Scholar]
- 37. Jiang Z, Liu X, Peng Z, Wan Y, Ji Y, He W, et al. AHD2.0: an update version of Arabidopsis Hormone Database for plant systematic studies. Nucleic Acids Res. 2011;39: D1123–1129. 10.1093/nar/gkq1066 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38. Bari R, Jones JD. Role of plant hormones in plant defence responses. Plant Mol Biol. 2009;69: 473–488. 10.1007/s11103-008-9435-0 [DOI] [PubMed] [Google Scholar]
- 39. Pieterse CM, Van der Does D, Zamioudis C, Leon-Reyes A, Van Wees SC. Hormonal modulation of plant immunity. Annu Rev Cell Dev Biol. 2012;28: 489–521. 10.1146/annurev-cellbio-092910-154055 [DOI] [PubMed] [Google Scholar]
- 40.Carlson MR Affymetrix Arabidopsis ATH1 Genome Array annotation data (chip ath1121501). R package version 2.7.0. 2012. Available: www.bioconductor.org
- 41. Stintzi A, Heitz T, Prasad V, Wiedemann-Merdinoglu S, Kauffmann S, Geoffroy P, et al. Plant ‘pathogenesis-related’ proteins and their role in defense against pathogens. Biochimie. 1993;75: 687–706. [DOI] [PubMed] [Google Scholar]
- 42. van Loon LC, Rep M, Pieterse CM. Significance of inducible defense-related proteins in infected plants. Annu Rev Phytopathol. 2006;44: 135–162. [DOI] [PubMed] [Google Scholar]
- 43. Singh K, Foley RC, Onate-Sanchez L. Transcription factors in plant defense and stress responses. Curr Opin Plant Biol. 2002;5: 430–436. [DOI] [PubMed] [Google Scholar]
- 44. Blais A, Dynlacht BD. Constructing transcriptional regulatory networks. Genes Dev. 2005;19: 1499–1511. [DOI] [PubMed] [Google Scholar]
- 45. Lee TI, Rinaldi NJ, Robert F, Odom DT, Bar-Joseph Z, Gerber GK, et al. Transcriptional regulatory networks in Saccharomyces cerevisiae . Science. 2002;298: 799–804. [DOI] [PubMed] [Google Scholar]
- 46. Ma S, Shah S, Bohnert HJ, Snyder M, Dinesh-Kumar SP. Incorporating motif analysis into gene co-expression networks reveals novel modular expression pattern and new signaling pathways. PLoS Genet. 2013;9: e1003840 10.1371/journal.pgen.1003840 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47. Davuluri RV, Sun H, Palaniswamy SK, Matthews N, Molina C, Kurtz M, et al. AGRIS: Arabidopsis gene regulatory information server, an information resource of Arabidopsis cis-regulatory elements and transcription factors. BMC Bioinformatics. 2003;4: 25 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48. Yilmaz A, Mejia-Guerra MK, Kurz K, Liang X, Welch L, Grotewold E. AGRIS: the Arabidopsis Gene Regulatory Information Server, an update. Nucleic Acids Res. 2011;39: D1118–1122. 10.1093/nar/gkq1120 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49. Fujimoto SY, Ohta M, Usui A, Shinshi H, Ohme-Takagi M. Arabidopsis ethylene-responsive element binding factors act as transcriptional activators or repressors of GCC box-mediated gene expression. Plant Cell. 2000;12: 393–404. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50. Sessa G, Meller Y, Fluhr R. A GCC element and a G-box motif participate in ethylene-induced expression of the PRB-1b gene. Plant Mol Biol. 1995;28: 145–153. [DOI] [PubMed] [Google Scholar]
- 51. Eulgem T, Rushton PJ, Schmelzer E, Hahlbrock K, Somssich IE. Early nuclear events in plant defence signalling: rapid gene activation by WRKY transcription factors. EMBO J. 1999;18: 4689–4699. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52. Du L, Chen Z. Identification of genes encoding receptor-like protein kinases as possible targets of pathogen- and salicylic acid-induced WRKY DNA-binding proteins in Arabidopsis. Plant J 2000;24: 837–847. [DOI] [PubMed] [Google Scholar]
- 53. Yu D, Chen C, Chen Z. Evidence for an important role of WRKY DNA binding proteins in the regulation of NPR1 gene expression. Plant Cell. 2001;13: 1527–1540. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54. Pandey SP, Somssich IE. The role of WRKY transcription factors in plant immunity. Plant Physiol 2009;150: 1648–1655. 10.1104/pp.109.138990 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55. Chen W, Provart NJ, Glazebrook J, Katagiri F, Chang HS, Eulgem T, et al. Expression profile matrix of Arabidopsis transcription factor genes suggests their putative functions in response to environmental stresses. Plant Cell. 2002;14: 559–574. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56. Maleck K, Levine A, Eulgem T, Morgan A, Schmid J, Lawton KA, et al. The transcriptome of Arabidopsis thaliana during systemic acquired resistance. Nat Genet. 2000;26: 403–410. [DOI] [PubMed] [Google Scholar]
- 57. Eulgem T, Somssich IE. Networks of WRKY transcription factors in defense signaling. Curr Opin Plant Biol. 2007;10: 366–371. [DOI] [PubMed] [Google Scholar]
- 58. Shinozaki K, Yamaguchi-Shinozaki K. Molecular responses to dehydration and low temperature: differences and cross-talk between two stress signaling pathways. Curr Opin Plant Biol. 2000;3: 217–223. [PubMed] [Google Scholar]
- 59. He S, Tan G, Liu Q, Huang K, Ren J, Zhang X, et al. The LSD1-interacting protein GILP is a LITAF domain protein that negatively regulates hypersensitive cell death in Arabidopsis. PLoS One. 2011;6: e18750 10.1371/journal.pone.0018750 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60. Yang T, Chaudhuri S, Yang L, Chen Y, Poovaiah BW. Calcium/calmodulin up-regulates a cytoplasmic receptor-like kinase in plants. J Biol Chem. 2004;279: 42552–42559. [DOI] [PubMed] [Google Scholar]
- 61. Michelmore RW, Christopoulou M, Caldwell KS. Impacts of resistance gene genetics, function, and evolution on a durable future. Annu Rev Phytopathol. 2013;51: 291–319. 10.1146/annurev-phyto-082712-102334 [DOI] [PubMed] [Google Scholar]
- 62. Navarro L, Zipfel C, Rowland O, Keller I, Robatzek S, Boller T, et al. The transcriptional innate immune response to flg22. Interplay and overlap with Avr gene-dependent defense responses and bacterial pathogenesis. Plant Physiol. 2004;135: 1113–1128. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63. Hass C, Lohrmann J, Albrecht V, Sweere U, Hummel F, Yoo SD, et al. The response regulator 2 mediates ethylene signalling and hormone signal integration in Arabidopsis. EMBO J. 2004;23: 3290–3302. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64. Ascencio-Ibanez JT, Sozzani R, Lee TJ, Chu TM, Wolfinger RD, Cella R, et al. Global analysis of Arabidopsis gene expression uncovers a complex array of changes impacting pathogen response and cell cycle during geminivirus infection. Plant Physiol. 2008;148: 436–454. 10.1104/pp.108.121038 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65. Ramirez V, Lopez A, Mauch-Mani B, Gil MJ, Vera P. An extracellular subtilase switch for immune priming in Arabidopsis. PLoS Pathog. 2013;9: e1003445 10.1371/journal.ppat.1003445 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66. Rentel MC, Lecourieux D, Ouaked F, Usher SL, Petersen L, Okamoto H, et al. OXI1 kinase is necessary for oxidative burst-mediated signalling in Arabidopsis. Nature. 2004;427: 858–861. [DOI] [PubMed] [Google Scholar]
- 67. von Saint Paul V, Zhang W, Kanawati B, Geist B, Faus-Kessler T, Schmitt-Kopplin P, et al. The Arabidopsis glucosyltransferase UGT76B1 conjugates isoleucic acid and modulates plant defense and senescence. Plant Cell. 2011;23: 4124–4145. 10.1105/tpc.111.088443 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68. Assenov Y, Ramirez F, Schelhorn SE, Lengauer T, Albrecht M. Computing topological parameters of biological networks. Bioinformatics. 2008;24: 282–284. [DOI] [PubMed] [Google Scholar]
- 69. Howles PA, Birch RJ, Collings DA, Gebbie LK, Hurley UA, Hocart CH, et al. A mutation in an Arabidopsis ribose 5-phosphate isomerase reduces cellulose synthesis and is rescued by exogenous uridine. Plant J. 2006;48: 606–618. [DOI] [PubMed] [Google Scholar]
- 70. Chen X, Liu J, Lin G, Wang A, Wang Z, Lu G. Overexpression of AtWRKY28 and AtWRKY75 in Arabidopsis enhances resistance to oxalic acid and Sclerotinia sclerotiorum . Plant Cell Rep. 2013;32: 1589–1599. 10.1007/s00299-013-1469-3 [DOI] [PubMed] [Google Scholar]
- 71. Encinas-Villarejo S, Maldonado AM, Amil-Ruiz F, de los Santos B, Romero F, Pliego-Alfaro F, et al. Evidence for a positive regulatory role of strawberry (Fragaria x ananassa) Fa WRKY1 and Arabidopsis At WRKY75 proteins in resistance. J Exp Bot. 2009;60: 3043–3065. 10.1093/jxb/erp152 [DOI] [PubMed] [Google Scholar]
- 72. Vanderbeld B, Snedden WA. Developmental and stimulus-induced expression patterns of Arabidopsis calmodulin-like genes CML37, CML38 and CML39 . Plant Mol Biol. 2007;64: 683–697. [DOI] [PubMed] [Google Scholar]
- 73. Ding Z, Li S, An X, Liu X, Qin H, Wang D. Transgenic expression of MYB15 confers enhanced sensitivity to abscisic acid and improved drought tolerance in Arabidopsis thaliana . J Genet Genomics. 2009;36: 17–29. 10.1016/S1673-8527(09)60003-5 [DOI] [PubMed] [Google Scholar]
- 74. Kauffmann A, Rayner TF, Parkinson H, Kapushesky M, Lukk M, Brazma A, et al. Importing ArrayExpress datasets into R/Bioconductor. Bioinformatics. 2009;25: 2092–2094. 10.1093/bioinformatics/btp354 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75. Wu Z, Irizarry RA, Gentleman R, Murillo FM, Spencer F. A Model Based Background Adjustment for Oligonucleotide Expression Arrays. Amer Statist Assoc. 2004;99: 909–917 [Google Scholar]
- 76. Smyth GK. Limma: linear models for microarray data In: Gentleman RR., Carey VJ, Huber W, Irizarry RA, Dudoit S, editors. Bioinformatics and Computational Biology Solutions using R and Bioconductor. New York: Springer; 2005. pp. 397–420. [Google Scholar]
- 77. Langfelder P, Horvath S. Fast R Functions for Robust Correlations and Hierarchical Clustering. J Stat Softw. 2012;46: 11 [PMC free article] [PubMed] [Google Scholar]
- 78. Langfelder P, Zhang B, Horvath S. Defining clusters from a hierarchical cluster tree: the Dynamic Tree Cut package for R. Bioinformatics. 2008;24: 719–720. [DOI] [PubMed] [Google Scholar]
- 79.Alexa A, Rahnenfuhrer J, Gene set enrichment analysis with topGO. R package version 2.18.0. 2010. Available: www.bioconductor.org
- 80. Jin J, Zhang H, Kong L, Gao G, Luo J. PlantTFDB 3.0: a portal for the functional and evolutionary study of plant transcription factors. Nucleic Acids Res. 2014;42: D1182–1187. 10.1093/nar/gkt1016 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All relevant data are within the paper and its Supporting Information files.