Abstract
Background
Recently, the research community has seen an influx of data relating to transcriptional regulatory interactions of Corynebacteria, organisms that are highly relevant to fields of systems biology, biotechnology, and human medicine. Information derived from DNA microarray experiments, computational predictions, and literature has opened the way for the graph-based analysis, visualization, and reconstruction of transcriptional regulatory networks across entire organisms. The reference database for corynebacterial gene regulatory networks CoryneRegNet provides methods for data storage and data exchange in a well-structured manner. Additional information on the model organism Escherichia coli K12 obtained from RegulonDB has been integrated. Generally, gene regulatory networks can be visualized as graphs by drawing directed edges between nodes, where a node represents a gene and an edge corresponds to a typed regulatory interaction. Cytoscape is an open-source software project whose aim is to provide graph-based visualization and analysis for biological networks. Its architecture allows the development and integration of user-made plugins to enhance core functionalities.
Results
We introduce two novel plugins for the Cytoscape environment designed to enhance in silico studies of procaryotic transcriptional regulatory networks. Our plugins leverage the information from the cornyebacterial reference database CoryneRegNet with the graph analysis capabilities of Cytoscape. CoryneRegNetLoader queries the CoryneRegNet database to extract a gene regulatory network represented as a directed graph. Additional information is stored as node/edge attributes within the graph. COMA facilitates consistency checks for gene expression studies given a gene regulatory network. COMA tests whether all gene expression levels correlate properly with the given network topology.
Conclusion
The plugins facilitate in silico studies of procaryotic transcriptional gene regulation, particularly in Corynebacteria and E. coli, by combining the knowledge from the corynebacterial reference database with the graph analysis capabilities of Cytoscape, which is one of the most-widely used tools for biological network analyses.
Background
Recently, the research community has seen an influx of data relating to transcriptional regulatory interactions of microorganisms. Information derived from DNA microarray hybridization experiments, computational predictions, and literature-based text mining has opened the way for the graph-based analysis of gene regulatory networks across entire organisms [1,2]. The large-scale reconstruction of these networks allows for in silico simulation of cell behavior in response to changing environmental conditions [3].
Corynebacteria are highly relevant to fields of systems biology, biotechnology, and human medicine [4]. The ontology-based reference database for corynebacterial transcription factors and regulatory networks CoryneRegNet [5-8] provides methods for the storage, analysis, and reconstruction of gene regulatory networks of corynebacteria. Data on the model organism Escherichia coli K12 has been imported from RegulonDB release 5.0 [9,10] and integrated together with corynebacterial data (refer to [8] for a detailed description of the integrated data). The reconstructed gene regulatory networks can be visualized as graphs by drawing typed, directed edges between nodes, where a node represents a gene and an edge corresponds to a known gene regulatory interaction (activation or repression).
Cytoscape [11-13] is an open-source software project whose aim is to provide graph-based visualization and analysis for all biological networks. Its architecture allows the development and integration of user-made plugins to enhance core functionalities. The easy to use plugin interface has encouraged the community to develop a number of Cytoscape plugins, which are publicly available at [14].
In this short article, we introduce two novel plugins for the Cytoscape environment. The plugins facilitate in silico studies of procaryotic transcriptional gene regulation, particularly in Corynebacteria and E. coli, by combining the knowledge from the corynebacterial reference database with the graph analysis capabilities of Cytoscape, which is one of the most-widely used tools for biological network analyses.
Implementation
Both plugins are implemented in Java 5 in compliance with the Cytoscape plugin interface mechanism by extending the class CytoscapePlugin and creating Java JAR files. A tutorial on Cytoscape plugins may be found at [15]. The plugins are publicly available at the Cytoscape plugin site at [14] (COMA in section 'Analysis' and CoryneRegNetLoader in section 'Network and Attribute I/O') as JAR files, which have to be copied into the Cytoscape plugins directory. They can also be installed more easily by using the 'Plugins'/'Manage plugins' menu of Cytoscape. In order to run CoryneRegNetLoader internet access is necessary. The plugins are compatible with Cytoscape version 2.5.x and have also been tested with the current beta release 2.6. The URL for a documentation and a step-by-step description of an application example is given in Availability and Requirements. The source code of both plugins is included in the JAR files.
Results and Conclusion
CoryneRegNet plugin
The CoryneRegNetLoader is able to extract transcriptional regulatory networks stored in CoryneRegNet and to import them into Cytoscape. A directed graph is then created, with genes as nodes and regulatory interactions as directed edges. For each node, CoryneRegNetLoader imports the up-to-date gene and protein annotation data, which is stored as node attributes. For each edge, the plugin retrieves the evidence of the underlying protein-DNA interaction and whether the target genes are up- or downregulated. This data is stored as edge attributes. In [16] we described the SOAP-based Web Service server of CoryneRegNet and presented an application example that shows how the community profits from an integrated visualization of corynebacterial network data. With this plugin, we take a step forward and provide an interface to a more powerful environment for integrated biological network analyses.
COMA plugin
The COMA plugin implements a method to facilitate consistency checks for gene expression studies given a gene regulatory network. It checks whether the expression levels of all genes are as expected from the network topology.
We discretize the expression levels of genes to overexpressed ('+') and underexpressed ('-') respectively. Furthermore, we do the same for the activation or repression of a target gene by a regulator. Let g ∈ {+, -} be the gene expression of a gene G. Let t ∈ {+, -} be the gene expression of a transcription factor T, which regulates G. Finally, let r ∈ {+, -} be the known gene regulatory interaction of gene G by transcription factor T. Subsequently consider the algebraic signs in the following equation:
t·g = r |
If the equation is incorrect, for instance '+' · '-' = '+', this is a putative inconsistency with the given regulatory network; also refer to [5]. Based on this, for every gene G of a given microarray experiment with expression g, all transcription factors T that regulate G are considered, and for all transcription factors the expression level t and the type of the regulatory interaction r is determined. Subsequently, COMA applies the above explained inconsistency test. Table 1 summarizes the results for all possible combinations of t, g, and r.
Table 1.
TF expr. t | Target gene expr. g | Regulation type r | Contradiction? |
+ | + | - | yes |
+ | - | + | yes |
- | + | + | yes |
- | - | - | yes |
+ | + | + | no |
+ | - | - | no |
- | + | - | no |
- | - | + | no |
This table illustrates all gene expression/regulation combinations for a transcription factor T (with relative expression level t), a target gene G (with relative expression level g), and a regulatory interaction of type r. For the first four rows, COMA would suggest that the gene regulation type r contradicts the given gene expression levels for the transcription factor T and the target gene G.
Although COMA was designed to work with the CoryneRegNetLoader, it can be used with any directed biological network as long as there is a Cytoscape edge attribute of type boolean, integer, or float. This attribute is used to distinguish between up- and downregulations and can be defined by the user. Gene expression values can be imported by using the Cytoscape microarray import feature, for instance. Generally, any node attribute of type integer or float can be used for COMA. Each putative inconsistent edge is listed in a result table. If the user selects a certain node-pair, both nodes including the edge between them are highlighted in the Cytoscape graph. Figure 1 show a subset of the complete gene regulatory network of C. glutamicum, extracted from CoryneRegNet by utilizing the CoryneRegNetLoader plugin. The COMA plugin has been applied to the graph by using the results of a gene expression study provided by Brune et. al [17], which is also available in Cytoscape format as Additional file 1. With this plugin, Cytoscape now provides hints for missing regulatory interactions.
Both plugins together facilitate in silico the study and visualization of procaryotic transcriptional gene regulation, particularly in Corynebacteria and E. coli, by combining the manually curated knowledge from the CoryneRegNet database with the power of the graph analysis and visualization capabilities of Cytoscape.
Since various chemical mechanisms take part in the regulation of the genes, in a future version of the COMA plugin we plan to consider also non-transcriptional gene regulatory interactions. Additionally, we plan to extend COMA to check for coherent and incoherent feed-forward loops [18] in a given network. We will also conside further rules to validate microarray data in the context of known regulatory networks (see e.g. [19]).
Availability and Requirements
Project name: CoryneRegNetLoader and COMA
Project home page: http://cytoscape.org/plugins and http://www.coryneregnet.de
Documentation and application example: https://www.cebitec.uni-bielefeld.de/groups/gi/software/coryneregnet/v4/CytoscapePlugins/index.html
Operating system(s): Platform independent
Programming language: Java 5
License: Academic Free License (AFL)
Any restrictions to use by non-academics: User should contact Jan.Baumbach@CeBiTec.Uni-Bielefeld.DE.
Authors' contributions
JB and LA developed and implemented the plugins. Both authors contributed to writing and both read and approved the final manuscript.
Supplementary Material
Acknowledgments
Acknowledgements
LA wishes to thank John 'Scooter' Morris and Thomas Ferrin (CGL, UCSF) for supervising, JB wishes to thank them for their cooperation and for their assistence with the Cytoscape plugin mechanism.
Furthermore, JB acknowledges Sven Rahmann (Dortmund University) and Andreas Tauch (CeBiTec, Bielefeld University) for supervising and for their help with the CoryneRegNet database. JB thanks Heladia Salgado-Osorio for providing RegulonDB data. JB also thanks the Boehringer Ingelheim Fond for partly funding the cooperation between the CeBiTec and the UCSF. LA received support from NIH grant T32 GM067547.
Contributor Information
Jan Baumbach, Email: jan.baumbach@cebitec.uni-bielefeld.de.
Leonard Apeltsin, Email: Leonard.Apeltsin@ucsf.edu.
References
- Brune I, Brinkrolf K, Kalinowski J, Pühler A, Tauch A. The individual and common repertoire of DNA-binding transcriptional regulators of Corynebacterium glutamicum, Corynebacterium efficiens, Corynebacterium diphtheriae and Corynebacterium jeikeium deduced from the complete genome sequences. BMC Genomics. 2005;6:86. doi: 10.1186/1471-2164-6-86. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Perez-Rueda E, Collado-Vides J. The repertoire of DNA-binding transcriptional regulators in Escherichia coli K-12. Nucleic Acids Res. 2000;28:1838–1847. doi: 10.1093/nar/28.8.1838. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Herrgård MJ, Covert MW, Palsson B. Reconstruction of microbial transcriptional regulatory networks. Curr Opin Biotechnol. 2004;15:70–77. doi: 10.1016/j.copbio.2003.11.002. [DOI] [PubMed] [Google Scholar]
- Wendisch VF, Bott M, Kalinowski J, Oldiges M, Wiechert W. Emerging Corynebacterium glutamicum systems biology. J Biotechnol. 2006;124:74–92. doi: 10.1016/j.jbiotec.2005.12.002. [DOI] [PubMed] [Google Scholar]
- Baumbach J. CoryneRegNet 4.0 – A reference database for corynebacterial gene regulatory networks. BMC Bioinformatics. 2007;8:429. doi: 10.1186/1471-2105-8-429. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Baumbach J, Brinkrolf K, Czaja L, Rahmann S, Tauch A. CoryneRegNet: An ontology-based data warehouse of corynebacterial transcription factors and regulatory networks. BMC Genomics. 2006;7:24. doi: 10.1186/1471-2164-7-24. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Baumbach J, Brinkrolf K, Wittkop T, Tauch A, Rahmann S. CoryneRegNet 2: An Integrative Bioinformatics Approach for Reconstruction and Comparison of Transcriptional Regulatory Networks in Prokaryotes. Journal of Integrative Bioinformatics. 2006;3:24. [Google Scholar]
- Baumbach J, Wittkop T, Rademacher K, Rahmann S, Brinkrolf K, Tauch A. CoryneRegNet 3.0-An interactive systems biology platform for the analysis of gene regulatory networks in corynebacteria and Escherichia coli. J Biotechnol. 2007;129:279–289. doi: 10.1016/j.jbiotec.2006.12.012. [DOI] [PubMed] [Google Scholar]
- Gama-Castro S, Jiménez-Jacinto V, Peralta-Gil M, Santos-Zavaleta A, Peñaloza-Spinola MI, Contreras-Moreira B, Segura-Salazar J, Muñiz-Rascado L, Martínez-Flores I, Salgado H, Bonavides-Martínez C, Abreu-Goodger C, Rodríguez-Penagos C, Miranda-Ríos J, Morett E, Merino E, Huerta AM, Treviño-Quintanilla L, Collado-Vides J. RegulonDB (version 6.0): gene regulation model of Escherichia coli K-12 beyond transcription, active (experimental) annotated promoters and Textpresso navigation. Nucleic Acids Res. 2008:D120–D124. doi: 10.1093/nar/gkm994. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Salgado H, Gama-Castro S, Peralta-Gil M, Diaz-Peredo E, Sanchez-Solano F, Santos-Zavaleta A, Martinez-Flores I, Jimenez-Jacinto V, Bonavides-Martinez C, Segura-Salazar J, Martinez-Antonio A, Collado-Vides J. RegulonDB (version 5.0): Escherichia coli K-12 transcriptional regulatory network, operon organization, and growth conditions. Nucleic Acids Res. 2006:D394–D397. doi: 10.1093/nar/gkj156. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cline MS, Smoot M, Cerami E, Kuchinsky A, Landys N, Workman C, Christmas R, Avila-Campilo I, Creech M, Gross B, Hanspers K, Isserlin R, Kelley R, Killcoyne S, Lotia S, Maere S, Morris J, Ono K, Pavlovic V, Pico AR, Vailaya A, Wang PL, Adler A, Conklin BR, Hood L, Kuiper M, Sander C, Schmulevich I, Schwikowski B, Warner GJ, Ideker T, Bader GD. Integration of biological networks and gene expression data using Cytoscape. Nat Protoc. 2007;2:2366–2382. doi: 10.1038/nprot.2007.324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, Amin N, Schwikowski B, Ideker T. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13:2498–2504. doi: 10.1101/gr.1239303. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cytoscape website http://cytoscape.org/
- Cytoscape plugins website http://cytoscape.org/plugins/
- Cytoscape plugin tutorial website http://www.cytoscape.org/cgi-bin/moin.cgi/Cytoscape Plugin Tutorial
- Neuweger H, Baumbach J, Albaum S, Bekel T, Dondrup M, Hueser A, Kalinowski J, Oehm S, Puehler A, Rahmann S, Weile J, Goesmann A. CoryneCenter – An online resource for the integrated analysis of corynebacterial genome and transcriptome data. BMC Syst Biol. 2007;1:55. doi: 10.1186/1752-0509-1-55. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brune I, Werner H, Huser A, Kalinowski J, Puhler A, Tauch A. The DtxR protein acting as dual transcriptional regulator directs a global regulatory network involved in iron metabolism of Corynebacterium glutamicum. BMC Genomics. 2006;7:21. doi: 10.1186/1471-2164-7-21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mangan S, Alon U. Structure and function of the feed-forward loop network motif. Proc Natl Acad Sci USA. 2003;100:11980–11985. doi: 10.1073/pnas.2133841100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gutiérrez-Ríos RM, Rosenblueth DA, Loza JA, Huerta AM, Glasner JD, Blattner FR, Collado-Vides J. Regulatory network of Escherichia coli: consistency between literature knowledge and microarray profiles. Genome Res. 2003;13:2435–2443. doi: 10.1101/gr.1387003. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.