Skip to main content
eLife logoLink to eLife
. 2015 Mar 17;4:e04346. doi: 10.7554/eLife.04346

Linking traits based on their shared molecular mechanisms

Yael Oren 1, Aharon Nachshon 1, Amit Frishberg 1, Roni Wilentzik 1, Irit Gat-Viks 1,*
Editor: Naama Barkai2
PMCID: PMC4362207  PMID: 25781485

Abstract

There is growing recognition that co-morbidity and co-occurrence of disease traits are often determined by shared genetic and molecular mechanisms. In most cases, however, the specific mechanisms that lead to such trait–trait relationships are yet unknown. Here we present an analysis of a broad spectrum of behavioral and physiological traits together with gene-expression measurements across genetically diverse mouse strains. We develop an unbiased methodology that constructs potentially overlapping groups of traits and resolves their underlying combination of genetic loci and molecular mechanisms. For example, our method predicts that genetic variation in the Klf7 gene may influence gene transcripts in bone marrow-derived myeloid cells, which in turn affect 17 behavioral traits following morphine injection; this predicted effect of Klf7 is consistent with an in vitro perturbation of Klf7 in bone marrow cells. Our analysis demonstrates the utility of studying hidden causative mechanisms that lead to relationships between complex traits.

DOI: http://dx.doi.org/10.7554/eLife.04346.001

Research organism: mouse

eLife digest

Many individuals who have diabetes also have other diseases that affect the heart and blood vessels. It is not uncommon for human diseases to occur together like this; and understanding the relationships between diseases and other traits can make it easier to diagnose conditions. Furthermore, it can also help researchers develop treatments that are more precisely targeted to each condition and cause fewer side effects.

Two conditions or traits tend to occur together if they are caused by mutations in the same gene or genes; or if they involve processes within cells that share the same proteins and other molecules. However, in most cases the genes and molecular mechanisms involved are not yet known so it is more difficult to work out how the traits are connected.

Computing techniques make it possible to assess the relationships between hundreds or thousands of traits at the same time. These high volume analyses can also allow scientists to identify less obvious relationships that might be missed in more traditional types of study.

Here, Oren et al. created a new computer algorithm to identify related traits, their shared genetic basis, and the molecular mechanisms behind them. The algorithm is called GEMOT and uses a three-step approach to sift through a large amount of data. Oren et al. tested GEMOT using a database of 1738 documented traits—including diseases and behaviors—in laboratory mice.

Oren et al. identified many clusters of traits in the mice and the underlying genetic and molecular mechanisms that link them. For example, they found that a mutation in a gene called Klf7 affected the expression of other genes that are involved in making new cells in the bone marrow. In turn, these changes influenced 17 different behaviors in the mice after they were injected with the painkiller morphine.

In humans, the same genes that underlie behaviors related to morphine treatment have been linked to the survival rate of patients with a form of brain cancer. This suggests that—alongside providing pain-relief—morphine may influence how the tumor grows. The algorithm developed by Oren et al. can now be used to further explore the impact of the environment on the relationships between traits.

DOI: http://dx.doi.org/10.7554/eLife.04346.002

Introduction

Epidemiological and clinical research has identified a profusion of correlated physiological traits, as well as a remarkably high incidence of co-occurrence and comorbidity among disorders. Various studies have shown that such connections among diseases are typically attributable to a common underlying genetic or molecular mechanism (Rzhetsky et al., 2007; Oti et al., 2008; Barabási et al., 2011; Cotsapas et al., 2011; Lee et al., 2012; Cross-Disorder Group of the Psychiatric Genomics Consortium, 2013). Disclosure of unexpected relationships among disease phenotypes and understanding of their common mechanisms opens the way to improved disease classification and treatment. In particular, it may allow a drug approved for one disease to be used for the treatment of another disease, and provide us with the means to anticipate undesired off-target effects (e.g., Ashburn and Thor, 2004; Dudley et al., 2011).

Advanced computational methods have made it possible to study the mechanisms underlying trait connections in an unbiased manner. One approach is to derive trait connectivity based on trait–trait comorbidity, co-occurrence, and correlations (Hidalgo et al., 2009; Shi et al., 2010; Blednov et al., 2012). As an example, Figure 1—figure supplement 1A illustrates a sample network and Figure 1—figure supplement 1B depicts a group of correlated traits in this network. Relying entirely on trait information, however, makes it difficult to identify the shared mechanisms and to distinguish shared molecular mechanisms from shared environmental influences. Alternatively, a common way to improve predictions is by integrating relationships between genes and traits, using gene–trait correlations, associations, or causal mutations (Rzhetsky et al., 2007; Cotsapas et al., 2011; Baker et al., 2012; Hwang et al., 2012; Gat-Viks et al., 2013). Such pairwise gene–trait connections were used to construct two-layer clusters (‘biclusters’) consisting of groups of traits linked to the same group of genes. For example, Figure 1—figure supplement 1C depicts a bicluster for the sample network in Figure 1—figure supplement 1A. Notably, although such ‘gene-based’ approaches provide a list of putative non-environmental mechanisms, their utilization has two major drawbacks. First, these approaches assign a single mechanistic layer whereas in fact what is affected by genetic variation is a number of molecular components (such as transcripts), which affect the physiological traits secondarily; thus, the model should include a series of mechanistic layers and the propagation of information between them. Secondly, gene-based approaches do not distinguish between reactive, independent, and causative relationships, whereas molecular components should be related causatively to the group of traits. For example, although transcript g2 is reactive to the p4 trait but does not cause it (Figure 1—figure supplement 1A), it is still part of a two-layer model (Figure 1—figure supplement 1C). Thus, a valid and reliable methodology for understanding similarities among distinct traits should identify a series of layers and causative relationships.

We have developed GEMOT, a methodology for constructing a causative model of trait–trait connections. GEMOT addresses the above challenges by constructing three-layer modules in which each module consists of a group of molecular mechanisms (here, gene transcripts within a ‘transcripts layer’) translating between a genetic layer and a layer of traits. In particular, a GEMOT module is focused specifically on causative, non-environmental relationships rather than on relationships of any kind, and accordingly it includes only transcripts that are part of causative relationships (referred to as ‘driver transcripts’, see Figure 1A, Figure 1—figure supplement 1D). Naively, systematic identification of GEMOT modules could be obtained by a detailed reconstruction of all relationships among variants, traits and transcripts (e.g., Neto et al., 2010; Hageman et al., 2011; Wang and van Eeuwijk, 2014). However, such a detailed reconstruction is a major computational and statistical burden, especially considering the large number of components. This problem is avoided in GEMOT by the use of a stepwise heuristic approach that drastically reduces the search space and allows scalability to large networks.

Figure 1. Schematic illustration of the GEMOT algorithm.

(A) An overview of GEMOT for the scenario depicted in Figure 1—figure supplement 1A. GEMOT incorporates 3 stages (I, II and III) that are schematically described in BD, EF and GI, respectively. Stage I: (B) High and low link-potential scores for pairs of variants and traits. A typical calculation for variant–trait pairs (left: v1p1, right: v1p2). Shown are variant–transcript associations (y-axis) and transcript–trait correlations (x-axis) for each transcript (black dots). GEMOT uses a threshold (horizontal dashed line) to compare the distribution of transcript–trait correlation in high and low transcript–variant associations scores. Link potential is high (left) when the distributions of correlation differ between the high and low association range more than expected by chance; link potential is low (right) where no difference is observed. Notably, a high link potential reflects the potential that some transcripts may translate between a variant and a trait, although such transcripts are not yet specified. (C) Bipartite graph construction. GEMOT constructs a graph whose two parts are variants (squares) and traits (diamonds); edge weights are the link-potential scores (solid lines, high; dashed lines, low). (D) Bipartite module identification. GEMOT identifies ‘heavy’ biclusters in the bipartite graph (in this example, 1 module). Stage II: (E) Transcript link score. The input is provided by all calculated correlation and association scores (such as the three plots on the left). On the right: given a transcript, GEMOT aggregates and ranks all its scores in a horizontal track (red, correlations; blue, associations) and uses the distribution of ranks to score the transcript for significance. (F) Tripartite module construction. GEMOT adds high-scoring transcripts from E, referred to as ‘candidate transcripts’ (circles), to the module. Stage III: (G) Causality p value scores. GEMOT uses a statistical score to assess causative relationships (blue, significant; white, non-significant) for each transcript (row) and trait (column) in a module. Non-causative relationships attain non-significant scores (cartoon examples on the left). (H) Module refinement. Starting with the causality p value scores for the tripartite module (from G), GEMOT eliminates non-causative transcripts (left) and non-affected traits (right) in an iterative manner. (I) The resulting GEMOT module (arrows, causative relationships).

DOI: http://dx.doi.org/10.7554/eLife.04346.003

Figure 1.

Figure 1—figure supplement 1. An example of GEMOT methodology compared to alternative methods.

Figure 1—figure supplement 1.

(A) A sample group of related traits and their related mechanisms. The illustration exemplifies a scenario of two causative gene transcripts (g1 and g3) translating between a certain genetic variant (v1) and 2 traits (p1, p4). An additional transcript g2 is reactive to trait p4 and an additional trait p3 is affected by v1 independently of any driver transcript. Traits p4 and p6 are influenced by the same environmental factor. (B) Trait-based grouping methods, applied on the data from A. All four traits are grouped according to either shared environmental factors or shared molecular factors, but the underlying mechanisms are not specified. (C) Gene-based methods, applied on the data from A. The model consists of a group of traits (p1, p3, p4) that share enriched relationships to a group of transcripts (v1, g1g3). Although environment-based relationships are purposely omitted here, the model still includes various types of relationships among transcripts and traits. (D) Our 3-layer GEMOT methodology, applied on the sample shown in A. The GEMOT module consists of causative transcripts called ‘driver transcripts’ (g1 and g3, middle) translating between a certain genomic interval (variant v1) and a group of traits affected by the drivers (p1, p4). The model avoids components that are related only through reactive or independent relationships (p3, g2) or by environmental factors (p6). Diamonds, traits; squares, genetic variants; circles, transcripts; empty circles, driver transcripts.

Figure 1—figure supplement 2. Evaluation of methods for identifying (broad sense) causative relationships.

Figure 1—figure supplement 2.

(A) Causal models M1–M5. (B) Shown is an error matrix, where each cell i, j, represents whether prediction in row i is TP, TN, FP or FN, when the data were generated using the model in column j. (C and D) AUC (top) and FDR (bottom) scores (y-axis) when using synthetic models of types M1–M3 (C) or when using synthetic models M1–M5 (D). Scores are presented across varying levels of σ (x-axis, using fixed λ = 4; left panels) or across varying values of λ (x-axis, using fixed σ = 0.6; right panels). Presented are four methods (color coded): an AIC-based method (Lee et al., 2009), parametric and non-parametric QTLHot (Neto et al., 2013), and its modification in GEMOT (‘Materials and methods’). The results indicate that GEMOT performs similarly or better than the existing methods in identifying broad-sense causality.

We applied GEMOT to a large dataset of 1738 traits spanning a broad spectrum of physiological, biochemical, clinical, and behavioral traits that were measured across the genotyped BXD mouse recombinant inbred strains (a cross between the C57BL/6J and DBA/2J strains [Wang et al., 2003]). We used measurements of transcript levels in bone marrow-derived myeloid cells (Gerrits et al., 2009). The modules were used to uncover shared driver transcripts underlying collections of closely related or distinct traits. Notably, many of the findings were supported by independent knowledge or data. We also demonstrated the tissue specificity of modules, based on a post-processing analysis of gene expression in additional tissues. Our study highlights the power of causative reconstruction combined with grouping of complex traits to reveal a comprehensive picture of phenome connections.

Results

The GEMOT algorithm

Global identification of traits that share common causal transcripts and genetic mechanisms requires a reliable reconstruction of a global causal network among variants, transcripts and traits—a notoriously difficult problem, particularly in the case of a large collection of traits and high throughput data. We designed GEMOT, a scalable algorithm for the systematic construction of three-layer modules, each consisting of a group of traits, their shared causal driver transcripts and a genetic variant. GEMOT is based on the notion of ‘linked relationships’ between a variant, a transcript, and a trait. Such relationships incorporate a transcript that is both associated with a variant and correlated with a trait, without a direct evaluation of the causative relationship between the three components. In particular, it relies on the observation that causative relationships are expected also to be linked relationships (but not vice versa). It is therefore possible to start by constructing candidate modules based on the potential of variants and traits to be linked through transcripts. The internal structure is then constructed within each of these modules. Notably, the linked relationships are exploited to avoid global reconstruction of the particular types of relationships, which are then confirmed only at the validation stage.

GEMOT's input is a collection of traits, genotyping, and molecular data across a population of individuals. GEMOT consists of three main stages (see ‘Materials and methods’, Figure 1A). In stage I, GEMOT identifies ‘bipartite modules’ consisting of a single genomic interval and multiple traits that are connected by linked relationships through certain transcripts (e.g., traits p1, p3, p4 and variant v1 in Figure 1A, left). In stage II, candidate transcripts are added to the modules solely on the basis of their linked relationships (e.g., candidate transcripts g1g3, Figure 1A, middle); the resulting ‘tripartite modules’, however, are not limited to causative relationships. Finally, in stage III, GEMOT refines the tripartite modules by investigating the causal relationships within them and eliminating non-causative transcripts. The output GEMOT module, therefore, finally consists of the validated driver transcripts, the trait(s) they affect, and a single causal genomic interval (e.g., driver transcripts g1, g3; traits p1, p4; variant v1 in Figure 1A, right). Overall, each of the first two stages is aimed at filtering relevant objects for the next stage: candidate modules are selected on the basis of their potential to include candidate transcripts (stage I), and candidate transcripts are selected on the basis of an efficient score that is expected to be high in true driver transcripts (stage II). The final stage (stage III) is aimed at validating the causative relationships in the candidate modules from the previous stage. In the following we provide additional details about the three GEMOT stages.

  • I. We start by calculating associations between each transcript and each variant (a ‘variant–transcript association’) and correlations between each transcript and each trait (a ‘transcript–trait correlation’). We combine these two measures in a statistical test to identify variant–trait pairs that have high potential to be linked through many transcripts. We call this scoring scheme a ‘link potential’ (Figure 1B). From these link-potential scores we construct a bipartite graph whose two parts are variants and traits, where edge weights are the link-potential scores (Figure 1C). We use a biclustering approach (the REL algorithm; [Gat-Viks et al., 2010]) to identify within this graph heavy ‘bipartite modules’, each consisting of a single genomic interval (harbouring a consecutive list of variants) and a collection of traits (Figure 1D). Importantly, such bipartite modules do not represent pleiotropy in general, but only pleiotropy that is likely mediated through transcripts.

  • II. We next apply a statistical test to identify transcripts whose linked relationships in the module are higher than expected by chance. This is done by evaluating the correlations and associations of transcripts with the module's traits and genomic interval respectively, computing ‘transcript link scores’, and using it to filter promising ‘candidate transcripts’ (Figure 1E). We then add the candidate transcripts to the bipartite modules, thus forming ‘tripartite modules’ (Figure 1F).

  • III. In the third stage the aim is to investigate the internal relationships within a module, thus allowing identification of driver transcripts and their affected traits. Recent methods allow structural reconstruction of a causality network (e.g., Neto et al., 2010; Hageman et al., 2011; Wang and van Eeuwijk, 2014), and can therefore be applied on each module to reveal its internal structure. However, since such methods may fail owing to a lack of scalability to large networks, we use an alternative approach that aims only to identify the relationships among layers, without the need to infer the causative relationships within each of the layers. To that end, we devise the following 2-step procedure: We first assess the causality among all candidate transcripts and traits in a module, assuming a single representative variant for the module's genomic interval (Figure 1G). We use a ‘causality p value’ score to assess the quality of such causative relationships. Next, the modules are refined by the iterative removal of transcripts and traits whose causality p values are non-significant (Figure 1H). The output is a list of ‘GEMOT modules’, each consisting of a single genomic interval, a group of validated ‘driver transcripts’, and their affected traits (Figure 1I).

Notably, the GEMOT algorithm is a unified pipeline that integrates several independent procedures, including biclustering, causality testing and network reconstruction. In this study we use the ReL biclustering algorithm (Gat-Viks et al., 2010); the causality testing proposed by Neto et al. (2013); and a tailored network reconstruction scheme. However, the GEMOT pipeline is general and can be applied using alternative procedures (e.g., biclustering [Tanay et al., 2002]; network reconstruction [Neto et al., 2010; Hageman et al., 2011; Wang and van Eeuwijk, 2014]; causality testing [Lee et al., 2009; Neto et al., 2013]).

Application of GEMOT in phenotypically diverse recombinant inbred mouse strains

We applied GEMOT to study phenome connections using gene expression in myeloid Gr-1+ cells (Gerrits et al., 2009) and 1738 traits across recombinant inbred BXD mice (see ‘Materials and methods’). Using GEMOT, we found 40 bipartite modules, 11 tripartite modules, and 8 mature GEMOT modules with non-overlapping sets of drivers (permutation-based FDR < 10−3, ‘Materials and methods’; see Figure 2A, Figure 2—figure supplements 1, 2 and Supplementary file 1A,B). For comparison, no GEMOT modules were found in 100 permutation tests, and the average number of bipartite and tripartite modules in 100 permutation tests was 32 and 0.06, respectively (Figure 2—figure supplement 2). As expected, the observed link potential scores in mice were much higher than would be expected by chance (p < 10−10, permutation test; Figure 2—figure supplement 3 and ‘Materials and methods’). It is highly unlikely, therefore, that our GEMOT modules were generated at random (p < 0.01). Notably modules nos. 1–3, 6–8 show strong correlations between traits, whereas the remaining modules show moderate or weak trait–trait correlations (Figure 2—figure supplement 4).

Figure 2. GEMOT modules in BXD mouse strains.

(A) Shown is a module identifier (column 1), its genomic interval (column 2), the numbers of driver transcripts and traits in the module (columns 3 and 4, respectively), and the main characteristic of its traits (column 5, see description in Supplementary file 1A). (B) Global visualization of the GEMOT modules. The graph presents the genomic intervals (squares, bottom), the transcripts (circles, middle) and traits (diamonds, top) of all eight resulting GEMOT modules. The transcripts, which are unique to each module, are connected to their module's variants and traits. Sets of traits known to be related to the same process are enclosed in a box and labeled (top). The module's genetic and transcripts layers are color coded as in A; the traits are color coded based on their gender: female (white), male (gray) or both (black). Notably, some modules have overlapping collections of traits, or their traits relate to the same process.

DOI: http://dx.doi.org/10.7554/eLife.04346.006

Figure 2.

Figure 2—figure supplement 1. Linkage disequilibrium.

Figure 2—figure supplement 1.

Scatter plots of linkage disequilibrium (correlation coefficient, y-axis) between all variants in the genome (x-axis) and the single representative genetic variant of a single module (marked by arrows). The analysis was applied only on those BXD mouse strains that were profiled in the myeloid gene-expression dataset (Gerrits et al., 2009) and were therefore used by the GEMOT algorithm in this study. Notably, for each module, none of the genomic positions outside its genomic interval was in absolute linkage disequilibrium that is larger than 0.75. Within the module's genomic interval, linkage disequilibrium reached 1, as expected.
Figure 2—figure supplement 2. Application of GEMOT on real and on permuted data.

Figure 2—figure supplement 2.

(A) Overview of the GEMOT algorithm applied on real data (in myeloid tissue) compared to permuted data (see ‘Materials and methods’). (B) A larger number of modules for real data than for permuted data. The histogram shows numbers of modules in the permuted dataset (x-axis, 100 repeats). Numbers of modules in the real data are indicated by gray arrows. The three plots record the numbers of modules that were constructed during stages I to III of the GEMOT algorithm (left to right, respectively).
Figure 2—figure supplement 3. Higher link potential scores for real data than for permuted data.

Figure 2—figure supplement 3.

Shown is a histogram representing the frequency (y-axis) of different link-potential ranges (x-axis) generated on the basis of real data (myeloid tissue, gray) compared to permuted data (black; see ‘Materials and methods’).
Figure 2—figure supplement 4. Trait- trait correlations in GEMOT modules.

Figure 2—figure supplement 4.

(A) Shown is the average trait–trait absolute correlation coefficient (y-axis) in each of the GEMOT modules (x-axis). (B). Matrices of trait–trait Pearson correlation coefficient (blue/red scale indicates negative/positive correlations) for three exemplified modules: modules nos. 2 (left), 3 (middle) and 4 (right).
Figure 2—figure supplement 5. Characterization of GEMOT modules in BXD mouse strains.

Figure 2—figure supplement 5.

(A and B) Evaluation of pairs of traits predicted to be in the same group based on the GEMOT methodology (blue), the trait-based hierarchical clustering approach (‘HC’, green), the gene-based INVAMOD method (red), and random trait pairs (black). Results are presented for GEMOT's 40 bipartite modules compared to the top 40 predictions from each of the existing methods. (A) Absolute correlation coefficients between the two traits in a pair (x-axis) vs their best average absolute-correlation with a gene transcript (y-axis). (B) Percentages of matched biological annotations (terms) within the description of 2 traits in a pair (y-axis) vs their trait–trait absolute correlation (x-axis). The higher the percentage of matched annotation terms, the higher the agreement between trait-pairing predictions and prior knowledge. The plots were generated using the moving average of a window of 200 trait pairs. In both cases, GEMOT outperformed the other methods along a wide range of trait–trait correlation values. (C and D) Evaluation of pairs of traits predicted to be in the same group, presented as in plots A and B but using the 8 GEMOT modules and eight top-scoring groups from each of the existing methods. As with the 40 bipartite modules (in plots A and B), GEMOT outperformed the other methods along a wide range of trait–trait correlation values. (E) Correlations between trait pairs. Shown are the absolute correlation coefficients between trait pairs from modules of different grouping methods (left to right: random, GEMOT, HC, and INVAMOD; x-axis). Results are presented for GEMOT's 40 bipartite modules (left) and 8 GEMOT modules (right); in each plot we use the same number of top-scoring groups from each of the compared methods. In each box the central mark is the median; edges are the 25th and 75th percentiles; whiskers extend to the 10th and 90th percentiles. The plot highlights the wide range of trait–trait correlations generated by GEMOT and the gene-based methods.

Given that our findings were significant, we next aimed to obtain a global perspective on the resulting modules. To that end we generated a graph of the modules, where the transcripts layer of each module is connected to the module's traits and genomic interval (Figure 2B). On this graph we marked groups of similar traits, such as different behavioral and physiological responses after ethanol stimulation, or the results of different thermal nociception tests. Notably, the trait collections of the different modules are partly overlapping. For example, five of the modules relate to morphine responses (nos. 2, 5–8) and two modules relate to thermal nociception (nos. 3 and 5). Nevertheless, although some of the modules have overlapping traits, there is no overlap between the drivers and variants of the different modules (Figure 2A and Supplementary file 1B), demonstrating GEMOT's ability to predict several distinct underlying mechanisms for the same collection of traits. Furthermore, a focused examination revealed that the traits of each of the modules shared a unique characteristic. For example, module nos. 6 and 2 relate to place preference following morphine injection, but at distinct time intervals (Figure 2); similarly, module nos. 1 and 3 relate to an anxiety assay, but with and without ethanol stimulation, respectively. Taken together, our results indicated high-level organization of overlapping collections of traits, while each module reflects a unique molecular and genetic signature that underlies a different trait characteristic.

Notably, some of the resulting modules consisted of multiple traits that are related to the same process, whereas others consisted of a collection of distinct traits (Figure 2 and Supplementary file 1B). For example, a module related to morphine response (module no. 2) consists of 17 different traits that were measured following treatment of mice with morphine at different time points and in various behavioral assays. Similarly, a module related to an anxiety assay (module no. 1) consists of two different traits that were measured following treatment with ethanol at different time points. In contrast, and consistently with our goal of identifying novel relationships among traits, module nos. 3, 4 and 5 suggest previously unknown connections between traits.

We next characterized pairs of traits within each group of traits (‘trait pairs’) to show that the quality of these pairs is not lower than in existing methods. We focused on three main properties of trait pairs: the correlation among traits in a pair; the correlation between a trait pair and the transcripts; and the knowledge-based relationships among traits. As a reference we demonstrate these properties for modules that were generated using three alternative methods: (i) the trait-based hierarchical clustering approach (denoted ‘HC’; [Hastie et al., 2009], as in Figure 1—figure supplement 1B); (ii) the gene-based INVAMOD algorithm, which identify pleiotropic genetic variants and their associated groups of traits in an agglomerative manner (Gat-Viks et al., 2013; as illustrated in Figure 1—figure supplement 1C); and (iii) a set of randomly sampled trait pairs (5% of all possible trait pairs). In particular, the bipartite modules were compared to the top 40 groups from each method (Figure 2—figure supplement 5A,B). Similar results were obtained when the eight GEMOT modules were compared to the top eight groups from each of the alternative methods (Figure 2—figure supplement 5C,D). For each property, we first explain the metric of evaluation and then present the results with GEMOT and with the alternative methods.

Correlation among traits

GEMOT's correlations among trait pairs were much stronger than expected by chance (p < 10−200), and were comparable to those obtained by the gene-based INVAMOD approach but weaker than those from the trait-based HC approach (p < 10−160; Figure 2—figure supplement 5E). This result was consistent with the fact that HC, but not GEMOT or INVAMOD, directly optimizes for such correlations.

Shared transcripts

For each trait pair we searched for a potential shared transcript that showed the best correlation (on average) with both traits. We found that GEMOT's predicted trait pairs were supported by the best-correlated transcripts across a wide range of trait–trait correlations (Figure 2—figure supplement 5A,C). As expected, in all methods the higher the correlation between traits, the stronger the correlations with the shared transcripts. This analysis indicated that GEMOT outperforms the gene-based and random methods (p < 10−30 and p < 10−50, respectively) and is comparable to the trait-based method.

Knowledge-based relations among traits

We next aimed at determining whether trait pairs predicted by GEMOT were supported by previously known connections between traits. Evaluation of such distinct trait pairs in light of previously known trait connections is a general problem, and no suitable systematic annotation is currently available. To tackle this problem we constructed an unbiased matched-annotation set of connections, and used it in our analysis as the gold standard. To systematically cover, independently of the correlation and co-association measures, the qualitative knowledge about connections among traits, we adopted the descriptive title of each trait in the GeneNetwork Phenotype Database (Wang et al., 2003). If the descriptions of two traits included at least one shared biological term, the traits were considered as ‘matched-term traits’ and were included in our gold standard set. Using this matched-annotation set, we found that all methods attained a similar proportion of matched-term trait; the higher the correlation between traits—the higher the proportion of matched terms, as expected (Figure 2—figure supplement 5B,D).

In the following we demonstrate three reconstructed modules, demonstrating GEMOT's ability to identify a model for a collection of tightly related traits, and for a previously uncharacterized combination of traits.

Discovery of a model for a collection of tightly related traits

The morphine module (module no. 2, see Figure 3) exemplifies the ability of GEMOT to suggest an underlying mechanism for an entire group of traits known to be tightly related. The module consisted of a collection of 17 behavioral assays in the recombinant mouse strains, all carried out to measure their responses to injection of morphine (50 mg/kg, over different periods of time). Measured parameters included distance traveled, place preference, and vertical activity ([Philip et al., 2010]; Supplementary file 1A). All module traits showed a strong positive correlation with each other (|r| values ranged from 0.6 to 0.99, Figure 2—figure supplement 4) and shared similar peaks within the genomic interval of module no. 2 (Figure 3A, top).

Figure 3. Characteristics of module no. 2.

(A) Genetic associations. Shown are the association scores (y-axis) across the genomic positions of chromosome 1 (x-axis) for four module traits (top) and for seven selected drivers (bottom) in myeloid cells. The position of the Klf7 gene is marked below the x-axis. (BE) Characterization of driver groups I and II. (B) A matrix of traits (rows) vs drivers (columns), where the blue/red scale indicates negative/positive Pearson correlation coefficients among them. (C) The transcript causality p value scores (y-axis) are shown for each module transcript (log scale), assuming a representative variant in the module's genomic interval (rs13475891 in chr1:62 Mbp). (D) The histogram represents the Pearson correlation coefficient (y-axis) of Klf7 with the remaining drivers (x-axis). (E) Genetic effect size of variant rs13475891 on the driver transcripts. The histogram represents the average expression levels of DBA2-carrying individuals minus the B6-carrying individuals (y-axis) for each module driver (x-axis). (F) Distribution of the causality −log p value scores of Idh1 (blue) and of Klf7 (black) on each of the remaining drivers in the module. Causality p values were calculated by positioning Idh1 or Klf7 between the module variant and a driver from this module. (G) Validation of the effect size of Klf7 perturbation of knockdown (left) or overexpression (right) on other drivers in bone-marrow hematopoietic stem cells (y-axis). The ‘effect size’ of perturbation (either knockdown or overexpression) on a certain transcript g is defined as the difference between the (log-scaled) expression of g in normal cells to that in the perturbed cells. In each panel, the first and second columns refer to groups I and II, respectively. **a significant t-test p value for determining whether the mean effect size is different from zero (FDR < 0.06). (H) Scatter plot, where for each driver in group I (a black square) the y-axis shows the transcript causality score (for morphine-response traits in this module; p values), and the x-axis shows the significance of Klf7 knockdown effect on its transcription level (t-test p value). The plot indicates that in group I, drivers with highly significant causality on behavioral responses to morphine were also significantly influenced by Klf7 knockdown. (I) Overall model illustration of module no. 2.

DOI: http://dx.doi.org/10.7554/eLife.04346.012

Figure 3.

Figure 3—figure supplement 1. Characterization of module no. 2 drivers.

Figure 3—figure supplement 1.

(A) Driver–driver correlations in module no. 2. A matrix of drivers (columns) vs drivers (rows), where the blue/red scale indicates negative/positive Pearson correlation coefficients. (B) Causality p value score profiles. A matrix of traits (columns) vs driver transcripts (rows), where the blue/white scale indicates the significance of causative relationships among the trait, the driver, and a representative variant in the module's genomic interval (rs13475891 in chr1:62 Mbp). (C) Relationship between module no. 2 drivers and human glioblastoma survival. The figure depicts the significance of differences in Kaplan–Meier survival plots (y-axis; −log p value) between groups of patients with glioblastoma, where groups are based on their levels of expression of a driver in module no. 2 (x-axis). Black/white indicates grouping based on up-regulation/down-regulation vs intermediate regulation of a driver.
Figure 3—figure supplement 2. Relationships among components in module no. 2.

Figure 3—figure supplement 2.

(A) Left: Klf7 expression levels (y-axis) of all individuals carrying the C57BL/6J allele (left column) and DBA/2J allele (right column) for the representative variant rs13475891 of module no. 2. Right: Scatter plot, where for each individual the y-axis shows the measured level of a morphine-related trait (distance travelled in 0–180 min in males and females) and the x-axis shows the Klf7 expression. (B) Plots are shown as in A, but for the Idh1 transcript. (C) Level of a morphine-related trait (distance travelled in 0–180 min in males and females; y-axis) of all individuals carrying the C57BL/6J allele (left column) and the DBA/2J allele (right column). The plot shows higher levels of the morphine-related trait in C57BL/6J-carrying individuals. (D) Effect size of variant rs13475891 on the traits in module no. 2. The histogram represents average expression levels of DBA/2J-carrying individuals minus the B6-carrying individuals (y-axis) for each module trait (x-axis). The plot indicates a higher level of the morphine-related trait in C57BL/6J-carrying individuals, as exemplified in C.
Figure 3—figure supplement 3. Causality of cis-associated transcripts.

Figure 3—figure supplement 3.

(A) Causality p value scores of Klf7 and Idh1 on several drivers in module no. 2. The histogram represents the causality p-value scores (y-axis) of Idh1 (blue) and Klf7 (black) on the module's transcripts (x-axis). The plot indicates a stronger influence of Klf7 than of Idh1. Causality p-value scores were calculated as −log p values in a causality test (see ‘Materials and methods’), where the expression levels of Klf7 (or Idh1) were positioned between the genetic variant of module no. 2 and the gene expression of another driver transcript in this module. (B) Causality p-value scores (y-axis) of Klf7 (left column) and Idh1 (right column) on drivers in groups I (blue) and II (red), indicating a stronger influence of Klf7 than of Idh1 in both driver groups.
Figure 3—figure supplement 4. Causative relationships of module no. 2 across multiple tissues.

Figure 3—figure supplement 4.

Significance of causative relationships (blue, high; white, low) among traits (rows) vs the expression of driver transcripts (columns) in nine different tissues or cell types (color coded). In all cases a representative variant in the genomic interval of the module was used for analysis of causality (see Supplementary file 1C). Myeloid cells are shown at the top, indicating that causality relationships in this module are observed in myeloid cells, but hardly if at all in other tissues. Source data: 1Gerrits et al., 2009; 2Geisert et al., 2009; 3Mozhui et al., 2012; 4Gatti et al., 2007; 5Alberts et al., 2011.
Figure 3—figure supplement 5. Causative relationships of module no. 2 across time points.

Figure 3—figure supplement 5.

A plot of causality −log p values of Klf7 (y-axis), assuming a fixed genetic variant in the Klf7 gene and various behavioral assays (blue, place preference; green, distance travelled; orange, vertical activity) that were performed at various time points (y-axis) after morphine injection (data from Philip et al., 2010). The plot indicates that the causative influences of Klf7 on the three different behavioral assays act mainly between 30 and 120 min after morphine injection.

Module no. 2 consists of a group of 32 driver transcripts (e.g., Klf7, p35, Lrrk2), all associated with the module's genotype and strongly correlated with each other and with the module traits (Figure 3A, bottom; Figure 3B; Figure 3—figure supplement 1A). For all driver transcripts, the causative relationships were much preferable to the alternative relationships (p value ≤ 0.005, permutation-based FDR < 6 × 10−5, ‘Materials and methods’; Figure 3C and Figure 3—figure supplement 1B). We found two main groups of drivers (Figure 3D,E, Figure 3—figure supplement 1A). The first (denoted ‘group I’) consists of 21 genes whose transcript levels are negatively correlated with the morphine traits. In this group, individuals carrying the DBA/2J allele in the module's variant have higher gene expression values than those of individuals carrying the C57BL/6J allele (e.g., Klf7 in Figure 3—figure supplement 2A). The second driver group (denoted ‘group II’) has the opposite correlation with the morphine traits and the opposite genetic effect (e.g., Idh1, in Figure 3—figure supplement 2B). These observations coincide with the fact that for all module traits, individuals carrying the C57BL/6J-allele have higher trait values (Figure 3—figure supplement 2C,D). Notably, causality p values in group I are more significant than in group II (p < 0.05, t-test; Figure 3C); one possible reason is that the two groups might relate to distinct mechanisms that differ in their causality strength.

The role of the Klf7 gene in morphine module no. 2 is particularly interesting. Klf7 and Idh1 are the only two cis-associated module drivers and their causative role in mediating morphine traits is highly significant. We hypothesized that the cis-associated variation in gene-expression levels may lead to variation in the trans-associated module drivers. To test this hypothesis we used the causality p value score (‘Materials and methods’), but utilized a cis-associated gene (Klf7 or Idh1) as the transcript positioned between the module's genomic interval and another module transcript. Using these scores we found that the causative p values of Klf7 on the remaining module transcripts were substantially more significant than the causative p values of Idh1 on those transcripts (Figure 3F, Figure 3—figure supplement 3A, p < 10−7, K-S test). This finding holds for each of the driver groups I and II (Figure 3—figure supplement 3B), suggesting that Klf7, but not Idh1, likely affects the other module drivers, which in turn affect behavioral activity in response to morphine.

The finding of positive and negative correlations of Klf7 with the drivers in groups I and II is particularly intriguing because it suggests that Klf7 is an activator of group I and a repressor of group II (Figure 3D). To validate the suggested central role of Klf7 in mediating variation in other drivers, we analyzed the influences of knockdown and overexpression of Klf7 on gene expression in bone-marrow hematopoietic stem cells from the C57BL/6J mouse strain (termed Klf7KD and Klf7OE, using three and four biological repeats, respectively; data were taken from [Schuettpelz et al., 2012]). Indeed, whereas knockdown of Klf7 led to a down-regulation of group I transcripts (p < 1.5 × 10−5, FDR < 5 × 10−5, t-test), it had no influence in group II (p > 0.32; Figure 3G, left), in agreement with the predicted role of Klf7 as an activator and repressor of groups I and II, respectively. Furthermore, we found that the more significant the influence of Klf7 knockdown on a driver in group I, the stronger the causative role of the driver on behavioral morphine traits (Figure 3H). Overexpression of Klf7 had the opposite effect, with significant down-regulation of transcripts in group II (p < 0.05, FDR < 0.06, t-test) and even a small increase of transcripts in group I (p < 0.002, FDR < 0.005; Figure 3G, right), consistently with our model. Taken together, the two lines of evidence—both natural and experimental perturbations—indicated that Klf7 is a key driver mediating the effects of additional drivers in groups I and II, which in turn affect morphine response diversity (Figure 3I).

Module no. 2 may affect morphine responses through a variety of mechanisms. For example, the p35/Cdk5 driver directly phosphorylates the opioid receptor (Xie et al., 2009; Pareek et al., 2012), and the morphine adduct MO-GSH is controlled by the Idh1 and Ggt1 drivers (Correia et al., 1984; Kumagai et al., 1990; Muller and Do, 2012). Furthermore, morphine treatment may exert its action through cell migration and cell invasion processes (Gach et al., 2011): the p35 driver, as part of the p35/Cdk5 complex, affects the Rac/Cdc42 complex through PAK inhibition, resulting in altered cell migration (Nikolic et al., 1998), while the Mmp13 driver alters cell migration in response to morphine treatment because of its ability to degrade collagen (Gach et al., 2011; Wang et al., 2013). Both the Klf7 and the Cdk5/p35 drivers activate p27 by expression or phosphorylation (Laub et al., 2001; Smaldone et al., 2004), and p27 in turn affects the Rho GTPases Rac and RhoA, which then alter cell invasiveness and infiltration (Kawauchi et al., 2006). In glioblastoma, for example, the p27/Rho pathway affects infiltration of tumor cells (Ruiz-Ontañon et al., 2013). In agreement with this prediction, using the Rembrandt database (Madhavan et al., 2009) we found that all four top-ranked Klf7-mediated drivers attain significant effects on survival of patients with glioblastoma (Kaplan–Meier p < 1 × 10−11, 1.4 × 10−4, 1 × 10−4 and 0.01 for the four drivers Ntng2, p35, Rexo2 and Lrrk2, respectively; in all cases, FDR < 0.05, Figure 3—figure supplement 1C), supporting the role of module no. 2 in cell invasiveness. Further experimental studies are required in order to test these suggested pathways and search for additional mechanisms.

Our findings in module no. 2 agree well with previous studies showing that several driver genes participate in the morphine response. For example, the Klf7 transcript is up-regulated in response to morphine (being one of the top ten up-regulated genes [Suzuki et al., 2003]). Both the p35 driver and its activated protein Cdk5 were up-regulated in response to acute morphine but down-regulated on exposure to chronic morphine (Ferrer-Alcón et al., 2003). The coregulated Idh1 and Ggt1 drivers (Muller and Do, 2012) are responsible for the synthesis and degradation, respectively, of the reduced form of glutathione (GSH). A conjugated form of morphine and GSH (MO-GSH) attains higher morphine reactivity (Correia et al., 1984; Kumagai et al., 1990) that may alter the morphine responses. However, whereas key roles of Klf7 have been reported primarily in neurons (Bieker, 2001; Laub et al., 2005), here we found a causative effect of Klf7 on behavioral responses to morphine that was specific to myeloid tissue (Figure 3—figure supplement 4). We cannot as yet explain this observation; however, p35/Cdk5-mediated neutrophil secretion (Rosales et al., 2004) and the cytokine-mediated regulation of Klf7 by morphine in lymphocytes (Suzuki et al., 2003) potentially provide an explanation for this tissue specificity. Furthermore, morphine injection leads to a reduction in neutrophil infiltration 30–120 min after treatment (Clark et al., 2007), in agreement with the timing of causative relationships between Klf7 and behavioral assays at 30–120 min after morphine injection (Figure 3—figure supplement 5). Taken together, our results suggest that in vivo behavioral responses to morphine are affected not only by neuronal activity, but also through certain components of the immune system.

Discovery of novel connections among traits

In the following we demonstrate GEMOT's ability to identify a model for previously uncharacterized connections, with either strong (module no. 3) or weak (module no. 4) correlations among traits.

Module no. 3 (Figure 4A) shows the ability of GEMOT to group a variety of distinct traits. This module consists of a genomic interval in chr4:133–142 Mbp and five correlated traits, namely pain response (thermal nociception [Philip et al., 2010]), lens weight (Zhou and Williams, 1999), eye weight (with or without correction for brain weight [Zhou and Williams, 1999]), and ethanol response (place preference [Cunningham, 1995]). All traits were found to be strongly intercorrelated, with |r| values ranging from 0.67 to 0.9 (Figure 2—figure supplement 4). Eya3 and Cd52 were proposed as cis-associated drivers, a suggestion further supported by the known role of Eya3 in eye development (Tadjuidje and Hegde, 2013) and the involvement of Cd52 in pain signaling (Poh et al., 2012). Figure 4—figure supplement 1 shows that the significant causative role of these drivers can be found along the entire myeloid pathway (including stem cells [LinSca-1+c-Kit+], common progenitors of the myeloid and erythroid lineages [LinSca-1c-Kit+], erythroid [TER-119+] and myeloid [Gr-1+] lineages), but not when using data from the lymphoid, eye, or brain tissues. This suggests that Eya3 and Cd52 play a role in pain processes and eye conditions mainly through their functionality in myeloid cells.

Figure 4. Characteristics of module nos. 3 (A) and 4 (B).

Left: Genetic associations. Association scores (y-axis) across the genomic positions (x-axis) for the module's driver transcripts, based on gene-expression data in myeloid cells. Middle: Matrix of correlations among traits (columns) vs driver transcripts (rows); the blue/red scale indicates negative/positive correlation coefficients. Right: Matrix of traits (columns) vs driver transcripts (rows), where the blue/white scale indicates the significance of their causative relationships based on gene-expression data from the myeloid tissue. A histogram depicting transcript causality p value scores is shown as in Figure 3C. A representative variant in the module's genomic interval is assumed (see Supplementary file 1C).

DOI: http://dx.doi.org/10.7554/eLife.04346.018

Figure 4.

Figure 4—figure supplement 1. Causative relationships of module no. 3 in multiple tissues.

Figure 4—figure supplement 1.

Significance of causative relationships (blue, significant; white, non-significant) among traits (columns) vs the expression of driver transcripts (rows) in nine different tissues or cell types (left to right matrices). A representative variant in the module's genomic interval is assumed (see Supplementary file 1C). Notably, causative relationships in this module are clearly observed along the entire myeloid lineage (stem cells [LinSca-1+c-Kit+], common progenitors of the myeloid and erythroid lineages [LinSca-1c-Kit+], erythroid [TER-119+], and myeloid [Gr-1+] lineages), but not in lymphoid tissues or other tissues under study.
Figure 4—figure supplement 2. Causative relationships of module no. 4 in multiple tissues.

Figure 4—figure supplement 2.

Profiles of causality p values across module no. 4, shown as in Figure 4—figure supplement 1. A representative variant in the module's genomic interval is assumed (see Supplementary file 1C). Causative relationships in this module are clearly observed in myeloid tissue, but not in the other tissues under study.

Module no. 4 (Figure 4B) demonstrates GEMOT's ability to identify a group of traits that show weak correlations among themselves, but share the same driver transcripts. The module consists of a genomic interval in chr11:8–19 Mbp and five traits: the measures of two hippocampal structures (volume and age-related lesions) from two distinct publications (Jucker et al., 2000; Martin et al., 2006), locomotor response to cocaine (Philip et al., 2010), number of liver tumors (Lee et al., 1995), and number of haematopoietic stem cells (Liang et al., 2007). The module's traits show weak and moderate correlations, with |r| values ranging from 0.01 to 0.58 (Figure 2—figure supplement 4). Of six suggested drivers, Ythdf2 and Aldh6a1 were predicted as the top drivers (Figure 4B, right panel). The validity of this prediction is supported by the known involvement of Aldh6a1 in brain structure (Marcadier et al., 2013), and is further supported by a recent report (Meyer and Jaffrey, 2014) that N6-methyladenosine (m6A) modification of RNA, whose readers are Ythdf13, causes an altered locomotor response to cocaine. In this module, a significant causative role was found in myeloid cells, but not in brain, eye, liver, or lymphoid tissue (Figure 4—figure supplement 2), suggesting a novel function of myeloid cells in regulating neurobiological and behavioral traits—an intriguing possibility that warrants future investigation.

Systematic evaluation of the GEMOT algorithm using synthetic data analysis

We next investigated the ability of GEMOT to identify the subsets of traits that are caused by the same transcripts. We first focused on studying GEMOT's utility for small sub-networks of co-mapped components, where each sub-network consists of a subset of traits caused by the same transcripts, as well as additional transcripts and traits that are independently or reactively related (Figure 5A). Such sub-networks mimic the tripartite modules that serve as input at the third stage of the GEMOT algorithm. A single synthetic data collection consisted of genotyping, phenotyping, and gene expression for 100 such sub-networks with two characteristic parameters: number of traits and noise level; in all cases we used 100 individuals (‘Materials and methods’). Using these synthetic data, GEMOT performance was compared to that of three alternative network reconstruction methods: Hageman et al., 2011; Wang and van Eeuwijk, 2014 (‘QPSO’); and Neto et al., 2010 (‘QTLNet’) (see parameter selection in Figure 5—figure supplement 2).

Figure 5. Comparative performance analysis on simulated models.

(A) Illustration of a sub-network model, in which all components are mapped to the same genetic variant v1 but not necessarily through the same relationships. In particular, the model includes k + 2 traits ,with k traits p1,..,pk that share the same underlying transcripts g1, g2, g3, and two additional traits pk + 1 and pk + 2 that are affected through other mechanisms (see detailed in Figure 5—figure supplement 1A). (B and C) Accuracy assessments of the synthetic sub-network depicted in A. Accuracy (y-axes) is compared across methods and different data parameters. Results are shown in models of different noise levels (x-axis, log-scaled; B) with either 5 (left) or 7 (right) traits, or over different numbers of traits (x-axis; C) with either a low noise level = 0.5 (left) or a high noise level = 2 (right). The accuracy metric evaluates whether p1,..,pk, but not pk + 1,..,pk + 2, share the same mechanisms, as detailed in Figure 5—figure supplement 1B. Plots depict alternative network construction methods (color coded, see Figure 5—figure supplement 2), indicating that GEMOT has an advantage over existing methods with noise levels ranging between 0.25 and 1, which is the relevant range for the mouse data in this study (see Figure 5—figure supplement 3).

DOI: http://dx.doi.org/10.7554/eLife.04346.021

Figure 5.

Figure 5—figure supplement 1. Accuracy assessment of predicted shared mechanisms in synthetic (co-mapped) sub-networks.

Figure 5—figure supplement 1.

(A) Detailed illustration of the synthetic network ‘Net-1’ (a simplified illustration of the same network is shown in Figure 5A). Among the k + 2 traits, k traits PC={p1C,...,pkC} are jointly affected by the same transcripts. (B) Accuracy assessment is summarized in an error matrix, where each cell i, j shows the prediction in row i for the mechanism of trait pt is TP, TN, FP or FN, while the actual class of trait pt is in column j. The two actual classes are ptPC and ptPC, as exemplified in A.
Figure 5—figure supplement 2. Evaluation of the compared network reconstruction methods.

Figure 5—figure supplement 2.

(A) Accuracy metrics (y-axes, according to the definition in Figure 5—figure supplement 1B) for three sub-network models (Figure 5—figure supplements 1A, 4A; color coded) with noise level 0.5 and 7 traits, shown across different method parameters (x-axes). The plots indicate that for Wang and van Eeuwijk, 2014 (top), Hageman et al., 2011 (middle), and Neto et al., 2010 (bottom), the best performances are attained with τ = 0.1, α = 0.05, and α = 0.25, respectively. These method parameters were used throughout the study. (B) Shown is the running time (y-axis, log scaled) across compared methods (color coded) and different numbers of traits (x-axis). The running time refers to construction of 100 sub-networks of type ‘Net-1’ (as demonstrated in Figure 5A) with noise level 0.5. The reported running time are from a Linux machine with 2.6 GHz AMD Opteron 6238 processors.
Figure 5—figure supplement 3. Relevant range of synthetic data parameters.

Figure 5—figure supplement 3.

Shown are the absolute correlation coefficients between transcripts and traits (A), between transcripts and variants (B), and between traits and variants (C) for different datasets (x-axis). Datasets (from left to right): all traits, transcripts and variants in mouse myeloid data (Gerrits et al., 2009) (first column); the components of module no. 2 (second column); synthetic sub-networks with noise level = 0.5 (third column); and synthetic sub-networks with noise level = 2 (fourth column). In each box the central mark is the median; edges are the 25th and 75th percentiles; whiskers extend to the most extreme data points not considered outliers; and outliers are plotted individually. Notably, synthetic sub-networks with high noise level = 2 resemble the background distribution in mouse data. In contrast, synthetic networks with lower noise levels (noise level = 0.5) show a signal that is higher than the background distribution, similarly to the inter-relations within the desired GEMOT modules.
Figure 5—figure supplement 4. Accuracy assessment in additional sub-networks.

Figure 5—figure supplement 4.

(A) Detailed illustration of two synthetic networks, ‘Net-2’ and ‘Net-3’. (B) Accuracy metrics (y-axes, according to the error matrix in Figure 5—figure supplement 1B) are shown over different numbers of traits (x-axes). Results are shown for Net-2 with noise level = 0.5 (left), for Net-3 with noise level = 0.5 (middle), and for Net-2 with noise level = 2 (right). Plots depict alternative methods of network construction (color coded).
Figure 5—figure supplement 5. Accuracy assessment in a large biological network.

Figure 5—figure supplement 5.

Performance is evaluated for the ability to group traits based on their co-mapping to the same variant (denoted naive grouping, A) and for the ability to identify groups of traits sharing the same causal transcripts (grouping-by-causality, B). (A) Naive grouping evaluation. Left: An error matrix, where each cell i, j shows the predicted class in row i for a trait pt is TP, TN, FP or FN while the actual class of trait pt is in column j. The actual class for a trait pt is either ptP(s) or ptP(s), where P(s) is the collection of all traits in a sub-network s. The predicted classes are defined similarly as ptQ(s) and ptQ(s), where Q(s) is the predicted group of traits that attain the largest intersection with the traits in a sub-network s. Right: The error matrix (from left panel) is used to calculate the accuracy metric (y-axis) across different numbers of traits (x-axis). Plots depict alternative grouping methods (color coded). The lower performance of GEMOT is expected, since it is designed for grouping by causality, rather than a naive grouping of traits. (B) grouping-by-causality evaluation. An error matrix (left) and an accuracy plot that utilizes this error matrix (middle), are presented as in A but for the case of grouping-by-causality. PC(s) is the subset of traits that share the same causal transcripts within a sub-network s, as exemplified in Figure 5—figure supplement 1A. Right: The error matrix (from left panel) is used to calculate the FDR metric (y-axis) across different numbers of traits (color coded) and alternative grouping methods. The plots clearly show that GEMOT's ability to group by causal relationships has an advantage over existing methods. Metrics: Accuracy = (TPR + TNR)/2, FDR = FPa/(FPa + TP) (see ‘Materials and methods’).

Performance was evaluated using an accuracy metric, which reflects the ability of a method to discern the correct subset of traits sharing the same transcripts (e.g., traits p1,...,pk but not pk + 1 and pk + 2 in Figure 5A and Figure 5—figure supplement 1B for details). Figure 5B presents the accuracy for synthetic data collections of varying levels of noise (using the sub-networks from Figure 5A with 5 or 7 traits). GEMOT displayed the best accuracy for noise levels ranging between 0.25 and 1, with lower accuracy for higher noise levels. Analysis of various network properties in both mouse and synthetic data shows that sub-networks with a noise level that do not exceed 1 are more likely to represent real biological modules (Figure 5—figure supplement 3). Unlike the compared methods, GEMOT's accuracy remained high with an increasing number of traits (Figure 5C); similar results were obtained for alternative network structures (Figure 5—figure supplement 4). Overall, in our simulation, GEMOT outperformed the compared algorithms in handling a growing number of traits and in identifying the correct groups of traits when using biologically-relevant parameters. These results do not rule out the possibility that for other tissues, conditions or organisms, utilizing the alternative methods as part of the third stage of the GEMOT algorithm may enhance its performance.

We next aimed to characterize GEMOT's utility for a large biological network that included groups of traits that share the same causal transcripts. Accordingly, each synthetic network included 100 traits, 200 transcripts and 100 variants, featuring five co-mapped sub-networks. A singe data collection consists of 100 networks, each containing five co-mapped sub-networks that carry the same number of traits (‘Materials and methods’). We compared GEMOT to two alternative trait-grouping methods: the trait-based iterative clique enumeration (ICE) approach (Shi et al., 2010) and the gene-based INVAMOD approach (Gat-Viks et al., 2013) (‘Materials and methods’). Notably, network construction methods (e.g., Neto et al., 2010; Hageman et al., 2011; Wang and van Eeuwijk, 2014) could not be compared owing to an unrealistic running time in the case of large networks. The analysis suggested that although all compared methods successfully discern all traits in a sub-network (Figure 5—figure supplement 5A), GEMOT attains higher accuracy in discerning those traits that share the same driver transcripts (Figure 5—figure supplement 5B). Notably, the GEMOT algorithm is tailored for identification of causative relationships, unlike the compared methods, explaining why GEMOT succeeded in discriminating the correct subsets of co-regulated traits.

Discussion

We set out to identify the molecular and genetic mechanisms underlying connections among groups of traits. To that end, we combined module identification (Gat-Viks et al., 2010) with causality testing (Neto et al., 2013) in a unified pipeline that relies on the definition of linked relationships so that candidate modules can be filtered out prior to the validation stage. Our results in mice highlighted three types of high-order organization of traits. (i) Groups of tightly related traits that share the same transcripts mechanisms (modules 1, 2, 6, 7, 8, e.g., Figure 3). (ii) Groups of distinct traits that share the same transcripts mechanism, but not necessarily high correlations among them (modules 3, 4, 5, e.g., Figure 4). (iii) Different groups commonly have overlapping traits, but typically differ in their underlying mechanisms (Figure 2B).

Our study emphasizes the need for methodologies for constructing causative models that underlie connections among traits. Whereas previous trait-grouping methods have used genetic or molecular data separately, and thus did not validate causative transcripts (e.g., Shi et al., 2010; Gat-Viks et al., 2013; as illustrated in Figure 1—figure supplement 1B,C), the GEMOT method aims at directly filtering and validating such relationships. Our simulations showed that GEMOT is superior to these methods in identifying trait groups that share the same underlying transcripts (Figure 5—figure supplement 5B). Another strategy is to use network reconstruction methods to construct groups of related traits (e.g., Neto et al., 2010; Hageman et al., 2011; Wang and van Eeuwijk, 2014). These methods can be applied in the case of either the complete biological network or the sub-networks within the tripartite modules. Whereas these methods are limited in their scalability and may be particularly inefficient when applied on a large number of components, our approach can be scaled to larger networks, but can construct the network only partially. For example, in comparing both running time and accuracy under increased network sizes we found GEMOT to be more scalable than the alternative network construction methods (Figure 5C and Figure 5—figure supplements 2B, 4B).

Our methodology opens the door to a variety of future research directions. One possibility is that GEMOT will be applied on a compendium of molecular data from multiple tissues. In such cases, GEMOT predictions can be used to simultaneously identify both the biological mechanism and its relevant tissue. Second, GEMOT is applicable to a variety of molecular data types in addition to gene-expression data. For example, its application in blood cytokines or plasma lipids (Teslovich et al., 2010) is expected to make it possible to identify molecular factors acting at the cell–cell communication level. Similarly, future extensions of GEMOT may provide the means to include environmental factors as part of the module. Third, monitoring of additional strains would allow discriminating between several alternative genomic intervals for the same module, which may arise due to linkage disequilibrium between chromosomally distinct loci. Fourth, characterization of GEMOT modules that share a similar collection of traits but have different genomic intervals may reflect gene–gene interactions that lead to connections among traits. Finally, GEMOT can potentially be further improved by the construction of internal causative relationships within the transcripts and the traits layers. For example, some drivers may control other drivers, which in turn affect a collection of traits (as exemplified in the case of Klf7 in Figure 3). It should be noted, however, that GEMOT cannot distinguish cases of multiple drivers that are part of the same regulatory circuit from cases of multiple drivers that act through several distinct circuits. Rather, its predictions provide biological or clinical hypotheses for additional experimental investigations.

Overall, our approach paves the way to the simultaneous study of several mechanistic layers underlying connections among traits, providing a multilayered view of phenotypic connections. Because the GEMOT methodology is a general one and can be applied to the study of other taxa, this approach may facilitate our understanding of the molecular mechanisms underlying human disease.

Materials and methods

The GEMOT algorithm

GEMOT is designed to identify three-layer modules in which driver components translate between a single genomic interval and a collection of traits. As input, GEMOT takes three types of objects: (1) a collection of traits across a population of individuals; (2) genotyped genetic variants for these individuals; and (3) high-throughput gene expression data across the same population.

Our algorithm incorporates three stages (Figure 1A). The first stage constructs candidate bipartite modules consisting of a group of traits and a genomic interval. In the second stage, candidate transcripts are added to each module from the previous stage, thus forming tripartite modules. The final stage validates the actual drivers and refines the modules accordingly. The GEMOT code is available at http://csgi.tau.ac.il/gemot/.

GEMOT stage I: construct bipartite modules

In the following we first define the construction of a bipartite graph and then explain the identification of bipartite clusters within this graph as previously described (Gat-Viks et al., 2010). We define a bipartite graph whose two parts correspond to genetic variants and traits, and in which the edges reflect the potential of a variant and a trait to have significant linked relationships (Figure 1C). Edge weights are calculated as follows (Figure 1B): First, for each pair of a genetic variant and a transcript, we evaluate the genetic association between the expression of the transcript and the candidate genetic variant. This yields a ‘variant–transcript association score’. In this study, for the case of homozygous recombinant inbred strains the association score is a −log t-test p value for the different gene-expression values between the strains carrying the two possible variant alleles. For other cases, such as an outbred population, other standard association scores can be applied (Falconer and Mackay, 1996). Secondly, for each pair of a transcript and a trait, we calculate the absolute Pearson correlation coefficient across genetic backgrounds. We term this score the ‘transcript–trait correlation score’. Finally, for each genetic variant and each trait we compare the distribution of transcript–trait correlations in high and low transcript–variant association scores (a statistical t-test). We assign such a t-test p value to five different transcript–variant association cutoffs (the five cutoffs partition the association range into 6 equally sized groups) and record the top −log t-test p value across these five cutoffs. We refer to the recorded −log p values as ‘link potentials’ and use them as edge weights in the bipartite graph.

Within this graph we use the ReL software package (Gat-Viks et al., 2010) to identify significant biclusters (Figure 1D). Briefly, the ReL algorithm starts with a set of seed clusters consisting of one trait and one variant whose link potential exceeds a certain initialization cutoff, cs. A trait or a variant can be included in a cluster if and only if its average link potential exceeds an improvement cutoff ci (here, cs = 180, ci = 90). Each bipartite cluster is subject to iterative improvements by addition or removal of traits and variants based on this cutoff. We refer to the bipartite clusters as ‘bipartite modules’ and further improve them in the following stages. Notably, ReL provides the same results when applied with or without Boneferroni correction for the gene–variant association −log p values and link potentials scores, since the construction of the biclusters is robust to an additive rescaling of these scores.

GEMOT stage II: construct tripartite modules

In this stage a rough list of candidate transcripts is constructed for each module (Figure 1E). To this end, for each transcript in a given module we rank its correlations and associations with all traits and variants in the input dataset and record its ranks of associations and correlations within the module. We next compare the distribution of recorded ranks in this module with the distribution of all ranks. The two distributions are compared using a Kolmogorov–Smirnov (K-S) test, a nonparametric test that may be used to compared two samples. We refer to the K-S p value for a transcript in a module as a ‘transcript link score’. Only transcripts with significant link scores are added to their module. Such transcripts are called ‘candidate transcripts’ and the resulting extended modules are referred to as ‘tripartite modules’ (Figure 1F).

GEMOT stage III: refine module (validate drivers)

In the following we first define the causality test and then describe the procedures for identifying driver transcripts and for module refinement.

Causality testing

The input for a causality test is a triplet of objects—a variant, a transcript, and a trait—each measured across the same population of individuals. Several types of relationships exist, including causative, reactive, and independent, and the goal of causality testing is to identify causative relationships among a given triplet of objects. Our causality analysis here follows the formulation of Neto et al. (2013). In this formulation, a likelihood score is first calculated for the causality model (denoted ‘M1’) as well as for three alternative models (denoted M2–M5, where M4 and M5 are modeled together, Figure 1—figure supplement 2A). A likelihood ratio score is then calculated between each of the three alternative models (as null hypotheses) and the causative model (as an alternative hypothesis), whereas the three respective p values are calculated on the basis of a theoretically derived distribution of the likelihood ratios. The final ‘causality p value’ is the maximal obtained for the three calculated p values.

Importantly, in the present study we introduce two important modifications in the scheme suggested by Neto et al. (2013). First, the distribution of likelihood ratios was evaluated empirically by randomly reshuffling the transcript levels of the candidate transcript and recomputing the likelihood ratio (repeating the procedure 100 times). The causality p value of a given triplet of objects was calculated according to the distribution of these reshuffling-based likelihood ratio scores (rather than using a theoretically derived distribution).

Second, we observed that in a GEMOT module a transcript may have a causal effect on a trait even in the presence of additional transcripts through which the variant influences the trait. Accordingly, in the GEMOT framework, both models M1 and M4 should be considered as causative relationships (a ‘broad’ view of causality), as opposed to the classical definition of causative relationships according to model M1 only (Figure 1—figure supplement 2A). In the present study we therefore utilized the broad-sense causative relationships. We observed that samples of model M4 attained significant likelihood ratios of M1 against models M2 and M3, and we therefore defined the (broad sense) ‘causality p value score’ as the maximal across the p values attained for M1 against M2 or M3. Our simulation study (see ‘Materials and methods’) indicated that GEMOT attains similar performance to that of the compared methods for models M1–M3, but outperforms the existing methods when adding the remaining models (Figure 1—figure supplement 2B–D).

Driver validation and module refinement

Transcripts that are causally related to traits can be identified by constructing a detailed causal network of relationships among all the components in the tripartite module. This, however, is a difficult task in the case of large tripartite modules harboring numerous components. We therefore devised an algorithm to identify the driver transcripts and their affected traits without needing to construct the module's entire network. The algorithm proceeds as follows: For each module, stage III starts with calculation of the causality scores for each pair of candidate transcripts and traits in the module, assuming a fixed single ‘representative variant’ from the module's genomic interval (Figure 1G). In this study, the representative variant was the one that attained the best average association scores across the module's candidate driver transcripts. Next, an iterative module refinement is applied in two steps. In step (i) we reveal the driver transcripts that are causally related to many of the module traits (Figure 1H, middle). In particular, we used Fisher's method (Sokal and Rohlf, 1990) to calculate the overall causality p value of a transcript (denoted ‘transcript causality p value’); only those transcripts with significant transcript causality p values were retained. The transcript causality p value is determined based on chi-squared distribution with 2k degrees of freedom: χ2k2(gi)=2j=1,..,kln(pi,j), where gi is the transcript; j = 1,...,k are the module traits; and pij is the causality p value for the transcript gi and a trait j assuming the module's representative variant. Next, in step (ii) those traits that do not have at least one transcript with a significant causality score are filtered out from among the transcripts that were selected in step (i) (Figure 1H, right). This process is repeated iteratively until convergence. Thereafter, the retained transcripts are called ‘driver transcripts’ and modules that contain at least one driver are termed ‘GEMOT modules’ (Figure 1I).

Mouse data

All mice data was taken from a previously produced body of work. We applied our analysis to data obtained from homozygous BXD recombinant inbred mouse strains (Peirce et al., 2004) generated by crossing C57BL/6J and DBA/2J inbred strains for many generations. Microarray expression data in myeloid cells across 24 BXD strains have been measured (Gerrits et al., 2009). To identify high-quality candidates we selected 5786 genes whose variation in expression across BXD strains, based on average intensities of the genes, was higher than expected. Expected variance values were calculated using a sliding window along the genes' average intensities. Gene-expression values were log10-transformed and normalized by Z-score normalization. All 2885 traits and 3796 genetic variants across BXD strains were downloaded from the WebQTL dataset (Wang et al., 2003). Trait values were normalized by Z-score normalization. Given the strains in the gene expression and trait datasets, we restricted our analysis to 1738 traits that had at least 15 strains in common. Other compared cell types or tissues (Gatti et al., 2007; Geisert et al., 2009; Gerrits et al., 2009; Alberts et al., 2011; Mozhui et al., 2012) were similarly preprocessed. Supplementary file 1C records the particular representative variants that were used for causality testing in each predicted GEMOT module.

To assess the corresponding false discovery rates, we generated negative controls based on a permutation test in which the transcript levels of each transcript were randomly shuffled and the GEMOT modules were recomputed (a process that was repeated 100 times). In each repeat, a variety of statistics (such as the number of identified modules) were recorded. The permutation-based ‘false discovery rate’ (FDR) is the ratio of the averaged number of statistics that were declared significant using the permuted data to the number of statistics that were declared significant using the original (non-permuted) data. In this study, GEMOT was applied using transcript link score cutoff = 10−95 for identifying candidate transcripts (stage II, permutation-based FDR < 0.09) and transcript causality p value cutoff = 0.005 (stage III, permutation-based FDR < 6 × 10−5).

Causality testing–performance evaluation

To investigate the performance of the causality score we simulated triplets of objects, each consisting of a variant v, a transcript g and a trait p. In all such triplets we assume 100 homozygous individuals. The genotyping of each variant was generated by sampling a vector of values 0 and 1 from a binomial distribution (with p = 0.5). Based on these genotyping values, the values of g and p were generated according to the following five different models (denoted M1–M5), as depicted in Figure 1—figure supplement 2A: M1, a causative model v → g → p; M2, an independent model v → g, v → p; M3, a reactive model v → p → g; M4, v → g → p and v → p; and M5, v → p → g and v → g. Each arrow in these five models was simulated as a linear expression with a normally distributed error term. For example, based on model M1, data were generated as g = αv + ε, p = λg + ε, εN(0, σ2); similarly, model M4 was generated as g = αv + ε, p = αv + λg + ε, εN(0, σ2). A single ‘synthetic collection’ consisted of 250 relationships from each of the five models, a total of 1250 samples. Results are displayed for many different collections, each generated using different combination of λ and σ values with α = 0.5. GEMOT's causality p values were compared to those obtained by two alternative methods: QTLHot (Neto et al., 2013) and an AIC-based method (Lee et al., 2009).

For a given significance threshold we evaluated the ability to identify causal relationships using true positive (TP), true negative (TN), false positive (FP) and false negative (FN) counts, which were defined according to the broad-sense definition of causality (Figure 1—figure supplement 2B). The area under the receiver operating characteristic (ROC) curve (the AUC) was calculated accordingly, where the higher the AUC the better the method. In addition, a balanced false discovery rate (FDR) can be used as a criterion for comparisons of methods, computed as FDR = FPa/(FPa + TP) where FPa accounts for imbalanced data by dividing FP by the ratio between the negative and positive synthetic datasets, calculated as FPa = FP/π0, π0 = (FP + TN)/(TP + FN). The method with lowest FDR is regarded as the best method (when all methods use the same p value cutoff). Notably, GEMOT attains similar performance to that of the compared methods for models M1–M3 (Figure 1—figure supplement 2C), but outperforms the existing methods when adding synthetic M4 and M5 samples (Figure 1—figure supplement 2D).

GEMOT performance analysis

The synthetic data analysis is focused on two simulations with increasing complexity of the input network: (i) sub-network analysis, in which the input is a sub-network where all components are co-mapped to the same variant; and (ii) network analysis, in which the input is a large network comprising several co-mapped sub-networks.

Sub-network analysis

A synthetic co-mapped sub-network consists of one variant v and m transcripts GC={g1C,...,gmC} that are associated with the variant v and are causally related to k traits PC={p1C,...,pkC}, that is, v{g1C,...,gmC}{p1C,...,pkC}. In addition, the sub-network includes a single transcript g1R that is reactive to phenotype p1R (thus, vp1Rg1R) and a pair of a transcript g1I and a trait p1I that are independently affected by v such that vp1I and vg1I. Altogether, a sub-network consists of the triplet (v, P, G), where P={PC,p1R,p1I} and G={GC,g1R,g1I}. In this study we analyzed three sub-network structures, referred to as Net-1 (Figure 5 and Figure 5—figure supplement 1A), Net-2 (Figure 5—figure supplement 4A, left), and Net-3 (Figure 5—figure supplement 4A, right); unless stated otherwise, the results refer to Net-1. In all cases, the total ‘number of traits’ in a sub-network is k + 2. The simulation data was generated by using for each edge aiaj in these sub-networks a linear coefficient β such that aj = βai + ε(β), ε(β) ∼N(0, σ2), σ = qεβ, where qε represents the relative proportion of noise referred to as the ‘noise level’. Observe that ε(β) depends on both the linear coefficient of the relationship β and on the noise level qε. We generated ‘synthetic collections’ of 100 sub-networks (100 individuals in each case); each collection is constructed for a given network structure (sub-networks Net-1, Net-2, Net-3), a certain total number of traits (k + 2 = 5, 7, 12, or 17 traits) and a certain noise level (qε = 0.25, 0.5, 1, 2, 4). In all cases we used β = 1.5 for relationships vp and vg and β = 0.6 for relationships gp and pg.

We compared GEMOT's performance on co-mapped sub-networks to that of alternative network construction algorithms, including the QTLNet method (Neto et al., 2010), the QPSO methodology (Wang and van Eeuwijk, 2014), and a Bayesian construction proposed by Hageman et al. (2011); for all methods, we used the parameters that gave the best results for the tested networks (Figure 5—figure supplement 2A). We evaluated the performance of each method using the true positive (TP), true negative (TN), false positive (FP) and false negative (FN) counts as defined in Figure 5—figure supplement 1B, which depicts whether the network construction correctly identified shared transcripts for the traits in PC. Relying on these counts, the balanced accuracy score was calculated as (TPR + TNR)/2 (here, TNR = TN/(TN + FP) and TPR = TP/(TP + FN)). AUC could not be computed because the three compared approaches do not provide a measure of statistical significance.

Network analysis

The synthetic network consisted of two parts: first, five co-mapped sub-networks s1,...,s5 of the form (v(sL), P(sL), G(sL)), and second, the remaining network comprising additional transcripts, variants, and traits. In total, the network consisted of 100 variants, 100 traits, and 200 transcripts that were used to generate synthetic data across 100 individuals. The data for the five sub-networks were generated as in the sub-network analysis (‘Materials and methods’), using varying numbers of traits. The remaining data were generated as follows: (i) genotyping of each variant was generated independently, as described in the causality-testing simulation (‘Materials and methods’); (ii) for each trait and individual, data values were sampled from a standard normal distribution; and (iii) gene-expression data were generated while maintaining the correlation between transcripts of the non-module components and between them and the module component. More specifically, for each individual i and transcript j that are not in a sub-network, the gene-expression data zij were generated in two steps: first, to maintain correlation among all non-sub-network transcripts we sampled the real data in murine myeloid cells (data taken from Gerrits et al., 2009) to generate a 200 × 200 covariance matrix, and then used this matrix to generate synthetic data values xij which approximately have the same covariance matrix. Next, to improve the correlation of the xij values with at least one co-mapped sub-network, we calculated the gene-expression data as zij = xij + cyik, where yik is the synthetic gene expression for an individual i and the (arbitrarily selected) kth transcript in the sub-network (in all cases, c = 1).

The synthetic network was used to test the ability of GEMOT to identify the groups PC(su) (here, u = 1,..,5), which are the groups of traits with the same causative mechanism behind them. Furthermore, we tested an existing trait-based approach (Iterative Clique Enumeration (ICE); Shi et al., 2010) and a gene-based approach (INVAMOD, Gat-Viks et al., 2013). Two different measures were used for performance evaluation: first we evaluated grouping of traits based on their co-mapping to the same variant (Figure 5—figure supplement 5A). Secondly, we evaluated the ability of each method to identify groups of traits sharing the same causal transcripts (Figure 5—figure supplement 5B).

Acknowledgements

We thank Prof. Marcelo Ehrlich for valuable discussions and comments, and Avital Brodt for artwork. This research is supported by the Israeli Centers of Research Excellence (I-CORE): Gene Regulation in Complex Human Disease, Center No. 41/11 (YO, RW); Israel Science Foundation (ISF) fund no. 1643/13 (YO, RW), ISF-Broad fund no. 1168/14 (AN, AF), and the Edmond J Safra Center for Bioinformatics at Tel-Aviv University (IGV, RW). IGV is a Faculty Fellow of the Edmond J Safra Center for Bioinformatics at Tel Aviv University and an Alon Fellow.

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Funding Information

This paper was supported by the following grants:

  • Israel Science Foundation (ISF) 1643/13 to Yael Oren, Roni Wilentzik.

  • Israeli Centers for Research Excellence (I-CORE) to Yael Oren, Roni Wilentzik.

  • Tel Aviv University Edmond J. Safra Center for Bioinformatics to Roni Wilentzik, Irit Gat-Viks.

  • Broad Foundation 1168/14 to Aharon Nachshon, Amit Frishberg.

Additional information

Competing interests

The authors declare that no competing interests exist.

Author contributions

YO, Conception and design, Analysis and interpretation of data, Drafting or revising the article.

AN, Conception and design, Analysis and interpretation of data, Drafting or revising the article.

IG-V, Conception and design, Analysis and interpretation of data, Drafting or revising the article.

AF, Analysis and interpretation of data, Drafting or revising the article.

RW, Analysis and interpretation of data, Drafting or revising the article.

Additional files

Supplementary file 1.

(A) Traits in the GEMOT modules of the BXD mouse strains. Shown is a module identifier (column 1) and details about a trait within it, including the trait title (column 2), PubMed record of the publication from which it is taken (column 3), the trait index in the GeneNetwork database (Wang et al., 2003; columns 4), and a short title of the trait as indicated in Figure 3B (column 5). (B) Drivers in the GEMOT modules of the BXD mouse strains. Shown is a module identifier (column 1), and details about a driver within it, including the gene symbol (column 2), its entrez identifier (column 3), genomic position (column 4), the type of association in the module (column 5), and its gene causality p value score (column 6). (C) Representative variants for causality testing in the tripartite modules of the BXD mouse strains. Shown is a tripartite module identifier and its GEMOT module identifier, if relevant (columns 1 and 2, respectively), the genomic position of the module's genomic interval (column 3), and the name and position of the representative variant (columns 4 and 5, respectively).

DOI: http://dx.doi.org/10.7554/eLife.04346.027

elife04346s001.doc (231KB, doc)
DOI: 10.7554/eLife.04346.027

Major datasets

The following previously published datasets were used:

Gerrits A, Li Y, Tesson BM, Bystrykh LV, Weersing E, Ausema A, Dontje B, Wang X, Breitling R, Jansen RC, de Haan G, 2009, Expression quantitative trait loci are highly sensitive to cellular differentiation state, GSE18067; http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE18067, Publicly available at NCBI Gene Expression Omnibus (http://www.ncbi.nlm.nih.gov/geo/).

Wang J, Williams RW, Manly KF, 2003, WebQTL: web-based complex trait analysis, http://www.genenetwork.org/webqtl/main.py, BXD Trait Collection. Publicly available at Genenetwork.

References

  1. Alberts R, Chen H, Pommerenke C, Smit AB, Spijker S, Williams RW, Geffers R, Bruder D, Schughart K. Expression QTL mapping in regulatory and helper T cells from the BXD family of strains reveals novel cell-specific genes, gene-gene interactions and candidate genes for auto-immune disease. BMC Genomics. 2011;12:610. doi: 10.1186/1471-2164-12-610. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Ashburn TT, Thor KB. Drug repositioning: identifying and developing new uses for existing drugs. Nature Reviews Drug Discovery. 2004;3:673–683. doi: 10.1038/nrd1468. [DOI] [PubMed] [Google Scholar]
  3. Baker EJ, Jay JJ, Bubier JA, Langston MA, Chesler EJ. GeneWeaver: a web-based system for integrative functional genomics. Nucleic Acids Research. 2012;40:D1067–D1076. doi: 10.1093/nar/gkr968. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Barabási AL, Gulbahce N, Loscalzo J. Network medicine: a network-based approach to human disease. Nature Reviews Genetics. 2011;12:56–68. doi: 10.1038/nrg2918. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Bieker JJ. Krüppel-like factors: three fingers in many pies. The Journal of Biological Chemistry. 2001;276:34355–34358. doi: 10.1074/jbc.R100043200. [DOI] [PubMed] [Google Scholar]
  6. Blednov YA, Mayfield RD, Belknap J, Harris RA. Behavioral actions of alcohol: phenotypic relations from multivariate analysis of mutant mouse data. Genes, Brain, and Behavior. 2012;11:424–435. doi: 10.1111/j.1601-183X.2012.00780.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Clark JD, Shi X, Li X, Qiao Y, Liang D, Angst MS, Yeomans DC. Morphine reduces local cytokine expression and neutrophil infiltration after incision. Molecular Pain. 2007;3:28. doi: 10.1186/1744-8069-3-28. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Correia MA, Krowech G, Caldera-Munoz P, Yee SL, Straub K, Castagnoli N., Jr Morphine metabolism revisited. II. Isolation and chemical characterization of a glutathionylmorphine adduct from rat liver microsomal preparations. Chemico-biological Interactions. 1984;51:13–24. doi: 10.1016/0009-2797(84)90016-4. [DOI] [PubMed] [Google Scholar]
  9. Cotsapas C, Voight BF, Rossin E, Lage K, Neale BM, Wallace C, Abecasis GR, Barrett JC, Behrens T, Cho J, De Jager PL, Elder JT, Graham RR, Gregersen P, Klareskog L, Siminovitch KA, van Heel DA, Wijmenga C, Worthington J, Todd JA, Hafler DA, Rich SS, Daly MJ, FOCiS Network of Consortia Pervasive sharing of genetic effects in autoimmune disease. PLOS Genetics. 2011;7:e1002254. doi: 10.1371/journal.pgen.1002254. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Cross-Disorder Group of the Psychiatric Genomics Consortium Identification of risk loci with shared effects on five major psychiatric disorders: a genome-wide analysis. Lancet. 2013;381:1371–1379. doi: 10.1016/S0140-6736(12)62129-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Cunningham CL. Localization of genes influencing ethanol-induced conditioned place preference and locomotor activity in BXD recombinant inbred mice. Psychopharmacology. 1995;120:28–41. doi: 10.1007/BF02246142. [DOI] [PubMed] [Google Scholar]
  12. Dudley JT, Deshpande T, Butte AJ. Exploiting drug-disease relationships for computational drug repositioning. Briefings in Bioinformatics. 2011;12:303–311. doi: 10.1093/bib/bbr013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Falconer DS, Mackay TFC. Introduction to quantitative genetics. 4th edition. Pearson Prentice Hall; 1996. [Google Scholar]
  14. Ferrer-Alcón M, La Harpe R, Guimón J, García-Sevilla JA. Downregulation of neuronal cdk5/p35 in opioid addicts and opiate-treated rats: relation to neurofilament phosphorylation. Neuropsychopharmacology. 2003;28:947–955. doi: 10.1038/sj.npp.1300095. [DOI] [PubMed] [Google Scholar]
  15. Gach K, Wyrębska A, Fichna J, Janecka A. The role of morphine in regulation of cancer cell growth. Naunyn-Schmiedeberg's Archives of Pharmacology. 2011;384:221–230. doi: 10.1007/s00210-011-0672-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Gat-Viks I, Chevrier N, Wilentzik R, Eisenhaure T, Raychowdhury R, Steuerman Y, Shalek AK, Hacohen N, Amit I, Regev A. Deciphering molecular circuits from genetic variation underlying transcriptional responsiveness to stimuli. Nature Biotechnology. 2013;31:342–349. doi: 10.1038/nbt.2519. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Gat-Viks I, Meller R, Kupiec M, Shamir R. Understanding gene sequence variation in the context of transcription regulation in yeast. PLOS genetics. 2010;6:e1000800. doi: 10.1371/journal.pgen.1000800. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Gatti D, Maki A, Chesler EJ, Kirova R, Kosyk O, Lu L, Manly KF, Williams RW, Perkins A, Langston MA, Threadgill DW, Rusyn I. Genome-level analysis of genetic regulation of liver gene expression networks. Hepatology. 2007;46:548–557. doi: 10.1002/hep.21682. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Geisert EE, Lu L, Freeman-Anderson NE, Templeton JP, Nassr M, Wang X, Gu W, Jiao Y, Williams RW. Gene expression in the mouse eye: an online resource for genetics using 103 strains of mice. Molecular Vision. 2009;15:1730–1763. [PMC free article] [PubMed] [Google Scholar]
  20. Gerrits A, Li Y, Tesson BM, Bystrykh LV, Weersing E, Ausema A, Dontje B, Wang X, Breitling R, Jansen RC, de Haan G. Expression quantitative trait loci are highly sensitive to cellular differentiation state. PLOS Genetics. 2009;5:e1000692. doi: 10.1371/journal.pgen.1000692. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Hageman RS, Leduc MS, Korstanje R, Paigen B, Churchill GA. A Bayesian framework for inference of the genotype-phenotype map for segregating populations. Genetics. 2011;187:1163–1170. doi: 10.1534/genetics.110.123273. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Hastie T, Tibshirani R, Friedman J. The elements of statistical learning. New York: Springer; 2009. [Google Scholar]
  23. Hidalgo CA, Blumm N, Barabasi AL, Christakis NA. A dynamic network approach for the study of human phenotypes. PLOS Computational Biology. 2009;5:e1000353. doi: 10.1371/journal.pcbi.1000353. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Hwang T, Atluri G, Xie M, Dey S, Hong C, Kumar V, Kuang R. Co-clustering phenome-genome for phenotype classification and disease gene discovery. Nucleic Acids Research. 2012;40:e146. doi: 10.1093/nar/gks615. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Jucker M, Bondolfi L, Calhoun ME, Long JM, Ingram DK. Structural brain aging in inbred mice: potential for genetic linkage. Experimental Gerontology. 2000;35:1383–1388. doi: 10.1016/S0531-5565(00)00190-X. [DOI] [PubMed] [Google Scholar]
  26. Kawauchi T, Chihama K, Nabeshima Y, Hoshino M. Cdk5 phosphorylates and stabilizes p27kip1 contributing to actin organization and cortical neuronal migration. Nature Cell Biology. 2006;8:17–26. doi: 10.1038/ncb1338. [DOI] [PubMed] [Google Scholar]
  27. Kumagai Y, Todaka T, Toki S. A new metabolic pathway of morphine: in vivo and in vitro formation of morphinone and morphine-glutathione adduct in guinea pig. The Journal of Pharmacology and Experimental Therapeutics. 1990;255:504–510. [PubMed] [Google Scholar]
  28. Laub F, Aldabe R, Friedrich V, Jr, Ohnishi S, Yoshida T, Ramirez F. Developmental expression of mouse Krüppel-like transcription factor KLF7 suggests a potential role in neurogenesis. Developmental biology. 2001;233:305–318. doi: 10.1006/dbio.2001.0243. [DOI] [PubMed] [Google Scholar]
  29. Laub F, Lei L, Sumiyoshi H, Kajimura D, Dragomir C, Smaldone S, Puche AC, Petros TJ, Mason C, Parada LF, Ramirez F. Transcription factor KLF7 is important for neuronal morphogenesis in selected regions of the nervous system. Molecular and Cellular Biology. 2005;25:5699–5711. doi: 10.1128/MCB.25.13.5699-5711.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Lee E, Cho S, Kim K, Park T. An integrated approach to infer causal associations among gene expression, genotype variation, and disease. Genomics. 2009;94:269–277. doi: 10.1016/j.ygeno.2009.06.002. [DOI] [PubMed] [Google Scholar]
  31. Lee GH, Bennett LM, Carabeo RA, Drinkwater NR. Identification of hepatocarcinogen-resistance genes in DBA/2 mice. Genetics. 1995;139:387–395. doi: 10.1093/genetics/139.1.387. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Lee KW, Woon PS, Teo YY, Sim K. Genome wide association studies (GWAS) and copy number variation (CNV) studies of the major psychoses: what have we learnt? Neuroscience and Biobehavioral Reviews. 2012;36:556–571. doi: 10.1016/j.neubiorev.2011.09.001. [DOI] [PubMed] [Google Scholar]
  33. Liang Y, Jansen M, Aronow B, Geiger H, Van Zant G. The quantitative trait gene latexin influences the size of the hematopoietic stem cell population in mice. Nature Genetics. 2007;39:178–188. doi: 10.1038/ng1938. [DOI] [PubMed] [Google Scholar]
  34. Madhavan S, Zenklusen JC, Kotliarov Y, Sahni H, Fine HA, Buetow K. Rembrandt: helping personalized medicine become a reality through integrative translational research. Molecular Cancer Research. 2009;7:157–167. doi: 10.1158/1541-7786.MCR-08-0435. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Marcadier JL, Smith AM, Pohl D, Schwartzentruber J, Al-Dirbashi OY, FORGE Canada Consortium. Majewski J, Ferdinandusse S, Wanders RJ, Bulman DE, Boycott KM, Chakraborty P, Geraghty MT. Mutations in ALDH6A1 encoding methylmalonate semialdehyde dehydrogenase are associated with dysmyelination and transient methylmalonic aciduria. Orphanet Journal of Rare Diseases. 2013;8:98. doi: 10.1186/1750-1172-8-98. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Martin MV, Dong H, Vallera D, Lee D, Lu L, Williams RW, Rosen GD, Cheverud JM, Csernansky JG. Independent quantitative trait loci influence ventral and dorsal hippocampal volume in recombinant inbred strains of mice. Genes, Brain, and Behavior. 2006;5:614–623. doi: 10.1111/j.1601-183X.2006.00215.x. [DOI] [PubMed] [Google Scholar]
  37. Meyer KD, Jaffrey SR. The dynamic epitranscriptome: N6-methyladenosine and gene expression control. Nature Reviews Molecular Cell Biology. 2014;15:313–326. doi: 10.1038/nrm3785. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Mozhui K, Lu L, Armstrong WE, Williams RW. Sex-specific modulation of gene expression networks in murine hypothalamus. Frontiers in Neuroscience. 2012;6:63. doi: 10.3389/fnins.2012.00063. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Muller P, Do KA. Bayesian inference for gene expression and proteomics. 2012. [Google Scholar]
  40. Neto EC, Broman AT, Keller MP, Attie AD, Zhang B, Zhu J, Yandell BS. Modeling causality for pairs of phenotypes in system genetics. Genetics. 2013;193:1003–1013. doi: 10.1534/genetics.112.147124. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Neto EC, Keller MP, Attie AD, Yandell BS. Causal graphical models in systems genetics: a unified framework for joint inference of causal network and genetic architecture for correlated phenotypes. The Annals of Applied Statistics. 2010;4:320–339. doi: 10.1214/09-AOAS288. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Nikolic M, Chou MM, Lu W, Mayer BJ, Tsai LH. The p35/Cdk5 kinase is a neuron-specific Rac effector that inhibits Pak1 activity. Nature. 1998;395:194–198. doi: 10.1038/26034. [DOI] [PubMed] [Google Scholar]
  43. Oti M, Huynen MA, Brunner HG. Phenome connections. Trends in Genetics. 2008;24:103–106. doi: 10.1016/j.tig.2007.12.005. [DOI] [PubMed] [Google Scholar]
  44. Pareek T, Zipp L, Letterio JJ. Cdk5: an emerging kinase in pain signaling. Brain Disorders and Therapy. 2012 doi: 10.4172/2168-975X.S1-003. S:1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Peirce JL, Lu L, Gu J, Silver LM, Williams RW. A new set of BXD recombinant inbred lines from advanced intercross populations in mice. BMC Genetics. 2004;5:7. doi: 10.1186/1471-2156-5-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Philip VM, Duvvuru S, Gomero B, Ansah TA, Blaha CD, Cook MN, Hamre KM, Lariviere WR, Matthews DB, Mittleman G, Goldowitz D, Chesler EJ. High-throughput behavioral phenotyping in the expanded panel of BXD recombinant inbred strains. Genes, Brain, and Behavior. 2010;9:129–159. doi: 10.1111/j.1601-183X.2009.00540.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Poh KW, Yeo JF, Stohler CS, Ong WY. Comprehensive gene expression profiling in the prefrontal cortex links immune activation and neutrophil infiltration to antinociception. The Journal of neuroscience. 2012;32:35–45. doi: 10.1523/JNEUROSCI.2389-11.2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Rosales JL, Ernst JD, Hallows J, Lee KY. GTP-dependent secretion from neutrophils is regulated by Cdk5. The Journal of Biological Chemistry. 2004;279:53932–53936. doi: 10.1074/jbc.M408467200. [DOI] [PubMed] [Google Scholar]
  49. Ruiz-Ontañon P, Orgaz JL, Aldaz B, Elosegui-Artola A, Martino J, Berciano MT, Montero JA, Grande L, Nogueira L, Diaz-Moralli S, Esparís-Ogando A, Vazquez-Barquero A, Lafarga M, Pandiella A, Cascante M, Segura V, Martinez-Climent JA, Sanz-Moreno V, Fernandez-Luna JL. Cellular plasticity confers migratory and invasive advantages to a population of glioblastoma-initiating cells that infiltrate peritumoral tissue. Stem cells. 2013;31:1075–1085. doi: 10.1002/stem.1349. [DOI] [PubMed] [Google Scholar]
  50. Rzhetsky A, Wajngurt D, Park N, Zheng T. Probing genetic overlap among complex human phenotypes. Proceedings of the National Academy of Sciences of USA. 2007;104:11694–11699. doi: 10.1073/pnas.0704820104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Schuettpelz LG, Gopalan PK, Giuste FO, Romine MP, van Os R, Link DC. Kruppel-like factor 7 overexpression suppresses hematopoietic stem and progenitor cell function. Blood. 2012;120:2981–2989. doi: 10.1182/blood-2012-02-409839. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Shi Z, Derow CK, Zhang B. Co-expression module analysis reveals biological processes, genomic gain, and regulatory mechanisms associated with breast cancer progression. BMC Systems Biology. 2010;4:74. doi: 10.1186/1752-0509-4-74. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Smaldone S, Laub F, Else C, Dragomir C, Ramirez F. Identification of MoKA, a novel F-box protein that modulates Krüppel-like transcription factor 7 activity. Molecular and Cellular Biology. 2004;24:1058–1069. doi: 10.1128/MCB.24.3.1058-1069.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Sokal RR, Rohlf FJ. 1990. Biometry.
  55. Suzuki S, Chuang LF, Doi RH, Chuang RY. Identification of opioid-regulated genes in human lymphocytic cells by differential display: upregulation of Kruppel-like factor 7 by morphine. Experimental Cell Research. 2003;291:340–351. doi: 10.1016/S0014-4827(03)00408-7. [DOI] [PubMed] [Google Scholar]
  56. Tadjuidje E, Hegde RS. The Eyes Absent proteins in development and disease. Cellular and Molecular Life Sciences. 2013;70:1897–1913. doi: 10.1007/s00018-012-1144-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Tanay A, Sharan R, Shamir R. Discovering statistically significant biclusters in gene expression data. Bioinformatics. 2002;18(Suppl 1):S136–S144. doi: 10.1093/bioinformatics/18.suppl_1.S136. [DOI] [PubMed] [Google Scholar]
  58. Teslovich TM, Musunuru K, Smith AV, Edmondson AC, Stylianou IM, Koseki M, Pirruccello JP, Ripatti S, Chasman DI, Willer CJ, Johansen CT, Fouchier SW, Isaacs A, Peloso GM, Barbalic M, Ricketts SL, Bis JC, Aulchenko YS, Thorleifsson G, Feitosa MF, Chambers J, Orho-Melander M, Melander O, Johnson T, Li X, Guo X, Li M, Shin Cho Y, Jin Go M, Jin Kim Y, Lee JY, Park T, Kim K, Sim X, Twee-Hee Ong R, Croteau-Chonka DC, Lange LA, Smith JD, Song K, Hua Zhao J, Yuan X, Luan J, Lamina C, Ziegler A, Zhang W, Zee RY, Wright AF, Witteman JC, Wilson JF, Willemsen G, Wichmann HE, Whitfield JB, Waterworth DM, Wareham NJ, Waeber G, Vollenweider P, Voight BF, Vitart V, Uitterlinden AG, Uda M, Tuomilehto J, Thompson JR, Tanaka T, Surakka I, Stringham HM, Spector TD, Soranzo N, Smit JH, Sinisalo J, Silander K, Sijbrands EJ, Scuteri A, Scott J, Schlessinger D, Sanna S, Salomaa V, Saharinen J, Sabatti C, Ruokonen A, Rudan I, Rose LM, Roberts R, Rieder M, Psaty BM, Pramstaller PP, Pichler I, Perola M, Penninx BW, Pedersen NL, Pattaro C, Parker AN, Pare G, Oostra BA, O'Donnell CJ, Nieminen MS, Nickerson DA, Montgomery GW, Meitinger T, McPherson R, McCarthy MI, McArdle W, Masson D, Martin NG, Marroni F, Mangino M, Magnusson PK, Lucas G, Luben R, Loos RJ, Lokki ML, Lettre G, Langenberg C, Launer LJ, Lakatta EG, Laaksonen R, Kyvik KO, Kronenberg F, König IR, Khaw KT, Kaprio J, Kaplan LM, Johansson A, Jarvelin MR, Janssens AC, Ingelsson E, Igl W, Kees Hovingh G, Hottenga JJ, Hofman A, Hicks AA, Hengstenberg C, Heid IM, Hayward C, Havulinna AS, Hastie ND, Harris TB, Haritunians T, Hall AS, Gyllensten U, Guiducci C, Groop LC, Gonzalez E, Gieger C, Freimer NB, Ferrucci L, Erdmann J, Elliott P, Ejebe KG, Döring A, Dominiczak AF, Demissie S, Deloukas P, de Geus EJ, de Faire U, Crawford G, Collins FS, Chen YD, Caulfield MJ, Campbell H, Burtt NP, Bonnycastle LL, Boomsma DI, Boekholdt SM, Bergman RN, Barroso I, Bandinelli S, Ballantyne CM, Assimes TL, Quertermous T, Altshuler D, Seielstad M, Wong TY, Tai ES, Feranil AB, Kuzawa CW, Adair LS, Taylor HA, Jr, Borecki IB, Gabriel SB, Wilson JG, Holm H, Thorsteinsdottir U, Gudnason V, Krauss RM, Mohlke KL, Ordovas JM, Munroe PB, Kooner JS, Tall AR, Hegele RA, Kastelein JJ, Schadt EE, Rotter JI, Boerwinkle E, Strachan DP, Mooser V, Stefansson K, Reilly MP, Samani NJ, Schunkert H, Cupples LA, Sandhu MS, Ridker PM, Rader DJ, van Duijn CM, Peltonen L, Abecasis GR, Boehnke M, Kathiresan S. Biological, clinical and population relevance of 95 loci for blood lipids. Nature. 2010;466:707–713. doi: 10.1038/nature09270. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Wang H, van Eeuwijk FA. A new method to infer causal phenotype networks using QTL and phenotypic information. PLOS ONE. 2014;9:e103997. doi: 10.1371/journal.pone.0103997. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Wang J, Williams RW, Manly KF. WebQTL: web-based complex trait analysis. Neuroinformatics. 2003;1:299–308. doi: 10.1385/NI:1:4:299. [DOI] [PubMed] [Google Scholar]
  61. Wang JR, Li XH, Gao XJ, An SC, Liu H, Liang J, Zhang K, Liu Z, Wang J, Chen Z, Sun W. Expression of MMP-13 is associated with invasion and metastasis of papillary thyroid carcinoma. European Review for Medical and Pharmacological Sciences. 2013;17:427–435. [PubMed] [Google Scholar]
  62. Xie WY, He Y, Yang YR, Li YF, Kang K, Xing BM, Wang Y. Disruption of Cdk5-associated phosphorylation of residue threonine-161 of the delta-opioid receptor: impaired receptor function and attenuated morphine antinociceptive tolerance. The Journal of Neuroscience. 2009;29:3551–3564. doi: 10.1523/JNEUROSCI.0415-09.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Zhou G, Williams RW. Eye1 and Eye2: gene loci that modulate eye size, lens weight, and retinal area in the mouse. Investigative Ophthalmology & Visual Science. 1999;40:817–825. [PubMed] [Google Scholar]
eLife. 2015 Mar 17;4:e04346. doi: 10.7554/eLife.04346.028

Decision letter

Editor: Naama Barkai1

eLife posts the editorial decision letter and author response on a selection of the published articles (subject to the approval of the authors). An edited version of the letter sent to the authors after peer review is shown, indicating the substantive concerns or comments; minor concerns are not usually shown. Reviewers have the opportunity to discuss the decision before the letter is sent (see review process). Similarly, the author response typically shows only responses to the major concerns raised by the reviewers.

Thank you for sending your work entitled “Linking traits based on their shared molecular mechanisms: A systems phenomics approach” for consideration at eLife. Your article has been favorably evaluated by Aviv Regev (Senior editor), a Reviewing editor, and 2 reviewers.

The Reviewing editor and the reviewers discussed their comments before we reached this decision, and the Reviewing editor has assembled the following comments to help you prepare a revised submission.

There is an overall appreciation for methods, such as the proposed GEMOT, that predict causative mutations leading to transcript variations that in turn drive phenotypes. However, to convince that the present method provides an advantage over existing approaches, three additions are required:

1) GEMOT performance should be compared to existing methods. This includes the qtlnet software (PMID 23288936, 21218138, and others) as well as similar methods (PMID 19540336, 21310061, 21242536, 25144184, 15711545, 25114278, 24013639, etc.).

2) Simulations should be used to evaluate the method against a 'truth' standard.

3) With regards to the Klf7 results: a consistent model describing the effect of each allele (B6, DBA, overexpression, knockout) should be presented to provide a clear hypothesis of how Klf7 effect is generated. (The interpretation of the Klf7 effects is unclear in the manuscript. Firstly, what is the nature of variation between the two Klf7 alleles? It has a cis-associated eQTL so, for example, which strain shows higher expression and can be linked to behavioral response to morphine? In the perturbation experiments, what is the strain background? Are the Klf7 knockout and overexpression effects on “driver genes” (up or down-regulation) consistent with corresponding Klf7 expression differences in the BxD population? That is, do Klf7 knockout effects look like a low-expressing Klf7 strain, and do Klf7 overexpression effects look like a high-expressing Klf7 strain? Figure 4 is also confusing as it appears to combine the knockout and overexpression experiments into summary scores. It would be useful to clearly see how these data support the inferred BxD model.)

4) The manuscript would be more convincing if the authors identify a group of traits that show low Pearson correlation among themselves, but share the same driver genes.

5) Considering the many methods already available (see above), the motivation for the present study should be better clarified.

6) The manuscript can be difficult to read at times, primarily due to a reliance on its own jargon. In some cases new nomenclature might be necessary, but many of the new terms seem to be very similar (if not identical) to widely-used, existing terms. In such cases, the paper should conform to standard nomenclature or define why such standards will not suffice. Notable examples are: “variant-gene associations” instead of eQTL; “linked relationships” instead of Pearson correlation and “link potential” instead of averaged correlation; “bipartite module” instead of pleiotropy. Furthermore, “gene drivers” and the “drivers layer” can probably be more simply referred to as “transcripts” and a “transcript layer” with appropriate context. “Causality score” appears to be a P value; why not refer to it as such? In some cases the new nomenclature can be potentially misleading, such as in the Klf7 discussion where a P value is referred to as a “perturbation effect”. Effect size and significance are different concepts with different biological interpretations.

7) High-dimensional approaches like GEMOT are prone to generating false positive associations, especially in such small experimental populations. Are P values systematically corrected for multiple tests? It is also important to estimate false discovery rates for a given significance threshold, which are not addressed in the manuscript. Additionally, the sample size in this study is very small, and BxD lines often exhibit non-local linkage disequilibrium due to limited recombination across few individuals. This can lead to many false-positive associations. How has this potential issue been addressed in the current study?

eLife. 2015 Mar 17;4:e04346. doi: 10.7554/eLife.04346.029

Author response


There is an overall appreciation for methods, such as the proposed GEMOT, that predict causative mutations leading to transcript variations that in turn drive phenotypes. However, to convince that the present method provides an advantage over existing approaches, three additions are required:

1) GEMOT performance should be compared to existing methods. This includes the qtlnet software (PMID 23288936, 21218138, and others) as well as similar methods (PMID 19540336, 21310061, 21242536, 25144184, 15711545, 25114278, 24013639, etc.).

We thank the reviewers for their suggestion, which we believe improves our conclusions. The specified publications are focused on three different aspects: (1) Testing causative relationships (Neto 2013, Lee 2009); (2) Network reconstruction (Neto 2010, Hageman 2011, Wang 2014); and (3) grouping traits (Chesler 2005). As suggested by the reviewers, in the revised manuscript we added simulations that relate to each of these three aspects, with a focus on comparison to the suggested methods.

(1) Testing causative relationships. We agree with the reviewers that our original manuscript did not provide assessment of our causality testing methodology. Notably, our goal was not to develop a new method for a causality P-value score. Rather, we aimed to use an existing methodology with minor modifications that were required in our particular case. In accordance, we used a methodology that is very similar to Neto et al. 2013, with two small modifications: first, we use empirical rather than theoretical evaluation of the likelihood ratio distribution; second, Neto et al. was interested in a 'classical' definition of causality, where a transcript translates between a variant and a trait. Here, we were interested in an additional objective function: identifying causal transcripts in the presence of other transcripts. This situation is common in our case of GEMOT modules harboring multiple causal transcripts translating between the same variant and a group of traits. We have therefore slightly modified the method suggested by Neto et al. to capture this 'broad' definition of causality. We apologize for omitting this important information from the original submission. We now added a detailed description of the methodology and alterations from Neto et al. to the subsection headed “GEMOT stage III: Refine module (validate drivers). We also added a new simulation study (new Figure 1–figure supplement 2, subsection “Causality testing—performance evaluation”, legend for Figure 1–figure supplement 2), which shows that in comparison to the compared methods (Neto et al. and Lee et al.), GEMOT attained improved accuracy in our relevant case of a broad-sense definition of causality. We now clarify this in the Methods section.

(2) Network reconstruction. In the third stage of the GEMOT algorithm, GEMOT aims to identify the groups of traits that share causal drivers within the given tripartite modules. Such identification can also be performed by constructing a detailed network within a module (using existing methods) and using this solution to identify the desired group of traits. Notably, our goal was not to better construct any given network. Rather, we aimed to develop scalable methodology for the identification of the shared drivers of the module's traits, which can scale to the large tripartite modules we found in real mouse data (with a typical size of over 100 candidate transcripts and 2-40 traits in the identified tripartite modules). According to our observations, the accuracy of the existing methods dropped drastically with an increased number of components, and had an unrealistic running time for our BXD modules. This led us to develop an alternative methodology in GEMOT phase III for identifying the players behind the group of traits without a full reconstruction of the network. However, the GEMOT pipeline is general and existing network reconstruction methods may be used in phase III of the pipeline, alongside or instead of our suggested phase-III approach. We now clarify these points in the Introduction, Results and Methods sections.

In the revised manuscript, we added a new simulation study that takes as input co-mapped subnetworks that mimic tripartite modules so as to identify the desired group of traits that share the same transcript players behind them. Notably, GEMOT does not construct a causal network but only predicts a subset of traits sharing the same mechanism. To avoid biases, we therefore based our systematic evaluation on the ability to identify the correct group of traits sharing a mechanism, thus ignoring irrelevant additional information produced by the compared network reconstruction methods. We now provide a detailed description of the simulated sub-networks and comparisons in the Results and Methods (subsections “Systematic evaluation of the GEMOT algorithm using synthetic data analysis” and “GEMOT Performance analysis”) and new five figures (new Figure 5 and new Figure 5–figure supplements 1, 2, 3 and 4). These simulation results support our usage of GEMOT in the range of biologically-relevant network parameters.

(3) Grouping traits. We have added a relevant simulation of a large biological network that harbors small groups of traits that share the same causal transcripts within it. We now discuss the simulation of this large network in the Methods (“GEMOT Performance analysis”) and Results (“Systematic evaluation of the GEMOT algorithm using synthetic data analysis”) sections. GEMOT's performance is compared to two alternative methods: (I) the ICE algorithm (Shi et al. 2010), which was used by Chesler et al. 2005 (as recommended by the reviewers); and (II) the INVAMOD algorithm. We find that GEMOT outperforms the alternative methods in identifying groups by causality across varying numbers of traits in a group (new Figure 5–supplementary figure 5B).

Importantly, an equivalent systematic comparison between GEMOT and the alternative network reconstruction algorithms on the entire network is not feasible due to the long running time of the network reconstruction algorithm. Therefore, the entire 3-phase GEMOT pipeline, which can be applied on thousands of components, cannot be directly compared to the network reconstruction methods. We now highlight this point and exemplify the running time of the compared methods for the case of small sub-networks (new Figure 5–figure supplement 2B).

Notably, three additional suggested publications (Westra 2013, Rutledge 2014 and Rosa 2009) have not been directly compared since they do not provide or develop any code that we could systematically apply to the synthetic data; these publications, however, discuss methodologies that are similar to the methods by Chesler 2005, Neto 2013 and Lee 2009, which were compared in our revised manuscript, as detailed above.

2) Simulations should be used to evaluate the method against a 'truth' standard.

We addressed this issue above (in our answer to comment 1). As noted, we now added simulations to evaluate GEMOT's accuracy. In particular, using a gold standard, we present the superiority of GEMOT for the goal of identifying groups of traits with causative players behind them. This is shown for synthetic networks of increasing complexity:

(I) Small network fragments including triplets of components (new Figure 1–figure supplement 2, subsection “Causality testing—performance evaluation” and Figure 1–figure supplement 2, legend).

(II) Co-mapped sub-networks (new figure 5 and Figures 5–figure supplement 1-4, and subsections “GEMOT Performance analysis” and “Systematic evaluation of the GEMOT algorithm using synthetic data analysis”).

(III) Full large networks (in the same subsections).

3) With regards to the Klf7 results: a consistent model describing the effect of each allele (B6, DBA, overexpression, knockout) should be presented to provide a clear hypothesis of how Klf7 effect is generated. (The interpretation of the Klf7 effects is unclear in the manuscript. Firstly, what is the nature of variation between the two Klf7 alleles? It has a cis-associated eQTL so, for example, which strain shows higher expression and can be linked to behavioral response to morphine? In the perturbation experiments, what is the strain background? Are the Klf7 knockout and overexpression effects on “driver genes” (up or down-regulation) consistent with corresponding Klf7 expression differences in the BxD population? That is, do Klf7 knockout effects look like a low-expressing Klf7 strain, and do Klf7 overexpression effects look like a high-expressing Klf7 strain? Figure 4 is also confusing as it appears to combine the knockout and overexpression experiments into summary scores. It would be useful to clearly see how these data support the inferred BxD model.)

Following the reviewers' suggestion, we first extended the analysis of module number 2 using the BxD data, and then demonstrated the agreement between this data and the perturbation experiment. In the revised manuscript we present a full consistency between the experimental perturbation results and the BxD-based model. Our extended analysis shows that the drivers in module number 2 can be partitioned into two types: group I is positively correlated with the Klf7 transcript and negatively correlated with the morphine-related traits; group II is negatively correlated with Klf7 and positively correlated with these traits. Consistently, strain DBA/2J shows higher levels of group I and Klf7 transcripts, whereas strain C57BL/6J shows higher levels of group II transcripts and of the morphine-related trait measurements. These results suggest that Klf7 acts as an activator of group I drivers and as a repressor of group II drivers. In accordance with this prediction, Klf7 overexpression in the C57BL/6J strain background up-regulated group I and down-regulated group II. Furthermore, Klf7 knockdown in the C57BL/6J strain background down-regulated group I, as expected. (Klf7 knockdown have no influence on group II, likely due to the low Klf7 level in the wild-type C57BL/6J, which already leads to an ineffective repression). We thank the reviewers for this suggestion, which we believe strengthened our results.

We have substantially revised the manuscript to include these new results that now support a causal role for Klf7 as an activator of drivers in group I and repressor of drivers in group II (in “Discovery of a model for a collection of tightly related traits”, in Results), and present new Figures 3B–group I and new Figure 3–figure supplements 1A, 2A-D and 3B (which replaces former Figure 4B-E and Figure 4–figure supplement 1) by providing all the relevant data about the partition of drivers into two groups, the correlation between objects and the difference in trait and expression levels between alleles, along with the corrected final model. In particular, we do not use a summary score for overexpression and knockdown, as suggested by the reviewers. Instead, Figures 3G, H now present the results of over-expression and knockdown in separate plots (replacing the unified presentation in former Figure 4D).

4) The manuscript would be more convincing if the authors identify a group of traits that show low Pearson correlation among themselves, but share the same driver genes.

We agree with the reviewers, and apologize for the missing details in our description of the modules in the original submission. The original submission already exemplified two modules with high trait-trait correlations (modules nos. 2 and 3) and one module with poor trait-trait correlations (module number 4, in formerly Figure 5B, currently Figure 4B). The revised manuscript clarifies this point in the Results and Discussion sections. We also added a new Figure 2–figure supplement 4 presenting the trait-trait correlations in each of the modules.

5) Considering the many methods already available (see above), the motivation for the present study should be better clarified.

The comment pointed out to a lack of clarity in comparison to existing methods, which we now address in full. First, we clarify the difference from existing methods. Whereas many trait-clustering methods are available, these do not rely on causative molecular players behind the grouped traits. Furthermore, whereas many causal network construction methods exist, such methods construct the entire networks and therefore cannot scale to large networks that include thousands of genes and traits (due to a lack of power and a non-realistic running time). We have revised the text in the Introduction, Results , and Discussion sections to clarify these points. Second, in the revised manuscript, we use simulations to demonstrate the advantages of GEMOT over various existing methods (network construction: Neto 2010, Hageman 2011, Wang 2014; grouping methods: Shi et al. 2010, Gat-Viks et al. 2013) for the goal of identifying groups of traits that share the same causal transcripts, comparing both running time and accuracy. In addition, we clarify the differences from existing causality testing approaches, including a comparative simulation study. We address the simulation studies in detail in response to comment 1, above. Third, we emphasize that the GEMOT pipeline is general and different existing methods may be used as subroutines within the GEMOT pipeline. For example, network construction methods may be applied to construct the sub-networks within the tripartite modules; existing causality-testing methods may be used to identify the driver genes; existing biclustering approaches may be utilized to identify the bipartite modules. We now clarify this point in the revised manuscript.

6) The manuscript can be difficult to read at times, primarily due to a reliance on its own jargon. In some cases new nomenclature might be necessary, but many of the new terms seem to be very similar (if not identical) to widely-used, existing terms. In such cases, the paper should conform to standard nomenclature or define why such standards will not suffice. Notable examples are: “variant-gene associations” instead of eQTL; “linked relationships” instead of Pearson correlation and “link potential” instead of averaged correlation; “bipartite module” instead of pleiotropy. Furthermore, “gene drivers” and the “drivers layer” can probably be more simply referred to as “transcripts” and a “transcript layer” with appropriate context. “Causality score” appears to be a P value; why not refer to it as such? In some cases the new nomenclature can be potentially misleading, such as in the Klf7 discussion where a P value is referred to as a “perturbation effect”. Effect size and significance are different concepts with different biological interpretations.

We thank the reviewers for this suggestion, and made the following changes accordingly:

1) Transcripts: The term 'gene' was replaced with ‘transcript’, and accordingly, the terms ‘driver layer’ and 'molecular layer' were replaced with ‘transcripts layer’. We modified the text throughout the manuscript accordingly.

2) Driver-related terms: We simplified the terminology by replacing the term 'candidate driver' with 'candidate transcript', and 'driver' with 'driver transcript'. Notably, we could not completely replace the term 'driver' and use the term 'transcript' instead, since we have two kinds of transcripts: the candidate transcripts in stage number II and the transcripts that were identified as drivers in stage number III.

3) Causative scores: The term 'causative score' was replaced with 'causality P value', and the term 'gene causality score' was replaced by 'transcript causality P value', as suggested.

4) The Klf7 discussion: In the results referring to module number 2 we now conformed to the standard terminology, where 'effect size' refers to difference between strains, and 'significance of effect' refers to the P value for these differences.

5) Linked relationships and link potential: In both cases, the term could not be replaced with an existing term. In the case of 'Linked relationships', we believe that replacing it with the term 'correlation' is misleading since 'correlation' refers to relations between two objects, whereas 'linked relationships' refers to the relations between three objects. In particular, 'linked relationships' refers to the combination of two existing terms: one is a simple 'correlation' between a transcript and a trait, and the second is an 'association' of a variant and a transcript. Therefore, there is no existing term referring to this combination. We now have the text slightly refined accordingly. Notably, since the term 'link potential' refers to an aggregation of 'linked relationships', we therefore could not use an existing term in this case.

6). Bipartite modules: In the GEMOT pipeline, bipartite modules do not represent any kind of pleiotropy, but only pleiotropy that is likely mediated through transcripts. The term 'Bipartite module' is therefore not equivalent to 'pleiotropy' and cannot be replaced. We now clarify this in the revised manuscript. Notably, bipartite graphs and sub-graphs conform to the standard terminology in the biclustering literature (e.g., Tanay et al., Bioinformatics 2002).

7) Variants: In some cases along the GEMOT pipeline, using the term 'eQTL' instead of 'variant' may be misleading. In particular, for the output GEMOT modules and association scores, the variant can be also terms an eQTL. However, in the remaining manuscript, the variant is not an eQTL. For example, during phase 1 of GEMOT, the variant only has linked relationships with a trait, therefore it cannot be declared as an eQTL; in bipartite and tripartite modules and also throughout the simulation studies (Figures 1A and 5A), reactive relationships may appear, and therefore the variant is not an eQTL. We believe that a consistent usage of the term 'variant' in all cases is more clear than using two different terms for the same object in different contexts.

Overall, in the revised manuscript, we changed 8 terms so as to conform to the standard terminology according to the reviewers’ suggestion.

7) High-dimensional approaches like GEMOT are prone to generating false positive associations, especially in such small experimental populations. Are P value systematically corrected for multiple tests? It is also important to estimate false discovery rates for a given significance threshold, which are not addressed in the manuscript. Additionally, the sample size in this study is very small, and BxD lines often exhibit non-local linkage disequilibrium due to limited recombination across few individuals. This can lead to many false-positive associations. How has this potential issue been addressed in the current study?

The reviewer raises two concerns regarding false positives. The first refers to the evaluation of FDR and the multiple testing problem, and the second relates to the linkage disequilibrium issue.

1) Multiple testing correction and FDR: In the revised manuscript, we now systematically correct for multiple testing and estimate FDR. First, in the case of real data, FDR is estimated based on a permutation test. We assessed the FDR for various predictions along the GEMOT pipeline, including the thresholds for identifying candidate transcripts and driver transcripts. We have substantially revised the Methods section accordingly. Second, we provide direct evaluation of FDR for the grouping of traits in synthetic networks, when the actual solution is known (new Figure 5–figure Supplement 5B and Figure 1–figure supplement 2C,D). Finally, We now systematically added multiple testing corrections for all reported P values throughout the manuscript. Notably, GEMOT provides the same results when applied with or without Boneferroni correction for the gene-variant association and link potential scores, since the construction of the bipartite modules is not sensitive to scaling of these values. We clarify this in the revised text.

2) Non-local linkage disequilibrium: We certainly agree that in some cases it would not be possible to discern between several potential genomic intervals due to non-local linkage disequilibrium. To address this issue, we tested the linkage disequilibrium (LD) between the genomic interval of each of our identified modules and the entire genome. We added a new Figure 2–figure supplement 1 presenting these results, allowing the observation of additional potential intervals for a module. We briefly discuss it in the manuscript (please see the Discussion section and the legend for Figure 2–figure supplement 1). For the particular modules presented in this study, all non-local LDs did not exceeded |r|=0.75.

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    Supplementary file 1.

    (A) Traits in the GEMOT modules of the BXD mouse strains. Shown is a module identifier (column 1) and details about a trait within it, including the trait title (column 2), PubMed record of the publication from which it is taken (column 3), the trait index in the GeneNetwork database (Wang et al., 2003; columns 4), and a short title of the trait as indicated in Figure 3B (column 5). (B) Drivers in the GEMOT modules of the BXD mouse strains. Shown is a module identifier (column 1), and details about a driver within it, including the gene symbol (column 2), its entrez identifier (column 3), genomic position (column 4), the type of association in the module (column 5), and its gene causality p value score (column 6). (C) Representative variants for causality testing in the tripartite modules of the BXD mouse strains. Shown is a tripartite module identifier and its GEMOT module identifier, if relevant (columns 1 and 2, respectively), the genomic position of the module's genomic interval (column 3), and the name and position of the representative variant (columns 4 and 5, respectively).

    DOI: http://dx.doi.org/10.7554/eLife.04346.027

    elife04346s001.doc (231KB, doc)
    DOI: 10.7554/eLife.04346.027

    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES