Abstract
In a multivariate setting, we consider the task of identifying features whose correlations with the other features differ across conditions. Such correlation shifts may occur independently of mean shifts, or differences in the means of the individual features across conditions. Previous approaches for detecting correlation shifts consider features simultaneously, by computing a correlation-based test statistic for each feature. However, since correlations involve two features, such approaches do not lend themselves to identifying which feature is the culprit. In this article, we instead consider a serial testing approach, by comparing columns of the sample correlation matrix across two conditions, and removing one feature at a time. Our method provides a novel perspective and favorable empirical results compared with competing approaches.
Keywords: Correlation matrix, Differential correlation, Feature selection, Hypothesis testing, Wald test
1. Introduction
In many modern research settings, it is of interest to identify features in a multivariate dataset that differ across conditions. For instance, in functional genetics, researchers look for differences in gene expression profiles between normal and diseased tissues. Using functional MRI data, scientists search for voxels that differ before and after a stimulus.
To study differences across conditions, one typically considers the mean of each feature in each condition. Suppose
and
are multivariate random vectors of a single set of
features under two different conditions. Let
and
denote the means of
and
, respectively, and let
denote the mean of the
th feature of
. To detect mean differences, one can consider null hypotheses of the form
![]() |
However, mean differences may not capture the underlying data characteristics that differ across conditions. For example, in the context of gene expression data, co-expression, or expression of two or more genes together, may change across conditions. To address dependence between two or more genes, methods have been proposed to incorporate correlation structure to test pairs or small sets of genes for mean differences (Lai and others, 2004; Shedden and Taylor, 2004; Xiao and others, 2004; Dettling and others, 2005; Ho and others, 2007). However, differences in correlations may also occur independently of mean differences.
One way to compare correlations across conditions is to test for equality of the covariance or correlation matrices. Suppose
and
are the population correlation matrices of
and
, respectively. The null hypothesis of interest is then
![]() |
In the low-dimensional setting (i.e.
, where
are the sample size of each condition), classical approaches for testing this null hypothesis are based on the likelihood ratio (Cole, 1968) or Wald-type test statistics (Kullback, 1967; Jennrich, 1970; Layard, 1972; Larntz and Perlman, 1985; Modarres and Jernigan, 1992; Satorra and Neudecker, 1997). Testing equality of covariance matrices is also a well-studied problem (Wilks, 1932; Bartlett, 1937; Muirhead, 1982; Seber, 1984). Methods have also been proposed for the high-dimensional setting (
) (Ledoit and Wolf, 2002; Schott, 2007; Li and Qin, 2014).
In either the low- or high-dimensional setting, the aforementioned references are concerned with testing the global null hypothesis,
. In other words, the goal is to determine whether the covariance or correlation matrix of all
features differs across conditions. In the present article, we are instead interested in identifying individual features whose correlations with other features differ across two conditions.
Several recent papers have proposed approaches for identifying features whose correlations differ across conditions (e.g. Gill and others, 2010; Amar and others, 2013; Bockmayr and others, 2013). However, these methods do not employ a formal hypothesis testing framework. In the multivariate normal setting, some authors have considered identifying pairs of features whose partial correlations differ across conditions (Guo and others, 2011; Mohan and others, 2012; Danaher and others, 2014); however, these methods focus on pairwise partial correlation differences across conditions, whereas we are interested in comparing relationships of each feature with all other features across conditions.
More related to this article are recent proposals by Hu and others (2010) and Cai and others (2013), who consider tests for equality of the columns of
and
in order to identify correlation differences in individual features. However, neither proposal takes advantage of the particular correlation structure that arises when a small number of features are disrupted across conditions, leading to a situation in which the correlations of those features with many other features differ across conditions. This would be the case in a comparison of gene expression profiles between normal and disease conditions if only one gene were disrupted in some disease, where that disruption affects that gene's correlation with many other genes. In this article, we are interested in characterizing the notion of a disrupted gene in terms of correlation matrices, and developing a corresponding hypothesis testing framework to identify features of interest.
As a motivating example, suppose
and
represent gene expression profiles under two distinct conditions. Let
and
denote, respectively, the
th columns of
and
, that is, the correlations between the
th gene and the
other genes, in the two conditions. Suppose that the first gene is disrupted in one of the two conditions, but that the two conditions are otherwise identical. This would manifest as
, and more specifically,
. Note that the submatrices of
and
with the first row and column removed are identical—that is, all of the differences between
and
can be attributed to the first gene. Figure 1 illustrates this motivating example.
Fig. 1.
In the motivating example of Section 1, the two correlation matrices differ within a single row/column. Our goal is to identify only the first feature as different between the two conditions.
Our goal is to develop a testing procedure that will determine that in the set-up of Figure 1, all differences between the correlation matrices across the two conditions can be attributed to the first feature. To describe the problem more clearly, we define the terms ‘dysregulated’ and ‘minimally dysregulated’—language borrowed from the study of gene expression data—as follows.
Definition 1. —
The
th feature is dysregulated across conditions
and
if
.
Let
denote a subset of size
and
denote the subvector of the random
-vector
corresponding to all but the features in
.
Definition 2. —
Consider a set of
features,
, that satisfies the following conditions:
;
for all strictly smaller subsets
,
.
Once we remove all of the features in
from
and
, there is no difference in the correlation matrices among the remaining features across conditions—that is,
. Using this terminology, we can now see that our goal is to identify
(e.g.
in Figure 1), as opposed to the set of dysregulated features (e.g.
in Figure 1).
In Section 2, we develop Wald-type test statistics for testing null hypotheses of the form
![]() |
(1.1) |
for
. We then show that simultaneous tests of (1.1) are not appropriate, as they lead to rejection for more than just the first feature in Figure 1 (i.e. they lead to a poor estimate of
). This motivates the need for a different approach. In Section 3, we develop a method to test the series of null hypotheses of the form
![]() |
(1.2) |
for
, where
denotes the set of all
-combinations of
. This series of hypothesis tests leads naturally to an estimator of the set
, which we show to have excellent empirical performance. In Section 4, we apply our proposal to two gene expression data sets. The discussion is in Section 5.
2. A simultaneous approach
Let
and
denote
-dimensional random vectors corresponding to a single set of
features under two distinct conditions. Let
and
denote the population covariance matrices of
and
, and let
and
denote their population correlation matrices, respectively. Without loss of generality, assume that the population mean vectors of
and
equal zero. Let
and
be independent and identically distributed samples of
and
. Denote the empirical covariance matrices by
![]() |
where
and
. Denote the empirical correlation matrices by
![]() |
Let the operator
denote the “vectorization” of a
matrix
, defined by
![]() |
In other words, the vectorization of
stacks the columns of
into a
column vector. Since our goal is to develop test statistics for each of the
features, we are interested in examining columns of the correlation matrices. Since the diagonal of a correlation matrix is always one, we ignore it, and so henceforth let
and
denote the
th column of
and
with the
th element removed.
2.1. A test of
In order to develop a test for
, we extend a classical Wald-type approach for testing
, which relies on the following result (Neudecker and Wesselman, 1990).
Lemma 1. —
Suppose that
has finite fourth moments, that is,
for all
. Then
where
,
denotes the gradient with respect to
, and
. Furthermore, if
is multivariate normal,
(2.1) for
, where
denotes the
th element of
.
Using Lemma 1, it can be shown that
![]() |
and
![]() |
where
and
are the asymptotic covariance matrices corresponding to
and
, respectively. Due to the independence of the samples from each condition, it follows that
![]() |
where
, and
and
are consistent estimators of
and
, respectively. This result motivates our proposed test statistic for
,
![]() |
(2.2) |
Proposition 1. —
Suppose that
and
have finite fourth moments,
,
and
as
for some finite constant
. Then
The proof of Proposition 1 is given in the supplementary material available at Biostatistics online.
Proposition 1 implies that the type I error rate of the following test of
,
![]() |
(2.3) |
is controlled at level
asymptotically. In a simulation study in the supplementary materials (available at Biostatistics online), we investigate the finite-sample type I error rate control of
in (2.3).
Given (2.3), one can simultaneously test
, using a multiplicity correction to address the problem of multiple comparisons. However, as we will see in the next section, this approach does not achieve the goal of our paper, as described in Section 1: to identify a minimally dysregulated set of features.
2.2. Motivation for a different approach
We now argue that simultaneously testing
for
can be problematic, motivating the need for a different approach. Consider a simplified version of the motivating example in Figure 1, where the
matrices
and
take the respective forms
![]() |
(2.4) |
![]() |
(2.5) |
where
,
, and
. Note that
and
are identical except in the first half of the first row/column.
We considered
. For
and
, we sampled from
and
, and computed
in (2.2) for
. This was repeated 10 000 times. The distributions of
,
and
are shown in Figure 2. (Test statistics
for other values of
are not shown, because they are identical in distribution to either
or
.) Other values of
,
, and
yielded similar results.
Fig. 2.
The histograms illustrate the distribution of
for
in the simulation study from Section 2.2. The dashed lines represent a
distribution, the asymptotic distribution of
under
according to Proposition 1. The gray shaded regions indicate the proportion of test statistics that would result in rejecting the null hypothesis
under (2.2). (a)
, (b)
.
The left-hand panels of Figure 2 indicate that
almost always rejects
, as expected. The right-hand panels of Figure 2 indicate that
has approximately a
distribution as suggested by Proposition 1, and hence that
has well-controlled Type I error rate.
However, the center panels of Figure 2 indicate that
does not have a
distribution. This becomes particularly pronounced as the sample size increases. Indeed,
does not hold, since the
elements of
and
are unequal in (2.4) and (2.5). Consequently, in this simulation study,
will tend to be rejected when the sample size is sufficiently large, even though the differences between
and
can be fully attributed to differences in the first feature's correlations across the two conditions.
Our goal was to identify only the first feature as differing across the two conditions. This simulation study reveals that simultaneous tests of
are flawed, essentially because each correlation involves a pair of features, and thus any difference in correlation across two conditions necessarily implicates at least two features. To accomplish our goal, we must develop a set of null hypotheses such that, in this example, only a single null hypothesis corresponding to the first feature is violated. This motivates our proposed approach: a serial testing procedure.
3. A serial approach
We just saw that tests of
fail to identify only the first feature as different across conditions in the setting of Figure 1. Therefore, they are not appropriate for the goal laid out in Section 1, which is to identify
, the minimal set of features that account for the differences between
and
.
We now consider a new hypothesis testing framework. Instead of simultaneously testing the
null hypotheses,
in (1.1) for
, we propose to test the series of null hypotheses
in (1.2) for
. The null hypotheses
are closely related to the cardinality of
. If
do not hold and
does hold, then
. In Section 3.1, we propose a test of
, which we call
. Then in Section 3.2, we apply serial tests of
in order to estimate
. A comparison to competing approaches is in Section 3.3.
3.1.
: A test of
Suppose that
in (1.2) holds. Then there exists a set
of size
such that
. Hence the test statistics
for
are asymptotically
. Consider the test
![]() |
(3.1) |
Proposition 2. —
Suppose the conditions of Proposition 1 and
hold. Then
The proof of Proposition 2 is given in the supplementary material (available at Biostatistics online). Proposition 2 implies that, under
, the test
controls type I error at a level
. A simulation study of the performance of
under
is presented in the supplementary materials (available at Biostatistics online).
3.2. An estimator of
Now suppose that we perform serial tests of
for
. Suppose we reject
, but do not reject
. Then
![]() |
(3.2) |
is a natural estimator of
from Definition 2: our failure to reject
supports condition 1 of Definition 2, and our rejection of
supports condition 2 of Definition 2.
We will now show that
with high probability.
Proposition 3. —
Suppose the conditions of Proposition 1 hold. Then
The proof of Proposition 3 is given in the supplementary material (available at Biostatistics online).
3.3. Comparison to competing methods
We compared the accuracy of
(3.2) as an estimator of
(Definition 2) to the methods proposed in Cai and others (2013) and Hu and others (2010), both of which use a simultaneous (rather than a serial) approach to identify features whose covariances with other features differ across conditions. Since the methods of Cai and others (2013) and Hu and others (2010) are based on covariances rather than correlations, we employ a covariance version of our proposal (see the supplementary material, available at Biostatistics online, for details).
The method of Cai and others (2013) simultaneously tests the null hypotheses
for
. The test statistic for
is the squared, standardized maximum value of
. We conclude that a given feature is in the estimate of
by Cai and others (2013) if its test statistic exceeds some threshold.
The method of Hu and others (2010) tests for equality of the joint distribution of “covariance distances” for a given feature across the two conditions (ignoring a fixed number of covariance distances, specified by the “trim number”). We conclude that a given feature is in the estimate of
by Hu and others (2010) if the resampling-based p-value falls below some threshold. We perform the method of Hu and others (2010) with trim numbers 0 and
.
We simulated data where
,
, and
. Each sample was drawn independently from a
or a
distribution. We fixed
to be
![]() |
and considered one of four possible scenarios for
, illustrated in Figure 3.
Fig. 3.
Four scenarios for
considered in the simulation study in Section 3.3. Each subfigure represents a
matrix. Black entries correspond to values of 0.1, for which
differs from
. White off-diagonal entries correspond to values of 0.3, for which elements of
are equal to
. (a) Scenario A, (b) Scenario B, (c) Scenario C, (d) Scenario D.
Our proposed approach, the method of Cai and others (2013) and that of Hu and others (2010) were applied in order to obtain estimates of
. For each method, the cardinality of the estimate of
was varied (for instance, in our method, this corresponds to the level
; in the method of Cai and others (2013), it corresponds to a threshold for the test statistic). The results, averaged over 1000 replications, are shown in Figure 4. The results corroborate Proposition 3, in that our method yields an average value of
no greater than 5 for all values of
considered.
Fig. 4.
Results from the simulation study in Section 3.3, averaged over 1000 simulated datasets. The
-axis displays the cardinality of the estimate of
, and the
-axis displays the number of true positives, i.e. the number of features in the estimate of
that are truly in
. For reference, the bounding triangle indicates two methods: the NW boundary is an idealized method that always correctly selects features in
, and the SE boundary indicates a method that selects features at random. (a) Scenario A, (b) Scenario B, (c) Scenario C, (d) Scenario D.
While our proposed approach does at least as well as the competitors in all four scenarios displayed in Figure 4, it does particularly well (relative to competitors) in Scenarios A, C, and D. Recall from Figure 3 that in Scenarios A, C, and D, the features in
have differential correlations with many other features, including features in
. Consequently, the serial approach taken by our proposal is key to identifying the correct set of features in
, as described in Section 2.2. In contrast, in Scenario B, the features in
have differential correlations only with other features in
; consequently, in this scenario, there is no need for a serial approach—a simultaneous approach will perform just as well. In fact, we see that in Scenario B, all approaches have comparable performance.
4. Application to gene expression data
In this section, we examine two gene expression datasets. These datasets are high-dimensional, leading to challenges for our proposal: (1) Propositions 2 and 3 require
; and (2) for large
, tests of
(1.2, 3.1) become computationally intractable unless
is quite small. Hence a screening procedure to reduce dimensionality is necessary. We consider three screening approaches in Sections 4.1–4.3, respectively.
4.1. Screening based on scientific knowledge
We first consider a gene expression dataset that consists of 11 861 gene expression measurements from 220 tissue samples taken from patients with one of four subtypes of glioblastoma multiforme (GBM; Verhaak and others, 2010. We compare the two GBM subtypes with the largest sample sizes,
(proneural) and
(mesenchymal). In order to reduce dimensionality, we restrict attention to 34 genes involved in TCR signaling, as was done in Mohan and others (2014).
At level
, we estimate
. Interestingly, NFKBIA is a tumor suppressor gene. There is known to be enrichment of single-nucleotide polymorphisms and haplotypes of NFKBIA in Hodgkin's lymphoma, colorectal cancer, melanoma, hepatocellular carcinoma, breast cancer, and multiple myeloma. Furthermore, it has been reported that NFKBIA tends to be deleted in glioblastomas (Bredel and others, 2011).
4.2. Unsupervised screening
We now consider a gene expression dataset that consists of 12 600 gene expression measurements from
normal (
) and
tumor (
) prostate gland specimens from patients undergoing radical prostatectomy (Singh and others, 2002). We reduce dimensionality using an unsupervised screening approach: we restrict our analysis to the
and 35 genes with the highest marginal variance. This screening approach has substantial precedent in the gene expression literature, in which it is often assumed that high-variance genes are more scientifically interesting than low-variance genes. Because the screening does not make use of the class labels (normal versus tumor), the resulting test for
results in valid statistical inference (Bourgon and others, 2010).
Figure 5 displays
for the 35 highest-variance genes. Estimates of
for
and
are reported in Figure 5. We considered very small
in order to avoid estimating
to contain all of the features. It is possible that the cancer and normal tissue tend to have substantially different gene expression, but also likely that early microarray studies such as Singh and others (2002) tend to suffer from batch effects.
Fig. 5.
, and the set of genes in
, when we restrict analysis to the genes with the highest variance in the prostate cancer dataset. The rows/columns of the matrix are in order of complete linkage clustering, using Euclidean distance. The gene ranking (highest to lowest variance) is indicated on the right-hand side of the matrix. The two tables list the genes in
for
(left) and
(right).
As expected,
depends on the set of genes considered. For example, for
, the 13th highest-variance gene is in
when
; however, this gene is not in
when we take
. By design, as
decreases,
decreases. As an example of the interpretation of
, consider the result when
and
. We rejected
and
at level
, but failed to reject
at level
(a conservative estimate of the p-value corresponding to this hypothesis is
). Consequently,
, and
. This suggests that there are (very conservatively) at least two minimally dysregulated genes among the top 20 highest-variance genes.
Calculating
is computationally intensive. Consider the case of
and
, displayed in the last row of the left-hand table in Figure 5. In this example,
. Therefore, in order to obtain these results, we performed tests of
for
. The test of
requires computing the test statistic
for
and
. Hence, in order to test
for all
, we computed
test statistics. Had we chosen a much larger value of
, then
might have contained many more genes, which would have substantially increased the necessary computations. Fortunately, computation of these test statistics can be easily parallelized.
4.3. Supervised screening
We once again consider the prostate cancer gene expression dataset of Section 4.2; this time, however, we take a supervised screening approach. In order to implement this approach, we split the observations into a training set and a test set of equal size. We perform screening on the training set in order to obtain a small set of features, and estimate
on the test set using that small feature set. This training/test set approach is needed in order for our estimate of
to retain the asymptotic properties established in Section 3.
To perform supervised screening, we compute
for all
in the training set. We then restrict attention to the
genes for which this quantity is largest. These 30 genes are displayed in Figure 6. Next, we estimate
on the test set, using only these 30 genes. For
, we rejected
for
, and failed to reject
; therefore,
, and
. This suggests that there are five minimally dysregulated genes among the 30 genes considered.
Fig. 6.
for the 30 genes with the largest sample correlation difference across conditions in the prostate cancer dataset. The rows/columns of the matrix are in order of complete linkage clustering, using Euclidean distance. The gene ranking (highest to lowest
-norm of the difference in sample correlation vectors) is indicated on the right-hand side of the matrix.
It is not surprising that, for a given value of
and
, the supervised screening approach results in a higher value of
than with the unsupervised screening approach: this makes sense because the supervised screening approach selects genes that are dysregulated in the training set. Note that the actual genes in
cannot be directly compared across the supervised and unsupervised screening approaches, as the set of genes used in each approach is largely non-overlapping.
5. Discussion
In this paper, we consider the task of identifying features whose correlations differ across conditions, as opposed to identifying features whose means differ across conditions. Correlation differences may occur independently of mean differences, and selection methods based on correlation differences have been proposed in the statistical literature (Hu and others, 2010; Cai and others, 2013). But the previously proposed methods test each feature simultaneously, which can be problematic in a scenario where just a small number of features have differential correlations with many other features across the two conditions. Instead, we proposed a serial testing approach, which overcomes the problems associated with simultaneous testing.
In this article, we propose the estimator
of
. Our proposal builds upon classical Wald-type tests of
. We control the asymptotic type I error rate, and demonstrate desirable performance in a variety of simulation settings. Specifically, when non-zero values of
are concentrated in certain rows/columns, our approach outperforms proposals that are based on simultaneous hypothesis tests (Hu and others, 2010; Cai and others, 2013).
We restrict attention to the low-dimensional setting, in which the numbers of observations in each group,
and
, exceed the number of features,
, for two reasons:
The asymptotic results on type I error control required
. In order to move into the high-dimensional setting, we would need to consider an alternative to
(2.2) for use in defining
and
. For example, the proposal of Cai and others (2013) could possibly be extended.
For large
, tests of
in (1.2) become computationally intractable as
increases. We note that if
is small, then typically computations are not a concern, as we must only test
for
increasing until we fail to reject some null hypothesis. Furthermore, for a given value of
, the computations involved in testing
can be easily parallelized. Future work could involve developing an alternative to considering all
combinations in order to test
. For instance, to test
, we could consider the
combinations that result from serially removing the feature
that corresponds to the largest
.
In Section 4, we show how to apply our proposal to high-dimensional data by screening to reduce the number of features.
We leave the challenging task of studying the sampling variability of
to future work. In addition, further research could focus on relaxing the conservativeness of
that results from the use of the union inequality in the proof of Proposition 2.
Supplementary material
Supplementary Material is available at http://biostatistics.oxfordjournals.org.
Funding
D.W. work was supported in part by NIH DP5OD009145, NSF CAREER DMS-1252624, and a Sloan Research Fellowship. X.-H.Z. work was supported in part by U.S. Department of Veterans Affairs, Veterans Affairs Health Administration Research Career Scientist Award (RCS 05-196).
Supplementary Material
Acknowledgements
We thank Palma London and Su-In Lee for cleaning, annotating, and sharing with us the GBM gene expression dataset studied in Section 4.1.
References
- Amar D., Safer H., Shamir R. (2013). Dissection of regulatory networks that are altered in disease via differential co-expression. PLoS Computational Biology 9(3), e1002955. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bartlett M. (1937). Properties of sufficiency and statistical tests. Proceedings of the Royal Statistical Society Series A 160, 268–282. [Google Scholar]
- Bockmayr M., Klauschen F., Gyorffy B., Denkert C., Budezies J. (2013). New network topology approaches reveal differential correlation patterns in breast cancer. BMC Systems Biology 7, 78. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bourgon R., Gentleman R., Huber W. (2010). Independent filtering increases detection power for high-throughput experiments. Proceedings of the National Academy of Sciences 107(21), 9546–9551. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bredel M., Scholtens D. M., Yadav A. K., Alvarez A. A., Renfrow J. J., Chandler J. P, Yu I. L. Y., Carro M. S., Dai F., Tagge M. J.. and others (2011). NFKBIA deletion in glioblastomas. New England Journal of Medicine 364(7), 627–637. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cai T., Liu W., Xia Y. (2013). Two-sample covariance matrix testing and support recovery in high-dimensional and sparse settings. Journal of the American Statistical Association 108(501), 265–277. [Google Scholar]
- Cole N. (1968). The likelihood ratio test of the equality of correlation matrices. Technical Report No. 65.
- Danaher P., Wang P., Witten D. (2014). The joint graphical lasso for inverse covariance estimation across multiple classes. Journal of the Royal Statistical Society, Series B 76(2), 373–397. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dettling M., Gabrielson E., Parmigiani G. (2005). Searching for differentially expressed gene combinations. Genome Biology 6, R88. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gill R., Datta S., Datta S. (2010). A statistical framework for differential network analysis. BMC Bioinformatics 11, 95. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Guo J., Levina E., Michalidis G., Zhu J. (2011). Joint estimation of multiple graphical models. Biometrika 1–15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ho Y. Y., Cope L., Dettling M., Parmigiani G. (2007). Statistical methods for identifying differentially expressed gene combinations. Methods in Molecular Biology 408, 171–191. [DOI] [PubMed] [Google Scholar]
- Hu R., Qiu X., Glazko G. (2010). A new gene selection procedure based on the covariance distance. Bioinformatics 26(3), 348–354. [DOI] [PMC free article] [PubMed] [Google Scholar]
-
Jennrich R. (1970). An asymptotic
test for the equality of two correlation matrices. Journal of the American Statistical Association
65, 904–912. [Google Scholar] - Kullback S. (1967). On testing correlation matrices. Applied Statistics 16, 80–85. [Google Scholar]
- Lai Y. L., Wu B., Chen L., Zhao H. Y. (2004). Statistical method for identifying differential gene-gene coexpression patterns. Bioinformatics 20(17), 3146–3155. [DOI] [PubMed] [Google Scholar]
- Larntz K., Perlman M. (1985). A simple test for the equality of correlation matrices. Rapport technique, Department of Statistics, University of Washington, 141.
- Layard M. (1972). Large sample tests for the equality of two covariance matrices. Annals of Mathematical Statistics 43, 149–151. [Google Scholar]
- Ledoit O., Wolf M. (2002). Some hypothesis tests for tehe covariance matrix when the dimension is large compared to the sample size. The Annals of Statistics 30, 1081–1102. [Google Scholar]
- Li W., Qin Y. (2014). Hypothesis testing for high-dimensional covariance matrices. Journal of Multivariate Analysis 128, 108–119. [Google Scholar]
- Modarres R., Jernigan R. W. (1992). Testing the equality of correlation matrices. Communications in Statistics - Theory and Methods 21(8), 2107–2125. [Google Scholar]
- Mohan K., Chung M., Han S., Witten D., Lee S. I., Fazel M. (2012). Structured learning of Gaussian graphical models. Advances in Neural Information Processing Systems, 620–628. [PMC free article] [PubMed] [Google Scholar]
- Mohan K., London P., Fazel M., Witten D., Lee S.-I. (2014). Node-based learning of multiple gaussian graphical models. The Journal of Machine Learning Research 15(1), 445–488. [PMC free article] [PubMed] [Google Scholar]
- Muirhead R. (1982) Aspects of Multivariate Statistical Theory. New York, NY: Wiley. [Google Scholar]
- Neudecker H., Wesselman A. M. (1990). The asymptotic variance matrix of the sample correlation matrix. Linear Algebra and its Applications 127, 589–599. [Google Scholar]
- Satorra A., Neudecker H. (1997). Compact matrix expressions for generalized Wald tests of equality of moment vectors. Journal of Multivariate Analysis 63, 259–276. [Google Scholar]
- Schott J. R. (2007). A test for the equality of covariance matrices when the dimension is large relative to the sample size. Computational Statistics and Data Analysis 51, 6535–6542. [Google Scholar]
- Seber G. (1984) Multivariate Observations. USA: John Wiley & Sons. [Google Scholar]
- Shedden K., Taylor J. (2004). Differential correlation detects complex association between gene expression and clinical outcomes in lung adenocarcinomas. Methods in Microarray Data Analysis 4, 121–132. [Google Scholar]
- Singh D., Febbo P. G., Ross K., Jackson D. G., Manola J., Ladd C., Tamayo P., Renshaw A. A., D'Amico A. V., Richie J. P.. and others (2002). Gene expression correlates of clinical prostate cancer behavior. Cancer Cell 1(2), 203–209. [DOI] [PubMed] [Google Scholar]
- Verhaak R. G. W., Hoadley K. A., Purdom E., Wang V., Qi Y., Wilkerson M. D., Miller C. R., Ding L., Golub T., Mesirov J. P.. and others (2010). Integrated genomic analysis identifies clinically relevant subtypes of glioblastoma characterized by abnormalities in pdgfra, idh1, egfr, and nf1. Cancer Cell 17(1), 98–110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wilks S. (1932). Certain generalisations in the analysis of variance. Biometrika 24, 471–494. [Google Scholar]
- Xiao Y. H., Frisina R., Gordon A., Klebanov L., Yakovlev A. (2004). Multivariate search for differentially expressed gene combinations. BMC Bioinformatics 5(164). [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.

























































