Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2019 Jun 6.
Published in final edited form as: J R Stat Soc Series B Stat Methodol. 2019 Mar 18;81(2):229–231.

Discussion on Covariate-assisted ranking and screening for large-scale two-sample inference

Guo Yu 1, Jacob Bien 2, Daniela Witten 3
PMCID: PMC6553469  NIHMSID: NIHMS1019147  PMID: 31178656

In this discussion contribution, we connect the elegant proposal of Cai, Sun and Wang to multiview data, in which multiple sets of variables (or ‘views’) are measured on the same observations. Using ideas from Section 4, we show that we can exploit a secondary view to improve power for testing on the first view.

Consider independent and identically distributed observations of m random variables under two conditions. In condition l{1,2}, observation i{1,,nl} of variable j{1,,m} is given by (view 1 )

Xij(l)=μj(l)+εij(l),

Where εij(l) is zero mean, and we suppress the common intercept. The random-mean vectors μ(1) and μ(2) are sparse. Furthermore, for the same individuals, we also observe a second view of m˜ variables (view 2):

Zik(l)=μ˜k(l)+ε˜ik(l)fork{1,,m˜}.

The mean vectors μ˜(l) are sparse, ε˜ik(l)is zero mean and again we suppress the intercept. Suppose that the two views satisfy a hierarchical sparsity constraint: for j{1,,m} and l{1,2},

μ˜σ(j)(l)=0μj(l)=0, (6)

where σ(j) maps the jth entry of μ(l) to its parent in μ˜(l): Fig. 11.

Fig. 11.

Fig. 11.

Schematic diagram of constraint (6) with σ(3) = 1

Concretely, suppose that X(l) and Z(l) contain protein and gene expression measurements respectively. If transcripts that encode the jth protein are absent (i.e. μ~σ(j)(l)=0),then the jth protein cannot be present (i.e.μj(l)=0).

Suppose that(μj(1),μ˜σ(j)(1)) is independent of (μj(2),μ˜σ(j)(2)). Further assume that the random errors (εij(l),ε˜iσ(j)(l)) are bivariate normal and independent across j, l and i, and independent of μ(l) and μ˜(l).

Using the terminology of Cai, Sun and Wang the ‘primary statistic’ for testing H0j:μj(1)=μj(2) is

Tj=Cj{X¯j(1)X¯j(2)}

for some constant Cj. We consider a pair of ‘auxiliary statistics’,

Rj=Dj[X¯j(1)+n2var{εij(1)}n1var{εij(2)}X¯j(2)],
Sj=Ej[Z¯σ(j)(1)+n2cov{εij(1),ε˜iσ(j)(1)}n1cov{εij(1),ε˜iσ(j)(2)}Z¯σ(j)(2)],

for some constants Dj and Ej. The statistic Rj is the same as T2j in the paper, whereas Sj is constructed by using the second data view. A small value of |Sj| provides evidence for μ˜σ(j)(1)=μ˜σ(j)(2)=0, which by constraint (6) suggests that μj(1)=μj(2). By analogy with proposition 1 in the paper, the oracle statistic is

TOR(j)(tj,rj,sj)Pr(θ1j=0|Tj=tj,Rj=rj,Sj=sj)=f(tj,rj,sj|θ1j=0)Pr(θ1j=0)f(tj,rj,sj)=f(tj|θ1j=0)f(rj,sj|θ1j=0)Pr(θ1j=0)f(tj,rj,sj).

Moreover, TOR(j)(tj,rj,sj)enjoys the properties in theorem 3 of the paper. Detailed proofs are available from https://hugogogo.github.io/paper/cars_discussion_supplement.pdf. If there is not a one-to-one mapping between σ(j) and j then TOR(j)(tj,rj,sj) must be estimated carefully.

Supplementary Material

supplement

Fig. 9.

Fig. 9.

(a) Power comparison and (b) empirical misclassification rates for two classes Nm(μ1,lm) and Nm(μ2,Im) based on 500 replications (FDR level α = 0.05; n1 = 50; n2 = 60; m = 1000; μ1,1:k=5/30;μ1,(k+1):(2k)=4/30;μ1,(2k+1):m=0;μ2,1:k=2/30;μ2,(k+1);(2k)=4/30;μ2,(2k+1):m=0):,:Inline graphic, method (5) based on Benjamini and Hochberg (1995);Inline graphic, method (5) based on CARS;Inline graphic, Bayes rule

Fig. 10.

Fig. 10.

Empirical misclassification rates when the same amount of locations are chosen for both methods:Inline graphic, Benjamini and Hochberg (1995);Inline graphic, CARS;Inline graphic, Bayes rule

Contributor Information

Guo Yu, University of Washington, Seattle.

Jacob Bien, University of Southern California, Los Angeles.

Daniela Witten, University of Washington, Seattle.

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

supplement

RESOURCES