Skip to main content
. 2021 Feb 8;10:e57116. doi: 10.7554/eLife.57116

Figure 2. Identification of essential genes based on combined dependency score.

Figure 2.

(A) Dependency scores defined with different mixing ratios are computed by projecting each point onto the corresponding lines. θ denotes the fraction of CRISPR dependency scores. PC1 is the direction of the primary principal component line. (B) The distributions of combined dependency scores for four genes shown in Figure 1B. (C) Top panel: the distribution of the combined dependency score S0.6. The essentiality threshold T0.6 is determined based on this distribution. Bottom panel: the distribution of efficacy scores G,X0.6 with various X (-th percentile). Genes that satisfy G,X0.6<T0.6 are defined as commonly or selectively essential, and the number of essential genes depends on X. (D) On the diagonal line, the numbers of commonly or selectively essential genes identified with various θ are shown. In the off-diagonal area, the numbers of essential genes identified with two distinct θ are shown. Color code indicates an overlap index, a measure of overlap between two essential gene sets. The overlap index ranges from 0 (no shared genes) to 1 (the smaller set is included in the larger set). (E) The extent to which the genes claimed essential by only CRISPR or shRNA are covered by the essential genes discovered by each mixing ratio.

Figure 2—source data 1. The combined dependency scores for 15,847 protein-coding genes in 423 cell lines for six θ.
θ = 0 and 1 corresponds to shRNA and CRISPR scores compared in Figure 1.
elife-57116-fig2-data1.xlsx (314.1MB, xlsx)