Skip to main content
. 2020 Jun 29;9:e54603. doi: 10.7554/eLife.54603

Figure 3. Simulations show that gene expression and capture efficiency influence the observed distribution of splicing.

(a) Simulations of alternative splicing and scRNA-seq under the binary-bimodal model, in which each cell produces one isoform or the other, but rarely both. As in Figure 2b, each row of the histogram shows Ψ^ for one intermediate exon across all cells. The observed Ψ^ distribution is similar to the true Ψ distribution, and its shape is largely unaffected by capture efficiency. (b) Simulations with the non-binary, unimodal model, in which most cells present a mixture of the two alternative isoforms. Exons with high expression have a unimodal distribution of true Ψ. Low capture efficiency results in an increase in binary observations (only one isoform observed), leading to a distortion of the observed distribution of Ψ^ to look bimodal. Only a handful of the highest expressed exons maintain a unimodal distribution of Ψ^. Fewer exons show unimodal splicing as the capture efficiency is reduced. (c) Under the binary-bimodal model, exons with high coverage have slightly fewer binary Ψ observations, and (d) simulated cells with a high number of total splice junction reads have slightly fewer exons with binary Ψ^. (e) Under the unimodal model, exons with intermediate splicing show a strong decrease in binary observations as coverage increases, as seen in real data (Figure 2c). (f) Similarly, simulated cells with high read coverage have a decrease of the proportion of binary Ψ^. (g) Effect of capture efficiency on the proportion of binary observations of cassette exons with underlying Ψ = 0.5. (h) Effect of the initial number of mRNA molecules and underlying Ψ on the proportion of binary Ψ^ observations.

Figure 3.

Figure 3—figure supplement 1. Theoretical calculations and simulations of the effect of biological and technical factors in splicing observations.

Figure 3—figure supplement 1.

(a) Theoretical likelihood of capturing only mRNAs representing one isoform of an alternatively spliced gene in a single cell, determined by the total number of mRNAs in the cell, the Ψ of the isoforms, and a 10% capture efficiency. (b) Probability of observing a binary Ψ^ (only one isoform observed) given the number of mRNA molecules of that gene present in the cell, when the underlying Ψ is fixed at 0.5. (c) Effect of the number of captured mRNA molecules on the uncertainty of Ψ^ observations. Ψ^ is understood as the maximum likelihood estimate of the underlying Ψ. (d) The posterior probability of having an observed Ψ^ within 0.1 of the underlying Ψ. (e) Distribution of the relative error between the average observed Ψ^, and the average true Ψ, in our simulations under different capture efficiency rates. The relative error is calculated as the difference between the average Ψ^ and the average true Ψ, scaled by the average true Ψ. (f) Distribution of the relative error between the variance in the observed Ψ^, and the variance in the true Ψ, in our simulations under different capture efficiency rates.
Figure 3—figure supplement 2. Schematic of scRNA-seq splicing simulator.

Figure 3—figure supplement 2.

Simulator of scRNA-seq splicing data. Green elements represent biological variables; blue elements represent technical processes. The underlying Ψ is drawn from a Beta distribution with either a bimodal or unimodal shape. Individual gene expression determines the total number of mRNAs, and these mRNAs are spliced stochastically according to Ψ, producing isoforms that splice in or skip the exon. mRNAs are captured with a probability drawn from a normal distribution. Sequencing produces splice junction reads, which determine the observed Ψ^.