Skip to main content
. 2022 May 10;29(5):465–482. doi: 10.1089/cmb.2021.0403
Algorithm 1: Reference data selection
Input: N×M data matrix X; parameter vector a.
 1. Filter out cells with library size greater than a1-th percentile.
 2. Remove genes with mean expression less than a2-th percentile.
 3. Remove genes with less than a3-th percentile nonzero cells.
 4. Keep cells with library size greater than the a4-th percentile.
 5. Keep genes with nonzero proportion greater than a5-th percentile.
The default parameter values area1=95,a2=25,a3=15,a4=5,a5=50.
Output: N×M reference data matrix Y.