Table 1.
Dataset | Sequencing | No. unique entries | Size (MB) |
---|---|---|---|
Simulated Ig | Simulated | 2 000 | 1.1 |
Stanford S22 (Jackson et al., 2010) | Roche 454 | 13 153 | 3.4 |
Mouse Ig-seq (Halemano et al., 2014) | Illumina MiSeq | 204 462 | 80.0 |
Human Ig-seq (Safonova et al., 2015) | Illumina MiSeq | 3 099 967 | 1 173.0 |
Simulated datasets are evaluated in a supervised manner, and real datasets are compared in an unsupervised manner.