Table 2.
RNAStralign dataset statistics.
RNA type | Length | All_Num | 512_Num | Train | Val | Test | Deredundancy |
---|---|---|---|---|---|---|---|
5S_rRNA | 104–132 | 11419 | 11419 | 9172 | 1114 | 1133 | 867 |
tRNA | 59-95 | 9245 | 9245 | 7405 | 933 | 907 | 527 |
group_1_intron | 163–615 | 2135 | 2058 | 1606 | 237 | 215 | 116 |
16S_rRNA | 54–1851 | 12608 | 973 | 765 | 101 | 107 | 84 |
tmRNA | 102–437 | 637 | 637 | 519 | 60 | 58 | 49 |
SRP | 30–553 | 601 | 591 | 480 | 45 | 66 | 48 |
RNaseP | 189–486 | 467 | 467 | 363 | 52 | 52 | 45 |
telomerase | 382–559 | 37 | 35 | 30 | 1 | 4 | 4 |
Total | 30–1851 | 37149 | 25425 | 20340 | 2543 | 2542 | 1740 |