Skip to main content
. 2023 May 25;24(4):bbad186. doi: 10.1093/bib/bbad186

Table 2.

Benchmark datasets commonly used in recent machine learning- and deep learning-based methods.

# fam. # seq. Length Ref.
RNA STRAND 172 4666 4 ~ 4381 [79]
TORNADO dataset [24]
 TrainSetA 11 3166 10 ~ 734
 TestSetA 11 697 10 ~ 768
 TrainSetB 22 1094 27 ~ 237
 TestSetB 22 430 27 ~ 244
Archive II 10 3975 28 ~ 2968 [80]
RNAStralign 8 30 451 30 ~ 1851 [81]
bpRNA-1m Inline graphic 2600 102 318 2 ~ 4381 [82]
bpRNAnew Inline graphic 1500 5401 33 ~ 489 [50]

# fam.: the number of families. # seq.: the number of sequences.