. 2022 Nov 7;39(1):btac715. doi: 10.1093/bioinformatics/btac715

Table 1.

Statistical information of the eight independent test datasets

Independent test datasets^a	Positive	Negative
XUAMP	1536	1536
APD3	494	494
DRAMP	1408	1408
LAMP	1054	1054
CAMP	203	203
dbAMP	522	522
YADAMP	324	324
DBAASP	178	178

The relationship among the eight independent test datasets is shown in Figure 1. We adopt CD-HIT (Huang et al., 2010) to remove the redundancy between the independent datasets and the training sets. Following (Veltri et al., 2018), the similarity between positive training samples and positive independent test samples is <90%, and the similarity between negative training samples and negative independent test samples is <40%.