Skip to main content
. Author manuscript; available in PMC: 2018 May 22.
Published in final edited form as: J Chem Inf Model. 2018 Feb 8;58(2):501–510. doi: 10.1021/acs.jcim.7b00397

Table 1.

Statistical Composition of the Training and Independent Validation Data Sets

data set no. of sequences numPa numNb ratioc
PATP-388 388 5,657 142,086 25.12
PATP-TEST 41 674 14,159 21.01
a

numP represents the number of positive samples.

b

numN represents the number of negative samples.

c

ratio = numN/numP.