. 2020 Nov 4;11:5575. doi: 10.1038/s41467-020-19266-y

Table 4.

Comparison of recently published methods for direct synthesis prediction on the USPTO-MIT set.

Model	Top-1		Top-2		Top-5
Model	Separated	Mixed	Separated	Mixed	Separated	Mixed	Ref. #
Transformer (single model)	90.4	88.6	93.7	92.4	95.3	94.2	¹⁸
Transformer (ensemble of models)	91		94.3		95.8		¹⁸
Seq2Seq	80.3				87.5		¹¹
WLDN	79.6				89.2		³²
GTPN	83.2				86.5		⁴⁰
WLDN5	85.6				93.4		²³
AT, this work^a	91.9	90.4	95.4	94.6	97	96.5
AT trained with same training set as in ref. ²².	92	90.6	95.4	94.4	97	96.1

^aThe results of the models applied to x100 augmented dataset using beam size = 10. Model was trained on a set of 439 k reactions, which combines both the training set of 400 k and the validation set of 39 k from ref. ²². The model was trained on the 400 k training set to better match performance of previous models.