Table 1:
Dataset | Train | Valid | Test | Total | Task |
---|---|---|---|---|---|
USPTO_TPL56 a | 360,545 | 40,059 | 44,511 | 445,115 | Reaction type classification |
USPTO_MIT12 | 409,035 | 30,000 | 40,000 | 479,035 | Forward prediction |
USPTO_50k29 a | 40,029 | 5,004 | 5,004 | 50,037 | Retrosynthesis |
C-N Coupling44
a, b (Random splits) |
2,767 | – | 1,188 | 3,955 | Reaction yield prediction |
C-N Coupling44
a, b (Out-of-sample test1) |
3,057 | – | 898 | 3,955 | Reaction yield prediction |
C-N Coupling44
a, b (Out-of-sample test2, 4 |
3,055 | – | 900 | 3,955 | Reaction yield prediction |
C-N Coupling44
a, b (Out-of-sample test3) |
3,058 | – | 897 | 3,955 | Reaction yield prediction |
USPTO_500_MTa | 116,360 | 12,937 | 14,238 | 143,535 | Multi-task prediction |
Contains stereochemical information
With reactants/reagents separation