Skip to main content
. 2019 Nov 12;11:69. doi: 10.1186/s13321-019-0391-2

Table 3.

Selection of functional groups detected by Checkmol with their corresponding number of molecules in PC9 and QM9 datasets and MAE in generalization conditions

Selection Functional group classes PC9 data QM9 data
Occurrences QM9 model Occurrences PC9 model
CF and NN bonds Azide 358 11.5 0
Azo compound 272 5.3 10 4.1
Acyl fluoride 105 6.4 0
Aryl fluoride 936 9.4 1562 4.1
Alkyl fluoride 4576 6.8 52 4.3
Abundant in QM9 Carbonitrile 4624 5.4 10315 4.7
Secondary alcohol 6282 6.2 10668 4.3
Trialkylamine 3301 6.4 10687 4.2
Alkyne 3906 6.1 10873 4.5
Tertiary amine 3388 6.4 11057 4.2
Aromatic compound 12728 8.2 15863 4.3
Dialkyl ether 9275 6.2 24012 4.3
Heterocyclic compound 42665 7.0 61904 4.3
QM9 model focus Hydroperoxide 717 45.4 0
Diaryl ether 22 39.1 0
Peroxide 430 32.7 0
Diarylamine 11 32.6 0
Carbamic acid halide 16 24.4 0
Nitrite 1 22.7 0
Hydroxamic acid 46 19.4 0
Nitroso compound 48 17.0 0
Hydroxylamine 805 16.2 13 3.4
Hemiacetal 819 2.8 0
PC9 model focus Enol ether 1977 4.4 2 16.0
Amide acetal 59 14.2 28 7.1
Carboxylci acid 4213 6.6 106 6.2
Hemiaminal 3034 6.5 207 6.0
Acyl cyanide 67 4.0 281 6.0

The selection italic CF and NN interactions, the most prominent groups in QM9, the largest MAE of the model trained on QM9 and the largest MAE of the model trained on PC9