Skip to main content
. 2020 Sep 24;11(47):12777–12788. doi: 10.1039/d0sc02639e

Prediction performance of the pre-trained model, average performance and standard deviation of the individual fine-tuned models that comprise the ensemble, and performance of the ensemble, for comparable output sizes. The table indicates the percentage of drugs for which at least one, at least half and all reference metabolites have been correctly identified, as well as, the total number of identified metabolites.

Model Output size At least one metabolite (%) At least half metabolites (%) All metabolites (%) Total identified metabolites Precision (%) Recall (%)
Pre-trained (beam 15) 9.1 39.3 27.4 13.1 49 6.4 22.6
Average (beam 15) 9.3 ± 0.4 78.8 ± 4.6 61.7 ± 5.7 33.1 ± 4.1 102.3 ± 8.0 13.1 ± 0.8 47.2 ± 3.7
Ensemble (beam 5) 10.2 90.5 77.4 42.9 125 14.5 57.6