Figure - PMC

Skip to main content

View full-text article in PMC

. Author manuscript; available in PMC: 2024 Aug 9.

Published in final edited form as: Nat Microbiol. 2024 Jan 29;9(2):537–549. doi: 10.1038/s41564-023-01584-8

Extended Data Figure 4: — (a) EFAM VPFs that have hits to annotated PHROG HMMs (test set) are used to evaluate the model calibration for each category. For each class, probabilities across all VPFs in the test set are binned into 10 partitions and the fraction of true positives for each bin is calculated. A perfectly calibrated model (dotted line) has a true positive proportion equal to the mean predicted probability for each bin. Below the perfect model indicates overconfidence and under the perfect model indicates under confidence. (b) Histogram of the number of predictions across the test set for each probability bin.