Skip to main content
. 2021 Dec 3;118(49):e2110828118. doi: 10.1073/pnas.2110828118

Fig. 5.

Fig. 5.

The source biomes predicted by MetaSource for Pfam families. (A) The receiver operating characteristic (ROC) analysis of binary-classification MetaSource model. This model was constructed to determine whether the source biome of the query Pfam family is one of the four biomes. (B) The ROC analysis of multiple-classification MetaSource model. This model was constructed to predict the source biome for Pfam families. To evaluate the overall prediction accuracy, the microaverage (obtained by aggregating the contributions of all classes to compute the average metric) and macroaverage value (calculated by the metric independently for each class and taking the average) were applied. (C) The Pfam classification result for all the Pfam families based on the prediction result of MetaSource model. (D) Average TM-score, accuracy of top-L contacts, and average MSA search time for the combined and MetaSource predicted biome datasets. (E) Case studies of modeling Pfam (PF08941 and PF00737) with MSA from different biomes. The model with the highest TM-score is shown in blue font. The model labeled with red frame is the source biome predicted by MetaSource.