Skip to main content
. 2020 Jan 21;5(1):e00774-19. doi: 10.1128/mSystems.00774-19

FIG 5.

FIG 5

Overview of prediction strategy. First, all of the isolates are clustered on the basis of genome similarities, performed by use of the KMA method. All of the clusters are then randomly divided into three groups to prevent the training, validation (Validat.), or testing of the models with similar isolates. The neural network and random forest models, schematically drawn at the top and bottom of the right side of the figure, respectively, were applied to the data sets generated.