Figure 2.
Supervised machine learning steps. Labeled data is split into training and testing sets. The algorithm is then trained to learn the differences between the labeled data. The test set is used to check and refine algorithm performance. Predictions can subsequently be made using new data previously unseen by the algorithm