Skip to main content
editorial
. 2019 Jul 11;8:163. doi: 10.1186/s13643-019-1074-9

Fig. 1.

Fig. 1

Classifying text using machine learning, in this example logistic regression with a ‘bag of words’ representation of the texts. The system is ‘trained’, learning a coefficient (or weight) for each unique word in a manually labelled set of documents (typically in the 1000s). In use, the learned coefficients are used to predict a probability for an unknown document