. 2023 Mar 3;9(9):eabq8123. doi: 10.1126/sciadv.abq8123

Table 1. Weighted out-of-sample performance of the TOC tool by broad crime type.

This table shows the out-of-sample classification performance of the production Text-based Offense Classification (TOC) model at the parent class (broad crime type) and at the child class [Uniform Crime Classification Standard (UCCS) code] weighted by the case count of each offense description. The model uses hierarchical classification method with multilayer perceptron (MLP) classifier trained at each parent node using 5000 4-grams selected by term frequency–inverse document frequency (TF-IDF) from preprocessed descriptions.

	Broad crime type			Full UCCS code
	Precision	Recall	F1 score	Precision	Recall	F1 score
All crime types	0.983	0.983	0.983	0.963	0.963	0.963
Broad crime types
Violent	0.997	0.994	0.995	0.993	0.989	0.991
Property	0.927	0.990	0.957	0.884	0.944	0.913
Drug	0.999	0.960	0.979	0.862	0.828	0.845
DUI	0.987	0.986	0.986	0.942	0.941	0.941
Public order	0.993	0.938	0.965	0.977	0.923	0.949
Criminal traffic	0.987	0.991	0.989	0.986	0.991	0.988