Skip to main content
. 2023 Mar 3;9(9):eabq8123. doi: 10.1126/sciadv.abq8123

Table 1. Weighted out-of-sample performance of the TOC tool by broad crime type.

This table shows the out-of-sample classification performance of the production Text-based Offense Classification (TOC) model at the parent class (broad crime type) and at the child class [Uniform Crime Classification Standard (UCCS) code] weighted by the case count of each offense description. The model uses hierarchical classification method with multilayer perceptron (MLP) classifier trained at each parent node using 5000 4-grams selected by term frequency–inverse document frequency (TF-IDF) from preprocessed descriptions.

  Broad crime type Full UCCS code
  Precision Recall F1 score Precision Recall F1 score
All crime types 0.983 0.983 0.983 0.963 0.963 0.963
Broad crime types
Violent 0.997 0.994 0.995 0.993 0.989 0.991
Property 0.927 0.990 0.957 0.884 0.944 0.913
Drug 0.999 0.960 0.979 0.862 0.828 0.845
DUI 0.987 0.986 0.986 0.942 0.941 0.941
Public order 0.993 0.938 0.965 0.977 0.923 0.949
Criminal traffic 0.987 0.991 0.989 0.986 0.991 0.988