Skip to main content
. 2010 Sep-Oct;17(5):507–513. doi: 10.1136/jamia.2009.001560

Table 2.

Accuracy results from the OpenNLP ME classifier. Rows represent training data; columns are test data. 10-fold cross validation with 80/20 data split for each fold

GENIA PTB Mayo
(a) Accuracy for the openNLP sentence boundary detector
GENIA 0.986 0.646 0.821
PTB 0.967 0.944 0.940
Mayo 0.959 0.652 0.947
GENIA+PTB+Mayo 0.986 0.942 0.949
(b) Accuracy results for the openNLP POS TAGGER
GENIA 0.986 0.764 0.804
PTB 0.851 0.969 0.878
Mayo 0.812 0.844 0.940
GENIA+PTB+Mayo 0.984 0.969 0.936

ME, maximum entropy; POS, part-of-speech.