Skip to main content
. 2019 Nov 13;7(4):e14850. doi: 10.2196/14850

Table 1.

Performance of all the models on the 2010 i2b2/VA dataset.

Model F-1 (%) Precision (%) Recall (%)
Hidden semi-Markova 85.23 86.88 83.64
LSTMb by Liu et al [39]a 85.78 c c
LSTM by Wu et al [43]a 85.94 85.33 86.56
BiLSTMd + ELMo by Zhu et al [40]a 86.84 (0.16) 87.44 (0.27) 86.25 (0.26)
BiLSTM + Flair 87.01 (0.18) 87.54 (0.15) 86.49 (0.21)
BiLSTM + ELMo 87.01 (0.24) 87.64 (0.19) 86.40 (0.30)
BiLSTM + ELMo + Flair 87.30 (0.06) 87.78 (0.09) 86.85 (0.07)
BiLSTM + ELMo + Flair + semantic embedding 87.44 (0.07) 88.03 (0.14) 86.91 (0.10)

aModel is trained using the complete dataset of i2b2 2010, which contains 349 notes in the training set and 477 notes in the test set.

bLSTM: long short-term memory.

cNot reported.

dBiLSTM: bidirectional LSTM.