. 2019 Nov 13;7(4):e14850. doi: 10.2196/14850

Table 1.

Performance of all the models on the 2010 i2b2/VA dataset.

Model	F-1 (%)	Precision (%)	Recall (%)
Hidden semi-Markov^a	85.23	86.88	83.64
LSTM^b by Liu et al [39]^a	85.78	—^c	—^c
LSTM by Wu et al [43]^a	85.94	85.33	86.56
BiLSTM^d + ELMo by Zhu et al [40]^a	86.84 (0.16)	87.44 (0.27)	86.25 (0.26)
BiLSTM + Flair	87.01 (0.18)	87.54 (0.15)	86.49 (0.21)
BiLSTM + ELMo	87.01 (0.24)	87.64 (0.19)	86.40 (0.30)
BiLSTM + ELMo + Flair	87.30 (0.06)	87.78 (0.09)	86.85 (0.07)
BiLSTM + ELMo + Flair + semantic embedding	87.44 (0.07)	88.03 (0.14)	86.91 (0.10)

^aModel is trained using the complete dataset of i2b2 2010, which contains 349 notes in the training set and 477 notes in the test set.

^bLSTM: long short-term memory.

^cNot reported.

^dBiLSTM: bidirectional LSTM.