. Author manuscript; available in PMC: 2021 Jan 14.

Published in final edited form as: J Biomed Inform. 2020 Jun 18;108:103473. doi: 10.1016/j.jbi.2020.103473

Table 7.

Spatial role extraction results using gold spatial indicators: Average Precision (P%), Recall (R%), and F1 measures of 10-fold CV across 5 different fold variations. CI - 95% confidence intervals of the average F1 measures across 50 iterations. BLSTM-C - Bi-LSTM CRF, BERT-L - BERT_large, BERT-LM - BERT_large (MIMIC), XLNet-L - XLNet_large.

Models	trajector			landmark			diagnosis			hedge			overall
Models	P	R	F1	P	R	F1	P	R	F1	P	R	F1	P	R	F1 (CI)
BLSTM-C	88.8	87.3	88.0	94.1	89.9	91.9	76.6	75.0	75.2	78.4	76.3	77.0	89.0	86.4	87.6 (±0.55)
BERT-L	89.7	91.8	90.7	95.4	96.1	95.8	72.7	85.5	78.4	72.8	84.1	77.8	88.8	92.4	90.5 (±0.42)
BERT-LM	91.2	93.1	92.1	95.6	96.6	96.1	72.3	83.9	77.4	75.0	86.1	80.1	89.5	93.3	91.4 (±0.54)
XLNet-L	92.8	94.1	93.5	96.1	96.8	96.4	78.6	88.0	82.8	79.6	88.6	83.7	91.6	94.2	92.9 (±0.38)