Skip to main content

View full-text article in PMC

. 2021 Jan 23;28(4):839–849. doi: 10.1093/jamia/ocaa326

Table 2.

The mean P, R, and F1 scores for the 3 labels in the FluTrack dataset

	Related vs unrelated			Awareness vs infection			Self vs other
	P	R	F1	P	R	F1	P	R	F1
Linear SVM	0.766	0.823	0.793	0.821	0.816	0.818	0.766	0.823	0.793
CNN GloVe 300	0.809^c	0.850 ^b	0.827^c	0.903^c	0.906 ^c	0.905 ^c	0.809^c	0.847^b	0.827^c
CNN Twitter GloVe 50	0.813^c	0.832	0.822^c	0.850^c	0.848^c	0.849^c	0.813^c	0.832	0.823^c
CNN Twitter GloVe 100	0.816 ^c	0.850 ^b	0.832 ^c	0.919^c	0.881^c	0.900^c	0.816 ^c	0.850 ^b	0.832 ^c
CNN Twitter GloVe 200	0.800^c	0.822	0.811^c	0.866^c	0.882^c	0.874^c	0.800^c	0.822	0.811^c
CNN Word2Vec 300	0.796^c	0.839^a	0.817^c	0.902^c	0.903^c	0.903^c	0.796^c	0.839^a	0.817^c
BiLSTM GloVe 300	0.771	0.836	0.802^b	0.857^c	0.771	0.812	0.771^a	0.836	0.802^b
BiLSTM Twitter GloVe 50	0.759	0.845^a	0.799^a	0.748	0.760	0.754	0.759	0.845^a	0.799^a
BiLSTM Twitter GloVe 100	0.795^c	0.794	0.794	0.821	0.752	0.785	0.795^c	0.794	0.794
BiLSTM Twitter GloVe 200	0.767	0.837	0.800^a	0.876^c	0.737	0.800	0.767	0.837	0.800^a
BiLSTM Word2Vec 300	0.788^c	0.829	0.808^c	0.833^a	0.819	0.826	0.788^c	0.829	0.808^c

P: precision; R: recall. Bold font indicates the best result obtained in each column.

^a

P value (resulting from the Wilcoxon signed rank test) between .05 and .01.

^b

P value (resulting from the Wilcoxon signed rank test) between .01 and 0001.

^c

P value (resulting from the Wilcoxon signed rank test) that is ≤.001.