Skip to main content
. 2021 Aug 30;8:636077. doi: 10.3389/fmolb.2021.636077

TABLE 4.

The 10 most frequent tokens, excluding stop words and punctuation marks, within various window sizes around entities correctly labeled by human reviewers.

Window size = 1 Window size = 3 Window size = 5
# Token Count Token Count Token Count
1 resistance 176 Tetracycline 230 Tetracycline 230
2 treatment 9 resistance 177 resistance 178
3 mM 4 Trimethoprim 118 Trimethoprim 118
4 oral 3 treatment 11 treatment 14
5 after 3 20 ∼ 7 20 ∼ 8
6 analogue 3 Figure 5 placebo 7
7 responses 3 concentration 5 effects 6
8 antibiotics 2 compared 4 Figure 6
9 exposure 2 100 4 KLK5 6
10 pharmacokinetics 2 mM 4 Matriptase 6