TABLE 4.
The 10 most frequent tokens, excluding stop words and punctuation marks, within various window sizes around entities correctly labeled by human reviewers.
| Window size = 1 | Window size = 3 | Window size = 5 | ||||
|---|---|---|---|---|---|---|
| # | Token | Count | Token | Count | Token | Count |
| 1 | resistance | 176 | Tetracycline | 230 | Tetracycline | 230 |
| 2 | treatment | 9 | resistance | 177 | resistance | 178 |
| 3 | mM | 4 | Trimethoprim | 118 | Trimethoprim | 118 |
| 4 | oral | 3 | treatment | 11 | treatment | 14 |
| 5 | after | 3 | 20 ∼ | 7 | 20 ∼ | 8 |
| 6 | analogue | 3 | Figure | 5 | placebo | 7 |
| 7 | responses | 3 | concentration | 5 | effects | 6 |
| 8 | antibiotics | 2 | compared | 4 | Figure | 6 |
| 9 | exposure | 2 | 100 | 4 | KLK5 | 6 |
| 10 | pharmacokinetics | 2 | mM | 4 | Matriptase | 6 |