Skip to main content
. 2022 Jul 15;6(3):344–374. doi: 10.1007/s41666-022-00118-x

Table 3.

Accuracy of the suggested responses for different models

Method Precision@1 (%) Precision@3 (%) Precision@5 (%) MRR
BERT 59.32 ± 1.64 85.42 ± 0.82 87.56 ± 0.63 0.79 ± 0.00
BiLSTM 58.98 ± 0.88 83.28 ± 0.75 85.37 ± 0.62 0.75 ± 0.00
Seq2Seq 53.21 ± 0.61 61.48 ± 4.46 68.63 ± 5.54 0.60 ± 0.02
XGBoost†* 51.33 ± 0.31 79.59 ± 0.52 82.83 ± 0.42 0.69 ± 0.01
SVM†* 47.62 ± 0.42 78.97 ± 0.49 82.25 ± 0.51 0.68 ± 0.00
Weighted TF-IDF 34.41 ± 2.57 46.46 ± 2.97 53.53 ± 1.2 0.42 ± 0.03
TF-IDF 32.32 ± 2.11 44.35 ± 1.28 51.01 ± 0.68 0.42 ± 0.03
Frequency 16.71 ± 0.25 32.20 ± 0.67 42.80 ± 0.74 0.35 ± 0.00

Bold entries are the best performance values given each metric

PubMedBERT embedding; Wikipedia-PubMed embedding; *TF-IDF