Skip to main content
. 2019 Dec 12;19(Suppl 9):251. doi: 10.1186/s12911-019-0986-6

Table 5.

Comparison with baseline PRF retrieval models in certain metrics

BM25+Rocchio RM3 HRoc1 HRoc2 HRoc3
MAP 0.0490 0.0540 0.0651 ∗+ (32.8%,20.5%) 0.0647 ∗+ (32.0%,19.8%) 0.0642 ∗+ (31.0%,18.9%)
P@5 0.2733 0.2600 0.2933 ∗+ (7.32%,12.8%) 0.2733 + (0.00%,5.11%) 0.2733 + (0.00%,5.11%)
P@10 0.2533 0.2467 0.2733 ∗+ (7.89%,10.8%) 0.2667 ∗+ (5.29%,8.11%) 0.2567 ∗+ (1.34%,4.25%)
P@20 0.2167 0.2233 0.2350 ∗+ (8.44%,5.24%) 0.2317 ∗+ (6.92%, 3.76%) 0.2317 ∗+ (6.92%, 3.76%)
F1 0.0853 0.0932 0.1112 ∗+ (30.3%,19.3%) 0.1108 ∗+ (29.9%,18.9%) 0.1096 ∗+ (28.5%, 17.6%)

The values in parentheses represent the improvements over BM25+Rocchio and RM3 respectively. The best result obtained is shown in bold, and the superscripts “*” and “+” denote statistically significant improvements over BM25+Rocchio and RM3, respectively (Wilcoxon signed-rank test with p<0.05)