Table 6.
Comparing F1-score of traditional models (TREC-6 dataset) with focusing POS tags (N+Adj+Det: NAD).
Rang of N-gram |
|||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Micro -score |
Macro -score |
||||||||||||||||||
Dataset/Model | Unigram |
|
|
Bigram | Trigram | Unigram |
|
|
Bigram | Trigram | |||||||||
Original | MNB | 0.9600 | 0.9660 | 0.9660 | 0.9640 | 0.9200 | 0.9538 | 0.9598 | 0.9599 | 0.9676 | 0.9249 | ||||||||
SVM | 0.9640 | 0.9680 | 0.9720 | 0.9580 | 0.9200 | 0.9672 | 0.9709 | 0.9745 | 0.9619 | 0.9239 | |||||||||
LR | 0.9600 | 0.9700 | 0.9680 | 0.9460 | 0.8980 | 0.9637 | 0.9728 | 0.9711 | 0.9512 | 0.9044 | |||||||||
+all POS Tag | MNB | 0.9560 | 0.9480 | 0.9380 | 0.9280 | 0.8960 | 0.9159 | 0.9099 | 0.9154 | 0.9058 | 0.8781 | ||||||||
SVM | 0.9800 | 0.9780 | 0.9760 | 0.9640 | 0.9360 | 0.9821 | 0.9801 | 0.9780 | 0.9674 | 0.9397 | |||||||||
LR | 0.9680 | 0.9740 | 0.9720 | 0.9560 | 0.8920 | 0.9711 | 0.9769 | 0.9741 | 0.9602 | 0.9028 | |||||||||
+focusing Tags | MNB | 0.9560 | 0.9680 | 0.9600 | 0.9560 | 0.9240 | 0.9395 | 0.9624 | 0.9453 | 0.9414 | 0.9105 | ||||||||
SVM | 0.9680 | 0.9780 | 0.9720 | 0.9640 | 0.9340 | 0.9708 | 0.9799 | 0.9747 | 0.9669 | 0.9377 | |||||||||
LR | 0.9620 | 0.9680 | 0.9640 | 0.9520 | 0.8760 | 0.9656 | 0.9712 | 0.9681 | 0.9562 | 0.8841 | |||||||||
+focusing Words | MNB | 0.9600 | 0.9740 | 0.9740 | 0.9540 | 0.9180 | 0.9544 | 0.9674 | 0.9672 | 0.9574 | 0.9147 | ||||||||
SVM | 0.9680 | 0.9740 | 0.9720 | 0.9680 | 0.9300 | 0.9710 | 0.9766 | 0.9749 | 0.9709 | 0.9317 | |||||||||
LR | 0.9520 | 0.9620 | 0.9620 | 0.9460 | 0.8900 | 0.9564 | 0.9658 | 0.9658 | 0.9526 | 0.8929 |