Skip to main content
. 2025 Feb 28;11:e2723. doi: 10.7717/peerj-cs.2723

Table 6. The impact of text vectorization on the performance of AICDR.

The best results are highlighted in bold.

Datasets Topic models Number of words
5 10 15
BBCsport LDA 5 5 5
NMF 4 4 4
K-Means 5 5 5
BBCNews LDA 6 6 6
NMF 5 5 5
K-Means 6 6 6
Reuters LDA 3 3 3
NMF 3 3 3
K-Means 3 3 3
AGNews GSDMM 4 4 4
SeaNMF 5 5 5
Snippets-1 GSDMM 4 4 4
SeaNMF 4 4 4
Snippets-2 GSDMM 3 3 3
SeaNMF 3 3 3