Skip to main content
. 2025 Feb 28;11:e2723. doi: 10.7717/peerj-cs.2723

Table 3. The clustering accuracy of different models on the correct number of topics.

The best results are highlighted in bold (The higher the better).

Datasets Topic models Metrics
NMI ARI ACC
BBCsport LDA 0.709 0.662 0.851
NMF 0.818 0.856 0.872
K-Means 0.894 0.896 0.963
BBCNews LDA 0.727 0.701 0.862
NMF 0.812 0.842 0.932
K-Means 0.751 0.726 0.868
Reuters LDA 0.432 0.443 0.669
NMF 0.550 0.597 0.776
K-Means 0.632 0.671 0.832
AGNews GSDMM 0.585 0.622 0.833
SeaNMF 0.563 0.600 0.822
Snippets-1 GSDMM 0.565 0.592 0.830
SeaNMF 0.580 0.634 0.850
Snippets-2 GSDMM 0.764 0.823 0.919
SeaNMF 0.787 0.850 0.939