Skip to main content

View full-text article in PMC

. 2025 Feb 28;11:e2723. doi: 10.7717/peerj-cs.2723

Table 3. The clustering accuracy of different models on the correct number of topics.

The best results are highlighted in bold (The higher the better).

Datasets	Topic models	Metrics
Datasets	Topic models	NMI	ARI	ACC
BBCsport	LDA	0.709	0.662	0.851
	NMF	0.818	0.856	0.872
	K-Means	0.894	0.896	0.963
BBCNews	LDA	0.727	0.701	0.862
	NMF	0.812	0.842	0.932
	K-Means	0.751	0.726	0.868
Reuters	LDA	0.432	0.443	0.669
	NMF	0.550	0.597	0.776
	K-Means	0.632	0.671	0.832
AGNews	GSDMM	0.585	0.622	0.833
AGNews	SeaNMF	0.563	0.600	0.822
Snippets-1	GSDMM	0.565	0.592	0.830
Snippets-1	SeaNMF	0.580	0.634	0.850
Snippets-2	GSDMM	0.764	0.823	0.919
Snippets-2	SeaNMF	0.787	0.850	0.939