. 2024 May 24;10:e1993. doi: 10.7717/peerj-cs.1993

Table 1. The characteristics of the data sets analysed.

The number of views, number of clusters, the largest number of features amongst the data views, and the number of samples for both the real and synthetic data sets analysed are presented. Real data are taken as heterogeneous, whereas the synthetic data are regarded as homogeneous. High-dimensional data contain more features than samples ( $p ≫ N$ ).

Data description
		Views (M)	Clusters ( $k$ )	Features ( $p_{l a r g e s t}$ )	Samples (N)	Hetero-geneous	High dimensional
	Data set	Views (M)	Clusters ( $k$ )	Features ( $p_{l a r g e s t}$ )	Samples (N)	Hetero-geneous	High dimensional
Real	Cancer types	3	3	22,503	253	✓	✓
	Caltech7	6	7	1,984	1,474	✓	✓
	Handwritten digits	6	10	240	2,000	✓	✗
−−−−−−−
Synthetic	MMDS	3	3	300	300	✗	✗
	NDS	4	3	400	300	✗	✓
	MCS	3	5	300	500	✗	✗