. 2022 Nov 7;9:907150. doi: 10.3389/fmolb.2022.907150

TABLE 5.

Evaluation Parameters for analysis of microarray gene expression data.

Evaluation metric	Specifics	References
Prediction performance evaluation parameters
Root Mean Squared Error (RMSE)	RMSE is a square root of mean of the difference between predicted values and actual values for each sample	Vihinen, 2012, Parikh et al., 2008a, Parikh et al., 2008b, Goffinet and Wallach, 1989
Root Relative Squared Error (RRSE)	RRSE is a normalized RMSE which enables the comparison between datasets or models with different scales. Standard deviation is used for normalization
Accuracy	The accuracy of a test is its ability to differentiate the cases and controls correctly
Precision/Positive Prediction Value	The Precision of a test is its ability to determine cases that are true cases
Sensitivity/Recall/True Positive Rate	The sensitivity of a test is its ability to determine the cases (positive for disease) correctly
Specificity/True negative Rate	The specificity of a test is its ability to determine the healthy cases correctly
F1-score	F1-score of a test is its ability to determine harmonic mean of precision and recall
MCC	MCC of a test is a correlation coefficient between the true and predicted values	Chicco and Jurman, 2020, Matthews, 1975
ROC curve	ROC curve is a graph where each point on a curve represents a sensitivity/specificity pair corresponding to a particular decision threshold. Area Under the ROC curve is a measure of how well a parameter can distinguish between cases and controls. ROC curves should be used when there are roughly equal numbers of instances for each class	Fawcett, 2006, Davis and Goadrich, 2006
Precision-Recall Curve	A precision-recall (PR) curve is a graph where each point on a curve represents a precision/sensitivity pair corresponding to a particular threshold. PR curves should be used when there is moderate to high class imbalance	Buckland and Gey, (1994)
Clustering performance evaluation parameters
Dunn’s Index	Dunn’s index is a ratio between the minimum distance between two clusters and the size of largest cluster. Larger the index better the clustering	Dunn, 1974, Dalton, Ballarin and Brun, 2009
Silhouette Index	Silhouette Index of a cluster is a defined as the average Silhouette width of its points. Silhouette width of a given point defines its proximity to its own cluster relative to its proximity to other clusters	Rousseeuw, 1987, Dalton, Ballarin and Brun, 2009
Figure of Merit Index	The FOM of a feature gene is computed by clustering the samples after removing that feature and by measuring the average distance between all samples and their cluster’s centroids. The FOM for a clustering technique is the sum of FOM over each feature gene at a time	Smith and Snyder, 1979, Dalton, Ballarin and Brun, 2009
Instability Index	Instability index is disagreement between labels obtained over data points to parts of a dataset, averaged over repeated random partitions of the data points. Clustering method is applied to a part of dataset, and the labels obtained on that part of the dataset are utilized to train a classifier that partitions the whole space	Guruprasad, Reddy and Pandit, 1990, Dalton, Ballarin and Brun, 2009
Hubert’s Correlation, Rand Statistics, Jaccard Coefficient, Folke’s and Mallow’s index	All these measures analyse the relationship between pairs of points using the co-occurrence matrices for the expected partition and the one generated by the clustering algorithm	Dalton, Ballarin and Brun, 2009, Brun et al., 2007