Skip to main content
. 2020 Oct 14;15(10):e0238835. doi: 10.1371/journal.pone.0238835

Table 1. Summary of basic properties of empirical distributions that are interesting for data mining.

Interesting basic Properties Exemplary data mining applications Statistical test used Descriptive Statistic
Uniformity versus multimodality Biomedical data [22], Water vapor [23] Hartigan’s dip test [12] Difference between mean and median can indicate multimodality, several coefficients [23]
Data clipping versus heavy-tailedness Flood data [24], Upper Income [25] Not required here, but we can refer to [24, 26] Range of data is sufficient for the task. “There is no easy characteristic for heavy-tailedness” [27]
Skewness versus normality Biomedical data [28], Strength of Glass Fibers & Market Value Growth [29] D'Agostino test [14] Third order statistics, for example [28]