Skip to main content
. 2013 Dec 31;8(12):e84133. doi: 10.1371/journal.pone.0084133

Table 3. Quality measures of the reconstructed hierarchies for the “hard” synthetic data set.

Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
algorithm A 31% 5% 18% 47% 0% 18% 66%
algorithm B 89% 91% 6% 3% 0% 83% 97%
P. Heymann & H. Garcia-Molina 48% 54% 29% 17% 0% 29% 76%
P. Schmitz 1% 2% 1% 3% 94% 1% 5%

In this case the frequency of the initial tags was independent of their position in the exact hierarchy during the benchmark generation, and the frequency distribution followed a power-law. This change compared to the data set used in Table 2. results in significant decrease in the quality measures for most of the involved methods, as shown by the ratio of acceptable links, Inline graphic, the ratio of inverted links, Inline graphic, the ratio of unrelated links, Inline graphic, the ratio of missing links, Inline graphic, the normalized mutual information between the exact- and the reconstructed hierarchies, Inline graphic, and the linearized mutual information, Inline graphic. The different rows correspond to results obtained from algorithm A, (1Inline graphic row), algorithm B, (2Inline graphic row), the method by P. Heymann & H. Garcia-Molina (3Inline graphic row), and the algorithm by P. Schmitz (4Inline graphic row).