Table 1.
Dataset | The Time Machine | Moby Dick | Pride and Prejudice | |||
---|---|---|---|---|---|---|
Words | 36,128 | 219,986 | 127,368 | |||
Baseline A | 0.09% | 0.07% | 0.06% | |||
Baseline B | 0.11% | 0.08% | 0.08% | |||
Δ | §6.2.2 | §4.4 | §6.2.2 | §4.4 | §6.2.2 | §4.4 |
α = δ = 0 | 6,416 (17.8) | 5,885 (16.3) | 36,105 (16.4) | 31,601 (14.4) | 24,459 (16.6) | 20,171 (14.3) |
α = δ = 1 | 4,377 (12.1) | 4,133 (11.7) | 29,938 (13.6) | 25,821 (11.7) | 20,003 (13.6) | 18,497 (13.1) |
α = δ = 2 | 1,018 (0.1) | 1,804 (5.3) | 8,149 (3.7) | 15,348 (7.0) | 5,243 (3.6) | 8,947 (6.3) |
α = δ = 3 | 44 (0.0) | 526 (1.6) | 336 (0.2) | 4,539 (2.1) | 270 (0.2) | 2,079 (1.5) |
α = δ = 4 | 0 (0.0) | 52 (0.1) | 4 (0.0) | 556 (0.3) | 1 (0.0) | 401 (0.0) |
α = δ = 5 | 0 (0.0) | 4 (0.0) | 0 (0.0) | 45 (0.0) | 0 (0.0) | 27 (0.0) |
α = δ = 6 | 0 (0.0) | 0 (0.0) | 0 (0.0) | 1 (0.0) | 0 (0.0) | 1 (0.0) |
Total | 11,855 (32.8) | 12,652 (35.0) | 74,532 (33.9) | 77,911 (35.4) | 49,976 (39.2) | 50,123 (39.6) |