Table 1.
Book | Length | mthr | dthr | P | dconv | Exponent |
---|---|---|---|---|---|---|
MT | 22,375 | 4 | 377 | 17.6 | 25 | 0.45 (0.05) |
HM | 32,564 | 5 | 446 | 16.4 | 30 | 0.95 (0.07) |
NK | 62,190 | 8 | 762 | 20.6 | 60 | 0.80 (0.05) |
TS | 73,291 | 8 | 669 | 17.5 | 40 | 0.47 (0.04) |
DC | 77,728 | 8 | 816 | 20.5 | 80 | 0.45 (0.08) |
IL | 152,400 | 12 | 830 | 22.7 | 70 | 0.38 (0.04) |
MD | 213,682 | 14 | 1,177 | 20.2 | 70 | 0.44 (0.05) |
QJ | 402,870 | 20 | 1,293 | 19.6 | 75 | 0.36 (0.03) |
WP | 529,547 | 23 | 1,576 | 24.3 | 200 | 0.45 (0.05) |
EI | 30,715 | 5 | 474 | 26.4 | 50 | 0.85 (0.10) |
RP | 118,661 | 11 | 628 | 15.6 | 70 | 0.57 (0.05) |
KT | 197,802 | 14 | 704 | 27.9 | 50 | 0.30 (0.03) |
mthr is the threshold for the number of occurrences and dthr is the number of words kept after thresholding. P is the percentage of the words in the book that pass the threshold, P = Σi=1dthr mi/L. dconv is the dimension at which a power law is being fit. The absolute values of the negative exponents of the fit are given in the last column, together with their error in parentheses.