Table 5. Coverage of the vocabulary by the dictionary in each language, both at the word-type and at the token level.
Title | Tokens | Types |
---|---|---|
Clarissa | 96.9% | 68.0% |
Moby-Dick | 94.7% | 70.8% |
Ulysses | 90.4% | 58.6% |
Don Quijote | 97.0% | 81.3% |
La Regenta | 97.9% | 89.5% |
Artamène | 83.6% | 43.6% |
Bragelonne | 97.5% | 89.8% |
Seitsemän v. | 95.4% | 89.8% |
Kevät ja t. | 98.3% | 96.2% |
Vanhempieni r. | 98.5% | 96.5% |
average | 95.0% | 78.4% |