Skip to main content
. 2010 Mar 9;5(3):e9411. doi: 10.1371/journal.pone.0009411

Figure 1. The rank histograms of English texts versus that of random texts (Inline graphic).

Figure 1

A comparison of the real rank histogram (thin black line) and two control curves with the Inline graphic upper and lower bounds of the expected histogram of a random text of the same length in words (dashed lines) involving four English texts. Inline graphic is the frequency of the word of rank Inline graphic. For the random text we use the model Inline graphic with alphabet size Inline graphic. The expected histogram of the random text is estimated averaging over the rank histograms of Inline graphic random texts. For ease of presentation, the expected histogram is cut off at expected frequencies below Inline graphic. AAW: Alice's adventures in wonderland. H: Hamlet. DC: David Crockett. OS: The origin of species.