Skip to main content
. 2015 Aug 11;5:12209. doi: 10.1038/srep12209

Figure 3. Random partitioning distributions (Inline graphic) for the four large corpora:

Figure 3

(A) Wikipedia (2010); (B) The New York Times (1987–2007); (C) Twitter (2009); and (D) Music Lyrics (1960–2007). Top right insets show the long tails of random partitioning distributions, and the colors represent phrase length as indicated by the color bar. The gray curves are standard Zipf distributions for words (q = 1), and exhibit limited scaling and with clear scaling breaks. See main text and Tabs. S1–S4, for example phrases.