Skip to main content
. 2021 Mar 31;10(1):15. doi: 10.1140/epjds/s13688-021-00271-0

Figure 2.

Figure 2

Overall dataset statistics. Number of messages captured in our dataset as classified by the FastText-LID algorithm between 2009-01-01 and 2019-12-31, which sums up to approximately 118 billion messages throughout that period (languages are sorted by popularity). This collection represents roughly 10% of all messages ever posted