Skip to main content
. 2016 May 5;11(5):e0155036. doi: 10.1371/journal.pone.0155036

Table 1. The number and distribution of sentiment annotated posts, and the time period of the posts.

The top part of the table refers to the 13 language datasets, and the bottom to the four application datasets.

Dataset Negative Neutral Positive Total Time period
Albanian 8,106 18,768 26,131 53,005 June—Sep. 2013
Bulgarian 15,140 31,214 20,815 67,169 Apr. 2013—Oct. 2014
English 26,674 46,972 29,388 103,034 Sep. 2014
German 20,617 60,061 28,452 109,130 Feb.—June 2014
Hungarian 10,770 22,359 35,376 68,505 June—Aug. 2014
Polish 67,083 60,486 96,005 223,574 July—Sep. 2014
Portuguese 58,592 53,820 44,981 157,393 Oct.—Dec. 2013
Russian 34,252 44,044 29,477 107,773 Sep.—Dec. 2013
Ser/Cro/Bos 64,235 68,631 82,791 215,657 Oct. 2013—Aug. 2014
Slovak 18,716 14,917 36,792 70,425 Sep.—Nov. 2014
Slovenian 38,975 60,679 34,281 133,935 Jan. 2014—Feb. 2015
Spanish 33,978 107,467 134,143 275,588 May 2013—Dec. 2014
Swedish 25,319 17,857 15,371 58,547 Sep.—Oct. 2014
Facebook(it) 8,750 7,898 2,994 19,642 Apr. 2011—Apr. 2014
DJIA30 12,325 70,460 20,477 103,262 June 2013—Sep. 2014
Environment 6,246 14,217 3,145 23,608 Jan. 2014—Dec. 2014
Emojis 12,156 19,938 37,579 69,673 Apr. 2013—Feb. 2015