Skip to main content
. 2022 Jul 19;8:e1039. doi: 10.7717/peerj-cs.1039

Table 3. Most common unigrams, bigrams, and emojis without stop words, punctuation, and numbers.

Stop words were removed using NLTK (Bird, Klein & Loper, 2009). Most unigrams and bigrams can have several English translations depending on the context. The table provides only one translation option.

Unigram Bigram Emoji
Item Count Item Count Item Count
Russian English Russian English
это it 1,117 доброе утро good morning 39 graphic file with name peerj-cs-08-1039-i002.jpg 443
просто simply 355 спокойной ночи good night 26 graphic file with name peerj-cs-08-1039-i003.jpg 313
спасибо thanks 306 спасибо большое thanks a lot 24 graphic file with name peerj-cs-08-1039-i004.jpg 246
хочу want 249 самом деле actually 23 graphic file with name peerj-cs-08-1039-i005.jpg 240
ещё yet 223 это просто it’s simple 23 graphic file with name peerj-cs-08-1039-i006.jpg 120
почему why 209 опубликовано фото published photo 18 graphic file with name peerj-cs-08-1039-i007.jpg 119
очень very 205 сих пор so far 17 graphic file with name peerj-cs-08-1039-i008.jpg 118
всё all 204 руб г rub g 16 graphic file with name peerj-cs-08-1039-i009.jpg 113
блять fuck 184 днем рождения birthday 15 graphic file with name peerj-cs-08-1039-i010.jpg 104
вообще generally 174 все ещё still 13 graphic file with name peerj-cs-08-1039-i011.jpg 100