Skip to main content
. 2019 Nov 11;27(2):225–235. doi: 10.1093/jamia/ocz191

Table 1.

The 3 HPV-related Twitter datasets, their date ranges, keywords used for data collection, and total number of tweets

Data source Date range Keywordsa Tweets b(N = 2 598 033)
Present study January 2016 to April 2018 cervarix, gardasil, hpv, human papillomavirus 2 238 433 (86.16)
Dunn et al11 January 2014 to December 2016 gardasil, cervarix, hpv + vaccin∗, cervical + vaccin 423 594 (16.30)
Du et at15 November 2015 to March 2016 cervarix, gardasil, hpv, human papillomavirus 184 468 (7.10)

ahpv + vaccin∗” means a tweet has to contain both hpv and a word starts with vaccin.

bNote that there are overlaps across the 3 datasets. The percentage indicates the amount of tweets of each dataset over the total number of unique tweets combined.