Skip to main content
. 2018 Mar 21;6:90. doi: 10.3389/fpubh.2018.00090

Table 2.

Characteristics of studies included in the current systematic review.

Reference Data source Studied disease Study period Location searched Purpose of the study Used keywords Type of analysis Main findings
Roche et al. (14) Twitter (423 tweets) Chikungunya The first 9 months of the 2014 outbreak Martinique To determine the predictive power of Chikungunya-related tweets Chick*, Chik* Correlational and regression analysis with epidemiological and environmental variables Models integrating information from Twitter well explain epidemiological dynamics over time

Marques-Toledo et al. (15) Twitter, Wikipedia access logs Dengue September, 2012–October, 2016 Brazil To explore the predictive power of tweets in forecasting dengue cases Dengue Mathematical model Tweets can be used to predict and forecast dengue cases

Nsoesie et al. (16) Twitter Dengue Not applicable Brazil To understand the determinants of sharing tweets related to dengue Dengue Machine learning techniques Sociodemographic variables play a major role in sharing dengue-related tweets

Ghosh et al. (17) Websites reporting news Dengue 2013–2014 India, China To explore the predictive power of models incorporating news Dengue Mathematical models and time series-regression techniques News-based models well correlated with epidemiological cases

Gomide et al. (18) Twitter Dengue 2006–April 2011 Brazil To explore whether social media can be effectively integrated into disease surveillance practice Dengue Content analysis, correlational analysis and spatiotemporal analysis Excellent correlation between tweets production and epidemiological cases (R2 = 0.9578)

Guo et al. (19) Baidu Dengue January 2011–December 2014 China To explore the feasibility of Baidu in real-time monitoring of Dengue Dengue Correlational analysis with epidemiological cases A strong correlation was found

Li et al. (20) Baidu Dengue Not applicable China To explore the predictive power of Baidu for forecasting Dengue cases Dengue Mathematical model Baidu-based forecasting with one-week lag well correlated with epidemiological cases

Nagpal et al. (21) YouTube Ebola Not applicable Not applicable To characterize the content of Ebola popular YouTube videos Ebola Content analysis The most relevant YouTube videos were those presenting clinical symptoms

Strekalova (22) Official centers for disease control and prevention (CDC) Facebook page Ebola 18 March 2014–31 October 2014 Not applicable To characterize the usage of new media from the CDC Ebola Content analysis Audience engagement with Ebola posts was significantly higher compared to other non-Ebola topics, submitted by CDC

Odlum and Yoon (23) Twitter (42,236 tweets—16,499 unique and 25,737 retweets) Ebola 24 July 2014–1 August 2014 Not applicable (tweets in English) To exploit Twitter as a real-time method of Ebola outbreak surveillance to monitor information spread Ebola, #Ebola, #EbolaOutbreak, #EbolaVirus, and #EbolaFacts Content analysis using NLP (Notepad++ and Weka) and correlation with epidemiological cases Tweets started to rise in Nigeria 3–7 days prior to the official announcement of the first probable Ebola case

Pathak et al. (24) YouTube (118 videos out of 198 videos) Ebola From inception–1 November 2014 Not applicable To characterize Ebola-related YouTube videos Ebola outbreak Content analysis The majority of the internet videos were characterized as useful, even though some videos were misleading

Roberts et al. (25) English language websites and Twitter Ebola 1 July 2014–17 November 2014 Not applicable To qualitatively analyze the Ebola-related narrative Ebola Content analysis and sentiment analysis Public engagement was directed toward stories about risks of U.S. domestic infections than toward stories focused infections in West Africa

Sastry and Lovari (26) Official CDC and World Health Organization pages Ebola 1 July 2014–15 October 2014 Not applicable To understand the development of an ontological Ebola narrative Ebola Narrative analysis framework Three themes: (a) consulting and containment, (b) international concern, (c) possibility of an epidemic in the United States

Liu et al. (27) Baidu, Sina Micro Ebola 20 July 2014–4 September in 2014 China To understand the public reaction to the Ebola outbreak Ebola Mathematical model Monitoring of social media enables to capture the spreading of fears related to epidemics outbreaks

Househ (28) Twitter (2,592,5152 tweets) and Google News Trend Ebola 30 September 2014–29 October 2014 Not applicable To understand the role of the media coverage on public reaction to the Ebola outbreak in terms of digital activities Ebola Correlational analysis A significant correlation between media coverage and tweets production was found

Jin et al. (29) Twitter Ebola Late September 2014–late October 2014 Not applicable To understand the public reaction to misinformation related to Ebola outbreak Ebola or #ebola, #EbolaVirus, #EbolaOutbreak, #EbolaWatch, #EbolaEthics, #EbolaChat, #nursesfightebola, #ebolafacts, #StopEbola, #FightingEbola, and #UHCRevolution Geo-coded analysis, coding, and mathematical model Some rumors were more popular than others

Lazard et al. (30) Twitter (2,155 tweets) Ebola 2 October 2014 United States To understand the public reaction to the Ebola outbreak Ebola, #CDCcha Content analysis using SAS Text Miner 12.1 Public concerned was about symptoms and lifespan of the virus, disease transfer and contraction, safe travel, and protection of one’s body

Alicino et al. (31) Google Trends Ebola 29 December 2013–14 June 2015 Worldwide Real-time monitoring and tracking of Ebola virus outbreaks Ebola, virus Ebola, Ebola virus, Ebola 2014, 2014 West Africa Ebola outbreak Correlational and regression analysis with epidemiological cases Correlation was stronger at a global level, but weaker at nation/country level

Basch et al. (32) YouTube (100 most viewed videos viewed more than 73 million times) Ebola Not applicable Not applicable To analyze the most viewed Ebola-related videos Ebola Content analysis YouTube could on the one hand enhance education and on the other hand spread misinformation

Fung et al. (33) Twitter, Google Trends Ebola September 2014–November 2014 Worldwide To understand the public reaction to the Ebola outbreak and the first US case Ebola Qualitative Worldwide traffic on Twitter and Google increased as news spread about the first US case

Fung et al. (34) Sina Weibo, Twitter Ebola 8–9 August 2014 with a follow-up 7 days later Not applicable To capture the reaction to misinformation related to Ebola emergency Ebola Content analysis (manual coding) Misinformation about Ebola was circulated at a very low level globally in social media

Wong et al. (35) Twitter (1,648 tweets) Ebola September 2014–2 November 2014 United States To understand the determinants of tweeting from local health departments Ebola Content analysis (manual coding from 2 independent authors) and regression analysis Approximately 60% of local health departments sent tweets

Wong et al. (35) Twitter via ArcGIS 10.2.2 and Google Trends Ebola September 2014–November 2014 United States To understand the determinants of tweeting from local health departments Ebola Geospatial analysis Weak, negative, non-significant correlation between online search activity, and per capita number of local health department Ebola tweets by state

Towers et al. (36) Twitter (250,723 tweets), web searches Ebola 29 September 2014–31 October 2014 United States To understand the impact of the media coverage on the public reaction to Ebola outbreak Ebola Mathematical model 65–76% of the variance in samples was described by the news media contagion model

van Lent et al. (37) Twitter (4,500 tweets from a corpus of 185,253 tweets) Ebola 22 March 2014–31 October 2014 The Netherlands To understand the predictors of Ebola-related tweet production Ebola, #Ebola Content analysis Significant positive relation between proximity and fear

Strekalova (22) Official CDC Facebook page via a Microsoft Excel add-on, Power Query Ebola 25 March 2014–31 October 2014 Not applicable To understand the usage of social media by the CDC Ebola Content analysis Differences in audience information behaviors in response to an emerging pandemic, and health promotion posts

Fung et al. (38, 39) Twitter (3,640 tweets on malaria) Malaria Not applicable Not applicable To characterize malaria-related tweets #GlobalHealth, #malaria Content analysis (with unsupervised machine learning techniques) The main topics were prevention, control, treatment, followed by advocacy, epidemiology, and social impact

Ocampo et al. (40) Google Trends Malaria 2005–2009 Thailand To exploit the predictive power of Google Trends in forecasting malaria cases Malaria and malaria-related terms Correlational analysis Google Trends-based model well correlated with epidemiological cases

Adawi et al. (41) Google Trends Mayaro Virus From inception (1 January 2004 on) Worldwide Real-time monitoring and tracking of Mayaro virus outbreaks Virus Mayaro, Mayaro virus, virus de Mayaro, virus del Mayaro Correlational and regression analysis Web searches were driven by media coverage rather than reflecting real epidemiological cases

Bragazzi et al. (42) Google Trends West Nile virus From inception (2004 on) Italy To exploit the predictive power of Google Trends West Nile virus Correlational analysis with epidemiological cases A positive significant correlation between web searches and cases was found

Watad et al. (43) Google Trends West Nile virus From inception (from 2004 on) United States To explore the predictive power of Google Trends West Nile virus Correlational and regression analyses and mathematical model Good correlation between web searches and real-world epidemiological figures. Using data 2004–2015 it was possible to predict data for 2016

Basch et al. (44) YouTube (100 most popular videos) Zika Not applicable Not applicable To analyze the most viewed Zika-related videos Zika Content analysis Majority of YouTube videos concerned babies, cases in Latin American and in Africa

Bragazzi et al. (45) Google Trends, Google News, Twitter, YouTube, and Wikipedia Zika 1 January 2004–31 October 2016 Not applicable To capture the public reaction to the Zika outbreak Zika Correlational and regression analyses Public interest was constantly increasing, with public alert on teratogenicity of the Zika virus

Dredze et al. (46) Twitter (138,513 tweets) Zika 1 January 2016–29April 2016 Not applicable To characterize Zika vaccine-related tweets Zika vaccine Content analysis (supervised machine learning techniques) Most tweets contained misleading information

Fu et al. (47) Twitter (1,076,477,185 tweets collected with Twitris 2.0 via API) Zika 1 May 2015–2 April 2016 Worldwide Content analysis of Zika-related Twitters data Zika Topic modeling was used to group bags of words. The 20-topic model was found to fit the data best, them were grouped in 5 themes 5 themes: (1) private/public response to the outbreak; (2) transmission routes; (3) societal impacts of the outbreak; (4) case reports; (5) pregnancy and microcephaly

Fung et al. (38, 39) Pinterest (616 posts), Instagram (616 photos) Zika Not applicable Not applicable To characterize the Zika-related only material shared via Pinterest and Instagram Zika virus, #zikavirus Content analysis (manual coding) Main languages were Spanish or Portuguese. Most popular topics were: prevention, pregnancy, and Zia-related deaths

Glowacki et al. (48) Twitter 1,174 tweets collected Zika During an hour-long live CDC Twitter chat on February 12, 2016 CDC-generated tweets Content analysis of Zika-related Twitters data Zika Text analytics to identify topics and extract meanings, using SAS Text Miner version 12.1 10 topics: virology, spread, infants’ sequelaes, how to participate to the chat, prevention, zika test, pregnants’ concerns, sexual transmission, encouraging to engage the chat, symptoms

Lehnert et al. (49) 913 obstetric practice websites randomly selected, Twitter and Facebook Zika January 2016–August 2016 Not applicable To understand the determinants of social media usage from obstetric community Zika Regression analysis 25–35% of websites reported Zika-related information. Information via social decreased throughout time

Majumder et al. (50) HealthMap and Google Trends Zika 31 May 2015–16 April 2016 Colombia To develop near real-time estimates for R0 and Robs associated with Zika Zika Incidence Decay and Exponential Adjustment (IDEA) model to estimate R0 and the discount factor (d) associated with the ongoing outbreak Robs estimated with digital data is comparable with the number calculated with the traditional method

McGough et al. (51) Google Trends, Twitter, HealthMap Zika May 2015–January 2016 Colombia, Venezuela, Martinique, Honduras, El Salvador To explore the predictive power of non conventional surveillance techniques Zika Mathematical model Integrating different non conventional surveillance techniques can improve prediction of Zika cases

Miller et al. (52) Twitter (1,234,605 tweets collected with Twitris 2.0 via API) Zika 24 February 2016–27 April 2016 Not applicable To determine the relevancy of the tweets regarding: symptoms, transmission, prevention, and treatment Zika, Zika virus, Zika treatment, Zika virus treatment Content analysis with a combination of NLP and ML—annotation performed by three microbiologists and immunologists, supervised classification techniques, including J48, MNB, Bayes Net, SMO, SVM, Adaboost, Bagging, and topical analysis with LDA The majority of the tweets were related to transmission and prevention, and were characterized by a negative polarity

Seltzer et al. (53) Instagram (342 pictures out of 500 tagged images) Zika May 2016–August 2016 Not applicable To characterize Zika-related images #zika Content analysis Most images conveyed negative feelings (such as fear and concerns) and majority of shared pictures contained misleading information

Sharma et al. (54) Facebook (top 200 posts) Zika For a week starting from 21 June 2016 Not applicable To characterize the content of Zika-related Facebook posts Zika Content analysis The misleading posts were far more popular than the accurate posts

Southwell et al. (55) Twitter Zika 1 January 2016–29 February 2016 United States, Guatemala, and Brazil To determine the role of the media coverage on tweets production Zika Correlational analysis A significant relationship between media coverage and digital behaviors was found

Stefanidis et al. (56) Twitter (6,249,626 tweets) Zika December 2015–March 2016 Not applicable To characterize Zika-related tweets in terms of temporal variations of locations, actors, and concepts Zika Spatiotemporal analysis The spatiotemporal analysis of Twitter contributions reflected the spread of interest in Zika from South to North America and then across the globe, with a prominent role played by the CDC and WHO

Teng et al. (57) Google Trends Zika 12 February 2016–20 October 2016 Not applicable To explore the predictive power of Google Trends Zika Mathematical model and correlation with epidemiological cases The best predictive model was autoregressive integrated moving average (0,1,3)

Vijaykumar et al. (58) Facebook pages of the Ministry of Health and National Environmental Agency (NEA) pages (1057 posts of which 33 were Zika-related) Zika 1 March 2015–19 September 2016 Singapore To understand the differences in outreach patterns between the preparedness and response stages of an outbreak Zika Thematic analysis Prevention-related posts as garnering the most likes, while update-related posts were most shared and commented upon