Table 1.
Public available COVID-19 related social media datasets.
Data Provider | Dataset Name | Geolocation Included | ID Only | Text | Advanced Analysis/secondary data | Geographic Coverage | Temporal Coverage | Publication |
---|---|---|---|---|---|---|---|---|
COVID-19 Twitter Dataset with Latent Topics, Sentiments and Emotions | No | No | No | Sentiment and emotion | Global/country | 1/28/2020 – 9/1/2021 | (Gupta et al., 2020) | |
COVID-19-TweetIDs | No | Yes | No | No | Global | 01/21/2020–02/11/2022 | (Chen et al., 2020a) | |
Coronavirus (COVID-19) tweets dataset | No | No | No | Sentiment | Global | 03/20/2020–02/12/2022 | (Lamsal, 2021) | |
Coronavirus geo-tagged tweets datasets | Yes | No | No | Sentiment | Global | 03/20/2020–02/12/2022 | (Lamsal, 2021) | |
Covid-19 Twitter chatter dataset for scientific use | No | Yes | No | No | Global/country | 03/22/2020–02/12/2022 | (Banda et al., 2021) | |
CoronaVis: A Real-time COVID-19 Tweets Analyzer | No | Yes | No | No | Global | 03/05/2020–12/31/2020 | (Kabir and Madria, 2020) | |
An Augmented Multilingual Twitter dataset for studying the COVID-19 infodemic | Yes | No | No | Sentiment, entity recognition, mentions, and hashtags | Global/Country | 01/01/2020–12/31/2021 | (Lopez and Gallemore, 2021) | |
GeoCoV19: A Dataset of Hundreds of Millions of Multilingual COVID-19 Tweets with Location Information | Yes | No | No | Sentiment and entity recognition | Global/location | 02/01/2020–03/31/2022 | (Qazi et al., 2020) | |
COVID-19 Twitter Dataset | No | Yes | No | No | Global | 04/01/2020–09/31/2020 | (Gruzd and Mai, 2020) | |
Preliminary Extraction from Geotweet Archive v2.0 for COVID-19 Tweets | Yes | Yes | No | No | Global/location | 03/01/2020–04/30/2020 | ||
Sina Weibo | Weibo COVID dataset | No | No | Yes | No | China | 12/07/2020–04/04/2020 | (Leng et al., 2020) |
COVID-19 related Weibo Data | No | No | Yes | No | China | 12/01/2019–02/27/2020 | (Fu and Zhu, 2020) | |
Reddit Mental Health Dataset | No | No | Yes | Sentiment and emotion | 28 mental health and non-mental health subreddits | 01/01/2018–01/01/2020 | ||
Coronavirus subreddit | No | No | Yes | Sentiment and topic modeling | r/Coronavirus subreddit | 01/20/2020–01/31/2021 | ||
The Reddit COVID dataset | No | Yes | Sentiment | posts and comments mentioning COVID in their title and body text | N/A- 25/10/2021 | (Tan, 2021) | ||
Youtube | YouTube's Pseudoscientific Video Recommendations | No | No | Yes | No | Search terms: 'covid-19′, 'coronavirus', 'anti-vaccination', 'anti-vaxx', 'anti-mask', or 'flat earth' | (Papadamou et al., 2020) |
|
Covid-related misinformation videos | No | Yes | No | No | Global | 11/012019–06/30/2020 | ||
COVID19 Instagram Post IDs | No | Yes | No | No | Global | 01/05/2020–3/30/2020 | (Zarei et al., 2020) |