Skip to main content
Digital Health logoLink to Digital Health
. 2022 Apr 12;8:20552076221092539. doi: 10.1177/20552076221092539

Ethical issues of collecting, storing, and analyzing geo-referenced tweets for mental health research

Marta Fadda 1,, Martin Sykora 2, Suzanne Elayan 2, Milo A Puhan 3, John A Naslund 4, Stephen J Mooney 5, Emiliano Albanese 1, Rosalba Morese 1, Oliver Gruebner 3,6
PMCID: PMC9008807  PMID: 35433020

Abstract

Spatial approaches to epidemiological research with big social media data provide tremendous opportunities to study the relationship between the socio-ecological context where these data are generated and health indicators of interest. Such research poses a number of ethical challenges, particularly in relation to issues such as privacy, informed consent, data security, and storage. While these issues have received considerable attention by researchers in relation to research for physical health purposes in the past 10 years, there have been few efforts to consider the ethical challenges of conducting mental health research, particularly with geo-referenced social media data. The aim of this article is to identify strengths and limitations of current recommendations to address the specific ethical issues of geo-referenced tweets for mental health research. We contribute to the ongoing debate on the ethical implications of big data research and also provide recommendations to researchers and stakeholders alike on how to tackle them, with a specific focus on the use of geo-referenced data for mental health research purposes. With increasing awareness of data privacy and confidentiality issues (even for non-spatial social media data) it becomes crucial to establish professional standards of conduct so that compliance with ethical standards of conducting research with health-related social media data can be prioritized and easily assessed.

Keywords: Ethics, mental health, digital epidemiology, spatial epidemiology, digital health geography, geovisual analytics, big data, sentiment analysis, social media

Introduction

Digital epidemiology for mental health research

To date, most epidemiological studies that have involved mining large social media data have focused on the prediction of infectious disease outbreaks, 1 but there is great potential to provide information on emerging trends in a wide range of both communicable and non-communicable conditions. More recently, researchers have started harvesting social media platforms to retrieve and analyze data related to mental health.2,3 Social media research aimed at identifying mental health disorders is based on the analysis of naturally occurring exchanges among online users. In this sense, a specific area of data science has focused on Natural Language Processing (NLP) and Machine Learning (ML) methodologies to analyze, detect, and predict symptoms of mental disorders through social media text, 4 images, and emoticons, 5 as well as social interactions (e.g. through mapping friendship connections). 6 In text mining, sentiment analysis (i.e. the classification of a given text as having positive, neutral, or negative emotional valence) is the most popular tool for understanding emotion expression. For example, Coppersmith and colleagues have measured four different mental health disorders (depression, bipolar disorder, post-traumatic stress disorder, and seasonal affective disorder) by applying the Linguistic Inquiry Word Count (LIWC) tool and open-vocabulary analysis in automatically derived samples of Twitter data. 7 The study was based on previously observed differences between depressed and control groups via LIWC on Twitter. Depressed users tended to use first person pronouns, negative emotion words, and words expressing anger more frequently. 8 Other NLP tools that have been used for research include OpinionFinder, 9 SentiStrength,10,11 and Affective Norms for English Words. 12 In addition, EMOTIVE and Stresscapes go beyond sentiments and identify basic emotions and stress, respectively.13,14 Studies showed that more in-depth and fine-grained affective measures/“signals” have some likely predictive value. In particular, research has showed how extending the LIWC variables with emotion variables (using EMOTIVE) improves the predictive models’ performance (on the same kind of Twitter datasets). One study showed how emotions over time have predictive value in classifying likelihood of depression (building on Coppersmith’s approach), 15 while another closely replicated Coppersmith’s study on predicting the four mental health conditions, showing how emotion variables can improve predictive performance across these. 16 Such methods have been developed both for monitoring population mental health trends and patterns, and identifying risk factors for individuals. 17

While more than 50% of the global population now use social media, with highest relative growth of social media usage occurring in low- and middle-income countries in recent years,18,19 much of research to date using social media for the purpose of mental health has leveraged data from Twitter. Twitter data feature several characteristics that make them particularly appropriate for mental health research. First, the Twitter API is more openly accessible, and tweets typically contain valuable meta-data (such as geo-referenced data) that can provide insight into public sentiments, opinion, and behaviour. 20 Second, Twitter has a strong hashtag culture that makes it easy to find, sort, and collect data. 2 Third, with more than 500 million tweets per day and 126 million daily active users, of which 80% use mobile devices, Twitter research can benefit from a large volume of public real-time, sometimes geo-tagged contents.21,22 Finally, communication on Twitter appears to mirror real world offline trends, for example, performance of movies at the box office, 23 disaster events, 24 political events/elections, 25 which suggests reliability/validity of the data relative to actual human behaviors, interactions, and events. Specifically, in relation to mental health, studies suggest that this platform is a feasible avenue for investigating mental health-related phenomena. Users seem to have a shared understanding that Twitter allows them to communicate freely about their mental health issues and to receive appropriate support from others with similar experiences. 26 However, studies have also found a substantial portion of communication on Twitter about mental illness is negative or stigmatizing,27,28 while others have found that mental health tweets can be used to challenge stigma or share information. 29

Adding the spatial dimension to digital epidemiology

Although person, time, and space represent the three core elements of epidemiology, 30 most digital epidemiological studies targeting mental health are location agnostic, missing the opportunity to integrate the spatial dimension into the analysis of mental health phenomena through social media data analysis. 31 Geo-social or geo-referenced social media data, that is, social media data that include geographic information, have the potential to foster our understanding of the relationship between the socio-ecological context where data are generated (i.e. place-based social determinants of health) and the given mental health indicators. 32 Furthermore, collecting geographic information on the concentration of specific emotions holds the promise of identifying areas of possible mental health distress to inform timely and targeted prevention and intervention strategies. 24 Only few studies analyzed emotional responses of Twitter users in the context of human-made or large-scale natural disasters over space and time. For example, Gruebner and colleagues investigated the feasibility of a space-time syndromic surveillance system for the emotional consequences of mass traumatic events based on Twitter data generated in the context of the terrorist attacks in Paris in 2015. 33 They also used geo-referenced Twitter data to extract negative emotions indicating discomfort in New York City (NYC) before, during, and after Superstorm Sandy in 2012, and to assess geographic variation in emotions over time,3436 also in the COVID-19 context. 37 Others investigated emotional acute and chronic stress as expressed in geo-referenced, in-situ social media language and their relationship with urban environments. 14 Han and Wang classified Weibo (a popular social media platform in China that functions in a similar way to Twitter) texts into six topics and nine sentiments related to the 2018 Shouguang flood and analyzed their spatial-temporal characteristics. 38

The collection, storage, and analysis of large datasets of social media data that involve tracking the location of individuals over time and assessing health-related outcomes in relation to shifting socio-ecological settings raise not only significant technical and methodological challenges but also ethical questions, particularly in relation to issues of privacy, informed consent, and data protection. 39 Yet studies using social media data rarely mention ethical concerns and frequently assume that, since the content they are interested in is public, its collection and use does not require ethical review. 40 In addition, recommendations aimed at providing practical advice on how to ethically collect, store, and analyze social media data for health-related research purposes are scant.

Therefore, the aim of this article is to review recommendations on how to address the ethical issues of collecting, storing, and analyzing social media data, and highlight their strengths and limitations to address the specific ethical issues of geo-referenced tweets for mental health research.

Strengths and weaknesses of current recommendations addressing the ethical issues of social media research

To identify ways to address the ethical implications of conducting research with (geo-referenced) social media data,41,42 we purposely identified ethical recommendations for conducting research with social media data following the approach defined in “Internet Research Ethics” of the Stanford Encyclopedia of Philosophy. 43 In the past two decades, an increasing number of research scholars, various professional associations, and non-profit scholarly and scientific agencies have explored ethical concerns of conducting research with Internet and social media data, and drafted statements or guidelines, or addendum to their extant professional standards. For the purposes of this article, we focused on ethical guidelines for Internet research drafted by scholarly associations, as they are likely to have wider recognition among researchers involved in collecting, storing, and analyzing geo-referenced tweets for mental health research. The Stanford Encyclopedia of Philosophy identifies four main associations that drafted guidelines of such type: the Board of Scientific Affairs of the American Psychological Association, which established an Advisory Group on Conducting Research on the Internet in 2001; the American Counseling Association, which revised its Code of Ethics in 2005; the Association of Internet Researchers (AoIR) Ethics Working Group Guidelines; and the National Committee for Research Ethics in the Social Sciences and the Humanities (NESH Norway). From this list, we excluded the guidelines developed by the Norwegian National Committee for Research Ethics as they may have only limited, local recognition, and those proposed by the American Counseling Association, as they constituted a minor addition to their more general Code of Ethics, which does not address the ethics of Internet or social media research as its main focus. Therefore, we selected the “Ethics Recommendations for Internet-mediated Research,” developed by the British Psychological Society (BPS), 44 and “Internet Research: Ethical Recommendations 3.0,” developed by the AoIR. 45 However, the two documents do not make specific reference to location tracking in social media research. The BPS suggests that the main ethics principles outlined in the Society’s Code of Human Research Ethics, namely respect for autonomy, privacy, and dignity of research participants, scientific integrity, social responsibility, and maximizing benefits and minimizing harm, should also apply to internet-mediated research. 44 Furthermore, for each principle, the recommendations highlight issues that may need special consideration in the context of internet-mediated research, including: the public-private domain distinction online; confidentiality and security of online data; procedures for obtaining valid consent; procedures for ensuring withdrawal rights and debriefing; levels of researcher control; and implications for scientific value and potential harm.42,43

The ethical concerns addressed by the two documents may be summarized into four main issues: (1) users’ privacy expectations and informed consent, (2) data security, (3) data confidentiality, and (4) social responsibility.

Privacy expectations

The recommendations state that collecting data without informed consent from a public online domain is justifiable when people could reasonably expect to be observed by others. 44 However, research shows that participants are not always aware that their contributions on social media websites are potentially accessible and that they may have context-specific intentions and expectations regarding how their content is used and who uses it. 46 In the case of Twitter, considering that approximately half of all tweets contain personal information about the user generating the content, 47 retrieving, analyzing, and storing such data can pose potential threats to users’ privacy and represent a source of rights violation, especially for those who are unaware of the public nature of their information and its harvesting for research purposes. Accessing such information may expose users to different sources of harm (e.g. posting tweets disclosing information about vacations, consuming drugs or alcohol, engaging in reckless behaviors, or having a stigmatizing disease that can put users at risk of harm from their family or other personal relationships, their employers, or insurance companies). 48 However, as the chance of violations of anonymity and confidentiality increases, arguments that valid consent is not necessary are weakened. 44 Furthermore, the recommendations recognize that participants may provide their consent without having been properly informed of the relevant issues. According to the AoIR, a way of trying to mitigate risks for participants is by reserving the acquisition of informed consent to the dissemination stage of a project, that is, by asking for informed consent from specific subjects for the publication of a quote or other data that might make them and their personal information identifiable. Finally, the guidelines do not recognize that users can lose control over their ability to remove their posts from social media (e.g. by closing an account, or deleting old posts) if this information is retained in a research dataset. For example, a user may withdraw her consent for a piece of information to be public, but cannot remove it from a dataset where this information has already been included, simply because the user is unaware the dataset exists and cannot contact the data owner. This could potentially have major implications for informed consent and for users’ withdrawal from a research study when their preferences over the extent to which “to be public” have changed. To address this issue, it would be ideal to establish a system to regularly assess whether the Twitter users involved in the dataset have changed their privacy settings or limited audience to which the data were made available, particularly before analyses are conducted to answer a new research question. However, this solution may be challenging (especially with regards to resourcing such a continuous task) and careful anonymization might be the only option to mitigate privacy concerns. For example, while a process of running a light-weight rehydration 1 to establish whether Twitter still returns content for specific tweet IDs could be implemented, it can be time consuming with large datasets. If a new research question/data sharing request is made to a repository, a fairly common practice is that it would be up to the researcher team interested in the dataset to rehydrate the data. Platform companies should also provide more efficient ways for researchers to be able to re-query validity of consent (especially for large datasets).

Data security

To maximize data security, the AoIR recommends applying technological solutions such as encryption, grounding such recommendation in the general principle that online activities should be mediated by an IT Security team with appropriate special responsibility and training. 45

Data confidentiality

The BPS recognizes risks relating to the levels of researcher control over confidentiality of data, particularly during the data collection and storage process, highlighting the need for researchers to recognize that it is not possible to uphold complete confidentiality of participants’ personal information gathered online because the networks are not in the control of the researcher.44,45 Researchers need to think about practices to properly inform participants about the ways their data are stored and transferred, particularly where risks are higher. This could be done by providing information through the platform where data are generated by the users. Furthermore, participants should be informed about the possibilities for breaches of confidentiality through the use of search engines (e.g. by copying and pasting the text portion of a social media post in a web search engine such as Google, it is possible to trace the user who generated this content online) and the accumulation of data from multiple sources.44,45 The recommendations highlight, that issues of confidentiality breaches may increase the risk of harm for participants, but do not mention how data sharing should occur and what the possible consequences of a sub-optimal handling of the data from a methodological point of view could be. To further reduce the risk of causing privacy harm or social disruptions by third parties who might be granted access to the data for research purposes, we recommend that researchers interested in using the data seek ethical approval for any secondary use of the data. In addition, we recommend that researchers share only agreed portions of the original, full dataset with third parties. For instance, researchers can be provided with a dataset containing the geo-location of tweets and the timestamp and a limited number of variables, so that identification of the users is further limited. When deciding which variables to share, special consideration should be given to different types of variables: for example, content-based emotion or sentiment variables are unlikely to be very revealing with regards to identity, while other tweet/account specific metadata (such as number of followers/followees, account creation date) might be much more dangerous. This would ensure that the project’s intrusion on users’ privacy by accessing their information is justified, and respect for their privacy expectations is maximized and balanced against the progress of the methodology and the promotion of public good interests. 49

Social responsibility

The BPS raises several key points in relation to the issue of social responsibility. It recommends considering the extent to which proposed research study procedures and dissemination practices might disrupt/harm specific social groups, and maintaining respect for social structures, as intrusions from researchers into spaces that their users consider private may be perceived as “invasive, unwelcome, and socially irresponsible.” 44 Since data extracted and enriched with sentiment analysis might identify users, locate them, or link information that can become a potential source of stigmatization (by, for instance, identifying them as having a mental condition), we recommend researchers to carefully assess whether the Twitter users involved in the dataset intended to make their information public in the light of their privacy settings or limited audience to which the data were made available. Therefore, we recommend researchers to only “rehydrate” those tweets that are available via the Twitter API. This procedure effectively provides for a recent assessment of whether consent had been withdrawn by the user.

Discussion

In this article, we mapped two widely recognized sets of recommendations in the context of analyzing big social media data for public health research. We found that three additional ethical considerations may apply to studies using geo-referenced data for mental health research that the recommendations do not specifically address: (1) an increased risk of identification; (2) increased risk of stigma; and (3) consequences of suboptimal quality of the analysis.

Increased risk of identification

Geo-referenced data offer the possibility to localize and therefore more accurately monitor people over space and time. Studies have shown that, while people generally geotag in uncommon places, consciously and intentionally, during or soon after being in a given place, many Twitter users do so inadvertently, or they geotag more precisely than they think. 50 Studies suggest that participants might identify researchers as intrusive in online communities, 51 and indicate a preference that researchers only use information marked as “public” by the author.7,52,53 Other studies found that only few users were aware that their tweets could be extracted and analyzed by researchers, with most feeling that this should not be done without their consent. 54 A very similar finding is from an exploratory survey with over 100 UK-based (mostly student) respondents. 52 Specifically, participants claimed to be generally aware of some analytics occurring, but were unsure as to what data about them was being analyzed and unclear about the actual reach of information they reveal online—its visibility to third parties. 55 To address this issue, we recommend following Rivers and Lewis six “TACTICs” for the ethical use of Twitter data that highlight transparency of the research process, protection of the anonymity of authors, respect for authors’ control efforts, focused collection, respect for the context where data are generated and collaboration with an IRB (Institutional Review Board). 56 In addition, to ensure transparency on what data is being collected and for which purposes, a registry should be made publicly available that summarizes all research studies using Twitter geo-reference data, and that offers contact information so that questions can be forwarded to PIs and their research teams. This could also be important for clearly stating the research objectives of different studies.

Increased risk of stigma

Due to the stigma associated with mental illness, special considerations may apply to the collection and analysis of geo-referenced big data for mental health research.57,58 There are ethical implications to using social media data posted by people who may have shared their information while being exposed, for example, during a large-scale disaster or disease outbreak. Therefore, they may not necessarily recognize that they share potentially sensitive data that can be collected and analyzed, either for scientific activities, commercial purposes, or first aid assistance. 46 In addition, when mental health issues have been identified through geo-referenced social media data and prompt intervention is warranted, those targeted by the intervention may incur in stigma-related harm for being identified as mentally ill. 59 Finally, while Twitter users may have (knowingly or unknowingly) consented to their information being public, they are unlikely to have consented for their information to be analyzed for mental health purposes. It may be a source of distress for some users to know that their posts were collected and analyzed to explore potential presence of mental health symptoms, or more alarmingly, to “label” them with a specific mental illness.

Consequences of suboptimal quality of the analysis

A third issue is the aftermath of the scientific analysis of the geo-referenced data, when its quality is suboptimal. For example, a major issue in social media big data research is that few studies are driven by a priori hypothesis testing, and instead are exploratory, which raises concerns about data dredging. If the scientific work is done poorly (e.g. the study is not rigorously designed, or the analytic techniques are not adequately described for the given purpose), wrong beliefs may come out of the research and may have direct impacts on public health measures (e.g. use of social media for diagnosis of mental health conditions risks labeling individuals who might not actually seek clinical consultation), or the stigmas associated to given populations. Since analyzing big data may lead to the creation of new data, 60 this should occur in a context that meets the highest methodological and data protection standards. A data sharing and confidentiality agreement is a formal contract that documents what data are being shared between the project principal investigators and third parties and how the data can be accessed and analyzed. It specifies the intended use of the data, any constraints on the use of the data, and how to ensure data confidentiality and security. However, issues of methodological rigor are not contemplated in such an agreement. Therefore, we recommend including considerations of methodological robustness in the data sharing and confidentiality agreement.

Limitations and conclusions

Some limitations of this perspective article are worth noting. We excluded the regulations and policies and the terms of service (including privacy statements) of the social media used in the research (i.e. Twitter) as well as the scientific literature available on the topic of the ethical, legal, and social implications of big data and social media research. This is because the scope of this article was to target professional, practical recommendations, rather than theoretical frameworks. Therefore, our search is not exhaustive, and may not capture all practical recommendations from professional societies. We also excluded the legal aspects of collecting, storing, and analyzing geo-referenced social media data, which is also an important area that requires increasing attention and emphasis for informing recommendations.

As the field continues to evolve rapidly, with new digital platforms emerging and new applications of digital data, there is a need to revisit existing guidelines on a regular basis to ensure that the concerns highlighted here are adequately addressed and modified accordingly to new risks that are identified. Furthermore, as most of the world’s social media users are located in low- and middle-income countries, scholars involved in the analysis of social media data for mental health purposes in these settings need to reflect on the implications of their work in light of the paucity of regulations, limited role of professional societies governing how social media data is used, relatively fewer efforts aimed at exploring social media communication in different languages and diverse cultural contexts, and significant deficits in providing even basic mental health services. 61 Here we provided a selection of practical recommendations to researchers interested in the use of geo-referenced social media data for analyzing mental health outcomes, by highlighting strengths and weaknesses of current recommendations.

Acknowledgments

We thank Christine Clavien for her comments on an earlier version of this manuscript. We also thank the Department of Data Protection at the University of Zurich for their tremendous support with the legal aspects in our M-HEALTH project, which served as the foundation for this manuscript.

1.

Rehydration is a standard process (in line with Twitter’s terms of use for sharing tweets) involving sending requests to the official Twitter API /statuses/lookup endpoint with tweet-IDs uniquely identifying a tweet, and Twitter responding with the tweet and available metadata. This process ensures that in case a user has set their account to private or have deleted their tweet, effectively withdrawing consent, that information would no longer be available for analysis.

Footnotes

Conflict of interest: The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Contributorship: MF, EA, and OG contributed to conception of the article. MF wrote the first draft of the manuscript. OG wrote sections of the manuscript. All authors contributed to manuscript revision, read, and approved the submitted version.

Ethical approval: Not Applicable.

Funding: The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the Swiss School of Public Health (SSPH+; Social Media Mental Health Surveillance M-HEALTH to OG, EA, and MAP).

Guarantor: Not Applicable.

References

  • 1.Salathé M, Khandelwal S. Assessing vaccination sentiments with online social media: implications for infectious disease dynamics and control. PLOS Comput Biol 2011; 7: e1002199. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Berry N, Lobban F, Belousov Met al. et al. #WhyWeTweetMH: understanding why people use Twitter to discuss mental health problems. J Med Internet Res 2017; 19: 1–18. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Coppersmith G, Dredze M, Harman C. Quantifying mental health signals in Twitter. In Proceedings of the ACL Workshop on Computational Linguistics and Clinical Psychology, 2015. pp. 51–60. [Google Scholar]
  • 4.Moreno MA, Jelenchick LA, Egan KGet al. et al. Feeling bad on Facebook: depression disclosures by college students on a social networking site. Depress Anxiety 2011; 28: 447–455. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Tan L, Toh HJ, Sim LKet al. et al. Sensitivity and specificity analysis: use of emoticon for screening of depression in elderly in Singapore. J Am Psychiatr Nurses Assoc 2018; 24: 452–456. [DOI] [PubMed] [Google Scholar]
  • 6.Seabrook EM, Kern ML, Rickard NS. Social networking sites, depression, and anxiety: a systematic review. JMIR Ment Health 2016; 3: 50. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Coppersmith G, Harman C, Dredze M. Measuring post traumatic stress disorder in Twitter. Proc 8th Int Conf Weblogs Soc Media ICWSM 2014; 2014: 579–582. [Google Scholar]
  • 8.Chung C, Pennebaker J. The psychological functions of function words. In: Social communication frontiers of social psychology. New York, NY, US: Psychology Press, 2007 343–359. [Google Scholar]
  • 9.Bollen J, Gonçalves B, Ruan Get al. et al. Happiness is assortative in online social networks. Artif Life 2011; 17: 237–251. [DOI] [PubMed] [Google Scholar]
  • 10.Kang K, Yoon C, Kim EY. Identifying depressive users in Twitter using multimodal analysis. International Conference on Big Data and Smart Computing (BigComp), 2016: pp. 231–238. doi: 10.1109/BIGCOMP.2016.7425918 [DOI] [Google Scholar]
  • 11.Durahim AO, Coşkun M. #Iamhappybecause: gross national happiness through Twitter analysis and big data. Technol Forecast Soc Change 2015; 99: 92–105. [Google Scholar]
  • 12.Choudhury MD, Gamon M, Counts Set al. et al. Predicting depression via social Media. ICWSM 2013; 2: 128–137. (AAAI, 2013). [Google Scholar]
  • 13.Sykora MD, Jackson T, O’Brien Aet al. et al. Emotive ontology: extracting fine-grained emotions from terse, informal messages. Int J Comput Sci Inf Syst 2013; 2: 19–26. [Google Scholar]
  • 14.Elayan S, Sykora M, Shankardass Ket al. The Stresscapes Ontology System: Detecting and Measuring Stress on Social Media. In 7th European Conference on Social Media ECSM 2020 (2020). p. 74-82.
  • 15.Chen X, Sykora M, Elayan Set al. et al. What about mood swings: Identifying depression on Twitter with temporal measures of emotions. In Companion Proceedings of the The Word Wide Web Conference 2018. (2018) pp. 1653–1660. doi.org/10.1145/3184558.3191624.
  • 16.Chen X, Sykora M, Elayan Set al. et al. Tweeting your mental health: An exploration of different classifiers and features with emotional signals in identifying mental health conditions. In Proceedings of the 51st Hawaii International Conference on System Sciences, (2018) pp. 3320-3328. doi:10.24251/HICSS.2018.421.
  • 17.De Choudhury M, Counts S, Horvitz E. Predicting postpartum changes in emotion and behavior via social media. In Conference on Human Factors in Computing Systems - Proceedings, 3267–3276. doi:10.1145/2470654.2466447.
  • 18.Digital 2021: Global Overview Report – DataReportal – Global Digital Insights. Available at: https://datareportal.com/reports/digital-2021-global-overview-report [Accessed February 8, 2021].
  • 19.Digital 2021: the latest insights into the ‘state of digital’. We Are Soc UK - Glob Socially-Led Creat Agency (2021) Available at: https://wearesocial.com/uk/blog/2021/01/digital-2021-the-latest-insights-into-the-state-of-digital [Accessed February 8, 2021].
  • 20.Chew C, Eysenbach G. Pandemics in the age of Twitter: content analysis of tweets during the 2009 H1N1 outbreak. PLOS ONE 2010; 5: e14118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Sinnenberg L, Buttenheim AM, Padrez Ket al. et al. Twitter as a tool for health research: a systematic review. Am J Public Health 2017; 107: e1–e8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Williams SA, Terras MM, Warwick C. What do people study when they study Twitter? Classifying Twitter related academic papers. J Doc 2013; 69: 384–410. [Google Scholar]
  • 23.Gaikar DD, Marakarkandy B, Dasgupta C. Using Twitter data to predict the performance of Bollywood movies. Ind Manag Data Syst 2015; 115: 1604–1621. [Google Scholar]
  • 24.Gruebner O, Lowe SR, Sykora Met al. et al. Spatio-temporal distribution of negative emotions in New York city after a natural disaster as seen in social media. Int J Environ Res Public Health 2018; 15: 2275. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Gaurav M, Srivastava A, Kumar Aet al. et al. Leveraging candidate popularity on Twitter to predict election outcome. In: Proceedings of the 7th workshop on social network mining and analysis - SNAKDD ’13. Chicago, Illinois: ACM Press, 2013, 1–8. [Google Scholar]
  • 26.Betton V, Borschmann R, Docherty Met al. et al. The role of social media in reducing stigma and discrimination. Br J Psychiatry J Ment Sci 2015; 206: 443–444. [DOI] [PubMed] [Google Scholar]
  • 27.Joseph AJ, Tandon N, Yang LHet al. et al. #Schizophrenia: use and misuse on Twitter. Schizophr Res 2015; 165: 111–115. [DOI] [PubMed] [Google Scholar]
  • 28.Passerello GL, Hazelwood JE, Lawrie S. Using Twitter to assess attitudes to schizophrenia and psychosis. BJPsych Bull 2019; 43: 158–166. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Reavley NJ, Pilkington PD. Use of Twitter to monitor attitudes toward depression and schizophrenia: an exploratory study. PeerJ 2014; 2: e647. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Moore DA, Carpenter TE. Spatial analytical methods and geographic information systems: use in health research and epidemiology. Epidemiol Rev 1999; 21: 143–161. [DOI] [PubMed] [Google Scholar]
  • 31.Schwartz R, Halegoua GR. The spatial self: location-based identity performance on social media. New Media & Society 2015; 17: 1643–1660. [Google Scholar]
  • 32.Gruebner O, Sykora M, Lowe SRet al. et al. Big data opportunities for social behavioral and mental health research. Soc Sci Med 2017; 189: 167–169. [DOI] [PubMed] [Google Scholar]
  • 33.Gruebner O, Sykora M, Lowe SRet al. Mental health surveillance after the terrorist attacks in Paris. Lancet 2016; 387: 2195–2196. [DOI] [PubMed] [Google Scholar]
  • 34.Lowe SR, Sampson L, Gruebner Oet al. et al. Community unemployment and disaster-related stressors shape risk for posttraumatic stress in the longer-term aftermath of hurricane sandy. J Trauma Stress 2016; 29: 440–447. [DOI] [PubMed] [Google Scholar]
  • 35.Sampson L, Lowe SR, Gruebner Oet al. et al. Perceived service need after hurricane sandy in a representative sample of survivors: the roles of community-level damage and individual-level stressors. Disaster Med Public Health Prep 2016; 10: 428–435. [DOI] [PubMed] [Google Scholar]
  • 36.Garske SI, Elayan S, Sykora Met al. Space-time dependence of emotions on Twitter after a natural disaster. Int J Environ Res Public Health 2021; 18: 5292. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Edry T, Maani N, Sykora Met al. Real-time geospatial surveillance of localized emotional stress responses to COVID-19: a proof of concept analysis. Health Place 2021; 70: 102598. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Han X, Wang J. Using social Media to mine and analyze public sentiment during a disaster: a case study of the 2018 Shouguang city flood in China. Int J Geo-Inf 2019; 8: 85. [Google Scholar]
  • 39.Salathé M, Bengtsson L, Bodnar TJ, et al. Digital epidemiology. PLoS Comput Biol 2012; 8:1–5. doi: 10.1371/journal.pcbi.1002616 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Zimmer M, Proferes NJ. A topology of Twitter research: disciplines, methods, and ethics. Aslib J Inf Manag 2014; 66: 250–261. [Google Scholar]
  • 41.Kinder-Kurlanda K, Weller K, Zenk-Möltgen Wet al. et al. Archiving information from geotagged tweets to promote reproducibility and comparability in social media research. Big Data Soc 2017; 4: 2053951717736336. [Google Scholar]
  • 42.Sloan L, Morgan J. Who tweets with their location? understanding the relationship between demographic characteristics and the use of geoservices and geotagging on Twitter. PLoS ONE 2015; 10: e0142209. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Buchanan EA, Zimmer M. Internet Research Ethics. The Stanford Encyclopedia of Philosophy (Spring 2021 Edition), Edward N. Zalta (ed.), forthcoming URL = <https://plato.stanford.edu/archives/spr2021/entries/ethics-internet-research/>.
  • 44.British Psychological Society. Ethics Guidelines for Internet-mediated Research. INF206/04.2017. Leicester: Author. Available from: www.bps.org.uk/publications/policy-and-guidelines/ research-guidelines-policy-documents/researchguidelines-poli. 2017.
  • 45.Franzke AS, Bechmann A, Zimmer M. Ess C and the Association of Internet Researchers (2020). Internet Research: Ethical Guidelines 3.0. https://aoir.org/reports/ethics3.pdf.
  • 46.Golder S, Ahmed S, Norman Get al. et al. Attitudes toward the ethics of research using social Media: a systematic review. J Med Internet Res 2017; 19: e195. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Naaman M, Boase J, Lai CH. Is it really about me? Message content in social awareness streams. Proceedings of the ACM Conference on Computer Supported Collabortive Work (CSCW2010), Savannah, GA.
  • 48.Mao H, Shuai X, Kapadia A. Loose tweets: an analysis of privacy leaks on Twitter. In Proceedings of the 10th annual ACM workshop on Privacy in the electronic society (WPES '11). Association for Computing Machinery, New York, NY, USA, 1–12. (2011) DOI: 10.1145/2046556.2046558. [DOI]
  • 49.Beauchamp TL, Childress JF. Principles of biomedical ethics. 7th ed. New York: Oxford University Press, 2013. [Google Scholar]
  • 50.Tasse D, Liu Z, Sciuto Aet al. et al. State of the geotags: motivations and recent changes. Proc Int AAAI Conf Web Social Media 2017; 11: 250–259. [Google Scholar]
  • 51.Eysenbach G, Till JE. Ethical issues in qualitative research on internet communities. Br Med J 2001; 323: 1103–1105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Shepherd A, Sanders C, Doyle Met al. et al. Using social media for support and feedback by mental health service users: thematic analysis of a Twitter conversation. BMC Psychiatry 2015; 15: 29. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Coppersmith G, Dredze M, Harman C. Quantifying Mental Health Signals in Twitter. in Proceedings of the Workshop on Computational Linguistics and Clinical Psychology: From Linguistic Signal to Clinical Reality (Baltimore, Maryland, USA: Association for Computational Linguistics), 51–60. doi: 10.3115/v1/W14-3207. [DOI]
  • 54.Fiesler C, Proferes N. “Participant” perceptions of Twitter research ethics. Soc Media + Society 2018; 4: 205630511876336. [Google Scholar]
  • 55.Sykora M, Elayan S, Barbour Net al. et al. A Survey of the Ethics of Social Media Analytics. In 7th European Conference on Social Media ECSM 2020, pp. 298-305.
  • 56.Rivers CM, Lewis BL. Ethical research standards in a world of big data. F1000Res 2014; 3: 38. [Google Scholar]
  • 57.Naslund JA, Aschbrenner KA. Risks to privacy with use of social Media: understanding the views of social Media users with serious mental illness. Psychiatr Serv Wash DC 2019; 70: 561–568. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Naslund JA, Bondre A, Torous Jet al. et al. Social Media and mental health: benefits, risks, and opportunities for research and practice. J Technol Behav Sci 2020; 5: 245–257. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Mak WWS, Poon CYM, Pun LYKet al. et al. Meta-analysis of stigma and mental health. Soc Sci Med 1982 2007; 65: 245–261. [DOI] [PubMed] [Google Scholar]
  • 60.Sax M. Big data: finders keepers, losers weepers? Ethics Inf Technol 2016; 18: 25–31. [Google Scholar]
  • 61.Naslund JA, Gonsalves PP, Gruebner Oet al. et al. Digital innovations for global mental health: opportunities for data science, task sharing, and early intervention. Curr Treat Opt Psychiatry 2019; 6: 337–351. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Digital Health are provided here courtesy of SAGE Publications

RESOURCES