Information propagation on cyber, relational and physical spaces about covid-19 vaccine: Using social media and splatial framework

Fuzhen Yin; Andrew Crooks; Li Yin

doi:10.1016/j.compenvurbsys.2022.101887

. 2022 Sep 14;98:101887. doi: 10.1016/j.compenvurbsys.2022.101887

Information propagation on cyber, relational and physical spaces about covid-19 vaccine: Using social media and splatial framework

Fuzhen Yin ^a,^⁎, Andrew Crooks ^b, Li Yin ^a

PMCID: PMC9472797 PMID: 36124092

Abstract

With the advent of social media, human dynamics studied in purely physical space have been extended to that of a cyber and relational context. However, connections and interactions between these hybrid spaces have not been sufficiently investigated. The “space-place (Splatial)” framework proposed in recent years allows capturing human activities in the hybrid of spaces. This study applies the Splatial framework to examine the information propagation between cyber, relational, and physical spaces through a case study of Covid-19 vaccine debates in New York State (NYS). Whereby the physical space represents the regional boundaries and locations of social media (i.e., Twitter) users in NYS, the relational space indicates the social networks of these NYS users, and the cyber space captures the larger conversational context of the vaccination debate. Our results suggest that the Covid-19 vaccine debate is not polarized across all three spaces as compared to that of other vaccines. However, the rate of users with a pro-vaccine stance decreases from physical to relational and cyber spaces. We also found that while users from different spaces interact with each other, they also engage in local communications with users from the same region or same space, and distance-based and boundary-confined clusters exist in cyber and relational space communities. These results based on the Splatial framework not only shed light on the vaccination debates but also help to define and elucidate the relationships between the three spaces. The intense interactions between spaces suggest incorporating people’s relational network and cyber presence in physical place-making.

Keywords: Covid-19, Vaccination, Social media, Social network analysis, Community detection, Urban informatics

1. Introduction

With the advent of social media, human dynamics studied in purely physical space have been extended to that of a cyber and relational context. This shift has been further intensified by the Covid-19 pandemic. During the early stages of the pandemic, while preventative measures such as social distancing and lockdowns attempted to reduce physical activities and interactions (Li, Zhao, He, Mansourian, & Axhausen, 2021), they also increased people’s use of social media to obtain health information and communicate with their friends and family while being physically apart (González-Padilla and Tortolero-Blanco, 2020, Saud et al., 2020). For example, during the pandemic, the hashtags “#covid” and “#coronavirus” were heavily mentioned on social media platforms (Chen, Lerman, & Ferrara, 2020). Furthermore, healthcare agencies and government officials have also leveraged social media platforms to mitigate the spread of misinformation about Covid-19, especially that related to the vaccines (Lovari, 2020); for example, the New York State’s #GetTheVaxFacts campaign (New York State, 2021b) and Connecticut’s Long Term Care Ombudsman Program (CDC, 2021).

Despite the benefits of social media including its ability to efficiently reach a large population during a crisis (Saud et al., 2020), there have also been considerable public health concerns raised by the spread of online anti-vaccine messaging (Puri, Coomes, Haghbayan, & Gunaratne, 2020). Studies have found polarized vaccination debates on social media (Yuan et al., 2019, Schmidt et al., 2018), suggesting the lack of interactions between pro and anti-vaccine users. Such polarization has the potential to strengthen the ideological isolation of anti-vaccine users, fuel vaccine hesitancy, and potentially lead to a dramatic increase in disease outbreak probabilities (Salathé & Bonhoeffer, 2008). However, the polarized vaccination debates that were found before the pandemic might not apply to the current Covid-19 vaccine due to their new characteristics including the rising trend in positive sentiment towards them (Hu et al., 2021) and health organizations’ active intervention to debunk vaccine misinformation on social media (New York State, 2021b, CDC, 2021). Therefore, it is important to gain a better understanding of the communication patterns of pro and anti-vaccine users in light of the current Covid-19 pandemic and specifically on social media in terms of whether they are polarized or not.

Coinciding with the lack of studies on Covid-19 vaccine polarization is that of studies on how information or different opinions (i.e., pro and anti-Covid-19 vaccine) are propagated across spaces. While some studies have attempted to establish connections between cyber and physical spaces. For example, through analyzing the spread of civil unrest during the Arab Spring across North Africa and the Middle East, AlSayyad and Guvenc (2015) found the reciprocal interactions between the social movements in the physical world and the media coverage on digital platforms. In another study, Zhao, Huang, Huang, Liu, and Lai (2014) found a correlation between online surfing behaviors and offline mobility patterns which suggested that human dynamics happening in one space could be used to predict those in another. While Gunaratne, Coomes, and Haghbayan (2019) and Ahmed, Quinn, Hancock, Freimuth, and Jamison (2018) argued that the pro or anti-vaccine messages trending in cyber space have the potential to link with the vaccine uptake or disease outbreak in the physical space. However, rarely have studies investigated how information and opinions are propagated from offline to online and from local to global.

To fill the gaps mentioned above, this study employs the recently proposed “space-place (Splatial)” framework to investigate the Covid-19 vaccine discourse on social media. The novelty of this Splatial framework lies in the fact that it provides a new way to capture and analyze human dynamics and interactions across multiple spaces (Shaw and Sui, 2020, Sui and Shaw, 2021). In addition to the cyber and physical spaces, this Splatial framework also adds the concept of relational space that infers networks between objects based on their relations and has the potential to bridge the cyber and physical gap (Croitoru, Wayant, Crooks, Radzikowski, & Stefanidis, 2015). However, this concept of relational space was relatively new and theoretical, and this concept needs further elaboration based on real-world events.

Using the Splatial framework, this paper investigates the propagation of different opinions (i.e., pro and anti-vaccine) between three spaces: cyber, relational, and physical spaces through a case study on the Covid-19 vaccine discussion in New York State (NYS) on Twitter. We aim to answer the following two research questions: (1) Are the pro and anti-Covid-19 vaccine social media users polarized in the three spaces? (2) How different opinions of pro and anti-vaccine were propagated between cyber, relational, and physical spaces? We employed several approaches including snowball sampling (Biernacki & Waldorf, 1981) to represent three spaces, sentiment analysis of Twitter data (Rathi, Malik, Varshney, Sharma, & Mendiratta, 2018), community detection in tweet-reply networks (Blondel, Guillaume, Lambiotte, & Lefebvre, 2008), and geosocial network analysis of the hybrid space systems, i.e., the cyber-relational system and the relational-physical system (Croitoru et al., 2015).

In the remainder of this paper, Section 2 discusses the “Spatial” framework along with our rationale for using it and how the rise of social media as a communication avenue can impact our understanding of place and space. In particular, we discuss the role of social media in emancipating human dynamics from physical to relational and cyber spaces. To showcase the role of social media, we concentrate on a particular case study, the Covid-19 vaccine debates. Section 3 introduces the study area and data collection process. Section 4 demonstrates the analysis of a case study before presenting our results in Section 5. Finally, Section 6 presents a summary and discussion of findings, limitations, and areas of further work.

2. Background

The definitions of space and place are constantly evolving and are the center of focus for many research fields such as Urban Planning and Geography (Dodge & Kitchin, 2003). However over the last two decades, with the growing sophistication of information and communication technologies (ICTs), location-aware devices combined with machine learning and data science have greatly changed how we study and understand space and place (Shaw & Sui, 2020). The significant growth of social media has given rise to the expansion of cyber space, and has stimulated research in exploring the connections between cyber and physical spaces along with online-to-offline interaction (Sui and Shaw, 2018, Batty, 1997).

Despite its great dependency on the software and hardware in the physical space (Batty, 1997), cyber space is often considered as the opposite of physical space or a great “no place” (Herrera, 2016, Lessig, 1995). Cyber space shows metaphysical differences (i.e., place, size, distance, route) from physical space (Bryant, 2001). Furthermore, cyberspace is gradually replacing the role of physical space by altering the traditional human dynamic patterns in physical space (Sui & Shaw, 2018). Specifically, by erasing the barrier of physical distance, cyberspace enhances our ability to conduct activities in a more flexible and timely manner (Yu & Shaw, 2008) and allows us to undertake a wide range of daily activities online (Kwan, 2000). For example, with internet access, people can work and study remotely, and socialize with friends through social networking platforms along with carrying out other activities such as shopping for goods online (Line, Jain, & Lyons, 2011). Meanwhile, it is often suggested that there is a correlation between human behaviors and patterns in cyber and physical spaces (i.e., homophily) which suggests that the dynamics witnessed in one space could be used to predict the dynamics in another space (Zhao et al., 2014).

However, our current knowledge about the relationships between cyber and physical spaces and their implications for human dynamics is limited (Sui and Shaw, 2018, Shaw and Sui, 2020, Porter, 2004). With much of the research being devoted to describing human mobility patterns in physical space (e.g., Gonzalez et al., 2008, Brockmann et al., 2006), surfing and communication behaviors in cyber space (e.g., Huberman et al., 1998, Chmiel et al., 2009, Sousa et al., 2010), using big data such as social media or smart card data to evaluate places in the physical world (e.g., Sulis et al., 2018, Long and Huang, 2019, Hamstead et al., 2018). There are only a few studies that explore and attempt to explain the mechanisms behind the cyber-physical system. For example, Croitoru et al. (2015) found that the interplay between cyber and physical spaces was transmitted through geosocial networks, networks tied to both cyber and physical spaces, and without such linkages, communities would not form and information would not propagate. Adding to this area of research, a new space-place (Splatial) framework has been recently proposed by Shaw and Sui (2020) which aims at developing a human-centered synergistic view of the space and place in the digital age. The Splatial framework allows for examining human dynamics across the cyber-physical system in a comprehensive way.

The Splatial framework includes not only the cyber and physical spaces but a hybrid of spaces including relational, relative, and mental spaces. Relational space highlights the relations between people and objects and provides a lens to view space as a network that promotes the flow of information, objects and materials (Castells, 2010). Relative space describes the corresponding locations of a moving or static object in the physical space (e.g., an autonomous vehicle detects the locations of nearby objects in its surroundings). Similar to Tuan (1977)’s definition of place, mental space is an area in space to which people have given meanings. Mental space works with the cognitive and mental aspects of space and highlights the observers’ feelings, emotions and perceptions of space. Among these spaces, the relational space inferring networks between objects based on their relations, interactions and links has the potential to bridge the cyber and physical spaces (Croitoru et al., 2015). Therefore, this study incorporated the relational space into the cyber-physical system (1) to analyze the information propagation mechanism between three spaces, cyber, relational and physical spaces; (2) to enrich the Splatial framework by providing concrete definitions of the three spaces based on the real-world event of the Covid-19 vaccine debates. This was at the expense of excluding mental and relative spaces whose focus is on the meaning of space or describing the physical world which we will revisit in our future work (see Section 6). Fig. 1 shows the schematic representation of the three spaces: cyber, relational and physical spaces by using NYS as a case study.

Fig. 1 — Schematic representation of the three spaces: cyber, relational and physical spaces.

It has long been noted that traditional GIS methods based solely on notions of physical distance and proximity are inadequate for measuring human activities in cyber space (Kwan, 2000). With advances in ICTs and the proliferation of social media, a huge amount of data about human activities in a hybrid of spaces is now emerging (Sui and Shaw, 2018, Lazer et al., 2009). For example, researchers have explored e-mails (Eckmann, Moses, & Sergi, 2004), mobile phones (Grauwin et al., 2017, Vanhoof et al., 2017), smart cards (Sulis et al., 2018) and social media (Croitoru et al., 2015, Cvetojevic and Hochmair, 2021) data to study the digital traces of diverse human activities such as communication, mobility and information propagation. Social media data in particular has demonstrated several characteristics. First, social media enables efficient information transmission at a global scale (Stefanidis, Crooks, & Radzikowski, 2013). By removing physical (or geographical) boundaries, social media allows individuals and organizations to disseminate and obtain information from anywhere and at any time (Kaplan & Haenlein, 2010). For example, during the Covid-19 pandemic, social media has been at the forefront of disseminating new scientific findings, publishing new protocols, and connecting the general public with their friends and family to reduce isolation and anxiety (González-Padilla & Tortolero-Blanco, 2020). In addition, social media data provides a rich geographic context that can be used to characterize human dynamics in physical space (Stock, 2018). For example, Long and Huang (2019) used social media data as a proxy of economic activity and found the positive impacts of design variables on urban vitality. Yin, Soliman, Yin, and Wang (2017) found that the mobility networks inferred from the geo-located tweets could yield geographically cohesive urban boundaries in the UK. In other studies, it has been shown how social media check-in data allows researchers to measure the physical co-presence in urban spaces and help to predict the socio-economic performance (Shen and Karimi, 2016, Shen et al., 2019). Furthermore, people’s activities on social media are part of a networking process whereby individuals share, reply and react selectively with other users based on their social ties or common interests (Sousa et al., 2010, Stefanidis et al., 2013). Users’ interaction on social media makes it possible to infer a relational space between users and analyze how information propagates. For example researchers have generated social networks based on users’ retweet activities (e.g., Yuan et al., 2019) and or the use of hashtags (e.g., Gunaratne et al., 2019). These characteristics of social media make it an ideal data source for investigating the interactions between the cyber, relational and physical spaces because not only does such data potentially bridge cyber and physical spaces, but also provides information about users’ interactions when it comes to generating relational spaces (Croitoru et al., 2015).

Turning to the current Covid-19 pandemic and the role of social media, it has had both a positive and negative impact on our society (González-Padilla & Tortolero-Blanco, 2020). The implementations of preventative measures such as social distancing and isolation intensified the use of social media as individuals try to stay connected while being physically apart. An example is how the number of tweets containing the keyword “coronavirus” spiked on the day when the U.S. reported the first Covid-19 related death (Chen et al., 2020). Moreover, Ahmed et al. (2018) demonstrated that the use of social media, specifically Twitter and Facebook as sources of health information, may promote vaccine uptake. Despite the benefits of social media including its ability to inexpensively reach a large population during a crisis (Saud et al., 2020), there are considerable public health concerns raised by the spread of harmful misinformation. This is especially the case for anti-vaccine messaging on social media platforms (Puri et al., 2020). In contrast to traditional media, social media allows anyone to generate rumors since the content posted need not undergo editorial scrutiny or scientific proof and thus can spread very rapidly (Massey et al., 2018). For example, amongst the top-ranked YouTube videos related to Covid-19, 27.5% of them contained non-factual information and currently have over 60 million views (Li, Bailey, Huynh, & Chan, 2020). Meanwhile, the dissemination of anti-vaccination messaging on social media may generate more user engagements and has the potential to fuel vaccine hesitancy which could result in greater outbreaks in the physical world (Puri et al., 2020). For instance, Basch and MacLean, 2019 analyzed 150 HPV-related posts on Instagram and found that anti-vaccine posts generated significantly more than average likes than other posts. Additionally, through analyzing the temporal patterns of pro and anti-vaccine discourse on Twitter from 2010 to 2019, Gunaratne et al. (2019) found a significant surge in anti-vaccine discussion between 2015–2016 that coincided with the 2014–2015 measles outbreak in the real world. Furthermore, scholars have also expressed concerns about the “bubble filters” algorithms (Pariser, 2011) used by many social media platforms. Such algorithms work by pushing content to viewers based on their past click behavior and search history. Thus, these algorithms may result in the ideological isolation of anti-vaccine users and limit public health penetration to promote vaccination within social media (Puri et al., 2020, González-Padilla and Tortolero-Blanco, 2020). For example, through analyzing tweets related to the MMR vaccine, Yuan et al. (2019) found that anti-vaccine users mainly resided in their enclosed communities and were highly segregated from pro-vaccine users. However, such studies were largely based on the pre-Covid-19 events. We would argue that as Covid-19 has now become a focus of intense social media discourse, there is a need to update our understanding of online vaccine discourse by analyzing the pro and anti-vaccination debates of Covid-19 vaccine.

As one of the most important measures for preventing communicable infectious diseases (Andre et al., 2008), vaccination plays an important role to protect people against diseases and has been shown to save lives (Polack et al., 2020). As the vaccine discourse continues to evolve on social media with trends often tied to real-world events (Gunaratne et al., 2019), analyzing the pro and anti-vaccine discourse and the propagation of vaccine-related information on social media provides a lens to study the interactions between the cyber, relational and physical spaces. The next section explains the methodology in detail.

3. Study area and data collection

3.1. Study area

As discussed above, this study consists of three spaces: physical, relational, and cyber spaces. To ground this study in a physical space we chose the State of New York (NYS) for our case study. Although the level of vaccine sentiments (e.g., pro-vaccine rates and vaccine confidence) may vary in different states or even across different countries (e.g., Lyu et al., 2022, Larson et al., 2016), since the purpose of our study was to investigate the interactions between hybrid spaces (i.e., physical, relational and cyber) rather than the spatial disparities of vaccine sentiments, NYS demonstrates a relevant case study to fulfill our research purpose. We write this for several reasons, first, NYS is among the top-tier states that actively engage in online vaccine discussions (Chen & Crooks, 2022). In addition to this, NYS was the original epicenter of the pandemic in the U.S. (McMinn & Carlsen, 2022). For example, NYS as of December 15, 2020, lead the U.S. in Covid-19 deaths, at 35,427 (Dong, Du, & Gardner, 2020). Furthermore, NYS has played a proactive role in promoting the Covid-19 vaccine through both traditional media and digital platforms such as the State’s #GetTheVaxFacts campaigns (New York State, 2021b). To encourage vaccination in hard-hit communities, NYS governor allocated $15 million to promote vaccines (New York State, 2021a). Fig. 2 shows the map of the study area (NYS).

Fig. 2 — Map of study area (NYS) with the primary road system. Red dots denote collected vaccine-related tweets in NYS.

NYS has clear physical boundaries and we use these to screen out tweets sent from NYS. We then trace back to the users who posted these tweets and label them as NYS users. The regional boundaries in NYS and the locations of NYS users create the physical space of our study. The relational space describes the social networks of NYS users inferred from their tweet-reply activities. The relational space not only includes NYS users but also a small portion of cyber users who directly interact with NYS users. The relational space captures the tweet-reply activities of NYS users. Cyber space provides a larger conversational context to relational space. Cyber space includes NYS users and also cyber users in Covid-19 vaccine debates. Cyber users are those people who are either outside of NYS or without locational information. Unlike the relational space that only captures the tweet-reply activities of NYS users, the cyber space also includes the tweet-reply activities of cyber users. Fig. 1 illustrates the three spaces and their connections.

3.2. Data collection

The physical space consists of the boundaries of NYS and its nine regions collected from NYS GIS Clearinghouse (2020). Turning to the tweets, tweets were collected between December 1, 2020 until August 31, 2021,via repeated calls to the Twitter application programming interface version 2 (API v2). While discussions surrounding Covid-19 date back to the initial period of the pandemics, such discourse has continued to evolve (Tyson, Johnson, & Funk, 2020), even at the time of writing new discussions are emerging. Thus our study, like any study, may not capture the entire picture of vaccine debates. However, our selected time period (i.e., December 1, 2020 until August 31, 2021) does capture the upsurge in vaccine discussions on Twitter as seen in other longitudinal analysis studies (e.g., Chen & Crooks, 2022). We identified tweets that are related to Covid-19 vaccine by using a combination of keywords (i.e., vaccine, vax, Moderna, Pfizer, Johnson & Johnson, AstraZeneca). Our rationale for choosing these terms was that previous studies have used the term of vaccine and its variations, such as “vaccine, vax, vaccination” (e.g., Gunaratne et al., 2019, Yuan et al., 2019). We also included other keywords such as “Moderna, Pfizer, Johnson & Johnson” because these were the Covid-19 vaccine that were approved by the FDA for emergency use in the US at the time of this study (FDA, 2020a, FDA, 2020b, FDA, 2021), and “AstraZeneca” in Canada and the U.K. (GOV.UK, 2022).

The NYS tweet dataset contained 39,204 tweets created by 11,443 distinct users. Each tweet has four types of information associated with it: identifier, content, location, and reference. Fig. 2 maps the locations of these tweets. The potential bias of this dataset is further discussed in Section 6. Table 1 shows the returned variables associated with of each tweet. The variable of “conversation_id” was used as a key to snowball sample the entire conversation about Covid-19 vaccine. The variable of “in_reply_to_user_id” allowed to construct tweet-reply networks between users. For example, if user A (“author_id”=“A”) has replied to user B (“in_reply_to_user_id”=“B”)’s tweets twice, then an edge of weight two is created from A to B. Different from other studies that built retweet networks (Yuan et al., 2019, Croitoru et al., 2015, Bello-Orgaz et al., 2017) or hashtag networks (Gunaratne et al., 2019), this study built tweet-reply networks because tweet-reply activities are more driven by social motivation (Sousa et al., 2010) and require more user engagements (i.e., users do not just press the retweet icon).

Table 1.

Variables associated with each tweet.

Categories	Variables	Descriptions
Identifier	tweet_id	Unique id of the tweet
	conversation_id	Id of the original tweet that this tweet is directly or indirectly replied to
	author_id	Unique id of the user who posted the tweet

Content	text	Posted text
	lang	Language of the tweet
	created_at	Creation time of the tweet

Location	place_id	Id of the place where the tweet is posted. The place could be a city, a neighborhood, or an amenity (e.g., restaurant or shopping center).
Location	place_geometry	Geographical information of the place in the form of coordinates or bounding box.

Reference	in_reply_to_user_id	Id of the user that this tweet replies to. N/A if none.

Open in a new tab

To create the cyber and relational spaces, we collected a second round of tweets to capture the larger ongoing conversation about the Covid-19 vaccine. If a tweet is in reply to another tweet, then the two tweets are in the same conversation. By searching tweets based on their “conversation_id” as shown in Table 1, we could snowball sample all the other tweets that were in the same vaccine conversation as the NYS tweets. We first extracted a list of unique “conversation_id” from the NYS tweet dataset. Then we called the Twitter API v2 again and used the list of “conversation_id” as a key to search Covid-19 vaccine conversations. After obtaining the tweets in all conversations, we filtered out the tweets that are irrelevant to vaccines (i.e., using the same keywords used in the search above to filter out the tweets). The remaining tweets then constituted the final dataset for constructing cyber space. This cyber space dataset involved not only NYS users but also cyber users who do not share their locational information or live outside of NYS. The cyber space dataset contains 448,958 tweets created by 239,716 distinct users. We created the relational space based on the tweets that are directly related to NYS users where the NYS users are either the author or the receiver of a tweet. The relational space dataset has 55,484 tweets posted by 21,876 distinct users. The variables associated with each tweet are shown in Table 1.

4. Methodology

To understand the interactions between cyber, relational, and physical spaces, this study used a combination of techniques which are outlined in Fig. 3 . After data collection, this study first conducted sentiment analysis of tweets (Section 4.1) through data processing, manual annotation, training classification model, and tweet labeling. By doing so, we classified tweets and users into three categories: pro, anti and neutral. Then, we constructed tweet-reply networks and performed community detection algorithms to reveal large communities in different spaces and examine users’ profiles (Section 4.2). Last, we built hybrid space networks to map the interactions between the three spaces (Section 4.3). For interested readers, we provide the code of our methodology and results at:https://osf.io/mxq3k/?view_only=da5e78a3e0a54789b46067fac54d548e. We do this to allow for replication and for readers to extend our work as they see fit.

Fig. 3 — Research workflow to investigate the propagation of different opinions between three spaces: cyber, relational and physical spaces.

4.1. Sentiment analysis

One of the key aspects of implementing the Splatial framework is to understand the transformation and linkage between cyber, relational and physical spaces (Shaw & Sui, 2020). In our study, this relates to the spread of pro and anti-vaccine opinions between the three spaces. Therefore, the first step is to analyze the sentiments of both the vaccine tweets and Twitter users. In the sentiment analysis of tweets, opinions are often grouped into three categories, positive, negative and neutral (Rathi et al., 2018, Yuan et al., 2019, Pak and Paroubek, 2010). In our case, these three categories correspond to pro, anti and neutral. Pro-vaccine relates to tweets that demonstrate support to the Covid-19 vaccine. While anti-vaccine tweets can be considered those that tend to delay in acceptance or refusal of vaccines. Lastly, neutral tweets are those that report certain facts about the Covid-19 vaccine without showing any inclination of their opinion. For example, such as tweets might cite an academic study, or announce a new policy, or come from news agencies. Table 2 shows a selection of tweets from these three categories.

Table 2.

Examples of pro, anti and neutral vaccine tweets.

Sentiment	Tweet
Pro-vaccine	“Vaccine appointment tomorrow because #science not stupidity”
Pro-vaccine	“Getting my vaccine”

Anti-vaccine	“Everyone who is taking the vaccine has an IQ of the average preschooler.”
Anti-vaccine	“I’m in NO rush to get the vaccine.”

Neutral	“NEW: More than 40 percent of New Yorkers have received at least one dose of the COVID-19 vaccine. Check out the full story here”
Neutral	“NYC to open COVID vaccine site in Queens via @nypmetro”

Open in a new tab

To automatically classify tweets into the three categories, we built a sentiment classifier using a Support Vector Machine (SVM). Our rationale for choosing a SVM classifier is that they have been widely used in text-based classifications (e.g., Joachims, 1999) including those looking at the sentiment of tweets (e.g., Rathi et al., 2018, Yuan et al., 2019), and it has been noted that they are appropriate when a dataset is unbalanced, in the sense that the number of entities in each category is not equal (e.g., Tang et al., 2008, Zhou et al., 2015). We also tested other classifiers including logistic regression, non-linear SVM and random forests. However, the linear SVM classifier outperformed the other classifiers in producing higher prediction accuracy and shorter execution time which we will come back to later. To train the linear SVM classifier, we first hand-labeled 2,065 unique tweets (Section 4.1.1). Then, we pre-processed all tweets to remove unnecessary information (Section 4.1.2), next to build an SVM classifier (Section 4.1.3), and to label all tweets and Twitter users into three categories, pro-vaccine, anti-vaccine or neutral (Section 4.1.4).

4.1.1. Manual annotation

The labeling process was done by using a labeling questionnaire approach. Specifically, we first randomly selected 5,000 tweets from the whole data corpus to create a sample pool. We then generated 100 questionnaires from the sample pool that each contained 100 tweets. As such, each tweet will appear in two questionnaires and creates some overlapping across different participants. We then randomly distributed these questionnaires to 100 participants on campus to label the tweets and received 62 responses. Simultaneously, one researcher with domain knowledge also randomly gathered 15 tweets (15%) from each questionnaire, marked them independently and used these labels as a reference to measure participants’ reliability. We used both the percentage agreement and Cohen’s kappa statistic to calculate the inter-rater reliability between participants and the researcher (Cohen, 1960). To identify reliable participants, we only retained participants who has over 60% of agreement with the researcher and the kappa’s value higher than 0.4 (i.e., moderate level of agreement proposed by Landis & Koch, 1977). Through this process, we were able to identify 25 reliable participants and 2,065 unique tweets after text processing. Among the 2,065 unique tweets, 435 tweets had been labeled twice by participants, while 1,630 tweets had only been labeled once. To increase the robustness of the annotation, the researchers of this paper then labeled these 1,630 tweets manually to ensure each tweet had two annotators. The conflicting labeling results between participants and the researchers were resolved through discussions amongst all the research team. The final hand-labeled dataset has achieved an 86% of agreement between the researchers and the participants and the kappa value of 66% which is in a similar range to other studies (e.g., Chen et al., 2021, Yuan et al., 2019) and suggests a substantial agreement between annotators (Landis & Koch, 1977). Therefore, we derived 2,065 unique hand-labeled tweets that accounted for 0.5% of the whole data corpus.

4.1.2. Data processing

Once the data has been collected, the first step before any classification task is data pre-processing (Ahmad, Aftab, & Ali, 2017). As we are dealing with text data, they need to be cleaned and lemmatized to eliminate noise (Kannan et al., 2014). To clean the tweets, we converted all of them to be lowercase, then removed emojis, mentions (i.e., @users), urls, punctuation, special characters (e.g., double space and newline), and stopwords before carry out lemmatization (Müller & Guido, 2016). However, as suggested by Rathi et al. (2018) and Yuan et al. (2019) we kept hashtags because they are important indicators of threaded discussions and can also contain useful information about the contents of the posts. For example, by observing our data, we noticed that some hashtags, for example, “#VaccineSaveLives” and “#GetVaccinated” suggest pro-vaccine opinions, and other hashtags, such as “#mybodymychoice”, “#NoVaccineForMe” indicate anti-vaccine opinions. However, some tweets may use hashtags as sarcasm to express the opposite sentiment. For example, some authors expressed an anti-vaccine stance by using a pro-vaccine hashtag (e.g, “Pretty sure I will not #GetVaccinated”), and vice versa (e.g., “#NoVaccineForMe. Anti-vaccine is dangerous, cry about it”). Therefore, we deleted the “#” symbol in the hashtags and kept the contents as part of the tweets for classification.

4.1.3. Classification model

After pre-processing, the tweets then needed to be vectorized and re-scaled in order to be learned by the machine. Previous studies, such as Yuan et al. (2019) and Rathi et al. (2018) have used Term Frequency-Inverse Document Frequency (tf-idf) for this purpose. The logic behind tf-idf is that it gives a high weight to any term that appears often in a particular text (in our case a tweet) but not in many other texts (Müller & Guido, 2016). By doing so, the terms that distinguish one text from others would receive a higher weight, but the terms that frequently appear in all texts (e.g., “vaccine”) will receive a lower weight. After rescaling the terms, tf-idf normalizes the representation of each text to have Euclidean norm of 1 so that the length of texts will not produce any bias. To do this, two parameters need to be tuned to construct a tf-idf vectorizer, N-gram and minimum document frequency (Min-df). N-gram refers to using a contiguous sequence of n words as a feature. Unigram means using a single word as a feature, bigram means using a two-word sequence as a feature and so on (Ahuja, Chug, Kohli, Gupta, & Ahuja, 2019). When building the vocabulary, Min-df is used to remove features that have a document frequency lower than a given threshold (Pedregosa et al., 2011).

Once we have vectorized and re-scaled the data, to train the tf-idf vectorizer and the SVM classifier, we split the labeled data into training (80%) and test (20%) datasets and perform the ten-fold cross-validation. The proportion of three classes (i.e., pro, anti and neutral) in the training and test datasets remained the same. Standard performance measures (i.e., precision, recall, accuracy, and F1-score) were used to compare the models of different combinations of parameters (Zhou et al., 2015). Table 3 shows the parameters that generated the best performance. The model constructed using these parameters achieved a cross-validation accuracy of 76.3% in the training dataset and the accuracy of 75.5% in the test dataset (see Table 4 ) which is in a similar range to other studies of sentiment analysis of vaccine-related tweets (e.g., Yuan et al., 2019, Du et al., 2017, Piedrahita-Valdés et al., 2021). Table 5 summarizes the precision, recall and F1-score for each class on the test dataset.

Table 3.

Parameter values which generate the best performance.

	Parameters	Values
Tf-idf vectorizer	N-gram	Unigram and bigram
Tf-idf vectorizer	Min-df	2

SVM classifier	C	0.001

Open in a new tab

Table 4.

Performance of the trained SVM classifier on the test dataset.

Metric	Scores
Accuracy	0.755
Precision	0.769
Recall	0.688
F1-score	0.720

Open in a new tab

Table 5.

Summary of the precision, recall and F1-scores for each class on the test dataset.

Class	Precision	Recall	F1-score
Anti-vaccine	0.663	0.532	0.590
Neutral	0.878	0.655	0.750
Pro-vaccine	0.767	0.879	0.819
Macro average	0.769	0.688	0.720
Weighted average	0.754	0.755	0.748

Open in a new tab

4.1.4. Tweets and user labeling

Once the linear SVM classifier was trained and tuned we then automatically classified all tweets in the data corpus into one of three classes: pro, anti and neutral. Table 6 shows the tweet labeling results, i.e. the number of tweets falling into the three classes in the physical, relational and cyber space datasets. Next, the labeled tweets with the same “author_id” were aggregated using the simple majority voting rule to decide users’ opinions. A simple majority voting rule is a decision rule that selects one from many alternatives based on the predicted classes that have the most votes (Lam & Suen, 1997) and it has been applied in other studies (e.g., Yuan et al., 2019, Gunaratne et al., 2019) to aggregate tweets or hashtags to derive users’ opinions. Specifically, if a user has the majority of their tweets labeled as one class, the user is also classified into that class. However, if the user has an equal amount of pro and anti-vaccine tweets, then this user is labeled as neutral.

Table 6.

Tweets labeling results. Number of tweets with different opinions in cyber, relational and physical spaces.

Class	Hand Labeled	Machine Labeled
		Physical Space	Relational Space	Cyber Space
Pro-vaccine	59.9%	80.0%	77.3%	73.4%
Anti-vaccine	26.8%	16.2%	20.3%	25.3%
Neutral	13.3%	3.8%	2.4%	1.3%
Total	2,065	39,204	55,484	448,958

Open in a new tab

4.2. Social network analysis

4.2.1. Reply network construction

After sentiment analysis of tweets and users, we then constructed two tweet-reply networks between Twitter users to represent relational and cyber spaces. In the tweet-reply network, nodes represent Twitter users, and edges indicate reply activities between Twitter users. For instance, if user A has replied to user B two times on Twitter, then an edge with a value of two is drawn from user A to user B. User A has an out-degree of two and user B has an in-degree of two. To sift out influential users, to make the networks denser and to concentrate on the main conversation camps, past researchers have removed inactive users from their analysis, for example, Yuan et al. (2019) removed isolate users while Bello-Orgaz et al. (2017) removed users with a low degree. In our study we used a threshold of total degree less than two to eliminate inactive users. Table 7 shows the attributes of the tweet-reply networks in relational and cyber spaces after removing inactive users. The size of the relational space network is smaller than the cyber space network in terms of nodes (active users) and edges (number of replies).

Table 7.

Attributes of the tweet-reply networks in the relational and cyber spaces after removing the inactive users whose total degree is less than two.

Networks	Nodes		Edges
	NYS users	Cyber users	Type	Count	Total Weights
Relational Space	3,245	4,333	Directed	12,692	21,459
Cyber Space	3,542	90,658	Directed	193,310	251,376

Open in a new tab

4.2.2. Community detection in reply networks

After constructing the tweet-reply networks, we then turn to community detection to group closely connected Twitter users based on their connections. Community detection is a widely used method to reveal the underlying structures in social networks (Bello-Orgaz et al., 2017). Communities in a social network indicate groups of users whose connections with each other are stronger than with users outside of their communities (Papadopoulos, Kompatsiaris, Vakali, & Spyridonos, 2012). In our case, communities represent groups of Twitter users who frequently reply to each other’s tweets to discuss the Covid-19 vaccine. Our interests in community detection stem from the community’s central role in information propagation and diffusion in social networks (Murata, 2010). We used the widely applied Louvain algorithm to detect communities and partition Twitter users because this algorithm outperforms other methods in producing higher modularity scores and shorter execution times (Blondel et al., 2008). The Louvain algorithm includes two iterative phases. The first phase is to calculate the modularity gain by adding and removing nodes to a new community. Once the modularity score cannot be increased by removing individual nodes, next, the algorithm aggregates the current communities into nodes and then repeat the first phase. We used the Louvain algorithm to detect communities only based on the edges (i.e., the number of tweet replies) between users and eliminate other information such as users’ locations and opinions. Therefore, the detected communities could be purely considered as well-connected online discussion communities or groups where users in the same community frequently reply to each other’s tweets to discuss the Covid-19 vaccine. By doing so, we could investigate how users with different opinions and from different locations (i.e., cyber, relational and physical spaces) interact with each other, and how the physical boundaries (i.e., regional boundaries in the NYS) impact the formation of online communities.

The software Gephi was used to perform the Louvain algorithm for community detection. When implementing the algorithm, the parameter of resolution controls the size of the smallest community (Bastian, Heymann, & Jacomy, 2009). Previous work has shown how setting the resolution to three resulted in robust findings (e.g., Yuan et al., 2019), and in our work, this resulted in 4,081 communities in the cyber space network and 1,328 communities in the relational space network. Fig. 4 shows the size (i.e., number of users) of all detected communities in cyber (Fig. 4A) and relational spaces (Fig. 4B) with large communities highlighted (i.e., containing over 1 % of total nodes). As is the norm, researchers often eliminate the small communities and focus mainly on the large communities (e.g., Croitoru et al., 2015, Yuan et al., 2019). Following this norm, we concentrate on the top large communities (i.e., containing over 1 % of total nodes) in both the cyber communities I-VI and communities (a)-(h) in the relational space to further investigate the users’ profiles of these communities. In the cyber space, the six top large communities I-VI all together have more than 85% of total users, while in the relational space, the eight top large communities (a)-(h) contain more than 66% of total users.

Fig. 4 — The size of all communities detected in cyber space (A) and relational space (B). Large communities are those that contain more than 1% of total nodes and are highlighted in yellow.

4.3. Hybrid space networks

In addition to the community detection in the tweet-reply networks, we also created hybrid space networks (Croitoru et al., 2015) between Twitter users to investigate the interactions between cyber, relational and physical spaces. By aggregating Twitter users based on their location, we created two hybrid space networks to investigate the information propagation in the cyber–relational and the physical–relational systems. The cyber-relational network is bipartite and depicts the interactions between cyber and relational space users, while the physical–relational network shows the communications between relational space users and users living in the nine regions in NYS. After constructing the hybrid networks, we investigated how the online vaccination debates, in the form of pro and anti-vaccine tweets, are propagated between physical, relational and cyber spaces.

5. Results

As mentioned in Section 4.2.2, the detected communities in our analysis represent groups of people who frequently reply to each other on Twitter to discuss the Covid-19 vaccine. This section analyzes the attributes of the communities in the cyber (Section 5.1), relational (Section 5.2) and physical spaces (Section 5.3). Next, Sections 5.4, 5.5 show how different opinions (i.e., pro and anti-vaccine opinions) spread from physical to relational, and from relational to cyber spaces.

5.1. Cyber space communities

Fig. 5 visualizes the tweet-reply network in the cyber space. In this figure, a point represents a user. The users of the six top large communities I-VI are highlighted using colors. As mentioned in Section 4.2.2, large communities in cyber space are those that contain over 1% of total users. The community I colored in red is the largest community that contains 33.13% of the total users. Community II colored in green is the second large community that covers 26.72% of the total users. Then, communities III, IV, V and VI have 10.15%, 8.23, 5.8% and 1.0% of the total users respectively. Communities I and II are the two main camps of vaccination debate in the cyber space.

Fig. 6 shows the users’ profiles in the six large communities. Fig. 6A illustrates the percentage of pro-vaccine, anti-vaccine and neutral users. In each community, more than a half ( $>$ 55%) of users are proponents of the Covid-19 vaccine, while there are still around 10–20% of anti-vaccine users. Community II and V have the largest proportion (17%) of anti-vaccine users. Users with neutral opinions constitute 10%–15% of total users in each community. Fig. 6B shows the distribution of users’ locations in each community. The top graph in Fig. 6B shows the percentage of cyber space and relational space users in each community. This graph suggests that the cyber space users dominate the online vaccination debates because they have reached over 64% of total users in all six large communities. Relational space users constitute 11–36% of total users in the six large communities. Among them, community VI has the largest proportion (35.2%) of relational space users. The bottom graph in Fig. 6B presents the proportion of physical space users (NYS users) in each community and depicts their locations at the regional level. Despite its small proportion (2.2%–6.5%), NYS users could be found in all six communities. Among them, communities III and VI have the largest proportion (6.5%) of NYS users. Meanwhile, the NYS users in the six large communities tend to anchor in a particular region. For example, while communities I, II, III, IV have the majority of NYS users coming from New York City, communities V, VI have the most NYS users coming from Western New York and Western Finger Lakes respectively.

5.2. Relational space communities

After examining the large communities in cyber space, this section analyzes the eight large communities in relational space. Relational space represents the social networks of NYS users that are inferred from their tweet-reply activities (Section 3.1). As shown in Fig. 1, the social network in relational space includes NYS users and also the cyber users who are directly connected with NYS users to discuss the Covid-19 vaccine. In Fig. 7 , nodes represent users and edges represent their interactions (i.e. tweet-reply activities). Fig. 7 illustrates the community structure in the relational space by highlighting its eight large communities (a)-(h). Large communities are communities with over 1% of total users (Section 4.2.2). The eight large communities all together cover more than 66% of users in the relational space. The size of communities decreases from (a)-(h). The largest community, community (a), covers more than 24% of total users in relational space. The community (h), which is the smallest, contains around 1.3% of total users in relational space.

Fig. 7 — Network visualization of the eight top large communities in relational space. (A) Visualization of communities using ForceAtlas layout. (B) Project communities into physical space. Nodes without location information are placed outside of NYS.

Fig. 8A shows the distribution of users’ opinions in the eight large communities in relational space. Each community has more than 65% of users supporting the Covid-19 vaccine. Among these communities, the community (g) has the largest proportion (82.2%) of Covid-19 vaccine proponents, and the community (e) has the smallest proportion (65.9%) of vaccine proponents. Compared to the pro-vaccine users, the anti-vaccine users in each community constitute a smaller proportion ranging from 6.7% to 14.8%. Community (c) has the largest proportion (14.8%) of anti-vaccine users and community (g) has the smallest proportion (6.7%). The users with neutral opinions constitute the smallest group (4.4%–8.9%) in each community compared to the users with pro or anti-vaccine opinions. The grey bars represent users with no opinion because they only received tweet replies from other users but never posted tweets about Covid-19 vaccine.

Fig. 8B analyzes the spatial traces of the eight large communities (a)-(h) in relational space by showing the distribution of users’ locations in these communities. The length of color bars represents the percentage of users posting tweets from a particular region of NYS. Cyber users are labeled as “no location” and colored in white. Fig. 8B indicates that all eight communities have a group of users whose locations are anchored in a particular region in NYS. For example, in communities (a), (b), (d), (e), (g), (h), most NYS users are posting from New York City (i.e., yellow bar). Among them, the community (g) has the largest proportion (47.8%) of New York City users. In addition to New York City, Fig. 8B also shows the existence of another two local clusters in Western New York (i.e., grey bar) and Western Finger Lakes (i.e., pink bar). Community (c) has a majority of NYS users posting from Western New York, and community (f) has a majority of NYS users tweeting from Western Finger Lakes. Fig. 9 maps users’ locations of the eight large communities (a)–(h) into physical space. The red dots in Fig. 9 represent NYS users in the eight communities. Fig. 9 shows the similar results as Fig. 8B that while the communities (a), (b), (d), (e), (g), (h) have most NYS tweeting from New York City, communities (c), (f) shows a cluster of local users whose locations were anchored in Western New York and Western Finger Lakes. Fig. 8B and Fig. 9 together suggest the existence of distance-based and boundary-confined local clusters in the large communities in relational space.

Fig. 9 — Mapping users’ locations of the eight large communities *(a)-(h)* from relational space into physical space. Red dots represent NYS social media users.

5.3. Physical space communities

After examining the communities in cyber and relational spaces (Section 5.1, 5.2), this section analyzes the communities in physical space. While the communities in cyber and relational spaces are detected based on the strengths of users’ interactions, the communities in the physical spaces are defined based on the regional boundaries. Physical space has nine communities that correspond to the nine regions in NYS. Fig. 10 shows the distribution of users’ opinions towards the Covid-19 vaccine in the nine regions. All regions, including both rural (e.g., Western Adirondacks and Eastern Adirondacks) and urban (e.g., New York City, Western New York) in NYS (Schultz, 2019), demonstrate a similar pattern in vaccine stance. Specifically, all nine regions have a majority ( $>$ 80%) of users supporting the Covid-19 vaccine, and this size of pro-vaccine users far outweighs that of anti-vaccine or neutral users. Anti-vaccine users constitute 6–12% of users in these nine regions. Among these regions, Western New York has the largest proportion (11.6%) of anti-vaccine users. Neutral users constitute the smallest proportion ranging from 5.3% to 8.5%.

Fig. 10 — Distribution of users’ opinions towards Covid-19 vaccine in communities in physical space.

Communities in the three spaces (i.e., cyber, relational and physical) demonstrate similarities in terms of users’ opinions as shown in Fig. 6, Fig. 8, Fig. 10. First, pro-vaccine users dominate the vaccine debates in the large communities in all three spaces. Large communities (i.e., communities with over 1% of total nodes) in cyber and relational spaces and communities in physical space have more than half of users supporting the Covid-19 vaccine. Next, anti-vaccine users are fragmented in the communities in three spaces. The proportions of anti-vaccine users range from 10% to 18% in cyber space, and range from 7% to 15% in relational space and range from 6% to 12% in physical space. We did not observe highly clustered and segregated anti-vaccine users as found in other vaccination studies (e.g., Yuan et al., 2019, Schmidt et al., 2018). However, the communities in the three spaces also demonstrate differences. Communities in the physical space have larger proportions of vaccine proponents and smaller proportions of vaccine opponents than those in the relational and cyber spaces. The three spaces’ different rates of pro or anti-vaccine stances are consistent with the tweet-labeling results as shown in Table 6.

5.4. Interactions between physical and relational spaces

After examining the large communities in three spaces, we then investigate how different opinions (i.e., pro and anti-vaccine opinions) spread between physical, relational and cyber spaces. Fig. 11 A–C show how all tweets, pro-vaccine tweets and anti-vaccine tweets spread between physical and relational spaces. In this figure, nodes R1–R9 represent users from the nine regions of NYS. The node of relational space indicates users in the relational space who are without locational information or posting outside of NYS. The size of nodes is proportional to the number of users. Edges are undirected and their thickness is proportional to the number of replied tweets between two nodes. Self-loop edges indicate replied tweets occurred between users of the same region. The number shows the percentage of replied tweets that happened between two nodes. The map in the bottom right of the Fig. 11 indicates the boundaries of the nine regions of NYS.

By comparing nodes’ sizes, Fig. 11 indicates that the region of New York City (R2) has the largest number of users participating in the vaccination debates and this number far outweigh the other regions. The region of Western New York (R9) has the second largest number of users. Moreover, the comparison of nodes’ sizes between Fig. 11B and C suggests that more users are involved in pro-vaccine than anti-vaccine discussions.

Fig. 11 also indicates that while the users from the nine regions intensively communicate with relational space users, they also engage in local conversations with users from the same region. In Fig. 11A, the thick edges linking the nine regions (R1-R9) and the relational space indicate a strong connection between physical and relational spaces. The self-loop edges suggest that the users of the nine regions also engage in local conversations. However, users’ connections with relational space users are stronger than with users from the same region. For example, New York City users (R2)’s interactions with relational space users constitute 64% of total tweets, but their interactions with New York City users only constitute 7%. Compared to the self-loop edges, the cross-region edges (i.e., communications between two different regions) are even fewer and weaker. Meanwhile, the comparison of the edges’ thickness between Fig. 11B and C indicates that NYS (i.e., physical space) users propagate more pro-vaccine than anti-vaccine opinions to relational space.

5.5. Interactions between relational and cyber spaces

Fig. 12 shows the spread of vaccine-related tweets between relational and cyber spaces. Nodes represent users in relational or cyber spaces and their size is proportional to the number of users in each space. Edges indicate tweet-reply activities. Edges are directed and drawn from authors to receivers of replied tweets. The thickness of edges is proportional to the number of replied tweets. Fig. 12 shows that the relational space has a smaller user size compared to cyber space. In both cyber and relational spaces, there are fewer users in anti-vaccine than pro-vaccine discussions.

Fig. 12A shows that the relational space users have strong interactions with cyber space users because in total more than 45% of replied tweets happen between the two spaces. For example, 4.3% of replied tweets are sent from relational to cyber spaces and 42.4% of replied tweets are from cyber to relational spaces. The self-loop edges indicate that intense communications are happening within each space. For example, 38.8% of replied tweets are within cyber space users and 14.5% of replied tweets are within relational space users. This suggests that while relational space users frequently communicate with relational space users, they also actively interact with cyber space users through sending or receiving replies from cyber space users. Fig. 12B and C indicate that the pro and anti-vaccine discussions have a similar trend.

The comparison between Fig. 12B and C indicates that the self-loop edges of pro-vaccine tweets in cyber space is 35.9% while that of anti-vaccine tweets in cyber space is 47.3%. This suggests that the cyber space users are experiencing stronger echo chamber effects in anti-vaccine than pro-vaccine debates.

6. Discussion and conclusion

By utilizing the Splatial framework and applying it to the study of the Covid-19 pandemic we present a novel way to study vaccination debates. Through analyzing users’ opinions in cyber, relational and physical spaces, our study did not observe the phenomenon of polarization which is commonly found in other political and vaccination debates (e.g., Yuan et al., 2019, Schmidt et al., 2018). Our results suggest that pro-vaccine users dominated the conversations in large communities ( $>$ 1% of total users), while anti-vaccine users were fragmented across these communities in all of the three spaces (i.e., cyber, relational and physical spaces). A possible explanation for this was that as the Covid-19 vaccine became available in December 2020, an increasing trend in positive sentiment toward vaccines took place on social media (Hu et al., 2021). For example, there have been many efforts to debunk vaccine misinformation on social media (New York State, 2021b, CDC, 2021) and there has been a growing trend for general users to actively engage in debates with anti-vaccine users to refute anti-vaccine arguments (e.g., Jamison et al., 2020).

In addition to the non-polarized vaccination debates, our study also provided a more nuanced view of the information propagation mechanism across various spaces (i.e., cyber, relational and physical spaces). Although social media was often considered as a great “no place” that can erase the barrier of physical distance in human communication (Herrera, 2016), we found that people’s opinions towards a particular topic (e.g., Covid-19 vaccine) varied across spaces. In our case, physical space (NYS) users demonstrated a higher rate of pro-vaccine stance and a lower rate of anti-vaccine stance than relational space and cyber space users. Meanwhile, physical space users have propagated more pro-vaccine than anti-vaccine content to the relational space. One reason for this could relate to the on-ground efforts to promote the Covid-19 vaccine in physical spaces, such as allocating funds to hard-hit physical communities and employing both traditional and digital media to promote vaccines (e.g., New York State, 2021a).

Our findings also suggest the co-existence of cross-space and intra-space interactions. For example, while intense communications happened between users from different spaces (e.g., physical-relational or relational-cyber interactions), users were also engaged in local communications with users from the same region or the same space (as shown in the self-loop edges in Fig. 11, Fig. 12). Meanwhile, large communities (i.e., that with $>$ 1% of total users) in both cyber and relational spaces contained a group of physical space users whose locations were anchored in a particular region of NYS, indicating the existence of distance-based and boundary-confined clusters within social media. This suggests that social media can help people to communicate across physical boundaries with a larger relational and cyber audience or within physical boundaries with local friends which has been argued by others (e.g. Bingham-Hall and Law, 2015, Arthur and Williams, 2019, Wellman, 2002).

The co-existence of cross-space and intra-space communications highlights the need to incorporate people’s relational networks and their cyber presence in the physical place-making since one objective of public place-making is to stimulate greater interactions among people and foster vitalized communities (Abdel-Aziz, Abdel-Salam, & El-Sayad, 2016). Meanwhile, the intense information flows that happen between cyber, relational and physical spaces indicate the inseparable nature of online-and-offline activities and strengthens the need to develop a hybrid-space perspective (i.e., the Splatial framework) for studying human dynamics and interactions (Shaw & Sui, 2020). As an early attempt to apply the Splatial framework for studying a real-world event (i.e., Covid-19 vaccine debate), our analysis approach can be applied in various domains for analyzing the interactions across a hybrid of spaces. Moreover, our definitions of the cyber, relational and physical spaces added realistic meanings to the three spaces and our snowball sampling methods provides a new way to construct three spaces using social media data. Next, our hybrid space networks help to visualize social interactions across different spaces and capture the spatial traces of online communities.

With respect to future works, while our focus here was on a proof of concept utilizing NYS as a case study, based on the findings presented here, it would be interesting to scale up to the whole of the U.S. or globally. However, this would require more data and computational resources than we currently have. But we believe our findings and the methodology presented here are generalizable to the other study areas. To this end, we have provided the code used in our methodology and results at OSF. However, as with all works, there are limitations related to our work. For example, one of our limitations is that our analysis only includes cyber, relational and physical spaces while excluding mental and relative spaces. As noted above (Section 2) this was done because relational space inferring networks between objects has more potential to bridge the cyber and physical spaces but a logical next step would be to extend this study to include mental and relative spaces thus allowing one to further clarify the relationships between hybrid spaces and give rise to a human-centered Splatial framework (Shaw and Sui, 2020, Croitoru et al., 2015). Furthermore, as with all studies that involve spatial boundaries, our study also suffers from the modifiable areal unit problem (Openshaw, 1981). We used regional boundaries to delineate physical communities because the state’s (NYS) Covid-19 vaccination administration plan was tailored and established at the regional level (New York State, 2020). However, researchers can further develop our study by testing how different spatial units (e.g., counties) can affect the information propagation mechanism. Another limitation is related to the population biases of social media data. Social media platforms such as Twitter may over-represent urban and suburban populations (Vogels, 2021, Olteanu et al., 2019) while losing the relevant discussions among rural populations. To overcome this limitation, future studies can combine social media data with survey data to better capture people’s opinions towards vaccines in physical space. Even with these areas of further work and limitations, this paper lays the foundation for elucidating the complex relationships between physically and digitally connected spaces and understanding the online-to-offline interactions. Furthermore, we believe our methods could also be used to grasp the public opinions about issues other than vaccine such as for using social media to generate social interactions and foster communities across different spaces and promote place-making in the digital age.

CRediT authorship contribution statement

Fuzhen Yin: Conceptualization, Methodology, Data-curation, Software, Formal-analysis, Investigation, Writing-original-draft, Writing-review-editing. Andrew Crooks: Conceptualization, Methodology, Investigation, Writing-original-draft, Writing-review-editing. Li Yin: Conceptualization, Investigation, Writing-review-editing.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Footnotes

^{Appendix A}

Supplementary data to this article can be found online at https://doi.org/10.1016/j.compenvurbsys.2022.101887.

Appendix A. Supplementary data

Supplementary material

mmc1.pdf^{(54.2KB, pdf)}

Data availability

We have shared the link to our code/data at the Attached File step.

References

Abdel-Aziz A.A., Abdel-Salam H., El-Sayad Z. The role of ICTs in creating the new social public place of the digital era. Alexandria Engineering Journal. 2016;55(1):487–493. [Google Scholar]
Ahmad M., Aftab S., Ali I. Sentiment analysis of tweets using SVM. International Journal of Computer Applications. 2017;177(5):25–29. [Google Scholar]
Ahmed N., Quinn S.C., Hancock G.R., Freimuth V.S., Jamison A. Social media use and influenza vaccine uptake among White and African American adults. Vaccine. 2018;36(49):7556–7561. doi: 10.1016/j.vaccine.2018.10.049. [DOI] [PMC free article] [PubMed] [Google Scholar]
Ahuja R., Chug A., Kohli S., Gupta S., Ahuja P. The impact of features extraction on the sentiment analysis. Procedia Computer Science. 2019;152:341–348. [Google Scholar]
AlSayyad N., Guvenc M. Virtual uprisings: On the interaction of new social media, traditional media coverage and urban space during the ‘arab spring’. Urban Studies. 2015;52(11):2018–2034. [Google Scholar]
Andre F.E., Booy R., Bock H.L., Clemens J., Datta S.K., John T.J., et al. Vaccination greatly reduces disease, disability, death and inequity worldwide. Bulletin of the World Health Organization. 2008;86:140–146. doi: 10.2471/BLT.07.040089. [DOI] [PMC free article] [PubMed] [Google Scholar]
Arthur R., Williams H.T. The human geography of Twitter: Quantifying regional identity and inter-region communication in England and Wales. PloS One. 2019;14(4) doi: 10.1371/journal.pone.0214466. [DOI] [PMC free article] [PubMed] [Google Scholar]
Basch C.H., MacLean S.A. A content analysis of HPV related posts on Instagram. Human Vaccines & Immunotherapeutics. 2019;15(7–8):1476–1478. doi: 10.1080/21645515.2018.1560774. [DOI] [PMC free article] [PubMed] [Google Scholar]
Bastian M., Heymann S., Jacomy M. Gephi: An open source software for exploring and manipulating networks. Proceedings of the International AAAI Conference on Web and Social Media. 2009;3:361–362. [Google Scholar]
Batty M. Virtual geography. Futures. 1997;29(4–5):337–352. [Google Scholar]
Bello-Orgaz G., Hernandez-Castro J., Camacho D. Detecting discussion communities on vaccination in Twitter. Future Generation Computer Systems. 2017;66:125–136. [Google Scholar]
Biernacki P., Waldorf D. Snowball sampling: Problems and techniques of chain referral sampling. Sociological Methods & Research. 1981;10(2):141–163. [Google Scholar]
Bingham-Hall J., Law S. Connected or informed?: Local Twitter networking in a London neighbourhood. Big Data & Society. 2015;2(2) [Google Scholar]
Blondel V.D., Guillaume J.-L., Lambiotte R., Lefebvre E. Fast unfolding of communities in large networks. Journal of Statistical Mechanics: Theory and Experiment. 2008;2008(10):P10008. [Google Scholar]
Brockmann D., Hufnagel L., Geisel T. The scaling laws of human travel. Nature. 2006;439(7075):462–465. doi: 10.1038/nature04292. [DOI] [PubMed] [Google Scholar]
Bryant R. What kind of space is cyberspace. Minerva-An Internet Journal of Philosophy. 2001;5(2001) 138–1. [Google Scholar]
Castells M. Globalisation, networking, urbanisation: Reflections on the spatial dynamics of the information age. Urban Studies. 2010;47(13):2737–2745. [Google Scholar]
CDC . Covid-19 Vaccine Community Features; 2021. Connecticut uses social media to engage long-term care residents. Retrieved 2022-02-20, from https://www.cdc.gov/vaccines/covid-19/health-departments/features/connecticut-ltcop.html. [Google Scholar]
Chen C.-F., Shi W., Yang J., Fu H.-H. Social bots’ role in climate change discussion on twitter: Measuring standpoints, topics, and interaction strategies. Advances in Climate Change Research. 2021;12(6):913–923. [Google Scholar]
Chen E., Lerman K., Ferrara E. Tracking social media discourse about the Covid-19 pandemic: Development of a public coronavirus Twitter data set. JMIR Public Health and Surveillance. 2020;6(2) doi: 10.2196/19273. [DOI] [PMC free article] [PubMed] [Google Scholar]
Chen Q., Crooks A. Analyzing the vaccination debate in social media data pre-and post-covid-19 pandemic. International Journal of Applied Earth Observation and Geoinformation. 2022;110 doi: 10.1016/j.jag.2022.102783. [DOI] [PMC free article] [PubMed] [Google Scholar]
Chmiel A., Kowalska K., Hołyst J.A. Scaling of human behavior during portal browsing. Physical Review E. 2009;80(6) doi: 10.1103/PhysRevE.80.066122. [DOI] [PubMed] [Google Scholar]
Cohen J. A coefficient of agreement for nominal scales. Educational and Psychological Measurement. 1960;20(1):37–46. [Google Scholar]
Croitoru A., Wayant N., Crooks A., Radzikowski J., Stefanidis A. Linking cyber and physical spaces through community detection and clustering in social media feeds. Computers, Environment and Urban Systems. 2015;53:47–64. [Google Scholar]
Cvetojevic S., Hochmair H.H. Modeling interurban mentioning relationships in the U.S. Twitter network using geo-hashtags. Computers, Environment and Urban Systems. 2021;87 [Google Scholar]
Dodge M., Kitchin R. Routledge; 2003. Mapping cyberspace. [Google Scholar]
Dong E., Du H., Gardner L. An interactive web-based dashboard to track Covid-19 in real time. The Lancet Infectious Diseases. 2020;20(5):533–534. doi: 10.1016/S1473-3099(20)30120-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
Du J., Xu J., Song H.-Y., Tao C. Leveraging machine learning-based approaches to assess human papillomavirus vaccination sentiment trends with Twitter data. BMC Medical Informatics and Decision Making. 2017;17(2):63–70. doi: 10.1186/s12911-017-0469-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
Eckmann J.-P., Moses E., Sergi D. Entropy of dialogues creates coherent structures in e-mail traffic. Proceedings of the National Academy of Sciences. 2004;101(40):14333–14337. doi: 10.1073/pnas.0405728101. [DOI] [PMC free article] [PubMed] [Google Scholar]
FDA (2020a). Coronavirus (Covid-19) update: December 22, 2020. U.S. Food & Drug Administration Newsroom. Retrieved 2022-01-17, fromhttps://www.fda.gov/news-events/press-announcements/coronavirus-covid-19-update-december-22-2020.
FDA (2020b). FDA takes key action in fight against Covid-19 by issuing emergency use authorization for first Covid-19 vaccine. U.S. Food & Drug Administration Newsroom. Retrieved 2022-01-17, fromhttps://www.fda.gov/news-events/press-announcements/fda-takes-key-action-fight-against-covid-19-issuing-emergency-use-authorization-first-covid-19.
FDA (2021). Coronavirus (Covid-19) update: April 27, 2021. U.S. Food & Drug Administration Newsroom. Retrieved 2022-01-17, fromhttps://www.fda.gov/news-events/press-announcements/coronavirus-covid-19-update-april-27-2021.
Gonzalez M.C., Hidalgo C.A., Barabasi A.-L. Understanding individual human mobility patterns. Nature. 2008;453(7196):779–782. doi: 10.1038/nature06958. [DOI] [PubMed] [Google Scholar]
González-Padilla D.A., Tortolero-Blanco L. Social media influence in the Covid-19 pandemic. International Brazilian Journal of Urology. 2020;46:120–124. doi: 10.1590/S1677-5538.IBJU.2020.S121. [DOI] [PMC free article] [PubMed] [Google Scholar]
GOV.UK (2022). Decision: Information for U.K. recipients on Covid-19 vaccine Astrazeneca (Regulation 174). Medicines & Healthcare Products Regulatory Agency. Retrieved 2022-01-17, fromhttps://www.gov.uk/government/publications/regulatory-approval-of-covid-19-vaccine-astrazeneca/information-for-uk-recipients-on-covid-19-vaccine-astrazeneca.
Grauwin S., Szell M., Sobolevsky S., Hövel P., Simini F., Vanhoof M., Smoreda Z., Barabási A.-L., Ratti C. Identifying and modeling the structural discontinuities of human interactions. Scientific Reports. 2017;7(1):1–11. doi: 10.1038/srep46677. [DOI] [PMC free article] [PubMed] [Google Scholar]
Gunaratne K., Coomes E.A., Haghbayan H. Temporal trends in anti-vaccine discourse on Twitter. Vaccine. 2019;37(35):4867–4871. doi: 10.1016/j.vaccine.2019.06.086. [DOI] [PubMed] [Google Scholar]
Hamstead Z.A., Fisher D., Ilieva R.T., Wood S.A., McPhearson T., Kremer P. Geolocated social media as a rapid indicator of park visitation and equitable park access. Computers, Environment and Urban Systems. 2018;72:38–50. [Google Scholar]
Herrera G.L. Power and Security in the Information Age. Routledge; 2016. Cyberspace and sovereignty: Thoughts on physical space and digital space; pp. 81–108. [Google Scholar]
Hu T., Wang S., Luo W., Zhang M., Huang X., Yan Y., et al. Revealing public opinion towards Covid-19 vaccines with Twitter data in the United States: Spatiotemporal perspective. Journal of Medical Internet Research. 2021;23(9) doi: 10.2196/30854. [DOI] [PMC free article] [PubMed] [Google Scholar]
Huberman B.A., Pirolli P.L., Pitkow J.E., Lukose R.M. Strong regularities in world wide web surfing. Science. 1998;280(5360):95–97. doi: 10.1126/science.280.5360.95. [DOI] [PubMed] [Google Scholar]
Jamison A.M., Broniatowski D.A., Dredze M., Sangraula A., Smith M.C., Quinn S.C. Vol. 1. Harvard Kennedy School Misinformation Review; 2020. (Not just conspiracy theories: Vaccine opponents and proponents add to the Covid-19 ‘infodemic’ on Twitter). [DOI] [PMC free article] [PubMed] [Google Scholar]
Joachims T. Proceedings of the Sixteenth International Conference on Machine Learning. Morgan Kaufmann Publishers Inc.; San Francisco, CA, USA: 1999. Transductive inference for text classification using Support Vector Machines; pp. 200–209. [Google Scholar]
Kannan S., Gurusamy V., Vijayarani S., Ilamathi J., Nithya M., Kannan S., et al. Preprocessing techniques for text mining. International Journal of Computer Science & Communication Networks. 2014;5(1):7–16. [Google Scholar]
Kaplan A.M., Haenlein M. Users of the world, unite! The challenges and opportunities of social media. Business Horizons. 2010;53(1):59–68. [Google Scholar]
Kwan M.-P. In: Information, Place, and Cyberspace: Issues in Accessibility. Janelle D.G., Hodge D.C., editors. Springer, Berlin Heidelberg; Berlin, Heidelberg: 2000. Human extensibility and individual hybrid-accessibility in space-time: A multi-scale representation using GIS; pp. 241–256. [Google Scholar]
Lam L., Suen S. Application of majority voting to pattern recognition: An analysis of its behavior and performance. IEEE Transactions on Systems, Man, and Cybernetics-Part A: Systems and Humans. 1997;27(5):553–568. [Google Scholar]
Landis J.R., Koch G.G. The measurement of observer agreement for categorical data. Biometrics. 1977:159–174. [PubMed] [Google Scholar]
Larson H.J., De Figueiredo A., Xiahong Z., Schulz W.S., Verger P., Johnston I.G., et al. The state of vaccine confidence 2016: global insights through a 67-country survey. EBioMedicine. 2016;12:295–301. doi: 10.1016/j.ebiom.2016.08.042. [DOI] [PMC free article] [PubMed] [Google Scholar]
Lazer D., Pentland A., Adamic L., Aral S., Barabasi A.-L., Brewer D., et al. Social science. Computational social science. Science (New York, NY) 2009;323(5915):721–723. doi: 10.1126/science.1167742. [DOI] [PMC free article] [PubMed] [Google Scholar]
Lessig L. The zones of cyberspace. The Stanford Law Review. 1995;48:1403. [Google Scholar]
Li A., Zhao P., He H., Mansourian A., Axhausen K.W. How did micro-mobility change in response to Covid-19 pandemic? A case study based on spatial-temporal-semantic analytics. Computers, Environment and Urban Systems. 2021;90 doi: 10.1016/j.compenvurbsys.2021.101703. [DOI] [PMC free article] [PubMed] [Google Scholar]
Li H.O.-Y., Bailey A., Huynh D., Chan J. YouTube as a source of information on Covid-19: A pandemic of misinformation? BMJ Global Health. 2020;5(5) doi: 10.1136/bmjgh-2020-002604. [DOI] [PMC free article] [PubMed] [Google Scholar]
Line T., Jain J., Lyons G. The role of ICTs in everyday mobile lives. Journal of Transport Geography. 2011;19(6):1490–1499. [Google Scholar]
Long Y., Huang C. Does block size matter? The impact of urban design on economic vitality for Chinese cities. Environment and Planning B: Urban Analytics and City Science. 2019;46(3):406–422. [Google Scholar]
Lovari A. Spreading (dis) trust: Covid-19 misinformation and government intervention in Italy. Media and Communication. 2020;8(2):458–461. [Google Scholar]
Lyu H., Wang J., Wu W., Duong V., Zhang X., Dye T.D., et al. Social media study of public opinions on potential covid-19 vaccines: informing dissent, disparities, and dissemination. Intelligent Medicine. 2022;2(01):1–12. doi: 10.1016/j.imed.2021.08.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
Massey P.M., Budenz A., Leader A., Fisher K., Klassen A.C., Yom-Tov E. What drives health professionals to Tweet about #HPVvaccine? Identifying strategies for effective communication. Preventing Chronic Disease. 2018;15 doi: 10.5888/pcd15.170320. [DOI] [PMC free article] [PubMed] [Google Scholar]
McMinn S., Carlsen A. National Public Radio; 2022. Tracking the coronavirus around the U.S.: See how your state is doing. Retrieved 2022-01-17, from https://www.npr.org/sections/health-shots/2020/09/01/816707182/map-tracking-the-spread-of-the-coronavirus-in-the-u-s. [Google Scholar]
Müller A.C., Guido S. O’Reilly; 2016. Introduction to machine learning with python: A guide for data scientists. [Google Scholar]
Murata T. In: Handbook of Social Network Technologies and Applications. Furht B., editor. Springer, US; Boston, MA: 2010. Detecting communities in social networks; pp. 269–280. [Google Scholar]
New York State . NYS Governor’s Press Office; 2020. Governor Cuomo Updates New Yorkers on the State’s Vaccination Administration Plan. Retrieved 2022-02-20, from https://www.governor.ny.gov/news/governor-cuomo-updates-new-yorkers-states-vaccination-administration-plan. [Google Scholar]
New York State . NYS Governor’s Press Office; 2021. Governor Cuomo Announces Allocation of $15 Million to Promote Vaccination in Communities Disproportionately Affected by COVID-19 Pandemic. Retrieved 2022-02-20, from https://www.governor.ny.gov/news/governor-cuomo-announces-allocation-15-million-promote-vaccination-communities. [Google Scholar]
New York State . NYS Governor’s Press Office; 2021. Governor Hochul announces #GetTheVaxFacts campaign to combat Covid-19 vaccine misinformation. Retrieved 2022-02-20, from https://www.governor.ny.gov/news/governor-hochul-announces-getthevaxfacts-campaign-combat-covid-19-vaccine-misinformation. [Google Scholar]
NYS GIS Clearinghouse . NYS ITS GIS Program Office; 2020. NYSDEC Regional Boundaries to Shorelines and NYSDEC Offices. Retrieved 2022-01, from https://gis.ny.gov/gisdata/inventories/details.cfm?DSID=1270. [Google Scholar]
Olteanu A., Castillo C., Diaz F., Kıcıman E. Social data: Biases, methodological pitfalls, and ethical boundaries. Frontiers in Big Data. 2019;2:13. doi: 10.3389/fdata.2019.00013. [DOI] [PMC free article] [PubMed] [Google Scholar]
Openshaw S. The modifiable areal unit problem. Quantitative Geography: A British View. 1981:60–69. [Google Scholar]
Pak A., Paroubek P. Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC’10) European Language Resources Association (ELRA); Valletta, Malta: 2010. Twitter as a corpus for sentiment analysis and opinion mining. [Google Scholar]
Papadopoulos S., Kompatsiaris Y., Vakali A., Spyridonos P. Community detection in social media. Data Mining and Knowledge Discovery. 2012;24(3):515–554. [Google Scholar]
Pariser E. Penguin UK; 2011. The filter bubble: What the internet is hiding from you. [Google Scholar]
Pedregosa F., Varoquaux G., Gramfort A., Michel V., Thirion B., Grisel O., et al. Scikit-learn: Machine learning in Python. Journal of Machine Learning Research. 2011;12:2825–2830. [Google Scholar]
Piedrahita-Valdés H., Piedrahita-Castillo D., Bermejo-Higuera J., Guillem-Saiz P., Bermejo-Higuera J.R., Guillem-Saiz J., et al. Vaccine hesitancy on social media: Sentiment analysis from June 2011 to April 2019. Vaccines. 2021;9(1):28. doi: 10.3390/vaccines9010028. [DOI] [PMC free article] [PubMed] [Google Scholar]
Polack F.P., Thomas S.J., Kitchin N., Absalon J., Gurtman A., Lockhart S., et al. Safety and efficacy of the BNT162b2 mRNA Covid-19 vaccine. New England Journal of Medicine. 2020 doi: 10.1056/NEJMoa2034577. [DOI] [PMC free article] [PubMed] [Google Scholar]
Porter C.E. A typology of virtual communities: A multi-disciplinary foundation for future research. Journal of Computer-Mediated Communication. 2004;10(1):JCMC1011. [Google Scholar]
Puri N., Coomes E.A., Haghbayan H., Gunaratne K. Social media and vaccine hesitancy: New updates for the era of Covid-19 and globalized infectious diseases. Human Vaccines & Immunotherapeutics. 2020;16(11):2586–2593. doi: 10.1080/21645515.2020.1780846. [DOI] [PMC free article] [PubMed] [Google Scholar]
Rathi, M., Malik, A., Varshney, D., Sharma, R., & Mendiratta, S. (2018). Sentiment analysis of tweets using machine learning approach. In: 2018 Eleventh International Conference on Contemporary Computing (IC3) (p. 1–3). doi: 10.1109/IC3.2018.8530517.
Salathé M., Bonhoeffer S. The effect of opinion clustering on disease outbreaks. Journal of The Royal Society Interface. 2008;5(29):1505–1508. doi: 10.1098/rsif.2008.0271. [DOI] [PMC free article] [PubMed] [Google Scholar]
Saud M., Mashud M., Ida R. Usage of social media during the pandemic: Seeking support and awareness about Covid-19 through social media platforms. Journal of Public Affairs. 2020;20(4) [Google Scholar]
Schmidt A.L., Zollo F., Scala A., Betsch C., Quattrociocchi W. Polarization of the vaccination debate on Facebook. Vaccine. 2018;36(25):3606–3612. doi: 10.1016/j.vaccine.2018.05.040. [DOI] [PubMed] [Google Scholar]
Schultz L. Vol. 11. Rockefeller Institute of Government Blog; 2019. (Introducing New York’s rural economics). Retrieved from https://rockinst.org/blog/introducing-new-yorks-rural-economies/ [Google Scholar]
Shaw S.-L., Sui D. Understanding the new human dynamics in smart spaces and places: Toward a splatial framework. Annals of the American Association of Geographers. 2020;110(2):339–348. [Google Scholar]
Shen Y., Karimi K. Urban function connectivity: Characterisation of functional urban streets with social media check-in data. Cities. 2016;55:9–21. [Google Scholar]
Shen Y., Karimi K., Law S., Zhong C. Physical co-presence intensity: Measuring dynamic face-to-face interaction potential in public space using social media check-in records. PloS One. 2019;14(2) doi: 10.1371/journal.pone.0212004. [DOI] [PMC free article] [PubMed] [Google Scholar]
Sousa, D., Sarmento, L., & Mendes Rodrigues, E. (2010). Characterization of the Twitter @replies network: Are user ties social or topical? In Proceedings of the 2nd International Workshop on Search and Mining User-Generated Contents (p. 63–70). New York, NY, USA.
Stefanidis A., Crooks A., Radzikowski J. Harvesting ambient geospatial information from social media feeds. GeoJournal. 2013;78(2):319–338. [Google Scholar]
Stock K. Mining location from social media: A systematic review. Computers, Environment and Urban Systems. 2018;71:209–240. [Google Scholar]
Sui D., Shaw S.-L. Human dynamics in smart and connected communities. Computers, Environment and Urban Systems. 2018;72:1–3. [Google Scholar]
Sui D., Shaw S.-L. Mapping covid-19 in space and time. Springer; 2021. Outlook and next steps: Understanding human dynamics in a post-pandemic world—beyond mapping Covid-19 in space and time; pp. 347–358. [Google Scholar]
Sulis P., Manley E., Zhong C., Batty M. Using mobility data as proxy for measuring urban vitality. Journal of Spatial Information Science. 2018;16:137–162. [Google Scholar]
Tang Y., Zhang Y.-Q., Chawla N.V., Krasser S. SVMs modeling for highly imbalanced classification. IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics) 2008;39(1):281–288. doi: 10.1109/TSMCB.2008.2002909. [DOI] [PubMed] [Google Scholar]
Tuan Y.-F. University of Minnesota Press; 1977. Space and place: The perspective of experience. [Google Scholar]
Tyson A., Johnson C., Funk C. Pew Research Center Science & Society; 2020. U.S. public now divided over whether to get Covid-19 vaccine. Retrieved 2022-07-14, from https://www.pewresearch.org/science/2020/09/17/u-s-public-now-divided-over-whether-to-get-covid-19-vaccine/ [Google Scholar]
Vanhoof M., Hendrickx L., Puussaar A., Verstraeten G., Ploetz T., Smoreda Z. Exploring the use of mobile phone data for domestic tourism trip analysis. Netcom. Réseaux, Communication et Territoires. 2017;31–3/4:335–372. [Google Scholar]
Vogels E.A. Pew Research Center Science & Society; 2021. Some digital divides persist between rural, urban and suburban America. Retrieved 2022-07-14, from https://www.pewresearch.org/fact-tank/2021/08/19/some-digital-divides-persist-between-rural-urban-and-suburban-america/ [Google Scholar]
Wellman B. In: Digital Cities II: Computational and Sociological Approaches. Tanabe M., Van Den Besselaar P., Ishida T., editors. Springer, Berlin Heidelberg; Berlin, Heidelberg: 2002. Little boxes, glocalization, and networked individualism; pp. 10–25. [Google Scholar]
Yin J., Soliman A., Yin D., Wang S. Depicting urban boundaries from a mobility network of spatial interactions: A case study of Great Britain with geo-located Twitter data. International Journal of Geographical Information Science. 2017;31(7):1293–1313. [Google Scholar]
Yu H., Shaw S.-L. Exploring potential human activities in physical and virtual spaces: A spatio-temporal GIS approach. International Journal of Geographical Information Science. 2008;22(4):409–430. [Google Scholar]
Yuan X., Schuchard R.J., Crooks A.T. Examining emergent communities and social bots within the polarized online vaccination debate in Twitter. Social Media + Society. 2019;5(3) [Google Scholar]
Zhao Z.-D., Huang Z.-G., Huang L., Liu H., Lai Y.-C. Scaling and correlation of human movements in cyberspace and physical space. Physical Review E. 2014;90(5) doi: 10.1103/PhysRevE.90.050802. [DOI] [PubMed] [Google Scholar]
Zhou X., Coiera E., Tsafnat G., Arachi D., Ong M.-S., Dunn A.G. Using social connection information to improve opinion mining: Identifying negative sentiment about HPV vaccines on Twitter. Studies in Health Technology and Informatics. 2015;216:761–765. [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary material

mmc1.pdf^{(54.2KB, pdf)}

Data Availability Statement

We have shared the link to our code/data at the Attached File step.

[b0005] Abdel-Aziz A.A., Abdel-Salam H., El-Sayad Z. The role of ICTs in creating the new social public place of the digital era. Alexandria Engineering Journal. 2016;55(1):487–493. [Google Scholar]

[b0010] Ahmad M., Aftab S., Ali I. Sentiment analysis of tweets using SVM. International Journal of Computer Applications. 2017;177(5):25–29. [Google Scholar]

[b0015] Ahmed N., Quinn S.C., Hancock G.R., Freimuth V.S., Jamison A. Social media use and influenza vaccine uptake among White and African American adults. Vaccine. 2018;36(49):7556–7561. doi: 10.1016/j.vaccine.2018.10.049. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0020] Ahuja R., Chug A., Kohli S., Gupta S., Ahuja P. The impact of features extraction on the sentiment analysis. Procedia Computer Science. 2019;152:341–348. [Google Scholar]

[b0025] AlSayyad N., Guvenc M. Virtual uprisings: On the interaction of new social media, traditional media coverage and urban space during the ‘arab spring’. Urban Studies. 2015;52(11):2018–2034. [Google Scholar]

[b0030] Andre F.E., Booy R., Bock H.L., Clemens J., Datta S.K., John T.J., et al. Vaccination greatly reduces disease, disability, death and inequity worldwide. Bulletin of the World Health Organization. 2008;86:140–146. doi: 10.2471/BLT.07.040089. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0035] Arthur R., Williams H.T. The human geography of Twitter: Quantifying regional identity and inter-region communication in England and Wales. PloS One. 2019;14(4) doi: 10.1371/journal.pone.0214466. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0040] Basch C.H., MacLean S.A. A content analysis of HPV related posts on Instagram. Human Vaccines & Immunotherapeutics. 2019;15(7–8):1476–1478. doi: 10.1080/21645515.2018.1560774. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0045] Bastian M., Heymann S., Jacomy M. Gephi: An open source software for exploring and manipulating networks. Proceedings of the International AAAI Conference on Web and Social Media. 2009;3:361–362. [Google Scholar]

[b0050] Batty M. Virtual geography. Futures. 1997;29(4–5):337–352. [Google Scholar]

[b0055] Bello-Orgaz G., Hernandez-Castro J., Camacho D. Detecting discussion communities on vaccination in Twitter. Future Generation Computer Systems. 2017;66:125–136. [Google Scholar]

[b0060] Biernacki P., Waldorf D. Snowball sampling: Problems and techniques of chain referral sampling. Sociological Methods & Research. 1981;10(2):141–163. [Google Scholar]

[b0065] Bingham-Hall J., Law S. Connected or informed?: Local Twitter networking in a London neighbourhood. Big Data & Society. 2015;2(2) [Google Scholar]

[b0070] Blondel V.D., Guillaume J.-L., Lambiotte R., Lefebvre E. Fast unfolding of communities in large networks. Journal of Statistical Mechanics: Theory and Experiment. 2008;2008(10):P10008. [Google Scholar]

[b0075] Brockmann D., Hufnagel L., Geisel T. The scaling laws of human travel. Nature. 2006;439(7075):462–465. doi: 10.1038/nature04292. [DOI] [PubMed] [Google Scholar]

[b0080] Bryant R. What kind of space is cyberspace. Minerva-An Internet Journal of Philosophy. 2001;5(2001) 138–1. [Google Scholar]

[b0085] Castells M. Globalisation, networking, urbanisation: Reflections on the spatial dynamics of the information age. Urban Studies. 2010;47(13):2737–2745. [Google Scholar]

[b0090] CDC . Covid-19 Vaccine Community Features; 2021. Connecticut uses social media to engage long-term care residents. Retrieved 2022-02-20, from https://www.cdc.gov/vaccines/covid-19/health-departments/features/connecticut-ltcop.html. [Google Scholar]

[b0095] Chen C.-F., Shi W., Yang J., Fu H.-H. Social bots’ role in climate change discussion on twitter: Measuring standpoints, topics, and interaction strategies. Advances in Climate Change Research. 2021;12(6):913–923. [Google Scholar]

[b0100] Chen E., Lerman K., Ferrara E. Tracking social media discourse about the Covid-19 pandemic: Development of a public coronavirus Twitter data set. JMIR Public Health and Surveillance. 2020;6(2) doi: 10.2196/19273. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0105] Chen Q., Crooks A. Analyzing the vaccination debate in social media data pre-and post-covid-19 pandemic. International Journal of Applied Earth Observation and Geoinformation. 2022;110 doi: 10.1016/j.jag.2022.102783. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0110] Chmiel A., Kowalska K., Hołyst J.A. Scaling of human behavior during portal browsing. Physical Review E. 2009;80(6) doi: 10.1103/PhysRevE.80.066122. [DOI] [PubMed] [Google Scholar]

[b0115] Cohen J. A coefficient of agreement for nominal scales. Educational and Psychological Measurement. 1960;20(1):37–46. [Google Scholar]

[b0120] Croitoru A., Wayant N., Crooks A., Radzikowski J., Stefanidis A. Linking cyber and physical spaces through community detection and clustering in social media feeds. Computers, Environment and Urban Systems. 2015;53:47–64. [Google Scholar]

[b0125] Cvetojevic S., Hochmair H.H. Modeling interurban mentioning relationships in the U.S. Twitter network using geo-hashtags. Computers, Environment and Urban Systems. 2021;87 [Google Scholar]

[b0130] Dodge M., Kitchin R. Routledge; 2003. Mapping cyberspace. [Google Scholar]

[b0135] Dong E., Du H., Gardner L. An interactive web-based dashboard to track Covid-19 in real time. The Lancet Infectious Diseases. 2020;20(5):533–534. doi: 10.1016/S1473-3099(20)30120-1. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0140] Du J., Xu J., Song H.-Y., Tao C. Leveraging machine learning-based approaches to assess human papillomavirus vaccination sentiment trends with Twitter data. BMC Medical Informatics and Decision Making. 2017;17(2):63–70. doi: 10.1186/s12911-017-0469-6. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0145] Eckmann J.-P., Moses E., Sergi D. Entropy of dialogues creates coherent structures in e-mail traffic. Proceedings of the National Academy of Sciences. 2004;101(40):14333–14337. doi: 10.1073/pnas.0405728101. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0150] FDA (2020a). Coronavirus (Covid-19) update: December 22, 2020. U.S. Food & Drug Administration Newsroom. Retrieved 2022-01-17, fromhttps://www.fda.gov/news-events/press-announcements/coronavirus-covid-19-update-december-22-2020.

[b0155] FDA (2020b). FDA takes key action in fight against Covid-19 by issuing emergency use authorization for first Covid-19 vaccine. U.S. Food & Drug Administration Newsroom. Retrieved 2022-01-17, fromhttps://www.fda.gov/news-events/press-announcements/fda-takes-key-action-fight-against-covid-19-issuing-emergency-use-authorization-first-covid-19.

[b0160] FDA (2021). Coronavirus (Covid-19) update: April 27, 2021. U.S. Food & Drug Administration Newsroom. Retrieved 2022-01-17, fromhttps://www.fda.gov/news-events/press-announcements/coronavirus-covid-19-update-april-27-2021.

[b0165] Gonzalez M.C., Hidalgo C.A., Barabasi A.-L. Understanding individual human mobility patterns. Nature. 2008;453(7196):779–782. doi: 10.1038/nature06958. [DOI] [PubMed] [Google Scholar]

[b0170] González-Padilla D.A., Tortolero-Blanco L. Social media influence in the Covid-19 pandemic. International Brazilian Journal of Urology. 2020;46:120–124. doi: 10.1590/S1677-5538.IBJU.2020.S121. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0175] GOV.UK (2022). Decision: Information for U.K. recipients on Covid-19 vaccine Astrazeneca (Regulation 174). Medicines & Healthcare Products Regulatory Agency. Retrieved 2022-01-17, fromhttps://www.gov.uk/government/publications/regulatory-approval-of-covid-19-vaccine-astrazeneca/information-for-uk-recipients-on-covid-19-vaccine-astrazeneca.

[b0180] Grauwin S., Szell M., Sobolevsky S., Hövel P., Simini F., Vanhoof M., Smoreda Z., Barabási A.-L., Ratti C. Identifying and modeling the structural discontinuities of human interactions. Scientific Reports. 2017;7(1):1–11. doi: 10.1038/srep46677. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0185] Gunaratne K., Coomes E.A., Haghbayan H. Temporal trends in anti-vaccine discourse on Twitter. Vaccine. 2019;37(35):4867–4871. doi: 10.1016/j.vaccine.2019.06.086. [DOI] [PubMed] [Google Scholar]

[b0190] Hamstead Z.A., Fisher D., Ilieva R.T., Wood S.A., McPhearson T., Kremer P. Geolocated social media as a rapid indicator of park visitation and equitable park access. Computers, Environment and Urban Systems. 2018;72:38–50. [Google Scholar]

[b0195] Herrera G.L. Power and Security in the Information Age. Routledge; 2016. Cyberspace and sovereignty: Thoughts on physical space and digital space; pp. 81–108. [Google Scholar]

[b0200] Hu T., Wang S., Luo W., Zhang M., Huang X., Yan Y., et al. Revealing public opinion towards Covid-19 vaccines with Twitter data in the United States: Spatiotemporal perspective. Journal of Medical Internet Research. 2021;23(9) doi: 10.2196/30854. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0205] Huberman B.A., Pirolli P.L., Pitkow J.E., Lukose R.M. Strong regularities in world wide web surfing. Science. 1998;280(5360):95–97. doi: 10.1126/science.280.5360.95. [DOI] [PubMed] [Google Scholar]

[b0210] Jamison A.M., Broniatowski D.A., Dredze M., Sangraula A., Smith M.C., Quinn S.C. Vol. 1. Harvard Kennedy School Misinformation Review; 2020. (Not just conspiracy theories: Vaccine opponents and proponents add to the Covid-19 ‘infodemic’ on Twitter). [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0215] Joachims T. Proceedings of the Sixteenth International Conference on Machine Learning. Morgan Kaufmann Publishers Inc.; San Francisco, CA, USA: 1999. Transductive inference for text classification using Support Vector Machines; pp. 200–209. [Google Scholar]

[b0220] Kannan S., Gurusamy V., Vijayarani S., Ilamathi J., Nithya M., Kannan S., et al. Preprocessing techniques for text mining. International Journal of Computer Science & Communication Networks. 2014;5(1):7–16. [Google Scholar]

[b0225] Kaplan A.M., Haenlein M. Users of the world, unite! The challenges and opportunities of social media. Business Horizons. 2010;53(1):59–68. [Google Scholar]

[b0230] Kwan M.-P. In: Information, Place, and Cyberspace: Issues in Accessibility. Janelle D.G., Hodge D.C., editors. Springer, Berlin Heidelberg; Berlin, Heidelberg: 2000. Human extensibility and individual hybrid-accessibility in space-time: A multi-scale representation using GIS; pp. 241–256. [Google Scholar]

[b0235] Lam L., Suen S. Application of majority voting to pattern recognition: An analysis of its behavior and performance. IEEE Transactions on Systems, Man, and Cybernetics-Part A: Systems and Humans. 1997;27(5):553–568. [Google Scholar]

[b0240] Landis J.R., Koch G.G. The measurement of observer agreement for categorical data. Biometrics. 1977:159–174. [PubMed] [Google Scholar]

[b0245] Larson H.J., De Figueiredo A., Xiahong Z., Schulz W.S., Verger P., Johnston I.G., et al. The state of vaccine confidence 2016: global insights through a 67-country survey. EBioMedicine. 2016;12:295–301. doi: 10.1016/j.ebiom.2016.08.042. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0250] Lazer D., Pentland A., Adamic L., Aral S., Barabasi A.-L., Brewer D., et al. Social science. Computational social science. Science (New York, NY) 2009;323(5915):721–723. doi: 10.1126/science.1167742. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0255] Lessig L. The zones of cyberspace. The Stanford Law Review. 1995;48:1403. [Google Scholar]

[b0260] Li A., Zhao P., He H., Mansourian A., Axhausen K.W. How did micro-mobility change in response to Covid-19 pandemic? A case study based on spatial-temporal-semantic analytics. Computers, Environment and Urban Systems. 2021;90 doi: 10.1016/j.compenvurbsys.2021.101703. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0265] Li H.O.-Y., Bailey A., Huynh D., Chan J. YouTube as a source of information on Covid-19: A pandemic of misinformation? BMJ Global Health. 2020;5(5) doi: 10.1136/bmjgh-2020-002604. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0270] Line T., Jain J., Lyons G. The role of ICTs in everyday mobile lives. Journal of Transport Geography. 2011;19(6):1490–1499. [Google Scholar]

[b0275] Long Y., Huang C. Does block size matter? The impact of urban design on economic vitality for Chinese cities. Environment and Planning B: Urban Analytics and City Science. 2019;46(3):406–422. [Google Scholar]

[b0280] Lovari A. Spreading (dis) trust: Covid-19 misinformation and government intervention in Italy. Media and Communication. 2020;8(2):458–461. [Google Scholar]

[b0285] Lyu H., Wang J., Wu W., Duong V., Zhang X., Dye T.D., et al. Social media study of public opinions on potential covid-19 vaccines: informing dissent, disparities, and dissemination. Intelligent Medicine. 2022;2(01):1–12. doi: 10.1016/j.imed.2021.08.001. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0290] Massey P.M., Budenz A., Leader A., Fisher K., Klassen A.C., Yom-Tov E. What drives health professionals to Tweet about #HPVvaccine? Identifying strategies for effective communication. Preventing Chronic Disease. 2018;15 doi: 10.5888/pcd15.170320. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0295] McMinn S., Carlsen A. National Public Radio; 2022. Tracking the coronavirus around the U.S.: See how your state is doing. Retrieved 2022-01-17, from https://www.npr.org/sections/health-shots/2020/09/01/816707182/map-tracking-the-spread-of-the-coronavirus-in-the-u-s. [Google Scholar]

[b0300] Müller A.C., Guido S. O’Reilly; 2016. Introduction to machine learning with python: A guide for data scientists. [Google Scholar]

[b0305] Murata T. In: Handbook of Social Network Technologies and Applications. Furht B., editor. Springer, US; Boston, MA: 2010. Detecting communities in social networks; pp. 269–280. [Google Scholar]

[b0310] New York State . NYS Governor’s Press Office; 2020. Governor Cuomo Updates New Yorkers on the State’s Vaccination Administration Plan. Retrieved 2022-02-20, from https://www.governor.ny.gov/news/governor-cuomo-updates-new-yorkers-states-vaccination-administration-plan. [Google Scholar]

[b0315] New York State . NYS Governor’s Press Office; 2021. Governor Cuomo Announces Allocation of $15 Million to Promote Vaccination in Communities Disproportionately Affected by COVID-19 Pandemic. Retrieved 2022-02-20, from https://www.governor.ny.gov/news/governor-cuomo-announces-allocation-15-million-promote-vaccination-communities. [Google Scholar]

[b0320] New York State . NYS Governor’s Press Office; 2021. Governor Hochul announces #GetTheVaxFacts campaign to combat Covid-19 vaccine misinformation. Retrieved 2022-02-20, from https://www.governor.ny.gov/news/governor-hochul-announces-getthevaxfacts-campaign-combat-covid-19-vaccine-misinformation. [Google Scholar]

[b0325] NYS GIS Clearinghouse . NYS ITS GIS Program Office; 2020. NYSDEC Regional Boundaries to Shorelines and NYSDEC Offices. Retrieved 2022-01, from https://gis.ny.gov/gisdata/inventories/details.cfm?DSID=1270. [Google Scholar]

[b0330] Olteanu A., Castillo C., Diaz F., Kıcıman E. Social data: Biases, methodological pitfalls, and ethical boundaries. Frontiers in Big Data. 2019;2:13. doi: 10.3389/fdata.2019.00013. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0335] Openshaw S. The modifiable areal unit problem. Quantitative Geography: A British View. 1981:60–69. [Google Scholar]

[b0340] Pak A., Paroubek P. Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC’10) European Language Resources Association (ELRA); Valletta, Malta: 2010. Twitter as a corpus for sentiment analysis and opinion mining. [Google Scholar]

[b0345] Papadopoulos S., Kompatsiaris Y., Vakali A., Spyridonos P. Community detection in social media. Data Mining and Knowledge Discovery. 2012;24(3):515–554. [Google Scholar]

[b0350] Pariser E. Penguin UK; 2011. The filter bubble: What the internet is hiding from you. [Google Scholar]

[b0355] Pedregosa F., Varoquaux G., Gramfort A., Michel V., Thirion B., Grisel O., et al. Scikit-learn: Machine learning in Python. Journal of Machine Learning Research. 2011;12:2825–2830. [Google Scholar]

[b0360] Piedrahita-Valdés H., Piedrahita-Castillo D., Bermejo-Higuera J., Guillem-Saiz P., Bermejo-Higuera J.R., Guillem-Saiz J., et al. Vaccine hesitancy on social media: Sentiment analysis from June 2011 to April 2019. Vaccines. 2021;9(1):28. doi: 10.3390/vaccines9010028. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0365] Polack F.P., Thomas S.J., Kitchin N., Absalon J., Gurtman A., Lockhart S., et al. Safety and efficacy of the BNT162b2 mRNA Covid-19 vaccine. New England Journal of Medicine. 2020 doi: 10.1056/NEJMoa2034577. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0370] Porter C.E. A typology of virtual communities: A multi-disciplinary foundation for future research. Journal of Computer-Mediated Communication. 2004;10(1):JCMC1011. [Google Scholar]

[b0375] Puri N., Coomes E.A., Haghbayan H., Gunaratne K. Social media and vaccine hesitancy: New updates for the era of Covid-19 and globalized infectious diseases. Human Vaccines & Immunotherapeutics. 2020;16(11):2586–2593. doi: 10.1080/21645515.2020.1780846. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0380] Rathi, M., Malik, A., Varshney, D., Sharma, R., & Mendiratta, S. (2018). Sentiment analysis of tweets using machine learning approach. In: 2018 Eleventh International Conference on Contemporary Computing (IC3) (p. 1–3). doi: 10.1109/IC3.2018.8530517.

[b0385] Salathé M., Bonhoeffer S. The effect of opinion clustering on disease outbreaks. Journal of The Royal Society Interface. 2008;5(29):1505–1508. doi: 10.1098/rsif.2008.0271. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0390] Saud M., Mashud M., Ida R. Usage of social media during the pandemic: Seeking support and awareness about Covid-19 through social media platforms. Journal of Public Affairs. 2020;20(4) [Google Scholar]

[b0395] Schmidt A.L., Zollo F., Scala A., Betsch C., Quattrociocchi W. Polarization of the vaccination debate on Facebook. Vaccine. 2018;36(25):3606–3612. doi: 10.1016/j.vaccine.2018.05.040. [DOI] [PubMed] [Google Scholar]

[b0400] Schultz L. Vol. 11. Rockefeller Institute of Government Blog; 2019. (Introducing New York’s rural economics). Retrieved from https://rockinst.org/blog/introducing-new-yorks-rural-economies/ [Google Scholar]

[b0405] Shaw S.-L., Sui D. Understanding the new human dynamics in smart spaces and places: Toward a splatial framework. Annals of the American Association of Geographers. 2020;110(2):339–348. [Google Scholar]

[b0410] Shen Y., Karimi K. Urban function connectivity: Characterisation of functional urban streets with social media check-in data. Cities. 2016;55:9–21. [Google Scholar]

[b0415] Shen Y., Karimi K., Law S., Zhong C. Physical co-presence intensity: Measuring dynamic face-to-face interaction potential in public space using social media check-in records. PloS One. 2019;14(2) doi: 10.1371/journal.pone.0212004. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0420] Sousa, D., Sarmento, L., & Mendes Rodrigues, E. (2010). Characterization of the Twitter @replies network: Are user ties social or topical? In Proceedings of the 2nd International Workshop on Search and Mining User-Generated Contents (p. 63–70). New York, NY, USA.

[b0425] Stefanidis A., Crooks A., Radzikowski J. Harvesting ambient geospatial information from social media feeds. GeoJournal. 2013;78(2):319–338. [Google Scholar]

[b0430] Stock K. Mining location from social media: A systematic review. Computers, Environment and Urban Systems. 2018;71:209–240. [Google Scholar]

[b0435] Sui D., Shaw S.-L. Human dynamics in smart and connected communities. Computers, Environment and Urban Systems. 2018;72:1–3. [Google Scholar]

[b0440] Sui D., Shaw S.-L. Mapping covid-19 in space and time. Springer; 2021. Outlook and next steps: Understanding human dynamics in a post-pandemic world—beyond mapping Covid-19 in space and time; pp. 347–358. [Google Scholar]

[b0445] Sulis P., Manley E., Zhong C., Batty M. Using mobility data as proxy for measuring urban vitality. Journal of Spatial Information Science. 2018;16:137–162. [Google Scholar]

[b0450] Tang Y., Zhang Y.-Q., Chawla N.V., Krasser S. SVMs modeling for highly imbalanced classification. IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics) 2008;39(1):281–288. doi: 10.1109/TSMCB.2008.2002909. [DOI] [PubMed] [Google Scholar]

[b0455] Tuan Y.-F. University of Minnesota Press; 1977. Space and place: The perspective of experience. [Google Scholar]

[b0460] Tyson A., Johnson C., Funk C. Pew Research Center Science & Society; 2020. U.S. public now divided over whether to get Covid-19 vaccine. Retrieved 2022-07-14, from https://www.pewresearch.org/science/2020/09/17/u-s-public-now-divided-over-whether-to-get-covid-19-vaccine/ [Google Scholar]

[b0465] Vanhoof M., Hendrickx L., Puussaar A., Verstraeten G., Ploetz T., Smoreda Z. Exploring the use of mobile phone data for domestic tourism trip analysis. Netcom. Réseaux, Communication et Territoires. 2017;31–3/4:335–372. [Google Scholar]

[b0470] Vogels E.A. Pew Research Center Science & Society; 2021. Some digital divides persist between rural, urban and suburban America. Retrieved 2022-07-14, from https://www.pewresearch.org/fact-tank/2021/08/19/some-digital-divides-persist-between-rural-urban-and-suburban-america/ [Google Scholar]

[b0475] Wellman B. In: Digital Cities II: Computational and Sociological Approaches. Tanabe M., Van Den Besselaar P., Ishida T., editors. Springer, Berlin Heidelberg; Berlin, Heidelberg: 2002. Little boxes, glocalization, and networked individualism; pp. 10–25. [Google Scholar]

[b0480] Yin J., Soliman A., Yin D., Wang S. Depicting urban boundaries from a mobility network of spatial interactions: A case study of Great Britain with geo-located Twitter data. International Journal of Geographical Information Science. 2017;31(7):1293–1313. [Google Scholar]

[b0485] Yu H., Shaw S.-L. Exploring potential human activities in physical and virtual spaces: A spatio-temporal GIS approach. International Journal of Geographical Information Science. 2008;22(4):409–430. [Google Scholar]

[b0490] Yuan X., Schuchard R.J., Crooks A.T. Examining emergent communities and social bots within the polarized online vaccination debate in Twitter. Social Media + Society. 2019;5(3) [Google Scholar]

[b0495] Zhao Z.-D., Huang Z.-G., Huang L., Liu H., Lai Y.-C. Scaling and correlation of human movements in cyberspace and physical space. Physical Review E. 2014;90(5) doi: 10.1103/PhysRevE.90.050802. [DOI] [PubMed] [Google Scholar]

[b0500] Zhou X., Coiera E., Tsafnat G., Arachi D., Ong M.-S., Dunn A.G. Using social connection information to improve opinion mining: Identifying negative sentiment about HPV vaccines on Twitter. Studies in Health Technology and Informatics. 2015;216:761–765. [PubMed] [Google Scholar]

PERMALINK

Information propagation on cyber, relational and physical spaces about covid-19 vaccine: Using social media and splatial framework

Fuzhen Yin

Andrew Crooks

Li Yin

Abstract

1. Introduction

2. Background

Fig. 1.

3. Study area and data collection

3.1. Study area

Fig. 2.

3.2. Data collection

Table 1.

4. Methodology

Fig. 3.

4.1. Sentiment analysis

Table 2.

4.1.1. Manual annotation

4.1.2. Data processing

4.1.3. Classification model

Table 3.

Table 4.

Table 5.

4.1.4. Tweets and user labeling

Table 6.

4.2. Social network analysis

4.2.1. Reply network construction

Table 7.

4.2.2. Community detection in reply networks

Fig. 4.

4.3. Hybrid space networks

5. Results

5.1. Cyber space communities

Fig. 5.

Fig. 6.

5.2. Relational space communities

Fig. 7.

Fig. 8.

Fig. 9.

5.3. Physical space communities

Fig. 10.

5.4. Interactions between physical and relational spaces

Fig. 11.

5.5. Interactions between relational and cyber spaces

Fig. 12.

6. Discussion and conclusion

CRediT authorship contribution statement

Declaration of Competing Interest

Footnotes

Appendix A. Supplementary data

Data availability

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases