Skip to main content
IEEE - PMC COVID-19 Collection logoLink to IEEE - PMC COVID-19 Collection
. 2018 Jan 11;5(1):265–276. doi: 10.1109/TCSS.2017.2786545

Information Diffusion on Social Media During Natural Disasters

Rongsheng Dong 1, Libing Li 2, Qingpeng Zhang 3,4,, Guoyong Cai 1
PMCID: PMC7176041  PMID: 32391405

Abstract

Social media analytics has drawn new quantitative insights of human activity patterns. Many applications of social media analytics, from pandemic prediction to earthquake response, require an in-depth understanding of how these patterns change when human encounter unfamiliar conditions. In this paper, we select two earthquakes in China as the social context in Sina-Weibo (or Weibo for short), the largest Chinese microblog site. After proposing a formalized Weibo information flow model to represent the information spread on Weibo, we study the information spread from three main perspectives: individual characteristics, the types of social relationships between interactive participants, and the topology of real interaction networks. The quantitative analyses draw the following conclusions. First, the shadow of Dunbar’s number is evident in the “declared friends/followers” distributions, and the number of each participant’s friends/followers who also participated in the earthquake information dissemination show the typical power-law distribution, indicating a rich-gets-richer phenomenon. Second, an individual’s number of followers is the most critical factor in user influence. Strangers are very important forces for disseminating real-time news after an earthquake. Third, two types of real interaction networks share the scale-free and small-world property, but with a looser organizational structure. In addition, correlations between different influence groups indicate that when compared with other online social media, the discussion on Weibo is mainly dominated and influenced by verified users.

Keywords: Emergency response, online collective behavior, Sina-Weibo, social network analysis

I. Introduction

The 2009 DARPA Red Balloon Challenge offered the Internet and social networking a chance to demonstrate their vast potential to solve a distributed, time-critical, highly distributed public problem [1], [2]. And the modern social network study has inferred much new quantitative knowledge about human activity patterns, such as influencer’s identification [2][5], the network topology measurement [6], [7], trust analysis [8][10], social hot spot-tracing [11][13], and the dynamics of information spread [14][16]. However, many applications, from pandemic prediction to earthquake response, require an understanding of how these patterns change when human encounter unfamiliar conditions [17], [18]. Especially for China, who suffered from frequent natural disasters, the understanding of how the behaviors of hundreds of millions of Web users change is very important. The empirical study of the human flesh search (HFS) [19], [20], for one, provided quantitative insights into these collective responses of Web users in China. Inspired by previous research on Web users collective responses, we choose two empirical cases in China-Yi’liang (2012) and Ya’an earthquakes (2013). What makes it additionally useful is that many densely populated areas in mainland China, such as Sichuan, Fujian, and so on, are frequent earthquake areas and often suffered severe damage from earthquakes.

At present for earthquake topics, there are two main types of studies: detecting seismic waves and enhancing rescue efforts. The former focuses on how to improve the accuracy of magnitude of earthquake forecasting or issue warnings as early as possible [21][24], such as the Did You Feel It system. And the later studies [25][27] try to explore ideas to cope with earthquake relief, postearthquake reconstruction, and to improve the mental health status of rescuers. This paper focuses on the latter effort from a social network perspective, with a particular focus on information diffusion and social networking behaviors.

In social network study, the above-mentioned focus [25][27] can be seen as a nonlinear superposition of a multitude of social interaction networks, where nodes represent individuals and edges capture a variety of different social relations. However, after further study, a group of researchers represented by Huberman et al. [28] found that social interactions within Twitter cannot be inferred directly from a declared relationship set of friends and followers: many users interact with few other people in their declared relationship network [28][31]. The key problem is that the structure of the underlying interaction network is not visible and must be inferred from the flow of information between individuals, which poses a serious challenge to our efforts to understand how the structure of the network affects social dynamics and the spread of information [32].

Take the tweet reposting connection shown in Fig. 1(a) and (b) as examples. There are four features in the underlying interaction network: 1) user-D has four followers, B, C, F, and E, but just C and F repost the message created by D; 2) user-F reposts the tweet created by user-D through the intermediary-C; 3) user-C participates more than once; and 4) the tweet created by user-D is also reposted by user-H who does not have the relationship of friend or follower with user-D.

Fig. 1.

Fig. 1.

Typical multirelational participant network on Twitter/Weibo. (a) Declared social network, where nodes represent users and directed edges represent relationships of followers or friends. (b) Information cascade of one tweet, where nodes represent participating users and directed edges indicate tweets citing relationships. (c) Typical multirelational participant network, where the blue node is the original poster, pink nodes are the reposters, and edges still indicate user relationships.

There are two ways of describing these features in previous studies. The conventional method distinguishes whether users interact with each other by adding extra attributes for nodes or edges of declared relationship networks [17], [18], [33]. Fig. 1(c) is the typical application of this method, which attempts to describe Fig. 1(a) and (b) together. However, it cannot depict the features-III and IV. The other solution is to describe all kinds of user relationships or interactions using a multilayer/multirelational network [34], [35]. Likewise, the method cannot express the features-II and III explicitly, and the types of models lack the parallel development of specific analysis methods to exploit the information hidden between the layers.

In response to the above-mentioned problem, we propose a formalized definition of the Weibo information flow (WIF) and have applied it to the empirical analysis of a Sina-Weibo data set on the topic of earthquakes. The goal of our work is to provide a way to extract the underlying hidden social network that more clearly represents the information cascade and actual interactions among users after an earthquake. We will then address the problem in understanding of hidden social networks behind the online information flow.

  • 1)

    What about the languages used by participants? How is the demographics of participant users? Are there regional features? What are the differences before and after the earthquake?

  • 2)

    What are the common features of influential users? What are the key factors in user influence?

  • 3)

    When a user reposts, will he/she evenly repost tweets that attract him/her, or just those tweets posted by his/her friends? Also for users whose message are reposted, does the reposting all take place solely among their followers? Are there any differences in information diffusion patterns before and after the earthquake?

  • 4)

    Compared with early Web2.0 online communications, does the Sina-Weibo platform have unique features that include social and information dissemination functions?

  • 5)

    Are there any correlations between the type of interacted user relationships and the topological feature of user networks?

The organization of this paper is as follows. Section III presents the main body of this paper. We first introduce the data set and give a formal definition of the WIF model in Section II. Section III consists of three subsections. Section III-A includes the empirical results of eight individual attributes: the configured language of a personal page, gender, location, the number of followers, the number of friends, the number of posted tweets, the number being reposted, and the correlation among three user rankings. Section III-B analyzes the proportion of each type of user relationship between interacted users and discusses the differences in reposting patterns of each interacted user group by the user relationships. Section III-C uses social network analysis to unveil the topological properties of two types of underlying interaction networks, which are extracted by the WIF model, and analyze the correlation between influential user groups of two networks. Section IV closes this paper with remarks for future work.

II. Data Set and Methodologies

A. Data Set

Yi’liang earthquake erupted on September 7, 2012. Ya’an earthquake erupted on April 20, 2013. Because happening consecutively in China, these two natural disasters triggered sparked discussions of Sina-Weibo users. We collected these tweet repost data related two disasters by the Sina-Weibo public application programming interface (API). The API’s registered users can obtain relevant permits as long as that their identities are verified by the Sina-Weibo user authentication. During the course of empirical analyses, we used the MySQL database management system for data extraction and cleansing, and used the Cytoscape toolkit to analyze the network topology [36]. All of the figures are plotted with MATLAB.

As presented in Table I, the data set consists of two parts, demonstrating the contrast between before and after earthquake. Yi’liang county in China was hit by a 5.7-magnitude earthquake on September 7, 2012; the section titled After Yi’liang Earthquake is the seismic information generated by Sina-Weibo for seven months after the earthquake. In addition, the section Before Ya’an Earthquake is the daily information exchanged on Sina-Weibo regarding Ya’an within a week before April 20, 2013, a 7.0-magnitude earthquake occurred. The object of comparison should be the “before Yi’liang earthquake,” but the amount of relevant data is too small for empirical analysis.

TABLE I. Sina-Weibo Data Set.

Social context of the Dataset Inline graphic Inline graphic Inline graphic Inline graphic Time span
After Yi’liang Earthquake 9396 407584 330466 3096 Sep 7, 2012 – Apr 30, 2013
Before Ya’an Earthquake 1918 49416 46682 811 Apr 13, 2013 – Apr 19, 2013

Inline graphic: The count of original tweets; Inline graphic: The count of retweets; Inline graphic: The count of Sina-Weibo users; Inline graphic: The count of tweet report trees (A tweet repost tree consists of one unique original tweet, it’s retweets and repost relationships among them.).

Each original tweet record consists of five parts: the original tweet, the original user, repost tweets and their corresponding users, the repost tree, and the relationship between users. Fig. 2 shows the variation trend over time of the total number of these tweets and retweets and five high peaks in the after Yi’liang earthquake data set. These peaks may be related to some sensitive events taking place during those periods. The following social events might offer some available clues.

  • 1)

    Inline graphic-September 9, 2012: A 5.7-magnitude earthquake occurred in Yiliang county of China.

  • 2)

    Inline graphic-September 19, 2012: The Yiliang county published the latest donations list for Yiliang earthquake.

  • 3)

    Inline graphic-December 30, 2012: Called the kindest substitute teacher, Zhu Yinquan accidentally fell from a building, which resulted in severe brain injury, and he had saved seven students by hand in Yiliang earthquake.

  • 4)

    Inline graphic-March 24, 2013: The news of relief fund was not distributed in time flashed on the Internet.

  • 5)

    Inline graphic-April 20, 2013: A 7.0-magnitude earthquake occurred in Yaan of Sichuan province, China.

Fig. 2.

Fig. 2.

Change in the total number of related original tweets and their retweets over time. Num_Tweets_Yi’liang: number of tweets created after the Yi’liang earthquake, Num_Tweets_Ya’an: number of tweets created before the Ya’an earthquake. Five vertical red lines titled Inline graphic, Inline graphic, Inline graphic, Inline graphic, and Inline graphic represent the five dates when important events took place.

B. Weibo Information Flow (WIF) Model

To describe and retrieve a Weibo-spread information precisely, a formal definition of WIF is proposed as the following quadruple:

B.

where WUS is the Weibo user set consisting of participant Weibo users, TS is the tweet set consisting of original tweets and retweets, RRS is the repost relationship set consisting of the citing relationships between tweets, and TRTSis the tweet reposts tree set consisting of repost trees of each original tweet.

1). Structure of Inline graphic:

1).

where gender = 0/1 is the “0” to male and “1” to female. pNum Inline graphic ProvincesNumList, PNum denotes the city where the current user lives (see the Sina-Weibo city code list). lang Inline graphic {zh-cn, zh-tw, zh-hk, en} represents the language that the current user has configured on his Weibo page, where “zh-cn” denotes the simplified Chinese characters used in Mainland China, “zh-tw” and “zh-hk” denotes the complex Chinese characters used in Taiwan and Hong Kong, respectively, and “en” denotes English. followerset is the set consisting of Weibo users who follow the current user. For the followerSet/friendSet of each participant, the actual data set mentioned in Table I just records a number, i.e., the number of his/her followers/friends. Of course, for the part of their followers/friends who also participated in the earthquake information dissemination by posting/reposting Sina-Weibo messages, there are detailed records in the data set. friendSet is the set consists of Weibo users whom the current user follows. sCount is the number of tweets posted by the current user.

2). Structure of Inline graphic :

2).

where flag = 0/1, “0” indicates that the current tweet is an original tweet and “1” indicates responding to the retweet, user Inline graphic WUS is the user who posted the current tweet, repostCount Inline graphic N is the number of times of the current tweet has been reposted, time is the time when the current tweet is posted, text is the content body of the current tweet, and ot Inline graphic TS, when flag = 1, it denotes the original tweet of the current tweet; otherwise, it is null.

3). Structure of Inline graphic:

3).

where Inline graphic denotes the tweet that was reposted. Regular occurrences of the multilayer reposting phenomenon exist: for example, user-A posts the ot first, and then user-B posts the Inline graphic by reposting ot, and Inline graphic are reposted by others, too. Therefore, the st is either an original tweet or a retweet. rt Inline graphic is the retweet generated by reposting st. depth Inline graphic N+, indicates the distance between rt and rt.ot in the repost tree where N+ = {1, 2, 3, Inline graphic}, and if RR.st.flag = 0, then RR.depth = 1. type Inline graphic {I, II, III, IV, V}, denotes five repost-types by relationship types between the st.user and the rt.user. Fig. 3(a) shows three types of user relationships among Weibo users where the directed edge indicates that the starting user follows the target user. Here, we refer to the pair of users linked by bidirectional edges as bi-friend if they both follow each other, and refer those users as strangers if there are not any directed edges. In this paper, it is assumed that if user-B reposts one tweet posted by user-A, then the tweet information flows from user-A to user-B. After the combination of the information direction between two users and their relationship type, there will be five types of RRs [see Fig. 3(b)–(f)]. The following equations are the symbolic descriptions of five types of RRs:

3).
Fig. 3.

Fig. 3.

Three types of user relationships and five types of information cascade patterns in WIF. (a) Three types of Weibo_user relationships. (b) RR where type = V. (c) RR where type = I. (d) RR where type = II. (e) RR where type = III. (f) RR where type = IV.

4). Structure of Inline graphic:

4).

where OT Inline graphic indicates the original tweet represented by the root node of the current tweet repost tree. RTS Inline graphic indicates the node set of the current TRT, representing RTs generated by reposting OT. Inline graphic indicates the edge set of the current TRT, representing RRs generated by reposting OT. size = (OT.repostCount +1) indicates the number of nodes of the current TRT.

III. Results and Discussion

A. Analysis of Users’ Characteristics

This subsection focuses on the following questions. What about the distribution of users’ language? How is demographics of participant users? Are there some regional features? What are the differences before and after the earthquake? What are the typical features of influential users?

1). Basic Analysis:

The demographics, language, and geographic distribution of online users are important measurement indicators of application fields in disaster relief and disease surveillance and control [21], [37], [38]. On Sina-Weibo, the growth spurt of seismic topic tweets emerged as soon as the earthquake occurred. Where did these active users come from? What are their social backgrounds? Our measurement of WU.gender, WU.lang, and WU.pNum shows: 1) there is a roughly equal proportion of males and females among participant users; 2) users living in the Chinese Mainland, Hong Kong, Macao, and Taiwan are the force that made the tweet peak instantaneously, and the reason of the phenomena mainly come from the locality of Sina-Weibo; and 3) there are a few foreign users participating to post or repost related tweets, but nearly all of them live in China.

Fig. 4 shows the geographic distribution of participant users before and after the earthquake. During the aftermath in Yi’liang which was hit by a 5.7-magnitude earthquake on September 7, 2012, Fig. 4(b) shows that Beijing, Shanghai, and Guangdong topped the rank in the number of participant users and the proportion of active users located in these three areas is 27.83%, these three cities are the political center, the financial center, and the largest province in the economy, respectively. Only 3.5% of active users are located in the local province (Yunnan). In cases prior to the earthquake, there is too little data to analyze unfortunately, probably because Yi’liang is a very remote country in Yunnan province. Therefore, we replaced it with Ya’an which is a popular tourist destination (in the same way later). Then, we found in Fig. 4(a) hat local province (Sichuan) ranked first in the number of active users posting Ya’an common topics, with a share of 26.33%, at the same time the sum of shares in Beijing, Shanghai, and Guangdong still reaches 29.25%.

Fig. 4.

Fig. 4.

Geographic distributions of Sina-Weibo participants before and after earthquakes in China by province. Geographic distribution of participants (a) before Ya’an earthquake and (b) after Yi’liang earthquake.

For participants after the Yi’liang earthquake, Fig. 5 shows the distributions of their four individual attributes the number of friends (WU.foCount), the number of followers (WU.frCount), and the number of followers/friends who also participated in the earthquake information dissemination by posting/reposting Sina-Weibo (WU.sCount-P and WU.repostsCount-P), which are counted by the following equations:

1).
Fig. 5.

Fig. 5.

Distributions of four individual attributes of participant users after the Yi’liang earthquake. (a) Number of each participant’s friends, and the fitting interval is located [130, 1300] between Inline graphic and Inline graphic. (b) Number of each participant’s followers, and the fitting interval is located [130,Inline graphic) on the right side of Inline graphic. (c) and (d) Number of each participant’s friends who also participated in the earthquake information dissemination by posting/reposting Sina-Weibo messages. In addition, “Inline graphic” is the slope of fitting line, “Inline graphic” is the coefficient of determination of the fitting procedure.

The fitting function of red fitting lines is the general power model– frequency Inline graphic with 95% confidence bounds, where c, Inline graphic, and Inline graphic.

Fig. 5 shows the distributions of four individual attributes of participant users. Sudden spikes in Fig. 5(a)–(c) are very obvious. However, we thought that these phenomena have nothing to do with social networks, but are led by the constraints of users privileges on Sina-Weibo. Just with the upper limit of WU.frCount, for example, there are three levels: 2000, 2500, and 3000 for nonmembers, ordinary members, and VIP members, respectively (see http://vip.weibo.com/privilege).

Another common feature found in Fig. 5 is segment characteristics. This is true in Fig. 5(a) and (b), where the sudden changes near 130 verify Dunbar’s number. That is, human communities are much larger than those of other primates and hence require more time to be devoted to social maintenance activities. However, there is an upper limit on the amount of time that can be dedicated to social demands, so this sets an upper limit on social group size [41], [42]. It is known that humans have the cognitive capacity to maintain about 150 stable social relationships. With the advent of different types of super social networking services developing one after another, people have once again picked up the topic for discussion [43], [44]. Many researchers have investigated how tools, such as Facebook and Twitter, have changed our capacity to handle social connections using the empirical study [45][47]. Here, Fig. 5(a) and (b) also shows the shadow of Dunbar’s number on Sina-Weibo.

In addition, compared with the distribution of broadly defined “friends/followers” which are influenced by the Sina-Weibo user privilege in Fig. 5(a) and (b), the number of each participant’s “actual friends/followers” in Fig. 5(c) and (d) shows the typical power-law distribution; from this, we could know that the unevenness is a universal phenomenon of social networks, even when there are some limiting conditions in social network services, such as the constraint of users’ privileges on Sina-Weibo as described earlier.

2). User Influence:

The number of relevant tweets is the most immediate indication of the popularity of posts on the Weibo space. The number of being reposted can offer us an intuitive view of the original poster’s influence on public opinion. For the popularity, the engagement/productivity, and the influence [5], [40], [48], three types of user rankings are made by the following Inline graphic, Inline graphic, and Inline graphic

2).

For the top 20 users of each ranking, the comparison shows that entertainers belong to the most popular group and most users who post tweets actively come from the grassroots, but nearly all users whose tweets regularly receive widespread reposting are news/charity organizations or entertainers. The rest of this subsection quantifies the correlations between the three rankings by the generalizedInline graphic model [49]:

2).

where Inline graphic and Inline graphic are two ordered rankings with the equal length, Inline graphic; and Inline graphic, if 1) Inline graphic and Inline graphic is only in one ranking and Inline graphic is in the other ranking; 2) Inline graphic is ranked higher than Inline graphic and only Inline graphic appears in the other ranking; or 3) Inline graphic and Inline graphic are in both rankings in the opposite order, otherwise, Inline graphic. In particular, Inline graphic. In addition, the normalized distance-Inline graphic is used, computed as follows [50]:

2).

The range of Inline graphic is from 0 to 1. Inline graphic means complete disagreement, and Inline graphic means complete agreement.

Fig. 6 shows that with the increase of Inline graphic, the distance between two arbitrary rankings will get to a relatively stable level, but three stable levels vary widely. 1) The top association between Inline graphic and Inline graphic allow us to infer that a huge fan base may indeed be transformed into a real influence on the public. 2) The weak but stable association between Inline graphic and Inline graphic let us deduce that active participation may indeed help to enhance user influence, but it is very limited. 3) There is little or no association between Inline graphic and Inline graphic, so it can be inferred that active grassroots participators seldom intersect with celebrities. Therefore, it can be seen that Weibo is a place shared by the general public and celebrities, which makes Weibo an excellent example of grassroots media. The seldom intersection makes the metamorphosis from grassroots activist to celebrity difficult, but it is a truth that intimate contact can make overnight success possible, though we have not found the hidden principles among this [51].

Fig. 6.

Fig. 6.

Comparison between three types of user rankings for all users by Inline graphic, Inline graphic, and Inline graphic.

B. User Relationship Structure Measurement

From a sociological perspective, friendship is a trust relationship built by repeated games, and now the quality of online social relationships has been receiving increased attention [30], [31], [52]. In our earthquake discussion case, Fig. 7 lists the three main information push services in Sina-Weibo. This section focuses on the following questions. Who actually repost tweets that the original user has posted? When a user reposts, does he/she repost tweets that he/she is interested in, or just those tweets posted by his/her friends? Also for users who get reposted, do the reposting all take place just among their followers? Are there any differences in information diffusion patterns before and after the earthquake?

Fig. 7.

Fig. 7.

Three main tweet push services on Sina-Weibo. (a) For any two users, User1 and User2, if User2 follows User1, then User1 is the friend of User2 and User2 is the follower of User1, in addition, tweets posted by User1 would be shown for User2 in real time. (b) All tweets beginning with the string of “@User2” will be shown for User2. (c) Every user can read “hot messages” that Sina-Weibo picked out from all tweets of a certain period by their popularity.

1). Structure of Weibo Information Flow:

For the five types of RR in Fig. 3, the butterfly shaped cartoon structure in Fig. 8 analyzed their share in the total amount of RR before and after earthquake, respectively, computed as follows:

1).
Fig. 8.

Fig. 8.

Structure of Sina-Weibo information spread before and after the earthquake, respectively. Notes: five parts marked with Inline graphicIV correspond to the five types of RRs in Fig. 3, and the percentages are their shares in the total amount of all RRs. Structure of Sina-Weibo information spread during (a) week before the Ya’an earthquake and (b) seven months after Yi’liang earthquake.

where Inline graphic, Inline graphic.

For the WIF structure before the Ya’an earthquake, Fig. 8(a) shows as follows. First, the Inline graphic from WU0 to WU3 shares the largest percentage of all RRs, 43.04%, which benefits from the push service in Fig. 7(a). Second, there is no declared relationship between WU0 and WU4, but the share of Inline graphic between them reaches up to 37.23%, which may be largely due to the push service in Fig. 7(c). Also, it can be seen that compared with the user popularity, the content of tweets is equally important for their great popularity. Third, the strongest declared friendship exists between WU0 and WU2, and the share of Inline graphic between them is 17.85%, which can be considered the mirror of the contacting pattern among friends in real life. Fourth, the share of Inline graphic is 1.438%. Unfortunately, we found no logical explanation for these behaviors. Fifth, the Inline graphic from WU0 to WU1 shares the smallest percentage of 0.4420%, less than 1%. Most users did not pay attention to tweets posted by their followers at all. It can be noticed that there is not an information push service from followers to friends, but in the real world, there is also a limited number of efficient ways to attract celebrities. So, this is what we can see in the hedged real world just through the lens of online media.

Fig. 8(b) shows the WIF structure after the Yi’liang earthquake, and in Table II, we have listed the comparison result of these two butterfly maps. We can see the proportion of Inline graphic from WU0 to WU4 rises remarkably, the growth is more than half.

TABLE II. Share of Each Type of RRs in WIF Before and After Earthquake.
Social context of the DataSet Multi-Column
Type=I Type=II Type=III Type=IV Type=V
Before Ya’an Earthquake 0.4420% 17.85% 43.04% 37.23% 1.438%
After Yi’liang Earthquake 0.1959% 8.255% 38.05% 52.82% 0.4891%

In this subsection, the following three conclusions can be drawn. First, “3.1 Analysis of user’ characteristics” has already shown that very few Weibo-big-Vs are able to attract wide attention; in other words, an overwhelming majority of Weibo users play the role of followers. From the dramatic contrast between proportions of Inline graphic and Inline graphic, Sina-Weibo seems like a natural platform to worship idols. Second, given the dramatic contrast among the proportions of Inline graphic and Inline graphic or Inline graphic, Weibo users pay more attention to receiving interesting information than making friends. Maybe this is what makes Weibo different from other online social platforms. Third, followers are indeed the major driving force for the Weibo information spread, but at the same time, the reports from strangers are equally crucial. Particularly in some public emergencies, strangers are likely to be in the vanguard of discussing and distributing news in real time. So, from the point of view of information distribution, in addition to the followers’ count of a user, it is also important to examine whether the tweet is selected as a hot message used to push to all other users (see http://hot.weibo.com/).

2). Distributions of Individual Being Reposted Times:

This subsection presents the distributions of individual reposting times in each type of RRS for the WIF structure after the Yi’liang earthquake [Fig. 8(b)]. In Fig. 9(f), the statistic sample set is the sum of all RRs. If there is a point (Inline graphic) in the scatter plot, it means that those users whose tweets have been reposted Inline graphic times by others makeup Inline graphic percentage of participating users total, and there is a following mapping between Inline graphic and Inline graphic:

2).

where if the following equation is true, then count(wu) = 1, otherwise count(wu) = 0:

2).
Fig. 9.

Fig. 9.

Distributions of the number of being reposted of each WU in RRS grouped by RR.type. (a) RRS of Weibo users’ own reposts. (b) RRS of reposts from strangers. (c) RRS of reposts from followers. (d) RRS of reposts from bifriends (mutually following each other). (e) RRS of reposts from friends. (f) RRS of all reposts.

The five statistic sample sets in Fig. 9(a) and (e) are Inline graphic, Inline graphic, Inline graphic, Inline graphic, and Inline graphic, respectively. So, the point (Inline graphic, Inline graphic) in these five scatter plots means that there is a following mapping between Inline graphic and Inline graphic

2).

where i Inline graphic, and if the following equation is true, then count(wu) = 1, otherwise count(wu) Inline graphic

2).

All six distributions in Fig. 9 follow a power-law distribution. This is consistent with the distributions of the number of being cited as well as citing others in an empirical study of HFS6. However, the three similar power-law slope values (Inline graphic, 1.75, and 1.84) in the HFS study are equivalent to one distribution (Inline graphic) of Inline graphic, far below those (Inline graphic, 2.225, 2.677, 3.882, and 2.138) other five distributions. The nature underlying power-law distribution is uneven, and the higher a slope value is, the more severe the imbalance gets. Corresponding to the situation here, it means that these users participating Inline graphic maintain the same interaction pattern with users of ordinary online social forums. However, there might be a serious structural imbalance in the interactions of Inline graphic, Inline graphic, Inline graphic, and Inline graphic, which may be the reason why Weibo is so different, churning out a string of “Weibo-big-Vs”-Weibo users with mass followings and whose identities have been verified by Sina. These features have to do with social role partitioning [Fig. 3(a)] and the role-based push services (Fig. 7) on Sina-Weibo.

C. Topology Measurement of Real Interaction Networks

Research by Huberman et al. [28] found that the driver of Twitter usage is a sparse and hidden network of connections underlying the “declared” set of friends and followers [53], that is to say in terms of the network density, there is a gap between the real interaction network and the declared user network. Additionally, some other researchers have also found that social interaction existed among various types of users, far more than among acquaintances [54]. From the perspective of network topology, beyond the network density, are there any differences in other topological properties? In addition, what are the key factors behind user influence? To answer these questions, first, a method is introduced to extract the hidden interaction network from the friends and followers network on the basis of the WIF model. Afterward, we present measurement results and provide the corresponding explanation.

The TRT structure is the key to extracting real interaction networks. For 3096 TRTS in the WIF after the Yi’liang earthquake, Fig. 10(a) shows their size distribution where the largest TRT consists of 102 526 retweets, and Fig. 10(b) shows the depth distribution of all RRs, where the maximum value is 85.

Fig. 10.

Fig. 10.

Distribution of TRT.size and RR.depth in the WIF related to the Yi’liang earthquake. (a) TRT.size distribution. (b) RR.depth distribution.

1). Extraction Process of Two Types of Real Interaction Networks:

According to the definition of WIF presented in Section II-B, the symbolic representation of the WIF instance in Fig. 11 is listed as follows:

1).

where WUS Inline graphic, WU3, WU4, WU5, WU6, WU7, WU8, WU9, WU10, WU11, Inline graphic. TS Inline graphic, ot2, ot3, ot4, ot5, ot6, ot7, ot8, ot9, ot10, ot11, ot12, Inline graphic, and Inline graphic, WU5, 0, -, -, null Inline graphic, Inline graphic, WU9, 3, -, -, null Inline graphic, Inline graphic, WU1, 10, -, -, null Inline graphic, Inline graphic, WU10, 1, -, -, Inline graphic, Inline graphic, WU4, 0, -, -, Inline graphic, Inline graphic, WU11, 0, -, -, Inline graphic, Inline graphic, WU2, 2, -, -, Inline graphic, Inline graphic, WU2, 5, -, -, Inline graphic, Inline graphic, WU12, 0, -, -, Inline graphic, Inline graphic, WU3, 1, -, -, Inline graphic, Inline graphic, WU6, 1, -, -, Inline graphic, Inline graphic, WU7, 2, -, -, Inline graphic, Inline graphic, WU7, 0, -, -, Inline graphic, Inline graphic, WU6, 0, -, -, Inline graphic, Inline graphic, WU8, 0, -, -, Inline graphic, Inline graphic, WU3, 0, -, -, Inline graphic. Inline graphic, rr2, rr3, rr4, rr5, rr6, rr7, rr8, rr9, rr10, rr11, rr12, Inline graphic, and Inline graphic ot2, rt1, 1, Inline graphic, Inline graphic ot2, rt2, 1, Inline graphic, Inline graphic rt1, rt3, 2, Inline graphic, Inline graphic ot3, rt4, 1, Inline graphic, Inline graphic ot3, rt5, 1, Inline graphic, Inline graphic ot3, rt6, 1, Inline graphic, Inline graphic rt4, rt7, 2, Inline graphic, Inline graphic rt5, rt8, 2, Inline graphic, Inline graphic rt5, rt9, 2, Inline graphic, Inline graphic rt7, rt10, 3, Inline graphic, Inline graphic rt8, rt11, 3, Inline graphic, Inline graphic rt9, rt12, 3, Inline graphic, Inline graphic rt9, rt13, 3, Inline graphic. TRTS Inline graphic, trt2, Inline graphic, and Inline graphic ot1, null, null, Inline graphic, Inline graphic ot2, Inline graphic, rt2, Inline graphic, Inline graphic, rr2, Inline graphic, Inline graphic, Inline graphic ot3, Inline graphic, rt5, rt6, rt7, rt8, rt9, rt10, rt11, rt12, Inline graphic, Inline graphic, rr5, rr6, rr7, rr8, rr9, rr10, rr11, rr12, Inline graphic, Inline graphic.

Fig. 11.

Fig. 11.

Example of the WIF included 16 Weibo messages and 12 Weibo users. (a) Structure of the Tweet Repost Tree Set, where each node is corresponding to a unique Weibo message, and directed edges between pairs of nodes indicate the presence of posting citations between them. (b) Structure of the following-based Weibo user declared social network, where each cartoon guy is corresponding to a unique Weibo user, and directed edges between pairs of guys indicate the presence of the following relationship between them.

The “declared” online social network is still the common empirical object among existing empirical studies, where nodes represent users, and the edge between nodes indicates the declared social relationship [6][8], [39], [48]. Given the prevalence of the Internet water army, this paper is only interested in how participants collaborated with each other and their relationship types. In addition, we found

1).

that is, more than a third of retweets in TRTS link only to the original tweet without any citations relating to other retweets, such as rt2 and rt6, in Fig. 11(a). We denoted these types of retweets as casual nodes and the corresponding participants as casual participants. Although casual nodes help spread information (the total number of reposted tweets is an important factor for tweet rank), those nodes did not contribute to the actual collaboration activities. Therefore, we excluded casual nodes and analyzed the remaining repost behavior, which involved a total of 249 237 retweets.

Fig. 12(a) shows one type of real interaction network extracted from the WIF instance in Fig. 11, which was named the friendship-based reposting cooperation network (FRCN). The cartoon figures represent distinct participants, and directed edges indicate the type of social relationships between them [corresponding to the black directed edges Fig. 3(a)]. The extraction process needs to visit all directed edges in Fig. 11(b), retaining only the edges and the participants linked by edges that match the following conditions. That is, if there is a pair of users (Inline graphic, Inline graphic) linked by edges in FRCN, then in Fig. 11(a) there is at least one RR that makes the following formula true:

1).
Fig. 12.

Fig. 12.

Two types of Weibo message spread participant network. (a) FRCN the FRCN. (b) SRCN the strange reposting cooperation network.

Another type of real interaction network is shown in Fig. 12(b); named stranger reposting cooperation network (SRCN) where the cartoon figures represent distinct participants, and directed edges indicate the information flow between them (corresponding the blue directed edges in Fig. 3). The extraction process needs to visit all Inline graphic, where type = IV in Fig. 11(b). If there is at least one Inline graphic matching the following condition, then we add the corresponding pair of users (Inline graphic, Inline graphic) and one directed edge from Inline graphic to Inline graphic into SRCN:

1).

Finally, on the basis of the WIF after Yi’liang earthquake, the FRCN consisted of 128 865 nodes and 159 379 edges and the SRCN consisted of 85 978 nodes and 104 410 edges. Therefore, it can be seen again that Sina-Weibo is an excellent synthesis of the traditional society of acquaintances and the strangers’ one.

2). Topology Measurement of FRCN and SRCN:

Table III lists a comparison of topological properties between FRCN, SRCN, and HFS7; some corresponding distributions are presented in Fig. 13.

TABLE III. Topological Properties Comparison of the HFS, the FRCN, and the SRCN.
Networks Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic
HFS7 20 813 29 798 2 821 11 556(55.5%) 10−4 2.1 2.4 0.027 2.650 8.679 28
FRCN 128 865 159 379 5 422 112 617(87.4%) 10−5 2.813 2.318 0.003 1.998 10.54 47
SRCN 85 978 104 410 104 410 6 241 10−5 2.848 1.887 10−4 2.401 1.745 9

Inline graphic: number of nodes; Inline graphic: number of edges; Inline graphic: number of components; Inline graphic: number of nodes in the giant component; Inline graphic: network density; Inline graphic: power of in-degree distribution; Inline graphic: power of out-degree distribution; Inline graphic: average clustering coefficient; Inline graphic: average degree; Inline graphic: average shortest path length; Inline graphic: network diameter.

Fig. 13.

Fig. 13.

Distributions of three topological properties in FRCN and SRCN. (a) Number of nodes in each component of FRCN. (b) Number of nodes in each component of SRCN. (c) In-degree of each node in FRCN. (d) In-degree of each node in SRCN. (e) Out-degree of each node in FRCN. (f) Out-degree of each node in SRCN. (g) Length of each shortest path in FRCN. (h) Length of each shortest path in SRCN.

The values of Inline graphic and Inline graphic in Table III show that there is a largest weekly connected component (giant component) both in FRCN and SRCN, and Figs. 13(a) and 14(b) show that the component size distributions of FRCN and SRCN both follow the power-law trait. Connectivity is an important part of the social network structure, and our analysis results indicate that doing things in groups is a strong human instinct; that is, everything we do or say tends to ripple through our network and have an impact on our friends. For the context of our study, if interest and friendship created the emergence of super components in the HFS groups and FRCN, then what is it that made the emergence of super components in SRCN? Though this question is very interesting, it goes beyond the scope of this paper.

Fig. 14.

Fig. 14.

Comparison of user rankings between FRCN and SRCN. (a) Comparison between Inline graphic, SRCN_Inline graphic, and FRCN_Inline graphic. (b) Comparison between Inline graphic, SRCN_Inline graphic, and FRCN_Inline graphic.

The comparison of Inline graphic in Table III shows that values of the network densities of FRCN and SRCN are both much less than the HFS groups; the interaction groups in Sina-Weibo are looser and more disorganized than traditional online social forums, such as HFS communities. This allows Weibo users to group together quickly and leave quickly.

All of the degree distributions in Fig. 13(c) and (f) are the power-law type, meaning that FRCN and SRCN both are scale-free networks. This also means that there are users whose tweets always receive the overwhelming public response (i.e., huge hubs), but the appeal of most of others is severely limited. For the scale-free property in a social network, the research team of Albert-László Barabás holds that it is a consequence of a decision-based queuing process: when individuals execute tasks based on some perceived priority, there will be heavy tail phenomenon [55]. We do not rule out the possibility, because the number being reposted is indeed an important factor for a tweet in a real-time-based push service shown in Fig. 7(c). However, one thing is certain: without the strong appeal of hub users in either FRCN or SRCN, few messages could get widespread.

The comparison of Inline graphic, avg.D, Inline graphic, and Inline graphic in Table III shows that FRCN and SRCN both are small-world networks. The avg.D values of FRCN and SRCN are both below the avg.D of HFS, and their Inline graphic values are at least an order of magnitude lower than HFS, indicating that most users are not friends of one another. However, the values of Inline graphic and Inline graphic in Table III and the result in Fig. 13(g) and (h) both shows that the avg.D has not affected the social distance, i.e., Weibo information can always flow from one user to another through a small number of hops. With the SRCN, for example, each user has less than three adjacent users on average, but the typical distance between randomly chosen users still remains fewer than nine hops. Another finding about FRCN and SRCN is that the shortest paths of FRCN behave according to the negative binomial distribution.

Finally, on the basis of analysis in Fig. 6, the rest of this section performed further analysis on how much the influential group may converge in FRCN and SRCN with the generalized Kendal Inline graphic model again. FRCN and SRCN are two real interaction networks emerging in entirely different ways, so we wondered in particular whether the appeal of these users on the top of Inline graphic and Inline graphic maintain the same powerful influence both in FRCN and SRCN. The two formulas in the following are for ranking users of FRCN and SRCN, respectively. And the results are shown in Fig. 14

2).

The total number of times a tweet is reposted is the most concrete indication of user influence. The comparison between the two curves of K(Inline graphic, FRCN_Inline graphic) and K(Inline graphic, SRCN_Inline graphic) in Fig. 14. A indicates that the influential users in FRCN cater to popular taste better than those in SRCN. The curve of K(SRCN_Inline graphic, FRCN_Inline graphic) in Fig. 14(a) shows that an influential group does converge in FRCN and SRCN, but with limited overlap especially so for the large-scale and widespread reposting. In addition, the two curves in Fig. 14(b) have a similar level, it can be observed that the two influential groups in FRCN and SRCN have at least one thing in common: both have a massive following of fans. Therefore, the determinant of a tweet gets large-scale notice would likely be related to the number of followers the publisher has.

IV. Conclusion

Weibo has been ubiquitously integrated into people’s everyday lives in China. Both Sina-Weibo and Tencent-Weibo had more than five million users in early May 2013, and their open platforms (including the data API service) have been improved constantly. Although the very existence of user records raises huge concerns of privacy, these user records also create a historic opportunity, that is offering for the first time unbiased data of unparalleled detail on the behavior of not one, but millions of individuals.

In this paper, we performed a comprehensive analysis of Weibo information diffusion during earthquakes. We found that symbolic representation applied to the WIF model is indeed a feasible choice for the empirical study of human behavior based on online social media data sets. In retrospect, our primary inspiration came from the description mechanism of concepts and relationships in ontology theory. The main feature of this idea is that it can give a formal expression for the data structure and the analysis process (such as the extraction of FRCN and SRCN).

However, the structure of social networks is only a starting point. When people talk about the “connectedness” of a social network, in general, they are really talking about two related issues. One is who is linked to whom; and the other is the fact that each individual’s actions have implicit consequences for the outcomes of everyone in the system [56]. In fact, Fig. 8 has given us some intuition that there is probably a serious structural imbalance between the “declared” relationship network and the real interaction network [57][59]. In addition, to measure public perceptions in emergencies, many researchers have worked extensively on the evolution of public opinion during information dissemination based on Twitter [60], which allows for many interesting directions for future work.

Biographies

graphic file with name dong-2786545.gif

Rongsheng Dong received the B.S. degree from the China University of Geosciences, Wuhan, China, in 1989.

He is currently a Professor with the School of Computer Science and Security, Guilin University of Electronic Technology. He is also currently a member of the Advisory Commission on Teaching Computer Curriculum in Colleges and Universities under the Ministry of Education, the Education Commission of China Computer Federation. He has received research grants from National Natural Science Foundation of China and Natural Science Foundation of Guangxi Province. His current research interests include graph data, social computing, and protocol engineering.

graphic file with name li-2786545.gif

Libing Li received the B.S. degree in computer science and technology from the Department of Computer Science, Changzhi University, Changzhi, China, in 2011, and the M.S. degree in computer application technology from the School of Computer Science and Information Security, Guilin University of Electronic Technology, Guilin, China, in 2014.

He is currently a Data and Application Architect of Mintaian Insurance Surveyors & Loss Adjusters Group Company Ltd., Shenzhen, China. His current research interests include public sentiment discovery, catastrophic risks modeling, and risk control and loss adjust.

graphic file with name zhang-2786545.gif

Qingpeng Zhang (M’10) received the B.S. degree in automation from the Huazhong University of Science and Technology, Wuhan, China, and the M.S. degree in industrial engineering and the Ph.D. degree in systems and industrial engineering from The University of Arizona, Tucson, AZ, USA.

He was a Post-Doctoral Research Associate with the Tetherless World Constellation, Rensselaer Polytechnic Institute, Troy, NY, USA. He is currently an Assistant Professor with the Department of Systems Engineering and Engineering Management, City University of Hong Kong, Hong Kong. His current research interests include social computing, complex networks, healthcare data analytics, semantic social networks, and Web science.

Dr. Zhang is an Associate Editor of the IEEE Transactions on Intelligent Transportation Systems and the IEEE Transactions on Computational Social Systems.

graphic file with name cai-2786545.gif

Guoyong Cai received the B.S. degree in applied physics from Hefei Polytechnic University, Hefei, China, the M.S. degree in computer science from the University of Shanghai for Science and Technology, Shanghai, China, and the Ph.D. degree in computer science from Zhejiang University, Hangzhou, China.

He is currently a Professor with the School of Computer Science and Security, Guilin University of Electronic Technology, Guilin, China. His current research interests include social media mining, social computing, and software engineering.

Funding Statement

This work was supported in part by the National Natural Science Foundation of China under 61762024, Grant 61763007, Grant 71402157, Grant 71672163, Grant 61363070, Grant 61063039, and Grant 61262074, the Natural Science Foundation of Guangxi Province under Grant 2017GXNSFDA198050, and the Natural Science Foundation of Guangdong Province under Grant 2014A030313753.

Contributor Information

Rongsheng Dong, Email: ccrsdong@guet.edu.cn.

Libing Li, Email: lilibing1@163.com.

Qingpeng Zhang, Email: qingpeng.zhang@cityu.edu.hk.

Guoyong Cai, Email: ccgycai@guet.edu.cn.

References

  • [1].Defense Advanced Research Projects Agency. (2010). DARPA Network Challenge Project Report. [Online]. Available: http://www.eecs.harvard.edu/cs286r/courses/fall10/papers/ProjectReport.pdf
  • [2].Pickard G.et al. , “Time-critical social mobilization,” Science, vol. 334, no. 6055, pp. 509–512, Oct. 2011. [DOI] [PubMed] [Google Scholar]
  • [3].Kitsak M.et al. , “Identification of influential spreaders in complex networks,” Nature Phys., vol. 6, no. 11, pp. 888–893, Aug. 2010. [Google Scholar]
  • [4].Bakshy E., Hofman J. M., Watts D. J., and Mason W. A., “Everyone’s an influencer: Quantifying influence on twitter,” in Proc. WSDM, Hong Kong, 2011, pp. 65–74. [Google Scholar]
  • [5].Fu K.-W. and Chau M., “Reality check for the Chinese microblog space: A random sampling approach,” PLoS ONE, vol. 8, no. 3, p. e58356, Mar. 2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [6].Fan P., Wang H., Jiang Z., and Li P., “Measurement of microblogging network,” (in Chinese), J. Comput. Res. Develop., vol. 49, no. 4, pp. 691–699, 2012. [Google Scholar]
  • [7].Lei K., Zhang K., and Xu K., “Understanding Sina Weibo online social network: A community approach,” in Proc. IEEE GLOBECOM, Dec. 2013, pp. 3114–3119. [Google Scholar]
  • [8].Zhan Z., Chen Y., and Fu Y., “Analyzing user relationships in Weibo networks: A Bayesian network approach,” in Proc. CloudCom-Asia, Fuzhou, China, 2014, pp. 502–508. [Google Scholar]
  • [9].Lü L., Medo M., Yeung C. H., Zhang Y.-C., Zhang Z.-K., and Zhou T., “Recommender systems,” Phys. Rep., vol. 519, no. 1, pp. 1–49, 2012. [Google Scholar]
  • [10].Fu F., Liu L., and Wang L., “Empirical analysis of online social networks in the age of Web 2.0,” Phys. A, Statist. Mech. Appl., vol. 387, nos. 2–3, pp. 675–684, Jan. 2008. [Google Scholar]
  • [11].Yu L. L., Asur S., and Huberman B. A., “Artificial inflation: The real story of trends and trend-setters in Sina Weibo,” in Proc. Int. Conf. Privacy, Secur., Risk Trust (PASSAT), Int. Conf. Social Comput. (SocialCom), Sep. 2012, pp. 514–519. [Google Scholar]
  • [12].Bastos M. T., Travitzki R., and Raimundo R. (2012). Tweeting political dissent: Retweets as pamphlets in FreeIran, FreeVenezuela, Jan25, SpanishRevolution and OccupyWallSt. Integrated Performance Primitives. [Online]. Available: http://ipp.oii.ox.ac.uk/sites/ipp/files/documents/BASTOS-TRAVITZKI-RAIMUNDO_-_Tweeting_Political_Dissent.pdf [Google Scholar]
  • [13].Overbey L. A., Paribello C., and Jackson T., “Identifying influential twitter users in the 2011 egyptian revolution,” in Proc. SBP, 2013, pp. 377–385. [Google Scholar]
  • [14].Lerman K., Ghosh R., and Surachawala T. (2013). “Social contagion: An empirical study of information spread on digg and twitter follower graphs.” [Online]. Available: http://arxiv.org/abs/1202.3162 [Google Scholar]
  • [15].Steeg G. V., Ghosh R., and Lerman K. (2011). “What stops social epidemics?” [Online]. Available: http://arxiv.org/abs/1102.1985 [Google Scholar]
  • [16].Guille A., Hacid H., and Favre C. (2013). “Predicting the temporal dynamics of information diffusion in social networks.” [Online]. Available: http://arxiv.org/abs/1302.5235?utm_source=twitterfeed&utm_medium=twitter [Google Scholar]
  • [17].Bagrow J. P., Wang D., and Barabási A.-László, “Collective response of human populations to large-scale emergencies,” PLoS ONE, vol. 6, no. 3, p. e17680, 2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [18].Allen R. M., “Transforming earthquake detection?” Science, vol. 335, no. 6066, pp. 297–298, Jan. 2012. [DOI] [PubMed] [Google Scholar]
  • [19].Wang F.-Y.et al. , “A study of the human flesh search engine: Crowd-powered expansion of online knowledge,” Computer, vol. 43, no. 8, pp. 45–53, Aug. 2010.21379365 [Google Scholar]
  • [20].Zhang Q.et al. , “Understanding crowd-powered search groups: A social network perspective,” PLoS ONE, vol. 7, no. 6, pp. 1–16, Jun. 2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [21].Sakaki T., Okazaki M., and Matsuo Y., “Earthquake shakes Twitter users: Real-time event detection by social sensors,” in Proc. WWW, Raleigh, NC, USA, 2010, pp. 851–860. [Google Scholar]
  • [22].Robinson B., Power R., and Cameron M., “A sensitive twitter earthquake detector,” in Proc. Companion WWW, Rio de Janeiro, Brazil, 2013, pp. 999–1002. [Google Scholar]
  • [23].Crooks A., Croitoru A., Stefanidis A., and Radzikowski J., “Earthquake: Twitter as a distributed sensor system,” Trans. GIS, vol. 17, no. 1, pp. 124–147, Feb. 2013. [Google Scholar]
  • [24].Sakaki T., Okazaki M., and Matsuo Y., “Tweet analysis for real-time event detection and earthquake reporting system development,” IEEE Trans. Knowl. Data Eng., vol. 25, no. 4, pp. 919–931, Apr. 2013. [Google Scholar]
  • [25].Moriya M. and Ryoke M., “Information balance between transmitters and receivers based on the twitter after great East Japan earthquake,” Int. J. Knowl. Syst. Sci., vol. 4, no. 2, pp. 66–76, Apr. 2013. [Google Scholar]
  • [26].Jung J. Y. and Moro M., “Multi-level functionality of social media in the aftermath of the great East Japan earthquake,” Disasters, vol. 38, no. 2, pp. S123–S143, Jul. 2014. [DOI] [PubMed] [Google Scholar]
  • [27].Toriumi F., Sakaki T., Kazama K., Kurihara S., Noda I., and Shinoda K., “Information sharing on Twitter during the 2011 catastrophic earthquake,” in Proc. WWW Companion, Rio de Janeiro, Brazil, 2013, pp. 1025–1028. [Google Scholar]
  • [28].Huberman B., Romero D. M., and Wu F. (Dec. 2008). Social networks that matter: Twitter under the microscope. First Monday, [S.l.], accessed: Jan. 2, 2018. [Online]. Available: http://firstmonday.org/ojs/index.php/fm/article/view/2317/2063, doi: 10.5210/fm.v14i1.2317 [DOI] [Google Scholar]
  • [29].Tavakolifard M., Almeroth K. C., and Gulla J. A., “Does social contact matter?: Modelling the hidden Web of trust underlying Twitter,” in Proc. WWW Companion, Rio de Janeiro, Brazil, 2013, pp. 981–988. [Google Scholar]
  • [30].Lam S. K. and Riedl J., “Are our online ‘Friends,” Really Friends?” Comput., vol. 45, no. 1, pp. 91–93, Jan. 2012. [Google Scholar]
  • [31].Wu S., Hofman J. M., and Mason W. A., “Who says what to whom on twitter,” in Proc. Int. Conf. World Wide Web (WWW), Hyderabad, India, 2011, pp. 705–714. [Google Scholar]
  • [32].Gomez-Rodriguez M., Leskovec J., and Krause A., “Inferring networks of diffusion and influence,” ACM TKDD, vol. 5, no. 4, pp. 1–37, Feb. 2010. [Google Scholar]
  • [33].Zhang J.et al. , “Social influence locality for modeling retweeting behaviors,” in Proc. IJCAI, Beijing, China, 2013, pp. 2761–2767. [Google Scholar]
  • [34].Mucha P. J.et al. , “Community structure in time-dependent, multiscale, and multiplex networks,” Science, vol. 328, no. 5980, pp. 876–878, May 2010. [DOI] [PubMed] [Google Scholar]
  • [35].Magnani M., Micenkova B., and Rossi L. (2013). “Combinatorial analysis of multiple networks.” [Online]. Available: http://arxiv.org/abs/1303.4986 [Google Scholar]
  • [36].Shannon P.et al. , “Cytoscape: A software environment for integrated models of biomolecular interaction networks,” Genome Res., vol. 13, no. 11, pp. 2498–2504, 2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [37].Grundmann H.et al. , “Geographic distribution of, staphylococcus aureus, causing invasive infections in Europe: A molecular-epidemiological analysis,” PLoS Med., vol. 7, no. 1, p. e1000215, 2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [38].Takhteyev Y., Gruzd A., and Wellman B., “Geography of Twitter networks,” Social Netw., vol. 34, no. 1, pp. 73–81, Jan. 2012. [Google Scholar]
  • [39].Mislove A., Marcon M., Gummadi K. P., Druschel P., and Bhattacharjee B., “Measurement and analysis of online social networks,” in Proc. ACM SIGCOMM, San Diego, CA, USA, Oct. 2007, pp. 29–42. [Google Scholar]
  • [40].Vogiatzis D., “Influential users in social networks,” in Semantic Hyper/Multimedia Adaptation. Berlin, Germany: Springer, 2013, pp. 271–295. [Google Scholar]
  • [41].Dunbar R. I. M., “Neocortex size as a constraint on group size in primates,” J. Human Evol., vol. 22, no. 6, pp. 469–493, 1992. [Google Scholar]
  • [42].Dunbar R., “Social networks: Human social networks,” New Scientist, vol. 214, no. 2859, pp. 4–5, 2012. [Google Scholar]
  • [43].The Economist. (Jan. 2011). Too Many Friends?. [Online]. Available: http://www.economist.com/blogs/theworldin2011/2011/01/sudan_facebook_and_dunbars_number
  • [44].MIT Technology Review. (Jul. 2012). Three Questions for Robin Dunbar. [Online]. Available: http://www.technologyreview.com/news/428478/three-questions-for-robin-dunbar/
  • [45].Gonçalves B., Perra N., and Vespignani A., “Modeling users’ activity on twitter networks: Validation of Dunbar’s number,” PLoS ONE, vol. 6, no. 8, p. e22656, 2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [46].Dezecache G. and Dunbar R. I. M., “Sharing the joke: The size of natural laughter groups,” Evol. Human Behav., vol. 33, no. 6, pp. 775–779, 2012. [Google Scholar]
  • [47].Wellman B., “Is Dunbar’s number up?” Brit. J. Psychol., vol. 103, no. 2, pp. 174–176, May 2012. [DOI] [PubMed] [Google Scholar]
  • [48].Kwak H., Lee C., Park H., and Moon S., “What is Twitter, a social network or a news media?” in Proc. WWW, Raleigh, NC, USA, 2010, pp. 591–600. [Google Scholar]
  • [49].Ronald F., Ravi K., and Sivakumar D., “Comparing top k lists,” SIAM J. Discrete Math., vol. 17, no. 1, pp. 134–160, 2006. [Google Scholar]
  • [50].Mccown F. and Nelson M. L., “Agreeing to disagree: Search engines and their public interfaces,” in Proc. JCDL, Vancouver, BC, Canada, 2007, pp. 309–318. [Google Scholar]
  • [51].Salganik M. J., Dodds P. S., and Watts D. J., “Experimental study of inequality and unpredictability in an artificial cultural market,” Science, vol. 311, no. 5762, pp. 854–856, 2006. [DOI] [PubMed] [Google Scholar]
  • [52].Mendoza M., Poblete B., and Castillo C., “Twitter under crisis: Can we trust what we RT?” in Proc. SOMA, Washington, DC, USA, 2010, pp. 71–79. [Google Scholar]
  • [53].Lerman K. and Ghosh R., “Information contagion: An empirical study of the spread of news on digg and Twitter social networks,” Comput. Sci., vol. 52, pp. 166–176, Mar. 2010. [Google Scholar]
  • [54].Cummings J. N., Butler B., and Kraut R., “The quality of online social relationships,” Commun. ACM, vol. 45, no. 7, pp. 103–108, Jul. 2002, doi: 10.1145/514236.514242. [DOI] [Google Scholar]
  • [55].Barabási A.-L., “The origin of bursts and heavy tails in human dynamics,” Nature, vol. 435, no. 7039, pp. 207–211, May 2005. [DOI] [PubMed] [Google Scholar]
  • [56].David E. and Jon K., Networks, Crowds, and Markets: Reasoning About a Highly Connected World. New York, NY, USA: Cambridge Univ. Press, 2010. [Google Scholar]
  • [57].Antal T., Krapivsky P. L., and Redner S., “Social balance on networks: The dynamics of friendship and hatred,” Phys. D Nonlinear Phenom., vol. 224, nos. 1–2, pp. 130–136, Mar. 2006. [Google Scholar]
  • [58].Kunegis J., Lommatzsch A., and Bauckhage C., “The slashdot zoo: Mining a social network with negative edges,” in Proc. Int. WWW Conf., Madrid, Spain, 2009, pp. 741–750. [Google Scholar]
  • [59].Leskovec J., Huttenlocher D., and Kleinberg J., “Signed networks in social media,” in Proc. CHI, ACM, Atlanta, GA, USA, 2010, pp. 1361–1370. [Google Scholar]
  • [60].Chew C. and Eysenbach G., “Pandemics in the age of Twitter: Content analysis of tweets during the 2009 H1N1 outbreak,” PLoS ONE, vol. 5, no. 11, p. e14118, 2010. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Ieee Transactions on Computational Social Systems are provided here courtesy of Institute of Electrical and Electronics Engineers

RESOURCES