Abstract
As scientists, we are proud of our role in developing the current digital age that enables billions of people to communicate rapidly with others via social media. However, when things go wrong, we are also responsible for taking an ethical stand and trying to solve problems, and this work aims to take a step in this direction. Our goal is to set the foundation for a mathematically formal study of how we might regulate social media and, in particular, address the problem of the echo chamber effect. An echo chamber is a closed system where other voices are excluded by omission, causing your beliefs to become amplified or reinforced. In turn, these bubbles can boost social polarization and extreme political views, and, unfortunately, there is strong evidence that echo chambers exist in social media. The fundamental question we try to answer is: how and can a regulation “break” or reduce the echo chamber effect in social media? Sadly, the paper’s main result is an impossibility result: a general regulation function that achieves this goal (on our social media model) while obeying the core values of democratic societies (freedom of expression and user privacy) does not exist. This result leaves us with hard future choices to make.
Subject terms: Computational science, Computer science, Information technology
Introduction
Social media (SM) websites and mobile applications have become a significant force driving opinion forming, news consumption, and information sharing among the citizens of our world1. As scientists, we are proud of this progress, but when things go wrong2, we are also responsible for taking an ethical stand and trying to fix problems3,4. This work aims to take a step in this direction. Our goal is to set the foundation for a rigorous study of how we might regulate social media and, in particular, address the problem of the echo chamber effect.
On a basic level, an echo chamber is “an environment in which somebody encounters only opinions and beliefs similar to their own, and does not have to consider alternatives5”. By communicating and repeating beliefs within a closed system or network insulated from rebuttal, echo chambers amplify or reinforce beliefs6,7. Unfortunately, in recent years, echo chambers have been observed on different social networks, for example, during the 2016 US presidential election8 and more recently on controversial topics (e.g., gun control, vaccination, abortion)9. In turn, echo chambers are believed to increase social polarization10,11, which is getting extreme by opinion amplification12. On social media, it was shown that echo chambers limit exposure to diverse perspectives and favor and reinforce presupposed narratives and ideologies9,13. However, it is important to note that this view is still under ongoing (academic) debate14–16 and we elaborate on this point in the Discussion section.
The crucial problem with echo chambers is that they deprive people (social media users) of a reality check, leaving them in a virtual reality. In particular, isolation or omission of opinions can hide the truth. However, what is the truth? We leave this out of context for this paper and consider “fact” (following Wittgenstein’s Tractatus17) or, in our case, people’s opinions to be the truth. Clearly, avoiding other voices will also prevent exposure to truth or reality. Furthermore, blocking information and voices in democratic states goes against fundamental human rights as expressed in The Universal Declaration of Human Rights18. In particular, Article 19 describes the freedom of expression, freedom of information, and the public’s (and individual) right to know others’ public opinions (among other things).
Motivated by recent debates about social media19, specifically, Moshe Vardi’s talk about computer science and ethics4, we take the moral viewpoint that echo chambers in social media (and other social interactions) need to be moderated. A critical first observation is that echo chambers have a spectrum that can be quantified. They do not just exist or not exist. We formalize this by considering both the echo (bias, reinforcement) and the chamber (size) as metrics of an opinion-spreading process. To specify and quatify an echo chamber effect, we must, in turn, define a geometry and a frame of reference, as we explain later.
We claim that to reduce the problem of echo chambers in social media, a state regulation is in place. Such regulation should not be left to private social media companies, which may have goals and priorities that differ from the well-being of society. Typically, these goals are driven by financial and other interests, and companies may, for example, as been claimed, benefit economically from echo chambers and extreme opinions. In this paper, we consider the social media’s spreading policy (e.g., what posts will appear on a user’s wall or feed) as an “opaque box” that the regulator does not and can not control. We model the regulation as an additional step in the spreading process.
Paper contribution We see this paper as a conceptual paper that encourages the community to address the echo chamber effect and other social media-related problems and propose a methodology and initial results.
We first present a general information-spreading model that captures the essence of a social media, friends-based, information-spreading process on a network of users. Our Simple Regulated Social-media Spreading process, denoted as , has several main components: (i) a social network of user and their opinions, (ii) a social media with its spreading policy, (iii) a regulation authority, and (iv) a user reaction function that determines if a user agree, disagree, or ignore a post. A spreading process starts with a user that shares a post and ends with a set of users that received the post and their reaction to it.
In turn, we propose a novel perspective to quantify the echo chamber of a spreading process. We represent an echo chamber with a pair where is the echo that captures the bias, and is the chamber that describes the size of the spreading. To compare different spreading processes, we define an echo chamber metric based on hyperbolic geometry and claim that it obeys two fundamental axioms. We then define the echo chamber effect using a frame of reference (i.e., a baseline) for each spreading process.
Next, we discuss and formally define the goals and principles of social media regulation. We set the goal of the regulation to “break” the echo-chamber effect and then consider the two most basic principles or values that we want our society and regulation to follow: freedom of expression and user privacy. Following this formulation, the main result of the paper is, then, an impossibility result. For the process, we prove that any regulation that reduces the echo chamber effect for any network and social media must violate either freedom of expression or user privacy for some users.
We complete the technical part by proposing a random linear regulation (RLR) function that preserves the freedom of expression and user privacy and is simple to implement. We conjecture that RLR reduces the echo chamber effect for real-world social media and initially explore it via simulations. We conclude the paper with a discussion that encourages the community to work on these important topics but also presents important objections, limitations, and alternatives to our model and results.
Background In their collaborative review16, Hadit and Chris studied different aspects of the political dysfunction in social media. In particular, they comprehensively discuss the echo chamber phenomenon and claims of its existence or non-existence20 in social networks. Cinelli et al.9 studied echo chambers on various social media (Gab, Facebook, Reddit, and Twitter) by measuring network homophily and biased information diffusion. They found dominant homophily interactions and social media segregation. However, the paper didn’t provide a formal definition of the echo-chamber effect, didn’t analyze its size dependence, nor offered a specific metric to measure it. Although Morales et al.21 found a platform (Reddit) in which the Echo-Chamber phenomenon is not emphasized in political interactions, they also confirm the existence of similar preferences in other fields.
Filter bubbles22 are linked to echo chambers and refer to intellectual isolation caused by algorithms predicting and showing users information they would likely agree with. Examples include Google Personalized Search and Facebook’s personalized feed. Though not in the same formal approach as our paper, efforts to avoid these bubbles using personalized software were addressed in previous works23,24. Nguyen25 also provided insights on differentiating echo chambers from bubbles. However, the existence of bubbles is under academic discussion, and there are works26,27 indicating that the phenomenon is minor or insignificant in terms of news consumption28,29, while most people regularly consume content from diverse media30,31.
A large-scale recent study of Facebook users in the USA activity from 202032–35 shows that content from ‘like-minded’ sources constitutes most of what people see on the platform. Moreover, reduced exposure to content from like-minded sources increased their exposure to content from cross-cutting sources but did not correspondingly reduce polarization in beliefs or attitudes. Manipulating the users’ feed by omitting reshared posts33 or presenting the content in reverse chronological order34 did not significantly affect political polarization or attitudes.
While there exists an informal regulation of online content through content moderation policies, community guidelines, and user reporting mechanisms36, there is a growing consensus on the need for formal regulation of social media platforms37. Still, partisan differences exist regarding the specific issues that regulation should address37. The potential benefits of regulation include promoting innovation, increasing competition, and encouraging social media companies to take responsibility for removing harmful content like hate speech and disinformation38. However, drawbacks include regulatory burdens that may hinder innovation, challenges in content moderation, and concerns about infringing on free speech values39.
In this paper, we present a general information-spreading model that captures the essence of the social media information-spreading process on a network of users. Our model takes inspiration from the fundamental paper of Kempe, Kleinberg & Tardos40, which presents and analyses three different, single opinion, spreading models (linear threshold, independent cascade, and weighted cascade) where their goal was to find the subset that maximizes the spread of influence. Subsequent works41–43 studied and expanded the spreading model while others44,45 examined strategies to contain the misinformation spread by identifying the set of highly influential nodes. In contrast, our model leaves the spreading process of social media as an opaque box and adds (an optional) layer of regulation separately, as described hereafter.
Social media regulation: model and definitions
In this section, we formally describe the social network model and the spreading process of a message (e.g., a post or a tweet) generated by the social media on it. We also introduce into this process the possibility of a regulator that can influence the spreading process of social media. We start by defining the network model.
Definition 1
(Social Network) A social network is a pair of:
A graph such that V is a set of n nodes (users of the network) and is a set of undirected edges , represents the connections between the users by which they can share information. For a given network N, we denote by G(N) and V(N) the graph and the nodes of N, respectively.
A coloring function over the nodes . The different colors represent the variety of opinions of network users. We denote by the fraction of red users in the network, i.e., , where is the set of red users. For simplicity, we consider two colors, but the model can support more colors.
We consider the spread of an idea or opinion, represented by a message , in a network N and users’ reactions to it. In our model, messages express opinions and are colored with one of the users’ colors. W.l.o.g, we assume that the initial message is written by a user v; therefore, the message color is also red. Initially, all the users are inactive. Formally, each user u has a reaction , which is set initially to .
The spreading process evolves in discrete rounds starting at time . Each user has an inbox (i.e., Feed) in which the message can appear while the user is inactive (and overall only once). In turn, each individual u who receives a message at time in his inbox becomes active at time t and must update his reaction to the message to either , , or . For a time t let and denote the set of users which reacted with , , or until time t, respectively. Let the active set of users at time t be . The set of inactive users at time t is denoted . The spreading process is non-reversible, namely when a node switches from being inactive to being active, it cannot change its status again during the process; formally, if then , if then and if then . We can now formally define a spreading sequence which describe the temporal evolution of the active set ,
Definition 2
(Spreading sequence) Given a network N and an initial active set such that and , a spreading sequence is a sequence of active sets over time for N (describe the users reactions over time). Let denote the spreading sequence up to time t.
To generate a social media spreading process, we must define the evolution of the active set from to . To do so, we first define what we call spreading functions, functions that, given the spreading process history, decide to which users the message will be spread next. We consider two types of spreading functions. The first function is the social media spreading function, that is the algorithm by which the Social media service (i.e., Meta, Twitter (now X), etc.) spread a message that a user wants to “share” or “tweet”. We assume that when an inactive user becomes active and agree an incoming message, the social media service activates its spreading function and recommends candidates users to receive the message in the next round (in their inbox). We note that in our model, reacting with agree means also to share the message, whereas, in some Social media services, a separation exists between agree or ‘like’ and ‘share’ reactions.
The second type of spreading function is the regulation spreading function, . The regulation function can overwrite the recommendation of the social media spreading function, and it is the final authority that decides in which inboxes of users the message landed at the next time step. We will discuss the principles of regulation functions later. The last component in the spreading process is a user reaction function, , which decides the reaction of users to incoming messages (i.e., agree, disagree or ignore); we will discuss user reaction in more detail shortly. We can now formally describe the evolution of a generic social media spreading process.
Definition 3
(Social media spreading process) A social media spreading process evolution is a 5-tuple where a single message starts at v (with v’s color) in a network N. The process evolves (and generates a spreading sequence ) using the social media function , the regulation function and the user reaction function , in the following order:
Message sharing: A message m is shared by a user at the end of time t. A message is shared after a user becomes active and reacts with agree to the message. At the time , a single user, v, shares its own message.
Social media candidates: Based on the network N and the current process phase , the Social Media selects the candidtaes set, using the social media function . is the set candidates: inactive users for which the message m will potentially (but not necessary) appear on their inbox at time . Formally, .
Regulation: Based on the network N, and the candidtaes set, , the regulator determines the inbox set of users, , using the regulation function . is the set of users for which the message will appear in their inbox at time . Formally, . For the special case of passive regulation (i.e., ) we denote by the identity function, .
User reaction: To finalize the time step, each inactive user u that is exposed to the message m in his inbox (feed) at the beginning of the time step , reacts to the message with or , using the user reaction function and update his status correspondingly. Formally, we activate for all users in the inbox set, , , and , such that , , .
Repeat or stop: For every user that updated its reaction to agree, the message is shared at the end of the time step (and we go back to 1). If no new user reacted with agree (i.e., ) the spreading process stops.
Figure 1 visually illustrates steps 1–4 in a social media spreading process evolution as defined in Definition 3.
Note that in the spreading process of Definition 3, the main three functions: the social media function , the regulation function , and the user reaction function are not defined explicitly, leaving much room for future proposals and research. To realize a concrete spreading process, we first discuss specific models for the social media service spreading policy and for the user reaction behavior. While both processes are complex and not transparent, we next present basic fundamental models for each function.
Modeling social media spreading function,
Each social media service (e.g., Meta, Twitter) has its internal complex algorithms for deciding how to distribute a message (e.g., post, tweet) that its users share. In general, such a spreading function could depend on the message’s content, each user’s history, and many other factors. Since we study the echo chamber effect (formally defined later) here, we must make some simplifying assumptions. In our model, the spreading is based only on the network’s topology and the users’ coloring. We assume that once shared by user v, a message can spread only to neighbors of v. Moreover, each node can see a message at most once, namely shared messages at time t can appear only in the inbox of inactive users at time (concerning the specific message). We also assume that messages are spread to each neighbor of v independently of the other neighbors of v. We propose the following homophily-based social media spreading function, .
Given a network N and the current spreading process at time t, , such that , let the set of user who changed their status to agree at time t denoted by . Note that is the set of users that do ‘share’ at time t. Let denote two probabilities. The probability that an inactive node which is a neighbor of a node will be chosen in the spreading function to receive the message, is defined independently by the function as follows:
1 |
Formally the social media candidates set is selected such for each , u is added to , independently, with probability . In our analysis and simulations, we consider three different settings for , defined as follows:
All neighbors : The message spreads to all inactive neighbors.
Strong homophily : The message spreads to all inactive neighbors with the same color.
p-homophily : The message spreading is biased toward inactive neighboring users with the same color but can also reach different color neighbors.
By adjusting p and q the social media service controls the spreading process. For example, intuitively, Strong homophily will lead to an echo chamber effect since the message will not reach blue users at all. It is suspected that the policies of social media services are homophilic to some extent, where users see more posts similar to their own, even when they have neighbors with other opinions.
Modeling user reaction function,
When a user receives a new message in his inbox (i.e., feed), how will she react to it? What will cause her to agree or disagree the message? Obviously, this is a non-transparent, complex process that is hard to model exactly. Therefore, we assume for simplicity that the reaction is only based on the number of users who agree or disagree to the message until time t, and the color of the user himself when we model the user reaction function .
More concretely, in this work, we assume that the probability that a user chooses to agree or disagree the message is proportional to the ratio of agree to the total number of reactions: . In particular, users with different colors react symmetrically but with opposite reactions. Recall that the message color is red. Formally, for each user (the inbox set at time ), we use the following reaction function:
2 |
We say that a user who reacts with agree or disagree actively reacted to the message, so a user either actively reacted to a message or ignored it.
The user reaction we consider aims to model the behavior that a red user will more likely support (agree and share) a post with a higher percentage of red votes. When the fraction of blue votes increases, he may be more reluctant and deiced to ignore the message (not share it). On the contrary, a blue user that sees a red post with many red votes will more likely oppose the message by a disagree. Recall that the blue users do not share a red message. Many interesting user reaction functions are left for future study, e.g., functions that depend on the number of votes or threshold functions.
Following the previous definitions, in the rest of the paper, we consider a specific model for social media spreading and regulation denoted .
Definition 4
[Simple Regulated Social-media Spreading process ()] A simple regulated social-media spreading process () is a social media spreading process as defined in Definition 3, with a p-homophily spreading function defined by Eq. (1) and a user reaction function as defined by Eq. (2).
Before continuing with our results, we present several examples of the spreading process.
Examples of social media spreading process (with passive regulation)
For illustration, we next provide a few examples of the social media spreading process, for now, with passive regulation. Figure 2 presents an example network, N, with 160 users and 577 edges, for which 50% of the users have red opinion, and 50% have blue opinion (and we use the legend of Figure 1). We will use N to demonstrate different social media spreading processes along the paper. We note that N is a sampled sub-network from a known political blogosphere network46 where each node is associated with a political opinion. As in the original networks, N is polarized where most of the edges are within each political group.
In turn, Figure 3, shows the results of three different social media spreading processes on N that start from v (denoted in dark red on the figure). The spreading uses passive regulation, and the user reaction function as defined in Eq. (2). The setup difference between the processes is that each uses a different social media spreading function . We consider three options: (a) ‘All neighbors’, and two versions of the process: (b) ‘Strong Homophily’, and (c) 0.7-homophily. Each figure presents the states of all users in N at the end of the spreading. Beneath it, we show a bar chart with the number and fraction of users that actively reacted to the original red message, with agree or disagree.
The first case, with ‘All neighbors’, is shown in Figure 3 (a). In this scenario, all the neighbors of a node who agree to the message (and therefore share it) receive it. As a result, we can observe a diverse spreading. The original message was spread to 96 users, from which 70 actively reacted to the message, 57 red users agree with it and 13 blue users disagree. Additionally, 12 red users and one blue user ignored it. In contrast, the second case, ‘Strong Homophily’, is shown in Figure 3b. Here all the 76 red users belonging to the connected component of v received the message. All those users reacted with agree because no blue node received the message. Therefore, only red users actively reacted to the message creating an unbalanced spreading. No active user ignored it. The third case, where the social media spreading function is 0.7-homophily, is shown in Figure 3c. This case yields mixed results from the previous two cases. A total of 78 users actively reacted to the message, from which 69 red user agree with it, and 9 blue users disagree. In addition, 3 red users and 2 blue users ignored the message.
Overall we observe that different social media policies can lead to different sets of nodes that actively react to the message. Informally, such policies can clearly lead to echo chambers where users are only exposed to similar opinions. In the next section, we discuss how to define and quantity the echo chamber effect for social media spreading process.
The echo chamber effect: a definition
In this section, we propose a methodology to capture the echo chamber effect in social media. Echo chambers, informally, limit the exposure of a message to diverse perspectives and favor or reinforce similar narratives9,32. It is believed that Echo chambers in social media may increase social and political polarization and extremism, although there are disagreements about this16.
Our main observation is that to study the echo chamber effect, it must first be formally quantified in social media networks. To the best of our knowledge, there is no such definition. We propose two qualitative measures to do so.
(i) Metrics. An echo chamber has two parts, the echo of a message spreading, namely the amount by which the spreading is reinforced by similar narratives, and the chamber of the spreading, namely the size of the spreading, measured as the number of users who actively reacted to the message.
(ii) A frame of reference by which the echo chamber of message spreading can be quantified, and a geometry (i.e., distance) to do so. We address these measures in turn.
Definition 5
(Echo-chamber of a spreading sequence ) Let be a spreading sequence of a message starting at a user v in a network N. Let and be the number of users that reacted with agree and disagree, respectively, to the message by the end of the spreading sequence , at time T. The echo-chamber of , denoted as , is the vector such that and .
We denote as the “echo” of the message spreading, i.e., the fraction of user who actively reacted to the message with agree, and as the “chamber”, i.e., the size of the message spreading. Next, we consider the frame of reference, which will enable us to compare the echo chambers of two spreadings. First, we define the “distance” between two echo chambers of message-spreading sequences. We would like our distance metric to have the following two axioms, one about echo change and the second about chamber change. We will explain them next.
Axiom 1
(Echo change) The distance between and is constant.
The first axiom states that convincing one user (i.e., ) to change its reaction (from disagree to agree or vice versa) requires a constant effort for every and . Hence, the above distance is constant (for every and d).
Axiom 2
(Chamber change) The distance between and is constant.
The second axiom states that keeping the same “echo” () but doubling the population requires a constant effort from each user; namely, every user adds a single new user with an opinion (reaction) identical to his. Therefore, the distance should be constant (for every and d).
To enable these axioms, we propose to use hyperbolic geometry. Recall that the hyperbolic distance ()47 between two vectors and , is defined as In turn, we use the to define the echo-chamber distance between different spreading.
Definition 6
(Echo-chamber distance) Consider two message spreading processes and , and their echo chambers and . The echo-chamber distance between them is the following hyperbolic distance
We can prove the following about the echo-chamber distance (see supplementary material).
Theorem 1
The echo-chamber distance, as defined in Definition 6, satisfy Axioms 1 and 2.
Until now, we considered a single spreading (i.e., a single starting user and a single message). When we want to quantify the echo-chamber of a network N for specific spreading functions and user reaction models, we take the expected spreading of a message. Formally,
Definition 7
(Echo-chamber of a node and a social media spreading process) For a well-defined social media spreading process S (Definition 3), the expected echo-chamber of a node v in S is defined as the expected echo-chamber of a spreading sequence starting at v. The average echo-chamber of S is defined as the expected spread of a red message. Formally,
3 |
where and are the expected echo and chamber of the spreading process starting at v, respectively, and and are the expected echo and chamber of the spreading process S, respectively.
We finalized our approach by providing for each node, network, and social media spreading process their points of reference. In an unbiased world, we would expect that the message reaction (echo) will be similar to the opinions of the general public , and we would also like to keep the expected number of active users for a spreading (chamber) the same as it would have been with a passive regulation (i.e., no regulation). For a social media spreading process we denote by a similar process but with passive regulation, i.e., . We can now formally define the reference points of a node and the spreading process.
Definition 8
(The reference points for a node and a social media spreading process) For a well defined social media spreading process S (Definition 3), the reference point for a node v in S and the reference point of S are defined as,
4 |
where is the echo of the network N, namely, the ratio of red users to red or blue users in the coloring of N, and and are the expected size of active users in the spreading process with passive regulation in and , respectively.
We define the echo chamber effect of a node v and a social media process s, denoted as , as the echo-chamber distance between their average echo-chamber and the corresponding reference point. Formally:
Definition 9
(The Echo-chamber effect of a node and a social media spreading process) The echo-chamber effect of a node v in a well defined social media spreading process S (Definition 3) is the echo-chamber distance between and its point of reference . The echo-chamber effect of S is the echo-chamber distance between and its point of reference . Formally,
5 |
Figure 4 presents the echo-chamber effect of the three social media scenarios we considered in Fig. 3. For each of the three processes, ‘All neighbors’, ‘Strong Homophily’, and ‘0.7-homophily’, we present their echo-chamber (), their reference point () and their echo-chamber effect () for the social media.
First, note the (expected) chamber of each process. For ‘All neighbors’, it is 81.90, while for ‘Strong Homophily’ and ‘0.7-homophily’, it is 74.21 and 67.79 nodes, respectively. The echo of the ‘Strong Homophily’ is the strongest. It is 100% since the message reaches only red nodes. Next is ‘0.7-homophily’ with an echo of 89%, and ‘All neighbors’ have the slightest echo of 82%. The reference point for each process (as defined in Eq. (4)) has an echo of 50% (since 50% of the nodes are red and 50% are blue) and the same echo as of the original process (since in this case the process has passive regulation and ). We can now discuss the echo chamber effect of each process. Recall that the echo chamber effect is defined as the distance between the process’s echo chamber and its reference point. We can observe that the ‘Strong homophily’ has the most significant echo chamber effect, then the 0.7-homophily, and then the ‘All neighbors’ scenario with the smallest echo chamber effect. Recall that all of these scenarios did not include a regulation. In the next section, we discuss desired principles for regulation functions, where our goal is to reduce the echo-chamber effect of a given network.
It is important to note that the geometry selection and the reference point are key components in our methodology. The concrete selection of both determines the echo chamber effect for a given social media spreading process. Therefore it raises the question, what are the correct choices for the reference point and the geometry? The goal of the reference point is to capture a baseline, namely the echo chamber in social media in a “perfect” world. Here, we consider a reference point defined by a hypothetical spreading in a world with passive regulation (i.e., without regulation) but with the same topology and coloring. Future studies can consider cases where the reference point also depends on a modified network topology or an ideal coloring of the users (e.g., random vs. clustered). The goal of the geometry is to quantify the effect. We leave the study of other alternatives of reference points for future work.
Goals and principles of social media regulation
In the following section, we discuss and formally define which basic properties we would like our regulation functions to have. We start with the goal of the regulation: to “break” the echo-chamber effect. Armed with our formal definitions of a social media spreading process, the echo-chamber effect, and regulation, we can now define what a regulation with a mitigate outcome is. Informally, this regulation function always (if possible) reduces the echo-chamber effect and never increases it. Formally,
Definition 10
(Mitigate outcome) Consider a well-defined social media spreading process and its version with passive regulation . The regulation has a mitigate outcome in S if for every node v such that its echo-chamber effect with passive regulation, is positive, then its echo-chamber effect with the regulation is strictly smaller. Formally,
6 |
What can regulation do to eliminate the echo-chamber effect? For example, a trivial regulation could block all posts, empty all feeds, and by that, completely eliminate any echo-chamber effect. This is, of course, an extreme action that gives the regulation an extreme power. Although, by definition, the regulator can intervene in the social media’s spreading process, we may want this intervention to be minimal. Similarly, we need to consider additional ethical issues like privacy (does a regulator need to protect privacy? to what level?) and censorship (to what extent do we allow censorship?)
In this work, we consider the two most basic principles or values that we want our society, and in particular, its regulation system, to follow. These principles, in turn, limit the regulation system’s power. The first principle we consider is freedom of expression: An individual’s right to voice their opinions without fear of censorship or legal sanction. We assume that the regulation is not allowed to censor any message a user sends; it can only add additional recipients to sent messages. Formally,
Definition 11
(Freedom of expression) A Regulation has the freedom of expression principle when the following holds: the regulation cannot block any recipient to whom the social media indented to deliver the message. Formally, for each time t and each user v, if the social media decided to add the message to the candidtaes set, i.e., , then the regulation function must include it in the inbox set of time t, :
The second principle we consider is user privacy: the regulation does not have access to the user’s opinions (colors), i.e., it cannot use a user’s opinion in making decisions. Formally,
Definition 12
(User privacy) Let N and denote two networks having the same topology (i.e., graph) but different user coloring. A regulation has the user privacy property when the following holds: For each time t, if the candidtaes set, , is equal in both networks, then for each node u the probability that the regulation adds u to the inbox set, , is the same, i.e., independent of the users’ coloring of each network. Formally: if then, ,
Note that if two networks have the same graph, i.e., , then it implies that they have the same set of users, , but not necessarily that their opinions are the same, i.e., . This means that the regulation can be topology-dependent, but not coloring-dependent (a stricter version can also assume that the regulation cannot know even the network’s topology). In the next section, we go back to the echo-chamber effect and study the ability of a regulation that has the freedom of expression and user privacy properties to reduce it.
Breaking the echo-chamber effect: an impossibility result for
In this section, we finally reach the core question of this study: can we use regulation to “break”, or more correctly, to reduce the echo-chamber effect of social media? The paper’s main result is the following impossibility result: regulation cannot have both the freedom of expression and user privacy properties, and have a mitigate outcome for the model (Definition 4). Formally,
Theorem 2
(Impossibility) It is impossible for regulation to have both the freedom of expression and user privacy properties while having a mitigate outcome for all simple regulated social-media spreading process, (Definition 4).
Proof
The proof is by example. Consider a network N of size n. The network has n/k connected components denoted as islands. Each island is of size users, it is highly connected, and all users are of the same color. Half of the islands are red, and half are blue; therefore, the network has red users and blue users.
The social media processes S employs the . Let v be a red node in a small component that starts a spreading process. The expected echo chamber of v with passive regulation, , will be since the message will not leave the connected component and, with high probability, will reach all the nodes in its connected components (formally, this depends on the value of p in the spreading function, the “connectivity level” inside the connected component and n. We leave the technical details for the full report). By Eq.(4), the reference point for a node v in is therefore . So the is in turn,
7 |
Recall that we assume that the regulation holds both the freedom of expression and user privacy properties. Now assume by contradiction that the regulation also has mitigate outcome. This means that has to spread the message to some blue users outside the connected component of v, otherwise the will not decrease. However, since preserves user privacy, namely, it is “color-blinded” if messages arrive at islands of blue nodes, some messages must also arrive at islands of red nodes. In turn, since preserves freedom of expression and has no censorship, the message will spread to all nodes in each red component it arrives at. But, then, by symmetry, messages will spread again (using the regulation) to new blue and red islands. The end result of this birth process will be with a high probability that a linear fraction of red nodes will receive the message and some blue blue nodes will receive it as well (at most a linear fraction). Therefore the expected echo chamber of v, will be, in the best case, (1/2, cn), for a constant c (independent of n). The reference point is, in fact equal (by definition) to , and therefore equal . The is, in turn, then
8 |
For each constant c, since is a monotone unbounded function, there exists then a large enough for which for any we have contradiction to the mitigate outcome of .
Discussion of main result. Theorem 2 raises some major ethical questions. In particular, what should be the objectives of the regulation function, and at what cost? What are the main tradeoffs (within our context), for example, between freedom of information (to reach outside your bubble) and privacy? Or is censorship unavoidable to reach some goals? Even though some of these questions are philosophical, our community is responsible for modeling and studying them to understand better what we can and cannot do. Avoiding these issues is also taking a stand and, therefore, not an option.
The network topology of the counter-example in the proof of Theorem 2 is far from real-life networks. Therefore, it may be the case that the impossibility result does not hold for networks with some connectivity properties. In the next section, we provide evidence for that and we propose a simple regulation function that may work well in practice.
Case study: random linear regulation
In this section, we propose Random Linear Regulation (RLR), a frame for regulation based on randomness, which complies with the regulation principles mentioned above and manages to reduce the echo chamber for real-world media networks.
Definition 13
(Random Linear Regulation) Given the network N and the candidtaes set, , the Random Linear Regulation function with parameter is defined as follows: , where is a set of users chosen uniformly at random s.t . Note that when , the regulation is passive, i.e., the set is empty.
The first observation about RLR is that it satisfies the two of the regulations’ principles and a third important property, local proportionality, which we define next.
Observation 1
The RLR function satisfies the following principles for each social media it regulates: freedom of expression, privacy-preserving, and local proportionality.
Where the local proportionality property is defined as follows.
Definition 14
(Local Proportionality:) has the local proportionality property if for each social media, , at every time t, the number of the additional users that were chosen by the regulation to get the message, (i.e., ), is no more than the number of users that were chosen by the social media to to the candidate set. Formally: .
The second observation is that RLR is simple to implement, assuming the social media implements it following the regulator’s demand. All it needs is the ability to receive a random user in the system, which is a reasonable requirement. It should also be possible for the regulator to check that RLR was implemented or for social media to provide proof of such implementation.
We know from Theorem 2 that theoretically, the RLR cannot preserve freedom of expression and user privacy while having a mitigate outcome. However, in real-world social media (which is extremely different from the isolated example of Theorem 2 proof), we can observe a monotonic reduction in the echo chamber effect as a function of , as we following described. Figure 5 demonstrates the use of RLR in the example network we presented in Figure 3b with values of , 0.25 and 0.5. The networks in the figure depict the end of the spreading process for particular run examples, while the bars and numbers in the figure presents the average values for 1000 runs of spreading simulations. We can observe in the bars of that despite the strong-homophily behavior of social media, thanks to the RLR spreading, the message has now reached blue users. Furthermore, the network figures demonstrate that, with RLR, there exist red users who have chosen to ignore the message. The chamber of the process with is 74.75, while for it is 75.82 nodes, both are slightly larger than the unregulated value (74.23), albeit not by a significant margin. The echo of RLR with is , which is smaller than the echo of RLR with () since the message reaches more blue nodes. The reference point for the processes (as defined in Eq. (4)) has an echo and chamber as of the original process (see Figure 3(b)). Finally, the decrease form 7.23 (for the case without regulation) to 6.78 (for ) and 6.44 (for ) (with regulation). Examining the RLR on larger real networks was also done, as described below.
Case Study - Experimental Results on a Real Network
Simulations were conducted to study the impact of RLR regulation on a social media platform. The simulations were performed on the graph, a network with 1222 users and 16717 edges. This graph represents the largest connected component of a weblog network focused on US politics in 200546. The users are categorized based on their political opinion, with 636 (52%) being conservative (red users) and 586 (48%) being liberal (blue users). The network consists of 7841 red edges, 7301 blue edges, and 1575 green edges, representing links within the respective groups and cross-links (See Figure 1 in supplementary material).
Starting with a randomly chosen red user, we update his status to and initiate the spreading process (see Definition 3) using the following parameters: (i) Spreading function, with sets to ‘Strong homophily’ or ‘0.7-homophily’. (ii) RLR regulation with sets to: 0, 0.25, 0.5, 0.75 and 1. After the process is stopped, we count the different reactions to the message for users who were exposed to it. Simulating 1000 times (for each parameter set), we compute the average number of users that reacted with agree, disagree, or ignore as presented in Figure 6.
Figure 6a shows the ‘Strong homophily’ spreading function. When (i.e., passive regulation), a red user spreads the message only to other red users. Hence, we can see that only red users got the message, which led to only agree response in the population. As expected, increasing the regulation intervention by , allows blue users to receive the message and to disagree with it. This, in turn, caused an increase in the amount of the red ignore (light red), meaning that part of the red users ignored the message and did not share it. Figure 6b for 0.7-homophily presents similar results, but in which unanimity of opinion in the red group does not exist in advance (due to the relaxation of the homophily spreading condition). However, in this spreading process, the regulation also has a mitigate outcome on the exposure of information to other parts of the network. Figure 6c demonstrates the changes in for both processes. The regulation had mitigate outcome, which is clearly expressed in the graph behavior: the is monotonically decreasing with the level of regulation (set by ) during the spreading process.
Discussion
This article proposes a methodology to study the echo chamber effect within social media and, particularly, how to mitigate it via regulation. We present a formalization of the social media’s spreading process to quantify the effect and, in turn, its mitigation potency. The complexities of modeling the spreading mechanisms within the network present multifaceted challenges, primarily due to the myriad of diverse modalities of social media actions and user responses. We examined a simple spreading model presenting a message only to neighboring users, allowing agree/disagree/ignore responses, but rich enough to capture the essence of the phenomenon. In future work, we would like to consider dynamic networks and the case where users’ feeds do not rely on their friends but are based on a recommendation system (e.g., TikTok’s “For You” and Instagram’s “Discover”). In our view, social media regulation is an important research area, currently, in its initial stages. Considerable endeavors are imperative and are required to make progress in solving the variety of problems these platforms present to our society.
Disclaimer: objections, limitations, and alternatives to our model and results. There are works16 claiming that the very existence of bubbles and the echo chamber is in doubt26, where the personalized social media algorithms have a weak impact on content diversity27. When tracking individual behaviors, the actual data show that most people take their information primarily from mainstream media and regularly consume content from diverse media sources28,29. At the same time, fake news website consumption is concentrated among a small group only30. Nevertheless, to our understanding, even if the news consumption is diverse in each bubble31, the opinion formation about them (due to comments, likes, emotions, etc.) is mainly influenced by the members of your own bubble, leading to an echo chamber of opinions as we modeled in our work.
We note that our model assumes a somewhat outdated version of social media since it captures only information received via friends, while modern social media is based on recommendation systems and search engines to receive information. As we demonstrated, echo chambers (e.g., of opinions) can be formed even in our simpler model. It will be interesting to study the problem on more complex models that extend ours, but the impossibility results already hold in a simpler one. Assuming more power to the social media spreading will only make the challenge harder.
Another important concept of our model, which can be criticized, is that the model assumes that social media spreading is an opaque box that cannot be directly regulated. In our model, the regulator can only modify the spreading of the social media platform. Moreover, we require that the regulator action preserve privacy and freedom of speech regardless of the social media actions. An alternative approach to reduce the echo chamber by the regulator could be to demand transparency or direct oversight of the spreading algorithms of the platform. For example, to require them to eliminate echo chambers by using private information they have. While this may be a legitimate approach, it is more complicated to implement and supervise and does not contradict the fundamental tradeoffs that our work presents.
Supplementary Information
Author contributions
All authors contributed to conceiving and designing the study, developed the theoretical framework and mathematical model presented in this paper. C.A. and H.D. collaborated on refining the model, discussing the implications of the findings, and revising the manuscript. H.D. conducted the simulations. All authors read and approved the final manuscript for submission.
Data availability
The dataset analyzed during the current study is based on the blog network form46 and imported using the Mathematica software. The code simulations and further details are available from the corresponding author upon reasonable request.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Contributor Information
Chen Avin, Email: avin@bgu.ac.il.
Hadassa Daltrophe, Email: hadasda1@sce.ac.il.
Supplementary Information
The online version contains supplementary material available at 10.1038/s41598-023-50850-6.
References
- 1.Walker M, Matsa KE. News Consumption Across Social Media in 2021. Pew Research Center; 2021. [Google Scholar]
- 2.Allcott H, Braghieri L, Eichmeyer S, Gentzkow M. The welfare effects of social media. Am. Econ. Rev. 2020;110:629–676. doi: 10.1257/aer.20190658. [DOI] [Google Scholar]
- 3.Vardi MY. Acm, ethics, and corporate behavior. Commun. ACM. 2022;65:5–5. [Google Scholar]
- 4.Vardi, M. Y. How to be an Ethical Computer Scientist. Moshe Vardi’s VardiFest Lecture (2022).
- 5.Oxford Advanced Learner’s Dictionary. echo chamber (noun) (2022).
- 6.Del Vicario M, et al. The spreading of misinformation online. Proc. Natl. Acad. Sci. 2016;113:554–559. doi: 10.1073/pnas.1517441113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Baumann F, Lorenz-Spreen P, Sokolov IM, Starnini M. Modeling echo chambers and polarization dynamics in social networks. Phys. Rev. Lett. 2020;124:048301. doi: 10.1103/PhysRevLett.124.048301. [DOI] [PubMed] [Google Scholar]
- 8.Hayat T, Samuel-Azran T. “you too, second screeners?” second screeners’ echo chambers during the 2016 us elections primaries. J. Broadcast. Electr. Media. 2017;61:291–308. doi: 10.1080/08838151.2017.1309417. [DOI] [Google Scholar]
- 9.Cinelli M, Morales GDF, Galeazzi A, Quattrociocchi W, Starnini M. The echo chamber effect on social media. Proc. Natl. Acad. Sci. 2021;118:56655. doi: 10.1073/pnas.2023301118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Barberá P, Jost JT, Nagler J, Tucker JA, Bonneau R. Tweeting from left to right: Is online political communication more than an echo chamber? Psychol. Sci. 2015;26:1531–1542. doi: 10.1177/0956797615594620. [DOI] [PubMed] [Google Scholar]
- 11.Settle JE. Frenemies: How Social Media Polarizes America. Cambridge: Cambridge University Press; 2018. [Google Scholar]
- 12.Lim SL, Bentley PJ. Opinion amplification leads to extreme polarization in social networks. Sci. Rep. 2022;12:18131. doi: 10.1038/s41598-022-22856-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Sasahara K, et al. Social influence and unfollowing accelerate the emergence of echo chambers. J. Comput. Soc. Sci. 2021;4:381–402. doi: 10.1007/s42001-020-00084-7. [DOI] [Google Scholar]
- 14.Mutz DC. Hearing the Other Side: Deliberative Versus Participatory Democracy. Cambridge University Press; 2006. [Google Scholar]
- 15.Bail CA, et al. Exposure to opposing views on social media can increase political polarization. Proc. Natl. Acad. Sci. 2018;115:9216–9221. doi: 10.1073/pnas.1804840115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Haidt J, Bail C. Social Media and Political Dysfunction: A Collaborative Review. New York: New York University; 2022. [Google Scholar]
- 17.Wittgenstein L. Tractatus Logico-philosophicus. London: Routledge; 1922. [Google Scholar]
- 18.Universal declaration of human rights. UN General Assembly302, 14–25 (1948).
- 19.Disinformation nation. Social Media’s Role in Promoting Extremism and Misinformation. House Committee Hearing. 117th Congress, (2022).
- 20.Bruns A. Are Filter Bubbles Real? John Wiley & Sons; 2019. [Google Scholar]
- 21.De Francisci Morales G, Monti C, Starnini M. No echo in the chambers of political interactions on reddit. Sci. Rep. 2021;11:1–12. doi: 10.1038/s41598-021-81531-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Pariser E. The Filter Bubble: What the Internet is Hiding From You. Penguin; 2011. [Google Scholar]
- 23.Bozdag E, Van Den Hoven J. Breaking the filter bubble: Democracy and design. Ethics Inf. Technol. 2015;17:249–265. doi: 10.1007/s10676-015-9380-y. [DOI] [Google Scholar]
- 24.Gillani, N., Yuan, A., Saveski, M., Vosoughi, S. & Roy, D. Me, my echo chamber, and i: introspection on social media polarization. In Proc. 2018 World Wide Web Conf., 823–831 (2018).
- 25.Nguyen CT. Echo chambers and epistemic bubbles. Episteme. 2020;17:141–161. doi: 10.1017/epi.2018.32. [DOI] [Google Scholar]
- 26.Haim M, Graefe A, Brosius H-B. Burst of the filter bubble? effects of personalization on the diversity of google news. Dig. Journalism. 2018;6:330–343. doi: 10.1080/21670811.2017.1338145. [DOI] [Google Scholar]
- 27.Puschmann C. Beyond the bubble: Assessing the diversity of political search results. Dig. Journalism. 2019;7:824–843. doi: 10.1080/21670811.2018.1539626. [DOI] [Google Scholar]
- 28.Nelson JL, Webster JG. The myth of partisan selective exposure: A portrait of the online political news audience. Soc. Media Soc. 2017;3:2056305117729314. [Google Scholar]
- 29.Fletcher R, Nielsen RK. Are news audiences increasingly fragmented? a cross-national comparative analysis of cross-platform news audience fragmentation and duplication. J. Commun. 2017;67:476–498. doi: 10.1111/jcom.12315. [DOI] [Google Scholar]
- 30.Guess, A., Nyhan, B. & Reifler, J. Selective Exposure to Misinformation: Evidence from the Consumption of Fake News During the 2016 us Presidential Campaign (2018).
- 31.Gargiulo F, Cafiero F, Guille-Escuret P, Seror V, Ward JK. Asymmetric participation of defenders and critics of vaccines to debates on french-speaking twitter. Sci. Rep. 2020;10:6599. doi: 10.1038/s41598-020-62880-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Nyhan B, et al. Like-minded sources on facebook are prevalent but not polarizing. Nature. 2023;12:1–8. doi: 10.1038/s41586-023-06297-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Guess AM, et al. Social media and elections: The effects of reshared content on political news exposure, attitudes, and polarization. Science. 2023;381:404–408. doi: 10.1126/science.add8424. [DOI] [PubMed] [Google Scholar]
- 34.Guess AM, et al. How do social media feed algorithms affect attitudes and behavior in an election campaign? Science. 2023;381:398–404. doi: 10.1126/science.abp9364. [DOI] [PubMed] [Google Scholar]
- 35.González-Bailón S, et al. Asymmetric ideological segregation in exposure to political news on facebook. Science. 2023;381:392–398. doi: 10.1126/science.ade7138. [DOI] [PubMed] [Google Scholar]
- 36.Gorwa R. The platform governance triangle: Conceptualising the informal regulation of online content. Internet Policy Rev. 2019;8:1–22. doi: 10.14763/2019.2.1407. [DOI] [Google Scholar]
- 37.Fukuyama, F. & Grotto, A. Comparative Media Regulation in the United States and Europe, 199-219. SSRC Anxieties of Democracy (Cambridge University Press, 2020).
- 38.Bhadani S, et al. Political audience diversity and news reliability in algorithmic ranking. Nat. Hum. Behav. 2022;6:495–505. doi: 10.1038/s41562-021-01276-5. [DOI] [PubMed] [Google Scholar]
- 39.Balkin JM. How to regulate (and not regulate) social media. J. Free Speech L. 2021;1:71. [Google Scholar]
- 40.Kempe, D., Kleinberg, J. & Tardos, É. Maximizing the spread of influence through a social network. In Proc. of the Ninth ACM SIGKDD International Conf. on Knowledge Discovery and Data Mining, 137–146 (2003).
- 41.Li Y, Fan J, Wang Y, Tan K-L. Influence maximization on social graphs: A survey. IEEE Trans. Knowl. Data Eng. 2018;30:1852–1872. doi: 10.1109/TKDE.2018.2807843. [DOI] [Google Scholar]
- 42.Zhang H, Mishra S, Thai MT, Wu J, Wang Y. Recent advances in information diffusion and influence maximization in complex social networks. Oppor. Mobile Social Netw. 2014;37:37. doi: 10.1201/b17231-3. [DOI] [Google Scholar]
- 43.Mossel E, Roch S. Submodularity of influence in social networks: From local to global. SIAM J. Comput. 2010;39:2176–2188. doi: 10.1137/080714452. [DOI] [Google Scholar]
- 44.Nguyen, N. P., Yan, G., Thai, M. T. & Eidenbenz, S. Containment of misinformation spread in online social networks. In Proc. of the 4th Annual ACM Web Science Conf., 213–222 (2012).
- 45.Wen S, et al. To shut them up or to clarify: Restraining the spread of rumors in online social networks. IEEE Trans. Parall Distrib. Syst. 2014;25:3306–3316. doi: 10.1109/TPDS.2013.2297115. [DOI] [Google Scholar]
- 46.Adamic, L. A. & Glance, N. The political blogosphere and the 2004 us election: divided they blog. In Proc. of the 3rd International Workshop on Link Discovery, 36–43 (2005).
- 47.Cannon JW, Floyd WJ, Kenyon R, Parry WR, et al. Hyperbolic geometry. Flavors Geom. 1997;31:2. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The dataset analyzed during the current study is based on the blog network form46 and imported using the Mathematica software. The code simulations and further details are available from the corresponding author upon reasonable request.