How do the effects of toxicity in competitive online video games vary by source and match outcome?

Jacob Morrier; Amine Mahmassani; R Michael Alvarez

doi:10.1371/journal.pone.0325462

. 2025 Jun 11;20(6):e0325462. doi: 10.1371/journal.pone.0325462

How do the effects of toxicity in competitive online video games vary by source and match outcome?

Jacob Morrier ^1,^*, Amine Mahmassani ², R Michael Alvarez ¹

Editor: Bernard Fong³

PMCID: PMC12157061 PMID: 40498691

Abstract

This article seeks to estimate variations in the effects of toxicity in competitive online video games by source and match outcome. To this end, we analyze proprietary data from the first-person action video game Call of Duty^®: Modern Warfare^® III, published by Activision^®. To overcome causal identification issues, we implement an instrumental variable estimation strategy. Our findings confirm that exposure to toxicity has statistically significant causal effects on short-term player engagement and the probability that players engage in similar behavior in the current match. Further, we show that these effects vary significantly depending on whether toxicity originates from opponents or teammates, whether it originates from teammates in the same or a different party, and the match’s outcome. These findings have meaningful implications regarding the allocation of resources for combating toxicity and the nature of toxicity across various contexts.

Introduction

Competitive online video games are a popular form of entertainment, with approximately 190.6 million players in the United States and 3.4 billion globally [1,2]. While they provide a positive experience to many players, they can also expose them to undesirable behavior, such as bullying, cheating, trolling, and toxicity. According to a 2023 survey, 76% of adult players report having experienced harassment in online multiplayer video games [1]. The incidence of toxic behavior in online multiplayer video games is generally attributable to their competitive nature and the anonymity conferred by online interactions [3–5]. Research indicates that toxicity in competitive online video games has become normalized, with some players perceiving it as an inherent and acceptable aspect of gaming culture, much like in competitive sports [4,6–8]. To put this issue into perspective, video games’ massive player bases mean that even a low incidence of toxicity results in thousands of daily incidents, affecting an even higher number of players.

The adverse effects of toxicity are widely acknowledged and well-documented. Two stand out as particularly noteworthy for the video game industry. First, toxicity drives player churn and dissuades new players from joining [7,9–11]. This effect provides a compelling business case for combating toxicity. Indeed, while video game service operators may seek to mitigate toxicity for ethical reasons, such as protecting players from psychological harm and promoting an inclusive and positive gaming environment, the negative effect of toxicity on player engagement highlights their vested interest in combating toxicity since it can ultimately impede the commercial success of their products.

Second, toxicity tends to spread, with exposure to it causing other players to engage in similar behavior [10,12–15]. As the adage goes, humans are, by nature, social beings. Accordingly, their peers heavily influence their actions. A wealth of empirical research, both experimental and observational, has exposed strong correlations and causal relationships between an individual’s behavior and outcomes and those of their environment [16–22]. This influence extends to virtuous and objectionable behavior, including academic dishonesty, bullying, and crime. In competitive online video games, the propagation of toxicity amplifies the consequences of a single player’s misconduct, increasing the industry’s incentives to address the issue before it becomes entrenched.

This article seeks to estimate the magnitude of these effects across different contexts. These estimates carry meaningful implications for the allocation of resources for combating toxicity. With limited available resources, we must direct them where they can have the most impact. In particular, we should target resources to contexts where the undesirable effects of toxicity on player engagement or its proliferation are most pronounced, ensuring that each prevented instance of toxicity brings the highest returns. In contrast, we should redirect resources away from contexts where players find satisfaction in behavior otherwise considered toxic. This issue is especially relevant in competitive online video games, where the boundary between acceptable and unacceptable behavior can often be blurred [8]. In this context, an overall negative effect may conceal positive consequences in some contexts and negative ones in others. By assessing how toxicity affects player engagement in various contexts, we can more effectively distinguish between toxic and acceptable behavior, allowing us to focus resources on combating the former.

We analyze differences in the effects of exposure to toxicity across three dimensions: (i) whether it originates from teammates or opponents, (ii) whether it comes from teammates in the same party, with whom players voluntarily choose to team up, or a different party, and (iii) whether the exposed player’s team wins or loses the match. For reference, parties are groups of one or more players who voluntarily choose to play together. The matchmaking algorithm typically keeps these parties together when forming teams. The literature has yet to explore how the effects of exposure to toxicity interact with these factors. These variables are readily observable and, thus, can readily be used to guide the allocation of resources. We expect the nature and effects of toxicity to differ significantly based on these factors.

To achieve our goal, we analyze proprietary data from Call of Duty^®, a popular first-person action video game franchise published by Activision^®. We focus on one of the series’ recent installments, Call of Duty: Modern Warfare^® III, particularly its most popular multiplayer mode, Team Deathmatch. In this mode, players are divided into two equally sized teams and compete to achieve the highest number of eliminations. After a brief pause, eliminated players reappear at a different location on the map. A team wins by reaching a predetermined elimination limit first or accumulating the most eliminations by the end of the match.

Since 2023, Activision has partnered with Modulate, a startup developing intelligent voice technology to identify online toxicity, and incorporated its proprietary voice chat moderation technology, ToxMod, into its gaming platforms [23]. ToxMod is a voice moderation technology that analyzes in-game voice chat interactions based on features such as transcribed content, volume, emotion, and intention [24]. These features are fed into machine learning models to detect six types of toxic content: adult language, audio assaults, cultural hate speech, sexual hate speech, sexual vulgarity, and violent speech. This technology’s beta rollout began in North America on August 30, 2023, within Call of Duty: Modern Warfare II and Call of Duty: Warzone, followed by a global release (excluding the Asia-Pacific region) coinciding with the launch of Call of Duty: Modern Warfare III on November 10, 2023. ToxMod only supported English during our observation period.

ToxMod provides unique data on toxicity and players’ exposure to it, serving as the basis of our analysis. Our dataset consists of data from a subset of matches in Team Deathmatch mode monitored by ToxMod during the first month after the game’s release. We classify a player as having engaged in toxicity if ToxMod flagged at least one of their voice chat interactions as toxic during a match.

We perform two regression analyses. The first considers the effect of exposure to toxicity from opponents and teammates depending on whether the exposed player’s team wins or loses the match. The second considers the effect of exposure to toxicity from teammates in a different party and the same party—teammates assigned algorithmically or those with whom players voluntarily teamed up, respectively—depending on whether the exposed player’s team wins or loses the match. In both analyses, we estimate the effect of exposure to toxicity on two outcome variables: (i) the time players take to enter their next match as a measure of short-term player engagement, and (ii) the likelihood that exposed players use toxic language in the current match as a measure of the contemporaneous propagation of toxicity.

Even with a large volume of high-quality data, analysts seeking to estimate the causal effect of exposure to toxicity face considerable statistical challenges. The reason is that, in observational data, some variables not accounted for in our regression models—because they are unmeasured or unmeasurable, for instance—may be simultaneously correlated with players’ outcomes and their exposure to toxicity, a phenomenon known as endogeneity [25, p. 513]. For example, teammates may concomitantly use toxic language in reaction to a random event occurring in a match, which might also influence their short-term player engagement. More fundamentally, players mutually affect each other. As a result, whether a player, their teammates, and their opponents engage in toxicity is jointly determined. Ultimately, endogeneity introduces biases in standard ordinary least squares (OLS) estimates and obscures the cause-to-effect relationship of exposure to toxicity. No previous observational study on toxicity in competitive online video games has addressed this causal identification issue.

One way to address endogeneity is with randomized controlled experiments. However, due to ethical and logistical constraints, conducting an experiment that randomly exposes players to toxicity is impossible. Instead, we propose an identification strategy neutralizing the causal identification issues in the available observational data. We implement an instrumental variable or two-stage least squares (2SLS) estimation strategy that leverages the fact that we observe players participating in multiple matches with different players. With this strategy, we isolate variations in outcomes of interest caused by interactions with players who, in prior matches with other players, have employed toxic language more frequently and, consequently, are more likely to use such language in the current game. This approach allows us to reliably assess whether and, if so, to what extent exposure to toxicity causes variations in player engagement and their likelihood of using similar language, distinguishing our findings from the existing literature.

Hypotheses

We formulate five hypotheses regarding the effects of exposure to toxicity depending on its source and the match outcome:

H1. Toxicity from teammates has a weaker effect on player engagement than toxicity from opponents.
H2. Toxicity from teammates spreads more than toxicity from opponents.
H3. Toxicity from teammates in the same party has a weaker effect on player engagement than toxicity from teammates in a different party.
H4. Toxicity from teammates in the same party spreads more than toxicity from teammates in a different party.
H5. Toxicity has a weaker effect when the exposed player’s team wins the match.

There are strong theoretical justifications for these hypotheses. In general, players are less likely to engage in harmful behavior toward teammates, as they share common goals and interests, unlike opponents, whose interests directly conflict with their own. Evidence that cooperation between players reduces aggression in video games supports this assertion [26–28]. In this context, we expect that players are less likely to direct toxicity at teammates, particularly those in the same party. Even when players expose teammates to toxicity as bystanders rather than victims, it is more likely to be perceived as innocent and, consequently, should have a weaker effect on player engagement, if any [8]. Conformity to social norms is one of the primary explanations for peer effects [29]. In general, individual perceptions of these norms are influenced more strongly by those with whom they feel a stronger affinity and connection [30,31]. This should apply to teammates, particularly those in the same party, increasing the likelihood that players will mirror their behavior. The same principle holds if social learning is supposed to drive the spread of toxicity. Finally, a player’s team winning may reduce the effects of toxicity, as success can foster emotional regulation and strengthen psychological resilience [32].

Previous studies generally support these hypotheses. First, they provide evidence that players are more prone to hostile behavior when their teammates, particularly their friends, engage in such actions, suggesting that contagion is more pronounced in these contexts [12,28,33]. However, other studies find that exposure to toxicity from opponents is associated with a larger increase in the likelihood that they engage in similar behavior, highlighting a retaliatory response [14]. Studies indicate that some players prefer retreating when confronted with toxic players [7]. Also, while playing with friends can increase engagement, long-term player retention is negatively affected by playing with toxic friends, particularly for veteran players [9]. Finally, many studies show that toxic behavior is more prevalent when a team is losing, suggesting that players may resort to toxic behavior under these circumstances they do not otherwise [12,34–36].

Data and methodology

Dataset description

Our dataset contains data from a subset of matches in Team Deathmatch mode monitored by ToxMod from November 10 to December 10, 2023. Our sample is not comprehensive, as it only includes matches monitored through ToxMod in a single game mode, among other limitations. It consists of 56,464,489 observations, each representing a player in a game, from 4,167,325 matches and 4,539,599 players. On average, we observe each player participating in 12.44 games.

This data reflects gameplay during the first month after the game’s launch and may not reflect activity later in its lifecycle. Early on, an influx of new players may need time to familiarize themselves with the game, including the toxicity prevailing in gaming culture. Accordingly, it might take some time before new players engage in toxicity [14]. Even veteran players may need time to adapt to newly introduced features. Finally, seasonal factors can significantly affect gameplay, with activity typically peaking during holidays and tapering off as daylight hours increase [37].

Due to technical issues, exposure data is missing for 34.6% of speech acts flagged as toxic by ToxMod. Fig 1 displays the daily evolution of the share of unavailable exposure data throughout our observation window. From November 17, one week after the game’s launch, exposure data became inaccessible for some offenses. Thereafter, the daily share of missing exposure data fluctuated between 5 and 73%, with 35 to 65% of exposure data unavailable on most days.

If exposure data is missing randomly and, in particular, independently of the source of toxicity and the match outcome, it does not introduce bias in our findings. It might still dilute our coefficients’ magnitude, but the small probability of exposure to toxicity suggests that this dilution is negligible. While demonstrating that exposure data is randomly missing is difficult, it seems plausible given the disruption’s cause. To support this conjecture, we present two pieces of evidence that the proportions between exposure probabilities conditional on variables of interest remain unchanged despite the missing exposure data. It follows that exposure data is missing in roughly similar proportions regardless of whether toxicity came from an opponent or a teammate, whether the teammate was in the same party or a different one, or whether the exposed player’s team won or lost.

First, Figs 2 and 3 illustrate the daily evolution of the probability that players are exposed to toxicity in different contexts throughout our study’s timeframe. A dashed vertical line marks the day after which some exposure data becomes unavailable. The proportions between exposure probabilities in various contexts are roughly constant throughout our observation window, including before and after November 17. Consequently, the missing data does not significantly alter the observed exposure patterns.

Second, Figs 4 and 5 illustrate the probability of a player being exposed to toxicity from opponents or teammates, whether in the same party or a different party, depending on whether the player’s team won or lost during the period from March 4 to April 12, 2024. Over this period, we have comprehensive exposure data for a random subset of matches. This figure indicates that the proportions between exposure probabilities in different contexts, as illustrated in Figs 6 and 7, are consistent in our observation window and a later period during which we have exhaustive exposure data.

Model specification

In this article, we seek to estimate the causal effect exposure to toxicity has on player engagement and their probability of using such language. We define these effects as the variation in the average time players take to enter their next match and the likelihood that they use toxic language in the current game, respectively, caused by exposure to toxicity from another player, holding all other variables constant.

In light of this, we define the following structural model of players’ behavior:

y_{i j} = α_{j} + β \cdot x_{i j} + ε_{i j},

where:

y_ij is the outcome of interest in match i for player j.
$α_{j}$ is a player-specific intercept.
$β$ is a coefficient vector.
$x_{i j}$ is a covariates vector.
$ε_{i j}$ is an error term.

In this model, the outcomes of interest are the time players take to enter their next match and their probability of using toxic language in the current game. The covariates include the number of teammates and opponents—or teammates from the same party and a different party, depending on the model specification—who expose player j to toxicity in match i, the outcome of match i for player j’s team, and interactions between these variables. For reference, Table 1 lists the covariates included in each model specification.

Table 1. Model covariates.

Model specification I	Model specificiation II
Number of teammates who expose the player to toxicity	Number of teammates in the same party who expose the player to toxicity
Number of opponents who expose the player to toxicity	Number of teammates in a different party who expose the player to toxicity
	Binary variable indicating whether the player is in a party with other players
Binary variable indicating whether the player’s team won or lost the match
Interactions between variables

Open in a new tab

Our structural model posits that outcomes of interest are primarily affected by two factors: (i) their intrinsic tendency to exhibit the outcome of interest, and (ii) the number of other players who expose them to toxicity. The coefficients $β$ reflect the causal effect of exposure to toxicity on outcome variables. They are the estimands of our analysis.

Before addressing causal identification issues, let us first clarify what the time players take to enter their next match captures. Consider a player who ends their current session and plans to return at the same hour the next day. In this scenario, 24 hours will elapse before their next match. Conversely, if a player joins a new match immediately, the elapsed time will be nearly zero. Overall, the time players take to enter their next match captures the interaction of two factors: (i) the probability they end their current session after a match, and (ii) the interval before they return to start a new session.

Causal identification issues

Naturally, one might consider estimating the coefficients $β$ using OLS. However, contrary to the standard assumptions in linear regression models, the covariates are not independent of the error terms, resulting in endogeneity.

Endogeneity stems from various sources, each posing a threat to the causal identification of our estimands. One source is model misspecification, as some variables are omitted from the model because they are either unmeasured or unmeasurable. These omitted variables may simultaneously affect the outcomes of interest and the likelihood of being exposed to toxicity. For example, endogeneity might occur if a player and their teammates resort to toxicity in response to an exogenous event in the game, with this random event also affecting the time they take to enter their next match.

Self-selection poses another threat to causal identification. Players sometimes form parties to engage in toxicity or under the expectation that their party members will do so. Also, when two players decide to join forces, it suggests a degree of familiarity between them. This familiarity can change the dynamics of their interactions, influencing both their likelihood of using toxic language and their chances of being exposed to toxicity through one another. In parallel, it may affect their level of engagement, causing them to enter their next match more quickly. When players do not voluntarily team up, their previous interactions can still have a lasting impact.

Endogeneity mechanically arises when estimating the effect of exposure to toxicity on the probability that a player uses toxic language. The reason is that players in a match mutually influence each other. To illustrate, consider a simplified scenario where a player has only one teammate and no opponents. In this case, the dependent variable in some equations appears on the right-hand side of others. Thus, the use of toxic language by players and their teammates is interdependent and jointly determined.

Formally, let us consider the pair formed by players j and k in match i. The two equations determining whether these players engage in toxicity are:

Y_{i j} = α_{j} + β Y_{i k} + ε_{i j}

Y_{i k} = α_{k} + β Y_{i j} + ε_{i k} .

To show that Y_ik and $ε_{i j}$ are correlated, we substitute the first equation into the second and rearrange the resulting expression to isolate Y_ik on the left-hand side:

Y_{i k} = α_{k} + β (α_{j} + β Y_{i k} + ε_{i j}) + ε_{i k} \Leftrightarrow (1 - β^{2}) Y_{i k} = α_{k} + β (α_{j} + ε_{i j}) + ε_{i k}

\Leftrightarrow Y_{i k} = \frac{β}{1 - β^{2}} (α_{j} + ε_{i j}) + \frac{1}{1 - β^{2}} (α_{k} + ε_{i k}) .

This equation implies that the error term $ε_{i j}$ directly enters the value of Y_ik, resulting in a correlation between them. Intuitively, this means that OLS estimates capture a teammate’s effect on a player’s inclination to engage in toxicity and its “reflection,” that is, the influence this player exerts on their teammate.

Identification strategy

To address the issues outlined above, we define an identification strategy leveraging the fact that we observe players participating in multiple matches with different players. Our approach is to implement an instrumental variable or 2SLS estimation strategy, a standard causal identification strategy. In particular, we instrument the variables representing the number of teammates and opponents exposing the player to toxicity in the current match with the sum of their probabilities to have used toxic language in prior matches with other players. This strategy isolates variations in outcome variables caused by interactions with players who, in previous matches with other players, have had a greater tendency to engage in toxicity and, therefore, are more likely to use such language in the current game.

Henceforth, for tractability, we consider a model that treats exposure to toxicity uniformly, regardless of its source. This model has a single coefficient reflecting the average effect of one other player engaging in toxicity. We can readily extend our approach to differentiate between sources of toxicity.

Formally, our identification strategy consists of adding the following equation to our structural model of players’ behavior:

x_{i j} = δ_{j} + γ \sum_{k \in 𝒫_{i, - j}} \sum_{ℓ \in ℳ_{i, k, - j}} \frac{x_{ℓ k}^{⋆}}{# ℳ_{i, k, - j}} + u_{i j},

where:

$𝒫_{i, - j}$ is the set of players in match i excluding player j.
$ℳ_{i, k, - j}$ is the set of matches prior to match i to which player k but not player j participated.
$x_{ℓ k}^{⋆}$ is a binary variable indicating whether player k used toxic language in match $ℓ$ .
u_ij is an error term.

The instrumental variable is computed by summing over all players other than player j in match i, indexed by k, the probability with which they have used toxic language in previous matches they participated in without player j, indexed by $ℓ$ . This instrumental variable belongs to the general class of spatial or “leave-one-out” instruments introduced in empirical industrial organization for demand and supply estimation and commonly used for the causal identification of simultaneous equation models [38,39].

For an instrument to be valid, it must satisfy two conditions: (i) relevance, meaning that there must be a strong correlation between the instrumental and endogenous explanatory variables, and (ii) exclusion, meaning that the instrumental variables must be independent of the structural model’s error term. We can empirically verify the validity of the first condition by examining the estimates of the first-stage regressions. As a rule of thumb, the F statistic against the null hypothesis that the instruments are irrelevant in the first-stage regressions should have a value greater than ten. Table 2 presents the coefficients for the instrumental and exogenous explanatory variables and the F statistic for all first-stage regressions in our analysis. Each column corresponds to an endogenous explanatory variable, and each row represents an instrumental or exogenous explanatory variable. For all first-stage regressions, the F statistic significantly exceeds ten, indicating a strong first stage.

Table 2. First-stage regression estimates.

(A) Opponents and Teammates.
	Opponents	Teammates	Opponents $\times$ Win	Teammates $\times$ Win
Opponents	0.0082^***	0.0015^***	–0.0003^***	–0.0005^***
Opponents	(0.000)	(0.000)	(0.000)	(0.000)
Teammates	–0.0001	0.0330^***	–0.0003^***	–0.0012^***
Teammates	(0.000)	(0.001)	(0.000)	(0.000)
Opponents $\times$ Win	0.0047^***	–0.0012^***	0.0156^***	0.0014^***
Opponents $\times$ Win	(0.001)	(0.000)	(0.001)	(0.000)
Teammates $\times$ Win	0.0007^**	–0.0012	0.0009^***	0.0345^***
Teammates $\times$ Win	(0.000)	(0.001)	(0.000)	(0.001)
Win	0.0001^***	0.0000^***	0.0002^***	0.0005^***
Win	(0.000)	(0.000)	(0.000)	(0.000)
F Statistic	365.2	801.8	1615.0	3159.9
(B) Teammates in a different party and the same party.
	Different Party	Same Party	Different Party $\times$ Win	Same Party $\times$ Win
Different Party	0.0271^***	–0.0001^*	–0.0003^***	–0.0001^***
Different Party	(0.001)	(0.000)	(0.000)	(0.000)
Same Party	–0.0010^**	0.0359^***	–0.0004^**	–0.0045^***
Same Party	(0.000)	(0.003)	(0.000)	(0.001)
Different Party $\times$ Win	–0.0002	–0.0001	0.0275^***	–0.0000
Different Party $\times$ Win	(0.001)	(0.000)	(0.001)	(0.000)
Same Party $\times$ Win	–0.0002	–0.0005	–0.0002	0.0449^***
Same Party $\times$ Win	(0.001)	(0.004)	(0.000)	(0.003)
Win	–0.0001^***	–0.0000^***	0.0004^***	0.0001^***
Win	(0.000)	(0.000)	(0.000)	(0.000)
Player is in a Party	–0.0000^*	0.0007^***	–0.0000	0.0003^***
Player is in a Party	(0.000)	(0.000)	(0.000)	(0.000)
F Statistic	505.1	917.4	2051.4	605.2

Open in a new tab

Note: $* p < 0.1$ ; $* * p < 0.05$ ; ${* * *} p < 0.01$

On the other hand, we cannot empirically test the validity of the exclusion restriction. Instead, it depends on the assumptions we are ready to make regarding the relationship between the instrumental variables and the structural equation’s error term. We argue that calculating the instrument with the probability of a player using toxic language in previous matches with other players neutralizes the principal sources of endogeneity.

First, the fact that no data from the current match enters the instrumental variables neutralizes endogeneity caused by events occurring in the current game that simultaneously affect the outcomes of interest and exposure to toxicity. For instance, it addresses the case wherein a player and one or more of their teammates use toxic language in reaction to, say, one of their common opponents using such language or another exogenous event.

Second, the fact that no data from the other matches wherein both players participated enters the instrumental variables neutralizes endogeneity from enduring factors reflecting their relationship and simultaneously affecting outcomes of interest and their exposure to toxicity, including but not exclusively through each other.

Third, using only data from past matches to compute the instrumental variables neutralizes the long-term effects of exposure to toxicity on the outcomes of interest. This is especially important when estimating the effect of exposure to toxicity on a player’s probability of using such language. Indeed, whether player j uses toxic language in a match may affect the propensity of one of the other players, say, player k, to use such language in future matches, regardless of whether player j participates in it. More generally, all events in the current game may influence players’ future behavior. Consequently, if data from future matches entered the instrumental variables, it would open a “backdoor” for a player’s use of toxic language or other events in the current game to penetrate the instrument, thereby violating the exclusion restriction.

In interpreting our findings, we must keep in mind that our estimation strategy provides an estimate of the local average treatment effect for “compliers,” defined as those players who were exposed to toxicity because they interacted with other players more likely to use toxic language in previous matches with other players and, consequently, exogenously more likely to use such language in the current game. Compliers do not include players who seek to alter their exposure to toxic language by intentionally deactivating the voice chat to evade it or using toxic language to provoke reactions from other players, for instance. If the effect of exposure to toxicity is heterogeneous, this local average treatment effect might not accurately reflect the average treatment effect for the entire player population.

Estimation

Our model contains player-specific intercepts, also called fixed effects, capturing the inherent tendency of players to exhibit outcomes of interest. Estimation of these fixed effects is computationally expensive. Consequently, analysts frequently resort to “down-sampling,” which consists of sampling a computationally convenient number of observations and estimating the model with fixed effects only for those. This results in a lower statistical accuracy.

Another method exists to overcome the computational cost of estimating fixed effects. Explicitly estimating the fixed effects is superfluous since they are not directly relevant to our analysis. Our reason for including them in the model is to absorb time-invariant variables affecting individual players’ propensity to exhibit the outcomes of interest. This is critical if there is a correlation between a player’s inherent tendency to display the outcomes of interest and their likelihood of being exposed to toxicity.

We can achieve the same end by demeaning the values of the dependent, independent, and instrumental variables for all players at the individual level [40, p. 427]. Upon doing so, we estimate the coefficients $β$ through the standard 2SLS estimation procedure without resorting to any down-sampling.

We restrict our analysis to observations for which: (i) we observe at least one other player in the current match play at least one other match with other players so that we can compute the value of the instruments for them, and (ii) we observe the player participate in at least two matches so that we can demean the values of the dependent, independent, and instrumental variables for them. These restrictions result in some attrition.

Ethical considerations

Caltech’s Institutional Review Board reviewed and granted an exemption for this study (Approval number: IR23-1395). It does not involve participants prospectively recruited by the authors. The data was collected as part of Activision’s routine commercial activities and does not include any information that could identify individual participants.

Results

Regression estimates are presented in Table 3. The effects of exposure to toxicity are illustrated in Figs 6, 7, 8, 9, 10, and 11. A summary of these effects, along with the average values of the outcome variables, is presented in Table 4. A discussion of these findings follows.

Table 3. Regression estimates.

	Time to enter the next match	Probability of using toxic language
(A) Opponents and teammates.
Opponents	60.683^***	0.1382^***
Opponents	(9.4202)	(0.0244)
Teammates	16.182^***	0.0854^***
Teammates	(2.8458)	(0.0099)
Opponents $\times$ Win	–36.596^***	–0.0832^***
Opponents $\times$ Win	(10.781)	(0.0281)
Teammates $\times$ Win	0.8545	–0.0074
Teammates $\times$ Win	(3.8833)	(0.0152)
Win	–0.3794^***	–0.0001^***
Win	(0.0053)	(0.0000)
F Statistic	8,958.2	504.1
(B) Teammates in a different party and the same party.
Different Party	17.936^***	0.0208^***
Different Party	(3.0740)	(0.0078)
Same Party	12.255^*	0.6949^***
Same Party	(7.1299)	(0.0785)
Different Party $\times$ Win	5.0996	–0.0045
Different Party $\times$ Win	(4.3649)	(0.0111)
Same Party $\times$ Win	–18.705^**	–0.0956
Same Party $\times$ Win	(7.5382)	(0.0910)
Win	–0.3811^***	–0.0001^***
Win	(0.0048)	(0.0000)
Player is in a Party	–0.4193^***	0.0013^***
Player is in a Party	(0.0082)	(0.0000)
F Statistic	12,551.2	8,454.4

Open in a new tab

Note: $* p < 0.1$ ; ${* *} p < 0.05$ ; ${* * *} p < 0.01$

Table 4. Effects of exposure to toxicity.

Source of toxicity	Outcome variable
	Time to enter the next match (hrs.)		Probability of using toxic language (pp.)
	Match outcome
	Win	Loss	Win	Loss
Opponents	24.09	60.68	5.49	13.81
Teammates	17.04	16.18	7.8	8.54
Teammates in a different party	23.03	17.93	2.08	1.63
Teammates in the same party	–6.46 (ns.)	12.25 (ns.)	69.48	59.93
Average	3.66	4.17	0.078	0.088

Open in a new tab

Note: ns. denotes estimates that are not statistically significant at the 95% confidence level.

Probability of exposure to toxicity

As a preamble, we consider the probability of exposure to toxicity. Figs 6 and 7 illustrate the likelihood of exposure to toxicity conditional on the match outcome, defined as whether the exposed player’s team won or lost. Fig 6 distinguishes between exposure to toxicity from opponents and teammates, and Fig 7 between toxicity from teammates in a different party and those in the same party.

The probability of exposure to toxicity from opponents or teammates is less than one-tenth of a percent. While some exposure data is missing, implying that the exposure to toxicity could be higher, this suggests that ToxMod primarily identifies the most severe instances of toxicity.

Players are considerably more likely—two to over three times more likely, depending on the match’s outcome—to be exposed to toxicity from teammates than opponents. Furthermore, players are more likely to be exposed to toxicity from opponents when their team wins and teammates when their team loses. Among teammates, players are slightly more likely to be exposed to toxicity from those in the same party. However, this difference in the probability of exposure to toxicity from teammates in a different party and the same party is much smaller than the difference in the likelihood of exposure to toxicity from opponents and teammates.

Effects of toxicity from opponents and teammates

We now turn to the effect of exposure to toxicity on the time players take to enter their next match and their probability of using similar language in the current game. Figs 8 and 9 illustrate the estimated effect of exposure to toxicity from opponents and teammates conditional on the match’s outcome. Figs 8 and 9 depict the effect of exposure to toxicity on the time players take to enter their next match and the probability that players use toxic language in the current game, respectively. Each estimate represents the average marginal effect of exposure to toxicity from one player. We illustrate estimates with their 95% confidence interval.

Exposure to toxicity significantly increases the time before players enter their next match. This effect ranges from 16.18 to 60.68 hours, depending on whether the toxicity originates from opponents or teammates and whether the exposed player’s team won or lost. This delay is substantial compared to the average time players take to enter their next match, which is 3.66 hours when their team wins and 4.17 hours when their team loses. Therefore, exposure to toxicity increases by a factor of five to 16 the time players take to enter their next match. The effect of exposure to toxicity from opponents is significantly higher when the exposed player’s team loses, corroborating Hypothesis 5. Conversely, exposure to toxicity from teammates has a higher effect when the exposed player’s team wins, though this difference is not statistically significant. The most pronounced effect is caused by exposure to toxicity from opponents when the player’s team loses. The effect of exposure to toxicity from opponents when the player’s team wins is much smaller. The latter is slightly higher in magnitude—although not significantly different—than the effect of exposure to toxicity from teammates regardless of the match’s outcome. On the whole, these findings support Hypothesis 1.

Exposure to toxicity also significantly increases the probability that a player uses similar language. This effect ranges from 5.49 to 13.81 percentage points, depending on whether toxicity originates from opponents or teammates and whether the player’s team won or lost. This effect is considerable, given that the observed incidence of toxicity is 0.078% when the exposed player’s team wins and 0.088% when it loses. Irrespective of its source, the effect of exposure to toxicity is higher when the player’s team loses, corroborating Hypothesis 5. The most pronounced effect is caused by exposure to toxicity from opponents when the player’s team loses. Conversely, the least pronounced effect is caused by exposure to toxicity from opponents when the player’s team wins. The latter is significantly smaller than the former. Exposure to toxicity from teammates exerts an effect of intermediate value on the probability that a player uses similar language. Accordingly, Hypothesis 2 is partially verified, at least in the context of the propagation of toxicity when the exposed player’s team loses.

Effects of toxicity from different-party and same-party teammates

Figs 10 and 11 illustrate the estimated effect of exposure to toxicity from teammates in a different party and those in the same party conditional on the match’s outcome. Figs 10 and 11 depict the effect of exposure to toxicity on the time players take to enter their next match and the probability that a player uses toxic language in the current game, respectively. Each estimate represents the average marginal effect of exposure to toxicity from one player.

Exposure to toxicity from teammates in a different party significantly increases the time before players enter their next match. This effect is substantial, with a delay of 17.94 hours after a loss and 23.04 hours after a win, equivalent to multiplying by a factor of five to seven times the time players take to enter their next match. In contrast, regardless of the match’s outcome, exposure to toxicity from teammates in the same party does not significantly affect the time players take to enter the next game, supporting Hypothesis 3. In particular, when the exposed player’s team wins, the effect of toxicity from teammates in the same party is negative and, thereby, significantly smaller than the effect of exposure to toxicity from teammates in a different party. These findings partially support Hypothesis 5, at least regarding the impact of exposure to toxicity from teammates in the same party.

Exposure to toxicity also significantly increases the probability that players adopt similar language. The effect of exposure to toxicity from teammates in the same party is particularly pronounced, resulting in a 59.93 to 69.48 percentage point increase in the probability that a player uses toxic language depending on the match’s outcome. In contrast, exposure to toxicity from teammates in a different party has a much smaller effect, with a magnitude of 1.63 to 2.08 percentage points depending on the match’s outcome. These results corroborate Hypothesis 4. Furthermore, all else equal, toxicity spreads more when the exposed player’s team loses than when it wins, supporting Hypothesis 5.

Discussion and conclusion

Our analysis provides valuable insights into the effect of exposure to toxicity on the time before players enter their match and their likelihood of using similar language. Our findings confirm that toxicity significantly affects player engagement, often negatively. Moreover, toxicity spreads as players exposed to it become more likely to use similar language. These results highlight the video game industry’s vested interest in combating toxicity.

We show that the effects of exposure to toxicity vary significantly with its source—whether it originates from opponents, teammates from a different party, or teammates in the same party—and the match’s outcome. The findings broadly validate our hypotheses. They also have practical implications, guiding video game service operators in targeting their efforts to combat toxicity. Specifically, to minimize the adverse effects of toxicity on player engagement, our analysis advises allocating resources for combating toxicity in the following order of decreasing priority:

Toxicity from opponents when the player’s team loses.
Toxicity from opponents when the player’s team wins.
Toxicity from teammates in a different party when the player’s team wins.
Toxicity from teammates in a different party when the player’s team loses.
Toxicity from teammates in the same party when the player’s team loses.
Toxicity from teammates in the same party when the player’s team wins.

On the other hand, to minimize the proliferation of toxic language, our analysis advises allocating resources for combating toxicity in the following order of decreasing priority:

Toxicity from teammates in the same party when the player’s team loses.
Toxicity from teammates in the same party when the player’s team wins.
Toxicity from opponents when the player’s team loses.
Toxicity from opponents when the player’s team wins.
Toxicity from teammates in a different party when the player’s team loses.
Toxicity from teammates in a different party when the player’s team wins.

These recommendations diverge depending on whether the primary objective is to minimize the negative effect of exposure to toxicity on player engagement or the propagation of toxicity. If the priority is to mitigate the impact of toxicity on player engagement, addressing toxicity from opponents should be a priority. In contrast, toxicity from teammates in the same party has a minimal effect on player engagement. Based solely on this factor, it may not deserve any intervention since its effect is not statistically significant. On the other hand, if our priority is to limit the proliferation of toxicity, addressing toxicity from teammates in the same party becomes a priority since it contributes most to its propagation.

Our findings also have meaningful implications regarding the nature of toxicity in different contexts. We find that exposure to toxicity has a lower effect on player engagement when it comes from teammates, particularly those in the same party as the exposed player when their team wins the match. In parallel, players are more likely to join the bandwagon. These findings suggest that toxicity from teammates, particularly those in the same party, is less likely to be directed at players when their team wins. There is only one other scenario in which exposure to toxicity has a higher effect on the likelihood that players engage in similar behavior, namely when the toxicity comes from opponents and the exposed player’s team loses the match. In this case, players likely retaliate against their opponents’ toxicity. Remarkably, this behavior is less common when the exposed player’s team wins, possibly because the victory provides a sense of retribution on its own.

Admittedly, this study presents some limitations. As noted above, some exposure data is missing. Although we are confident it does not introduce biases in our findings, we cannot demonstrate it with certainty. In addition, our analysis focuses on a single game and game mode. Our results may not extend to other games and modes, particularly those beyond first-person action video games, let alone entirely different settings such as social networks and online forums. We also focus on one form of toxicity: toxic language in voice chat interactions. It excludes other expressions of toxicity, including toxic language in text chat interactions, that may occur in competitive online video games.

In conclusion, our work paves the way for exciting research. First, while our analysis focuses on the short-term effects of exposure to toxicity, there is limited evidence of its long-term impact. Addressing this gap would require data spanning a longer timeframe. We should also consider how the effects of exposure to toxicity differ based on factors beyond its source and the match outcome, including players’ experience, skill levels, and cultural influences. Examining a broader range of games and game modes across various genres would help overcome the limitations discussed earlier. Finally, although we have identified where the video game industry should target its resources and interventions, additional evidence is needed regarding the effectiveness of various strategies for preventing toxic behavior to determine the industry’s optimal course of action in these situations [41].

Acknowledgments

The authors thank Andrea Boonyarungsrit, Grant Cahill, Min Kim, Rafal Kocielnik, Jonathan Lane, Zhuofang Li, Gary Quan, Deshawn Sambrano, Feri Soltani, Carly Taylor, and Michael Vance for their invaluable feedback and support in writing this article.

Data Availability

This study analyzes proprietary data collected and owned by Activision, with a detailed description provided in the manuscript. Due to commercial and confidentiality restrictions, this data cannot be shared publicly. For access inquiries, please contact Gary Quan, Expert Technical Project Manager at Activision®/Demonware, at gquan@demonware.net.

Funding Statement

Activision funded this study through a grant, with RMA as the principal investigator. Activision also provided financial support to AM in the form of a salary. Activision collected the data as part of its routine commercial activities but had no involvement in the design of this study, the data analysis, the decision to publish, or the preparation of the manuscript. No other external funding was received for this study.

References

1.ADL Center for Technology and Society. Hate is no game: hate and harassment in online games. 2023. https://www.adl.org/resources/report/hate-no-game-hate-and-harassment-online-games-2023
2.Entertainment Software Association. Essential facts about the U.S. video game industry. 2024. https://www.theesa.com/wp-content/uploads/2024/05/Essential-Facts-2024-FINAL.pdf
3.Lapidot-Lefler N, Barak A. Effects of anonymity, invisibility, and lack of eye-contact on toxic online disinhibition. Comput Hum Behav. 2012;28(2):434–43. doi: 10.1016/j.chb.2011.10.014 [DOI] [Google Scholar]
4.Adinolf S, Turkay S. Toxic behaviors in esports games: player perceptions and coping strategies. In: Proceedings of the 2018 Annual Symposium on Computer-Human Interaction in Play Companion Extended Abstracts. 2018. p. 365–72. [Google Scholar]
5.Lee SJ, Jeong EJ, Jeon JH. Disruptive behaviors in online games: effects of moral positioning, competitive motivation, and aggression in League of Legends. Soc Behav Personal: Int J. 2019;47(2):1–9. [Google Scholar]
6.Hilvert-Bruce Z, Neill JT. I’m just trolling: the role of normative beliefs in aggressive behaviour in online gaming. Comput Hum Behav. 2020;102:303–11. [Google Scholar]
7.Türkay S, Formosa J, Adinolf S, Cuthbert R, Altizer R. See no evil, hear no evil, speak no evil: how collegiate players define, experience and cope with toxicity. In: Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems. 2020. p. 1–13. doi: 10.1145/3313831.3376191 [DOI] [Google Scholar]
8.Beres NA, Frommel J, Reid E, Mandryk RL, Klarkowski M. Don’t you know that you’re toxic: normalization of toxicity in online gaming. In: Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems. 2021. p. 1–15. [Google Scholar]
9.Grandprey-Shores K, He Y, Swanenburg KL, Kraut R, Riedl J. The identification of deviance and its impact on retention in a multiplayer game. In: Proceedings of the 17th ACM Conference on Computer Supported Cooperative Work & Social Computing, 2014. p. 1356–65. [Google Scholar]
10.Kordyaka B, Jahn K, Niehaves B. Towards a unified theory of toxic behavior in video games. Internet Res. 2020;30(4):1081–102. [Google Scholar]
11.Kowert R, Kilmer E. Toxic gamers are alienating your core demographic: The business case for community management. 2023. https://www.takethis.org/wp-content/uploads/2023/08/ToxicGamersBottomLineReport_TakeThis.pdf
12.Neto JAM, Yokoyama KM, Becker K. Studying toxic behavior influence and player chat in an online video game. In: Proceedings of the International Conference on Web Intelligence, 2017. p. 26–33. [Google Scholar]
13.de Mesquita Neto JA, Becker K. Relating conversational topics and toxic behavior effects in a MOBA game. Entertain Comput. 2018;26:10–29. [Google Scholar]
14.Shen C, Sun Q, Kim T, Wolff G, Ratan R, Williams D. Viral vitriol: Predictors and contagion of online toxicity in World of Tanks. Comput Hum Behav. 2020;108:1–9. [Google Scholar]
15.Morrier J, Mahmassani A, Alvarez RM. Uncovering the viral nature of toxicity in competitive online video games; arXiv preprint 2025. https://arxiv.org/abs/2410.00978 [DOI] [PubMed] [Google Scholar]
16.Manski CF. Economic analysis of social interactions. J Econ Perspect. 2000;14(3):115–36. [Google Scholar]
17.Alexander C, Piazza M, Mekos D, Valente T. Peers, schools, and adolescent cigarette smoking. J Adolesc Health. 2001;29(1):22–30. doi: 10.1016/s1054-139x(01)00210-5 [DOI] [PubMed] [Google Scholar]
18.Salmivalli C. Bullying and the peer group: a review. Aggress Violent Behav. 2010;15(2):112–20. [Google Scholar]
19.Epple D, Romano RE. Peer effects in education: a survey of the theory and evidence. In: Benhabib J, Bisin A, Jackson MO, editors. Handbook of social economics. North-Holland. 2011. p. 1053–163. [Google Scholar]
20.Kreager DA, Rulison K, Moody J. Delinquency and the structure of adolescent peer groups. Criminology. 2011;49(1):95–127. doi: 10.1111/j.1745-9125.2010.00219.x [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Sacerdote B. Peer effects in education: how might they work, how big are they and how much do we know thus far? Handbook of the economics of education. Elsevier. 2011. p. 249–77. doi: 10.1016/b978-0-444-53429-3.00004-1 [DOI] [Google Scholar]
22.Graham BS. Identifying and estimating neighborhood effects. J Econ Literat. 2018;56(2):450–500. doi: 10.1257/jel.20160854 [DOI] [Google Scholar]
23.Activision Publishing. Call of duty takes aim at voice chat toxicity, details year-to-date moderation progress. 2023. https://www.callofduty.com/blog/2023/08/call-of-duty-modern-warfare-warzone-anti-toxicity-progress-report
24.Kowert R, Woodwell L. Moderation challenges in digital gaming spaces: Prevalence of offensive behaviors in voice chat. 2022. https://www.takethis.org/wp-content/uploads/2022/12/takethismodulatereport.pdf
25.Wooldridge JM. Introductory econometrics: a modern approach. 5th ed. South-Western Cengage Learning. 2013. [Google Scholar]
26.Velez JA, Mahood C, Ewoldsen DR, Moyer-Gusé E. Ingroup versus outgroup conflict in the context of violent video game play: the effect of cooperation on increased helping and decreased aggression. Commun Res. 2014;41(5):607–26. [Google Scholar]
27.Velez JA, Greitemeyer T, Whitaker JL, Ewoldsen DR, Bushman BJ. Violent video games and reciprocity: the attenuating effects of cooperative game play on subsequent aggression. Commun Res. 2016;43(4):447–67. [Google Scholar]
28.McLean D, Waddell F, Ivory J. Toxic teammates or obscene opponents? Influences of cooperation and competition on hostility between teammates and opponents in an online game. J Virt Worlds Res. 2020;13(1):1–15. [Google Scholar]
29.Henrich J, Boyd R. The evolution of conformist transmission and the emergence of between-group differences. Evol Hum Behav. 1998;19(4):215–41. [Google Scholar]
30.Charness G, Rigotti L, Rustichini A. Individual behavior and group membership. Am Econ Rev. 2007;97(4):1340–52. [Google Scholar]
31.Dimant E. Contagion of pro- and anti-social behavior among peers and the role of social proximity. J Econ Psychol. 2019;73:66–88. [Google Scholar]
32.Rieger D, Wulf T, Kneer J, Frischlich L, Bente G. The winner takes it all: The effect of in-game success and need satisfaction on mood repair and enjoyment. Comput Hum Behav. 2014;39:281–6. [Google Scholar]
33.Sun XVY, Chen VHH. Toxic behavior in multiplayer online games: the role of witnessed verbal aggression, game engagement intensity, and social self-efficacy. Chin J Commun. 2024:1–19. [Google Scholar]
34.Kou Y, Nardi B. Regulating anti-social behavior on the internet: the example of League of Legends. In: iConference 2013 Proceedings, 2013. p. 616–22. [Google Scholar]
35.Kwak H, Blackburn J, Han S. Exploring cyberbullying and other toxic behavior in team competition online games. In: Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems; 2015. p. 3739–48. [Google Scholar]
36.Märtens M, Shen S, Iosup A, Kuipers F. Toxicity detection in multiplayer online games. In: 2015 International Workshop on Network and Systems Support for Games (NetGames). 2015. p. 1–6. [Google Scholar]
37.Palomba A. Digital seasons: How time of the year may shift video game play habits. Entertain Comput. 2019;30:100296. [Google Scholar]
38.Hausman JA. Valuation of new goods under perfect and imperfect competition. In: Bresnahan TF, Gordon RJ, editors. The economics of new goods. University of Chicago Press. 1996. p. 207–48. [Google Scholar]
39.Nevo A. Measuring market power in the ready-to-eat cereal industry. Econometrica. 2001;69(2):307–42. [Google Scholar]
40.Greene WH. Econometric analysis. 8th ed. Prentice Hall. 2018. [Google Scholar]
41.Wijkstra M, Rogers K, Mandryk RL, Veltkamp RC, Frommel J. How to tame a toxic player? A systematic literature review on intervention systems for toxic behaviors in online video games. Proc ACM Hum-Comput Interact. 2024;8(315). [Google Scholar]

PLoS One. 2025 Jun 11;20(6):e0325462. doi: 10.1371/journal.pone.0325462.r001

Author response to Decision Letter 0

23 Aug 2024

PLoS One. doi: 10.1371/journal.pone.0325462.r002

Decision Letter 0

Bernard Fong

26 Feb 2025

PONE-D-24-33185Uncovering the Effect of Toxicity on Player Engagement and its Propagation in Competitive Online Video GamesPLOS ONE

Dear Dr. Morrier,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Apr 11 2025 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.
A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.
An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Bernard Fong

Academic Editor

PLOS ONE

Journal Requirements:

1. When submitting your revision, we need you to address these additional requirements. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf 2. Please update your submission to use the PLOS LaTeX template. The template and more information on our requirements for LaTeX submissions can be found at http://journals.plos.org/plosone/s/latex. 3. Thank you for stating in your Funding Statement: Activision provided funding for this study through a sponsored research grant. Please provide an amended statement that declares *all* the funding or sources of support (whether external or internal to your organization) received during this study, as detailed online in our guide for authors at http://journals.plos.org/plosone/s/submit-now. Please also include the statement “There was no additional external funding received for this study.” in your updated Funding Statement. Please include your amended Funding Statement within your cover letter. We will change the online submission form on your behalf. 4. Thank you for stating the following in the Competing Interests section: AM contributed to this article while being employed by Activision. The opinions expressed by the authors do not represent the views of Activision. The other authors state that they have no competing interests.We note that one or more of the authors are employed by a commercial company: Activision. a. Please provide an amended Funding Statement declaring this commercial affiliation, as well as a statement regarding the Role of Funders in your study. If the funding organization did not play a role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript and only provided financial support in the form of authors' salaries and/or research materials, please review your statements relating to the author contributions, and ensure you have specifically and accurately indicated the role(s) that these authors had in your study. You can update author roles in the Author Contributions section of the online submission form. Please also include the following statement within your amended Funding Statement. “The funder provided support in the form of salaries for authors [insert relevant initials], but did not have any additional role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript. The specific roles of these authors are articulated in the ‘author contributions’ section.”If your commercial affiliation did play a role in your study, please state and explain this role within your updated Funding Statement. b. Please also provide an updated Competing Interests Statement declaring this commercial affiliation along with any other relevant declarations relating to employment, consultancy, patents, products in development, or marketed products, etc. Within your Competing Interests Statement, please confirm that this commercial affiliation does not alter your adherence to all PLOS ONE policies on sharing data and materials by including the following statement: "This does not alter our adherence to PLOS ONE policies on sharing data and materials.” (as detailed online in our guide for authors http://journals.plos.org/plosone/s/competing-interests) . If this adherence statement is not accurate and there are restrictions on sharing of data and/or materials, please state these. Please note that we cannot proceed with consideration of your article until this information has been declared. Please include both an updated Funding Statement and Competing Interests Statement in your cover letter. We will change the online submission form on your behalf. 5. In the online submission form you indicate that your data is not available for proprietary reasons and have provided a contact point for accessing this data. Please note that your current contact point is a co-author on this manuscript. According to our Data Policy, the contact point must not be an author on the manuscript and must be an institutional contact, ideally not an individual. Please revise your data statement to a non-author institutional point of contact, such as a data access or ethics committee, and send this to us via return email. Please also include contact information for the third party organization, and please include the full citation of where the data can be found. 6. Your ethics statement should only appear in the Methods section of your manuscript. If your ethics statement is written in any section besides the Methods, please move it to the Methods section and delete it from any other section. Please ensure that your ethics statement is included in your manuscript, as the ethics statement entered into the online submission form will not be published alongside your manuscript.

Additional Editor Comments:

Both reviewers believe the paper has good potentials and have pointed out some areas of improvements. The authors are suggested to systematically address the reviewers' comments and resubmit their revised manuscript.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Partly

Reviewer #2: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: No

Reviewer #2: No

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: No

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: General comment:

This manuscript provides a well-structured investigation into the impact of toxicity on player engagement within online gaming, using "Call of Duty: Modern Warfare III" as a case study. The present study intends to investigate the influence of toxic language on players’ level of engagement and their use of similar toxic language under the setting of online video gaming. While the study put specific focus on the toxicity in a first-person action video gaming shows some originality that is commendable in its design and methodological rigor, there are critical areas that require further clarification, elaboration, and adjustments to enhance the robustness and clarity of the findings. Specifically, there are several major issues that are necessary to be addressed, including the confusing logic flow of the review of literature, lack of representative studies in the literature review part, unclear conceptual framework, lack of hypothesis establishment, and lack of proper academic English writing. Besides, no line numbers are provided in the manuscript makes it more difficult to read and review. Therefore, the aforementioned fundamental issues should be further revised. I hope my comments below help to clarify what I am thinking and shed insight on some ways to tackle these problems moving forward.

Introduction:

Lack of background information: The primary issue of this manuscript is the lack of necessary background information of the context, which is the use of toxic languages and behaviors in online gaming settings, in justifying the rationale of conducting this study. The authors elaborated three reasons to reinforce the significance of this study, while all these three reasons lack empirical evidence to support the justifications. For example, in the beginning of this piece of work, the authors could first demonstrate the current development of online gaming (especially first-person action video games) by using empirical statistics to reflect its development in recent years as well as the significant influence on players. Then, the authors might refer to some evidence from existing articles to further articulate the universal phenomenon of toxicity in online gaming which could also lead to negative effects and behaviors. After that, the authors can move to the further elaboration of the estimates’ effects on consequent variables, including the exposure of toxicity, player engagement, and probability to use a similar toxic language. The authors should discuss each hypothetical relationship one by one and provide empirical justifications to support the hypotheses.

Literature Gaps: Another major issue in the introduction part is the lack of thorough review of current literature relating to the topic. While the introduction provides a rationale, it would benefit from an explicit identification of specific gaps in the literature that this study seeks to address. Detailing gaps can provide clearer research positioning. When discussing the effect of exposure to toxic language in the online gaming on player engagement and the consequent use of similar toxic language, for example, the authors should make a fruitful review of existing literature to support the hypothesis, indicating that the establishment of such hypothesis is theoretically supported. For example, when discussing the exposure to toxic language on player engagement, the authors should define what is toxic language and player engagement. After that, the authors should elaborate on previous literature focusing on use of toxic language’s influence on engagement in other areas, such as team sports and other gaming settings. Then, based on these discussions, the authors can finally arrive at the hypothetical relationship between the exposure of toxic language and the online gaming engagement. Also, please establish a clear statement to indicate your hypothetical relationship based on previous discussion, such as “H1-1: The exposure of toxic language in the first-person online gaming can significantly influence player’s gaming engagement.” These are the essential elements which should be included in this quantitative study.

Finally, the third and fourth paragraph in the page 3 and the second paragraph in page 4 should be placed in the method section, which illustrates the research techniques that were implemented in this study. Moreover, it’s better to clear statement the research purpose and the brief significance of the study at the end of introduction section. In addition, a conceptual framework to show the overall hypothetical causal relationship among variables is necessary to be depicted.

Method:

Dataset Scope and Representation: While it’s noted that the dataset contains data from "Call of Duty: Modern Warfare III" over one month, the representativeness of this timeframe needs elaboration. For example, is this month indicative of typical player engagement, or could seasonal factors (like game launches or holidays) influence toxicity levels? Including a rationale for selecting this timeframe would support the study's generalizability.

Moreover, the dataset description mentions that exposure data is unavailable for 34.6% of toxic statements. It would be valuable to expand on why this data is unavailable and how the absence might affect outcome reliability. For instance, is there any indication that missing data is systematically related to certain types of matches or player demographics? Clarifying this could strengthen the data’s integrity.

Rationale for Instrumental Variables (IV): The choice of instrumental variables, especially the “leave-one-out” approach, is intriguing but requires further justification. The authors could provide a brief comparison with alternative approaches, explaining why IV is particularly suited for isolating toxicity's causal effect on player engagement.

Explanation of Variables: The notation used to specify variables, such as yij for outcome, αj for intercept, and vector xij, might benefit from clearer definitions. A table listing each variable and its description, along with how each represents the model’s components, would improve accessibility for readers less familiar with econometric modeling.

Down-Sampling Implications: Since the dataset contains millions of observations, a down-sampling technique was employed. Expanding on how down-sampling was conducted—whether through a random sample, stratified by certain variables, or other means—would clarify whether the down-sampled data accurately represents the overall dataset. Furthermore, discussing whether down-sampling could introduce bias in estimated effects is essential. For instance, is there a possibility that high-intensity matches or those with certain toxicity patterns are underrepresented? Addressing this could include a sensitivity analysis to check the robustness of results with varying down-sample sizes.

Ethical issue: It is also necessary to provide a detailed explanation of the ethical review process and the decision, including the ethical application reference number, the procedures, and documents submitted for ethical review.

Results:

Summary of findings: The authors should include one separate paragraph in the beginning of the results section of stating the summary of the findings after the analysis. This guiding paragraph can signal the reader with a general understanding of the following elaboration.

Magnitude of Effect on Match Re-Entry Times: The reported delay in re-entering the next match (up to 60.68 hours in some contexts) requires further interpretation. This substantial effect size suggests strong aversion, but additional analysis explaining the practical implications of this delay is needed. For instance, how does this delay impact overall player retention or leave? Offering more justifications or comparing this delay to normal re-engagement times in other contexts could contextualize its significance.

Summarizing Key Findings in a Table: A summary table that consolidates the main findings—especially the effect sizes for different sources of toxicity and contexts (opponent, same-party, different-party)—would be a helpful quick reference. This table could include columns for source of toxicity, effect on re-engagement time, effect on propagation probability, and statistical significance.

Discussion:

Integration with Existing Literature: While the discussion synthesizes the study’s findings, it would benefit from a deeper integration with prior research on toxic behavior in online environments and social contagion theory. Specifically, linking findings on toxic behavior propagation to similar behaviors observed in social networks, online forums, or even real-world sports settings would situate this study within a broader context. This could also include comparisons to other studies that have observed peer influence in different online gaming contexts.

Explaining Mechanisms of Toxicity Propagation: The study reveals that exposure to toxicity, particularly within same-party contexts, has a strong effect on propagation. Expanding on the psychological or social mechanisms that may drive this phenomenon—such as social learning, peer influence, or in-group favoritism—could provide theoretical insight. Including references to theories of group behavior or behavioral modeling in competitive settings would add depth to the discussion.

Theoretical contributions and practical implications: Based on the results and analysis, the authors should include one paragraph in elaborating the theoretical contributions to the current research area, such as the research gap it has filled, the problems it has answered, or the knowledge it can provide for the existing literature. Besides, another paragraph illustrating the practical implications for practitioners is also a must. The discussion could go further by offering specific, actionable recommendations. For instance, the authors could suggest targeted moderation strategies, such as real-time intervention when toxicity is detected in same-party contexts or offering player rewards for positive behavior, particularly after a loss when players are more vulnerable to engaging in toxicity. Moreover, the study’s prioritization framework for addressing toxicity (e.g., focusing on same-party versus different-party toxicity) is insightful. However, it would benefit from further elaboration on how resources should be allocated. Discussing how game developers could prioritize resources for monitoring and intervention based on match outcomes, party composition, or other specific contexts could provide more practical guidance.

Limitations and Future Research Directions: While the study has a solid methodology, there are inherent limitations that should be explicitly acknowledged. For instance, the reliance on one game and a specific subset of player interactions (e.g., Team Deathmatch mode in “Call of Duty: Modern Warfare III”) may limit generalizability to other gaming environments or demographics. Acknowledging these limitations can provide readers with a balanced understanding of the study’s scope.

Data Availability Constraints: The missing exposure data should be revisited in the limitations. While some technical reasons for this missing data are noted, discussing the potential bias this introduces—particularly if certain types of matches or players are more affected by missing data—would reinforce transparency. The authors could suggest that future studies should ensure a more complete data capture or explore alternative methods to account for missing data.

Future Research Opportunities: This study opens several avenues for future research, and the discussion should explicitly highlight them. Examples might include:

• Cross-Game Comparisons: Studies that compare toxicity propagation across various game types or genres could validate whether findings are unique to the competitive first-person shooter genre or generalizable to other online gaming contexts.

• Longitudinal Effects: Future research could investigate the long-term effects of repeated exposure to toxicity. For instance, does continuous exposure reduce player engagement over months or lead to permanent changes in behavior?

• Intervention Efficacy: Follow-up studies could test the effectiveness of different intervention techniques, such as muting toxic players or rewarding positive behavior. Examining these interventions’ impact on player engagement and community health could provide actionable insights for developers.

By addressing these areas, particularly with greater emphasis on actionable recommendations for game developers and a nuanced consideration of social and psychological mechanisms, the manuscript will offer a more robust and practically relevant contribution to the field. Given the current gaps, a major revision is recommended to ensure that the manuscript meets its full potential in terms of academic rigor, practical applicability, and theoretical significance.

Reviewer #2: In this manuscript, the authors present the effect of toxic language on player engagement based on a popular online game (Call of Duty: Modern Warfare III). To proceed with the data analysis, the authors a large pool of data (>50million observations, >4million matches and >4million players) and used ToxMod for monitoring.

Overall, the authors have done a great work to correlate, among others, the toxic language with the average time of a player entering a new match which is crucial for preserving engagement of large amounts of players.

In online gaming and generally in virtual environments we can identify various types of fun (Lazzaro, 4 keys to fun). In online games, like COD, we can find hard fun (“serial” and competivive winners) and people fun (https://www.nicolelazzaro.com/the4-keys-to-fun/). “Hard fun”-ners are the most competitive players, while people fun”-ners are the players who join to play with their friends and enjoy more the social interaction within the game experience. A first correlation that would expand this study more is the level/skill/winning ratio (=> hardcore gamers) along with their tendency to toxic language. Maybe the hard core gamers (“hard fun”-ners) although they face this toxicity by other players (both teammates and opponents) this is is a part of the competitiveness element. An example of real-life sports was the great Michael Jordan who was a famous trash-talker. Trash-talking was a tool that allowed him to both assert his dominance and amplify his competitive edge. (https://www.basketballnetwork.net/off-the-court/why-michael-jordan-only-respected-trash-talk-when-the-score-was-tied). This pushed him further to be a better player and gave him extra motivation. Same principles apply here too.

That being said, although toxicity is considered generally a bad thing it may offer some opposite results in engagement for certain types of players. This is something not addressed in this paper and it would be important if we are trying to be more holistic.

On the other hand, “people fun” type of players may feel discouraged with all this toxicity and less engaged over time. I suspect they would also have a lower level/skillset than the previous type of players. It is worth checking this parameter too.

So, it would be great if the authors had this data and could correlate this info. Their models of understanding the effects of toxicity would be much more deeper, since the total outcome would be the product of the (obviously) negative effects of toxicity but with also its positive engagement results in specific areas/player types.

One more suggestion is to examine the cultural approach of this issue and its effects. Due to technical reasons, like ping/latency, players are usually located in servers somehow close to them. I do not know if it would be possible based on the data already acquired, but it would really interesting both for scientific and industry reasons if the toxic speech affects the same way players’ engagement from geographic locations (e.g North/Western Europe vs Mediterranean players, or US areas).

All in all, the authors have made a great work analyzing this data and creating the models. But this problem is a more complex one and it is a deeper multi-variable problem. One-size-fits-all solutions may raise some parts of engagement but may drop others so, it would be important to be mentioned and if possible researched too.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: SHI YUCHEN

Reviewer #2: No

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2025 Jun 11;20(6):e0325462. doi: 10.1371/journal.pone.0325462.r003

Author response to Decision Letter 1

21 Mar 2025

Attached is our detailed response to the reviewers' comments.

Attachment

Submitted filename: Response to Reviewers.pdf

pone.0325462.s001.pdf^{(254.4KB, pdf)}