Skip to main content
PLOS One logoLink to PLOS One
. 2021 Aug 11;16(8):e0255097. doi: 10.1371/journal.pone.0255097

Proximity can induce diverse friendships: A large randomized classroom experiment

Julia M Rohrer 1,2,*, Tamás Keller 3,4,5, Felix Elwert 6
Editor: Federica Maria Origo7
PMCID: PMC8357142  PMID: 34379633

Abstract

Can outside interventions foster socio-culturally diverse friendships? We executed a large field experiment that randomized the seating charts of 182 3rd through 8th grade classrooms (N = 2,966 students) for the duration of one semester. We found that being seated next to each other increased the probability of a mutual friendship from 15% to 22% on average. Furthermore, induced proximity increased the latent propensity toward friendship equally for all students, regardless of students’ dyadic similarity with respect to educational achievement, gender, and ethnicity. However, the probability of a manifest friendship increased more among similar than among dissimilar students—a pattern mainly driven by gender. Our findings demonstrate that a scalable light-touch intervention can affect face-to-face networks and foster diverse friendships in groups that already know each other, but they also highlight that transgressing boundaries, especially those defined by gender, remains an uphill battle.

Introduction

Friendships matter because social networks shape outcomes ranging from health behaviours to criminal activity and socio-economic achievement [17]. Friendship networks, however, are strongly constrained by homophily: Humans have the well-documented tendency to form and maintain relationships with those who resemble themselves along dimensions such as gender, race, ethnicity, age, religion, and education (e.g., [8]). Thus, social networks often lack diversity, and their benefits are distributed unevenly.

Previous research suggests that proximity between individuals can lead to friendships. In the present study, we investigate to which extent proximity can lead to friendships that transgress group boundaries imposed by homophily. For this purpose, we conducted a large-scale field experiment that randomly seated students in 3rd to 8th grade classrooms next to each other (Hungarian primary education, equivalent to ISCED 1 and ISCED 2 levels according to the international classification). To our knowledge, this is the first randomized experiment of this kind outside of the elite context of college, and the first study to explicitly investigate whether the effects of proximity on friendship are modified by similarity along the lines of gender, educational achievement, and ethnicity.

What do we know about the effects of proximity?

Prior research on the causal effects of proximity on friendships has focused on college freshmen. Multiple studies exploited natural experiments, such as alphabetical seating [9, 10]. Others directly randomized the assignment of college roommates [11], or the seating chart during an introductory meeting of psychology freshmen [12]. Most of these studies find that proximity promotes friendship formation. Some studies further assessed downstream outcomes of induced proximity, establishing positive effects of being randomly assigned a Black roommate on White students’ attitudes and behavior [11, 1318].

Open questions

Prior findings are promising but leave two important questions open. First, prior work studied college freshmen, a highly selected and comparatively homogeneous elite in unusual circumstances, especially if they live in dorms. Having recently relocated from the parental home to college, college students must quickly construct a new social network among strangers from scratch [19] and thus may be particularly susceptible to the effects of proximity. This raises the question whether previous findings generalize to other populations in more quotidian and scalable settings. The only recent (non-randomized) study on the effects of being seated next to each other on friendships among school-age children failed to find an effect of proximity on friendship nominations [20, 21].

Second, little is known about the boundary conditions of the effects of proximity. Previous studies have addressed whether or not proximity can lead to inter-ethnic friendships (in the college context), but have neglected other potentially relevant dimensions of socio-cultural diversity. For example, it is unknown whether proximity can lead to friendships among students with different levels of educational achievement, which may expose lower-achieving students to positive role models; or whether proximity can lead to mixed-gender friendships, which may discourage the development of gendered attitudes and communication styles that are linked to power asymmetries in adulthood [2224].

The present study

To investigate the effects of proximity on friendship in general, and the extent to which proximity can promote boundary-crossing friendships among school-age children in particular, we conducted a large pre-registered field experiment. We randomized the seating charts within 182 3rd through 8th grade classrooms in 40 primary schools in rural Hungary for the duration of the Fall 2017 semester (5 months; ordinarily, the majority of seating charts would be designed by teachers, see S2 Text). We assessed best-friend nominations at the beginning of the subsequent Spring 2018 semester to test the expectation that being seated next to each other had a positive causal effect on friendships (deskmate hypothesis). In contrast to previous studies among college students, we thus investigated the effects of proximity in a scalable environment (because nearly all children must attend school), at younger ages, and in groups that already know each other well (from 1st grade onwards).

Since humans tend to form friendships with self-similar others, the friendship-inducing effect of proximity is likely tampered by homophily. We therefore expect that induced proximity should promote friendship especially among individuals who resemble each other (modification-by-similarity hypothesis). In this study, we investigate effect modification by students’ dyadic similarity along the three salient dimensions of gender, educational achievement, and ethnicity (Roma/non-Roma) to quantify the extent to which proximity can promote diverse friendships with respect to these categories.

Materials and methods

Pre-registration

We follow a detailed pre-analysis plan, filed at the RCT registry of the American Economic Associations before the receipt of any outcomes data on April 13, 2018 (see https://doi.org/10.1257/rct.2895-1.0). Deviations from the plan are explained in S1 Text and on the project page on the Open Science framework (https://osf.io/4vjc5/), which archives the data necessary to reproduce our central analyses, all analytic scripts and more detailed results.

Study overview

We executed a large-scale field experiment in primary schools in Hungary (általános iskola). Classroom-seating charts were randomized for the duration of the fall semester (September 2017 through January 2018). Outcome variables were collected through student surveys between February and April 2018. The analytic sample for the central analyses consisted of N = 2,996 students (forming 24,962 dyads) within 182 3rd through 8th grade classrooms at 40 schools. Of these students, 48.2% (N = 1,447) were female; and 22.2% were of Roma ethnicity (N = 666). Ethnicity was missing for 4.5% of the sample (N = 136). Students’ ages ranged from 8 to 17 years (M = 11.88, SD = 1.80); the high maximum age is due to students who had to repeat classes. Class sizes ranged from 10 to 33 students (M = 19.42).

Recruitment and sample

In the spring of 2017, we contacted all primary schools in 7 contiguous counties of central Hungary, excluding the capital city of Budapest, via the heads of the local school districts to elicit information about classroom layouts and seating practices. We aimed to enroll all 3rd through 8th grade classrooms in which (1) teachers would implement our randomized seating chart in three core subjects: Hungarian literature, Hungarian grammar, and mathematics; (2) all students would receive instruction in these subjects together (e.g., no ability grouping); (3) the classroom layout would form a grid of freestanding forward-facing 2-person desks.

After obtaining initial participation agreements from principals and teachers at 55 schools and dropping schools and classrooms that did not meet our inclusion criteria (see pre-analysis plan for details), the pre-analysis plan anticipated a sample of N = 3,814 students across 195 classrooms at 41 schools. The pre-analysis plan also stipulated additional exclusion criteria going forward. Following these pre-registered criteria, we dropped (in this order) 13 classrooms (containing 226 students) in which fewer than 30% of students answered the friendship-nomination item; 391 students who did not answer the friendship-nomination item; 36 students with missing values on at least four of the seven variables comprising the similarity index; and 113 students who were assigned to sit alone (as a robustness check, we also report models including those students). Subsequent data inspection resulted in the exclusion of 11 doubly entered students, 5 students whose classrooms were smaller than the pre-registered minimum size of 10, and 36 students who had left their classrooms before the intervention. The final analytic sample consisted of N = 2,996 students forming 24,962 dyads within 182 3rd through 8th grade classrooms at 40 schools.

Pre-registered balance checks reported in S3 Text indicate excellent balance on all baseline covariates in the final analytic sample.

Intervention and exposure variable

Before the start of the fall semester 2017, we randomly assigned the students within each classroom to freestanding forward-facing two-person desks via unconstrained random partitioning, using a random number generator. We based the randomization on the class rosters from the preceding spring semester and stipulated a replacement algorithm to account for the small number of students who would exit or enter the class roster during the summer, with the aim of plausibly preserving randomization. Teachers were instructed to fill the seats of exited students with entering students from left to right, front to back, in alphabetic order of entering students’ surnames. We call the seating chart resulting from randomization and algorithm-compliant replacements the “intended seating chart.” The intended seating chart underlies all of our analyses (intention-to-treat analyses). Our central experimental exposure variable is coded = 1 for each dyad within a classroom that comprises deskmates in the intended seating chart, and = 0 otherwise.

Teachers were instructed to employ the intended seating chart in the three core subjects of the curriculum—mathematics, Hungarian literature, and Hungarian grammar—from the first day of classes (September 1, 2017) until the end of the fall semester (January 31, 2018). These three subjects form the core of the curriculum and receive the greatest weight in admission to selective secondary schools. They were taught in the same room for all grade levels and accounted for 6 to 10 lessons per week (25 to 45 percent of all lessons). Enforcing the seating chart across all subjects was not possible because (1) in some subjects, classrooms were split into smaller groups (e.g., different foreign languages), and (2) in some subjects, students were not seated in a fixed grid-layout (e.g., physical education and arts). However, seating charts typically apply to all subjects in a given room, and (depending on the grade level) most subjects were taught in the same room. Thus, students assigned to sit next to each other in the three core subjects likely also sat next to each other in other subjects; but we did not verify adherence outside of the core subjects.

Teachers were permitted to reseat students, but were asked to preserve deskmate assignments wherever possible. For example, if a student had to be moved to the front of the classroom because of vision problems, we asked that her deskmate be moved with her. We assessed compliance via teacher reports after the second week of classes and via classroom visits by our field-staff throughout the fall semester; 94.4 percent of the dyads in which students actually sat next to each after the second week of classes comprised students who were supposed to sit next to each in the intended seating chart.

Baseline covariates and similarity index

At the beginning of the study, classroom teachers reported students’ characteristics including student’s gender (male vs. female); ethnicity (Roma Hungarian vs. non-Roma Hungarian); and end-of-semester grades for spring 2017 in Hungarian literature, Hungarian grammar, mathematics, diligence and behavior. Grades ranged from 1 (worst) to 5 (best) for literature, grammar, and mathematics; and from 2 to 5 for diligence and behavior. From these fives grades, we generated the grade point average (GPA). Following the pre-analysis plan, missing teacher reports of baseline covariates (3.2 to 3.6% of the total sample) were filled in with students’ self-reports collected at endline.

To quantify the similarity between students, we calculated Gower’s general coefficient of similarity [25] for each dyad within a classroom. Gower’s index is a simple metric to quantify the similarity between two units along a number of variables that may be qualitative and/or quantitative. We calculated the similarity index based on students’ gender, ethnicity, and baseline grades in literature, grammar, mathematics, behavior, and diligence. Gender and ethnicity were each weighted by a factor of 1; every baseline grade was weighted by a factor of 1/5 (i.e., all grades together received a weight of 1). Gower’s index ranges from 0 (maximum possible dissimilarity along all dimensions) to 1 (perfect similarity along all dimensions). For example, a Roma girl and a non-Roma boy whose grades are all at opposing ends of the scale would receive a similarity index of 0; two Roma girls with exactly the same grades would receive a similarity index of 1. We standardized the similarity index for further analysis.

Friendship nominations

At the end of the study, students were asked to nominate up to 5 of their “best friends” within the classroom as part of a written 45-minute in-class survey. In this study, the primary outcome is students’ reciprocated friendships, coded = 1 if both dyad members nominated each other as best friends and = 0 otherwise. As robustness checks, we also analyzed best-friend nominations within the classroom regardless of reciprocation.

Statistical analyses

Deskmate hypothesis

We begin by evaluating the hypothesis that induced proximity fosters friendship between students. The deskmate effect is the net effect of sitting next to each other on friendship formation and friendship dissolution: students who were seated next to each other may either be induced newly to form a reciprocated tie, or to forego the dissolution of an existing tie.

We model the effect of sitting next to each other on reciprocated friendship nominations using a Bayesian multi-membership multilevel probit model. This is a dyad-level model with one observation for each unordered dyad consisting of students i and j in classroom c,

Friendship{ij}c*=b0+bD*Deskmate{ij}c+Classroomc+s{i,j}Studentsc+ϵ{ij}c (1)

where Friendship{ij}c* is the latent continuous friendship propensity of the dyad; Deskmate{ij}c = 1 if the students in the dyad are deskmates and = 0 otherwise; Classroomc is a vector of classroom fixed effects to account for randomization within classrooms; and ϵ{ij}c~N(0,σϵ2) is the i.i.d. dyad-specific error term. The term ∑s∈{i,j}Studentsc refers to the two i.i.d. random effects for the students s in the dyad, Studentsc~N(0,σStudent2). The latent continuous friendship propensity is linked to manifest friendship via the threshold function Friendship{ij}c = 1 if Friendship{ij}c*>0 and = 0 otherwise.

We interpret the results of this model in two complementary ways to assess the causal effect of sharing a desk: (i) on the latent continuous friendship propensity, Friendship{ij}c*, and (ii) on the probability of a manifest binary friendship nomination, Friendship{ij}c. First, following preferred practice in certain research fields (e.g., psychology), we present bD, the probit coefficient for the deskmate indicator, which estimates the effect of sharing vs. not sharing a desk on the latent propensity for forming a reciprocated friendship. As always in probit (or logit) models, this parameter is identified only up to scale [26], so that the coefficient should only be interpreted qualitatively as evidence about the direction of the deskmate effect on the latent friendship propensity.

Second, following preferred practice in other fields (e.g., economics and sociology), we present the average marginal effect (AME) of sharing vs. not sharing a desk on the probability of forming a manifest reciprocated friendship. Since probit models are non-linear probability models, the effect of deskmate exposures on the probability of manifest friendship nominations likely varies across dyads. The AME is the average of the effects of sitting vs. not sitting next to each other on the probability of forming a reciprocated friendship across all dyads. For clarity, we present AMEs alongside the average of the predicted probabilities (predictive margins) for forming a reciprocated friendship if all dyads were deskmates and if all dyads were not deskmates.

We evaluate the evidence for the deskmate hypothesis by computing Bayesian 95% credible intervals (CI95) around our point estimates (probit coefficients, AMEs, and predictive margins, respectively). Bayesian CI95 are defined as the intervals into which the unobserved parameter falls with 95% probability, incorporating information from the Bayesian priors. While there are fundamental differences between frequentist and Bayesian statistics, from a pragmatic perspective, Bayesian credible intervals and frequentist confidence intervals often lead to very similar numerical results [27]. We interpret credible intervals that exclude zero as evidence for the presence of an effect.

Importantly, since seating charts were randomized within classrooms, the estimated effect of sharing a desk—evaluated either as the effect on the latent friendship propensity or as the AME on the probability of manifest friendships—has a causal interpretation after controlling for classroom fixed effects.

Modification-by-similarity hypothesis

Next, we evaluate the hypothesis that the causal effect of sharing a desk on friendship formation increases when students resemble each other more with respect to baseline characteristics. Since dyadic similarity was not randomized, our analyses do not identify the causal effect of similarity on friendship. Instead, we estimate (i) observational homophily, i.e., the extent to which similar individuals happen to befriend each other regardless of sharing a desk, and (ii) effect modification, i.e., the extent to which the causal deskmate effect varies with observed dyadic similarity (see [28] on the difference between causal interaction and effect modification). The model evaluating observational homophily adds Gower’s (1971) index of similarity to the model of Eq (1). The model evaluating the similarity hypothesis (i.e., effect modification) further adds the product term of the index of similarity and the deskmate indicator.

For graphical representations, we define “low” (“high”) values of Gower’s similarity index as values falling 1 SD below (above) the sample mean across all dyads.

To explore which specific dimensions of the similarity index drive heterogeneity in the deskmate effect, we also estimate secondary models that allow the deskmate effect to vary with indicators of the dyad’s gender constellation (boy-boy, girl-girl, mixed-gender), ethnic constellation (both Roma, both non-Roma, mixed), or GPA. Analyses of modification by GPA incorporate two variables, the absolute GPA difference within the dyad and the dyad’s mean GPA (to control for grade levels in order to isolate the role of grade differences). The analyses follow the same logic as for the index of similarity (i.e., we first add the indicator of similarity on the respective dimension to evaluate observational homophily, and then we further add its product term with the deskmate indicator to evaluate the modification-by-similarity hypothesis).

Special care must be taken when interpreting the coefficients of the product terms between the deskmate indicator and similarity measures for evidence about the modification-by-similarity hypothesis. Qualitative conclusions about interaction effects in non-linear models, such as probit or logit models, can depend on the scale of the outcome [29, 30]. For example, two variables that relate to the outcome additively on one scale may relate to the outcome multiplicatively after the model undergoes a non-linear transformation that changes the scale of the outcome. Hence, when evaluating effect modification or interactions between variables in a probit model, it is possible that two variables (here, the deskmate indicator and dyadic similarity) statistically interact in their effect on the latent continuous outcome (here, the latent friendship propensity) but do not interact in their effect on the probability of the manifest binary outcome (here, the probability of friendship nominations), or vice versa—and this can result simply from mechanically transforming probit coefficients into AMEs after a given model has been estimated on given data.

Despite strong, and at times conflicting, preferences across methodological communities [29, 3133], no outcomes scale is inherently superior to another. Analysts who are interested in how the effect of sharing a desk on the latent friendship propensity is modified by similarity would inspect the probit coefficient on the interaction between the deskmate indicator and the similarity measure. This would make sense, for example, for analysts who want to know whether all groups of students are similarly nudged toward more (or less) positive relations. By contrast, analysts who are interested in effect modification in the effect of sharing a desk on manifest friendship nominations would compare group-specific AMEs. This would make sense, for example, if analysts believe that sharp classifications into “friend” vs. “not a friend” matter for classroom dynamics. Since we are interested in both qualities of the friendship network (latent friendship propensities and manifest nominations), we present probit coefficients and AMEs (accompanied by the relevant predictive margins) alongside each other.

As before, we statistically evaluate our estimates (probit coefficients, AMEs, and predictive margins, respectively) by computing the relevant Bayesian CI95. Additionally, in order to evaluate whether there is evidence for any variation between multiple groups (e.g., those defined by Gower’s index, or by gender, ethnicity, or GPA constellations), we compare models with and without deskmate-by-group interactions by inspecting the difference in their expected predictive accuracies, DiffELPD (a Bayesian measure of model fit), computed via approximate leave-one-out cross-validation. Following convention, we conclude in favor of a model if DiffELPD is at least twice its standard errors, DiffELPD ≥ 2 * SEDiffELPD).

Robustness checks

We explored several robustness checks for the deskmate and similarity hypotheses. First, in addition to reciprocated friendships, we also analyzed friendship nominations regardless of reciprocation. These analyses only differed from the previously described models in that (i) they included twice as many dyads, because every unordered dyad corresponds to two ordered sender-receiver dyads; and (ii) they included random effects for senders and receivers. In contrast to the analysis of reciprocated friendships, lower-level units (dyads) where thus nested within one higher-level unit of the classification sender and one higher-level unit of the classification receiver, resulting in a cross-classified multilevel probit effect model.

Second, we address two methodological concerns (especially in economics) about probit fixed effects models, such as the models introduced above. First, it is known that non-linear fixed effects models can be problematic in small panels (here, small classrooms) [34]. To address this concern, we re-estimated our primary models by substituting class-size indicators for the classroom fixed effect, thus replacing our pre-registered (and potentially problematic) fixed-effects model with a more conventional covariate-adjusted model. This substitution is permissible since the fixed effect is only needed to control for differences across classrooms in the probability that a given dyad is a deskmate dyad. Since this probability only depends on class size, controlling for class size is sufficient for causal identification. Second, to address the more general skepticism about non-linear models in parts of the social sciences, we estimated linear probability models (LPMs) for the probability of manifest friendship nominations. Among other advantages, LPMs, in contrast to probit and logit models, do not rest on distributional assumptions about the structural errors of the latent continuous friendship propensity. These LPMs mirror the specification of the main analyses described above.

In short, results for all three types of models (probit with fixed effects, probit controlling for classroom size, LPM with fixed effects) were extremely similar. The largest absolute difference in the estimated AMEs across models was 1.6 percentage points, with most discrepancies well below 1 percentage points, which does not affect our qualitative conclusions.

We present additional explanations behind all analyses, all model outputs, and a table contrasting the resulting estimates across different model specifications on the Open Science Framework.

Attrition

About 10% of students were omitted from our main analysis, because they did not provide friendship nominations (e.g., because they lacked parental consent for the endline survey, did not attend school on the day of the assessment, or skipped the question). Multivariate non-response models indicated some selective non-response. While gender and ethnicity did not predict missingness (p >.12); a 1 SD increase in GPA predicted a 2.4 percentage point increase in the probability of response (p = .001); and a 1 SD increase in similarity (Gower’s index) predicted a small but statistically significant decrease of 1.5 percentage points in the probability of response (p = .004). To address possible bias from selective attrition, we ran two additional sets of analyses.

First, we estimated a lower bound for the deskmate effect by imputing missing friendships nominations under extremely conservative assumptions: whenever nominations were missing, we assumed that (1) the student did not nominate their deskmate and (2) the student nominated all non-deskmates who had nominated them. This minimized the number of friendships between deskmates and maximized the number of friendships between non-deskmates. Second, we re-ran the central analyses with dyadic non-response weights. The resulting estimates identify the causal effect of interest under the assumptions that our non-response model is correctly specified. A more detailed description, the full analysis code and results of these additional analyses can be found on the Open Science Framework.

Software

All models were estimated in the R [35] software package brms [36, 37] using R Studio [38]. We used the default Bayesian priors in brms, which are non-informative, or very weakly informative. All figures were created in ggplot2 [39].

IRB approval and consent

This study was reviewed and approved by the IRB offices at the Center for Social Sciences, Budapest (data collection and analysis), and at the University of Wisconsin-Madison (data analysis). Consent was obtained at multiple points. School districts, school principals, and teachers provided written consent to participating in the seating chart randomization. Parents provided written consent for the retrieval of administrative records via teachers, and for their children’s participation in the survey.

Results

Deskmate hypothesis: Effect of the intervention on friendships

We analyzed the effect of being seated next to each other (for the duration of one semester) on students’ reciprocated friendships within the classroom (after the end of the semester) using Bayesian multi-membership multilevel probit models. We report results first as probit coefficients for the effects on students’ latent continuous propensity toward friendship (Table 1), and second as average marginal effects (AME) for the effects on students’ predicted probability of a manifest reciprocated friendship. Since AMEs are non-linear functions of the probit coefficients, they answer different questions and may lead to qualitatively different conclusions (see Methods).

Table 1. Results of Bayesian multi-membership multilevel probit models for the effects of sitting next to each other on reciprocated friendship.

Main Analysis Modification by Overall Dyadic Similarity
Estimate 95% CI Estimate 95% CI
b0 -0.96 [-1.20; -0.72] -1.49 [-1.84; -1.14]
bDeskmate 0.27 [0.19; 0.35] 0.29 [0.19; 0.39]
σStudent 0.04 [0.00; 0.10] 0.52 [0.46; 0.58]
bSimilarity 0.83 [0.79; 0.86]
bSimilarity*Deskmate 0.07 [-0.05; 0.18]
NDyads 24,962 24,962
NStudents 2,996 2,996

The results show that sitting next to each other had a large positive effect on students’ friendships. The intervention increased the latent continuous friendship propensity of a dyad (Table 1, Main Analysis, bDeskmate = 0.27; CI95: [0.19, 0.35]), and it increased the predicted probability of a manifest friendship by 7.0 percentage points (CI95: [4.6; 9.4]), from 15.3 percent to 22.3 percent. This evidence confirms the deskmate hypothesis: induced proximity fostered friendships.

Our conclusions remained unchanged when including students in the analysis who were assigned to sit alone, and when analyzing directed friendship nominations (regardless of reciprocation; see detailed results on the Open Science Framework). Excluding dyads who did not adhere to treatment resulted in a slightly larger effect estimate of 7.2 percentage points (CI95: [4.9; 9.8]). In models in which we allowed the deskmate effect to vary between classrooms, we found some variability of the deskmate effects, but the differences were substantively small (SDDeskmate effect = 0.09, CI95: [0.00, 0.23]), see S1 Fig. These models also suggested that the number of students in the classroom did not modify the deskmate effect (bottom tertile, 17 students or fewer: AME = 7.5 percentage points, CI95: [4.5, 10.5], top tertile, more than 20 students: AME = 7.3 percentage points, CI95: [4.4, 10.5]).

Imputing missing outcomes in the most conservative manner results in a lower bound estimate of b = 0.17, CI95: [0.09, 0.24]). In this model, sitting next to each other increased the probability of a manifest friendship by 4.0 percentage points (CI95: [2.0; 6.1]), from 14.6 percent to 18.7 percent. Lastly, applying non-response weights, we estimated that the deskmate effect was b = 0.24, CI95: [0.15, 0.33]). In this model, sitting next to each other increased the probability of a manifest friendship by 5.9 percentage points (CI95: [3.5; 8.4]), from 14.8 percent to 20.8 percent.

Modification-by-similarity hypothesis: Moderating role of overall similarity

We documented observational homophily by inspecting the association between reciprocated friendship and Gower’s index for dyadic similarity between students [25], which included students’ gender (boy vs. girl), educational achievement (baseline grade-point average [GPA]), and ethnicity (Roma vs. non-Roma). As expected, there was a strong association between dyadic similarity and dyads’ tendency to form a reciprocated friendship (bSimilarity = 0.83 per SD of the similarity index, CI95: [0.80, 0.86]; AME = 9.0 percentage points, CI95: [8.5, 9.5], from low [- 1 SD] to average similarity; AME = 20.03 percentage points, CI95: [19.4, 21.4], from average to high [+ 1 SD] similarity; NStudents = 2,996, NDyads = 24,962).

We tested the modification-by-similarity hypothesis by asking whether the causal deskmate effect varied with students’ dyadic similarity. Support for the modification-by-similarity hypothesis was scale dependent, as is often the case in non-linear probability models (see Methods). We did not find evidence that similarity modified the deskmate effect with respect to the latent continuous friendship propensity (see Table 1, Overall Similarity), and model fit did not improve when including it (DiffELPD = 1.2 = 0.7*SEDiff ELPD in favor of the more parsimonious model without the interaction term).

By contrast, students’ dyadic similarity positively modified the deskmate effect with respect to the probability of manifest friendships (Fig 1A and 1B). The AME of sitting next to each other on the probability of manifest friendships was AMELow = 1.5 percentage points (CI95: [0.2, 3.1]) for dyads of low similarity; AMEAverage = 5.7 percentage points (CI95: [3.4, 8.0]) for dyads of average similarity; and AMEHigh = 11.7 percentage points (CI95: [7.7, 15.6]) for dyads with high similarity. The 95% credible intervals for the differences between the deskmate effects for dyads with low, average, and high similarity, respectively, comfortably excluded zero (ΔAMEAverageLow = 4.2 percentage points, CI95: [0.3, 5.7]; ΔAMEHighAverage = 6.1 percentage points, CI95: [2.6, 9.8]).

Fig 1. Model predictions for the effect of sitting next to each other on reciprocated friendships.

Fig 1

Left column displays the predicted friendship probabilities (predictive margins); right column displays the corresponding average marginal deskmate effects (differences in the predicted friendship probabilities) in percentage points. (A) and (B): effect modification by overall similarity (low: -1 SD, high: +1 SD on Gower’s similarity index based on gender, ethnicity, and baseline GPA). (C) and (D): effect modification by the gender composition of the dyad. (E) and (F): effect modification by absolute GPA difference between the students (low: -1 SD, high: +1 SD relative to the mean absolute difference). (G) and (H): effect modification by the ethnic (Roma/non-Roma) composition of the dyad.

Together, these results indicate that sitting next to each other equally increased the latent continuous friendship propensity for all dyads, and it also increased the probability of manifest friendships, even for fairly dissimilar dyads. But since the baseline friendship propensity was much larger among similar dyads (due to homophily), increasing the friendship propensity by a fixed amount pushed more similar dyads than dissimilar dyads across the threshold of manifest friendship. Thus, seating similar students next to each other resulted in more additional reciprocated friendships than did seating dissimilar students next to each other. Imputing missing values in the most conservative manner did not change conclusions regarding the lack of an interaction on latent friendship propensities. Furthermore, we still observed an interaction on the probability of manifest friendships (i.e., 95% credible intervals for the differences between the deskmate effects for dyads with low, average, and high similarity exclude zero), but all average marginal effects were somewhat smaller and the 95% credible interval now contained zero for low-similarity dyads: AMELow = 1.7 percentage points (CI95: [−0.4, 1.9]); AMEAverage = 3.1 percentage points (CI95: [1.2, 5.1]); and AMEHigh = 7.6 percentage points (CI95: [4.0, 11.1]). The same pattern held for analyses applying non-response weights, with average marginal effects falling between the estimates from the pre-registered complete-case analysis and the lower bound analysis: AMELow = 1.1 percentage points (CI95: [−0.1, 2.7]); AMEAverage = 4.8 percentage points (CI95: [2.5, 7.2]); and AMEHigh = 10.6 percentage points (CI95: [6.5, 15.0]).

Modification-by-similarity hypothesis: Moderating influence of gender, educational achievement and ethnicity

To better understand the modifying role of dyadic similarity for the effect of proximity on friendship, we performed separate follow-up analyses along each dimension of similarity. For the estimated coefficients, see S1 Table. A more detailed summary of the results, including all estimates, credible intervals, and model comparisons can be found on the Open Science Framework.

Gender

Results closely mirrored the results for the overall similarity index. There was strong associational homophily: Same-gender dyads (Nboth boys = 6,700, Nboth girls = 5,848) were much more likely to report a reciprocated friendship than mixed-gender dyads (Nmixed gender = 12,414). Evidence for the modification-by-similarity hypothesis was again scale dependent, with no modification of the effect of induced proximity on the latent continuous friendship propensity, but clear differences in the effects of sitting next to each other on the probability of manifest friendships. This can be seen in Fig 1C and 1D. Being seated next to each other increased the probability of a friendship among mixed-gender dyads by AMEmixed gender = 2.3 percentage points (CI95: [1.0, 3.9]), among all-female dyads by AMEboth girls = 9.3 percentage points (CI95: [4.0, 14.8]); and among all-male dyads by AMEboth boys = 13.1 percentage points (CI95: [8.0, 18.3]). Imputing missing values in the most conservative manner, as well as non-response weighting, led to the same pattern of results (albeit with smaller effect estimates).

Educational achievement

There was once again clear evidence for associational homophily: the larger the absolute difference in baseline GPA between two students (controlling for dyad’s mean baseline GPA), the smaller their propensity to report a reciprocated friendship. With respect to the modification-by-similarity hypothesis, we again found no effect modification with respect to the latent continuous friendship propensity. Furthermore, the intervention increased the probability of a manifest friendship for dyads in which students had similar or dissimilar grades (Fig 1E). While the AMEs of sitting next to each other on the probability of a manifest friendship increased slightly with the similarity of students’ grades (Fig 1F), the 95% credible intervals for comparisons between the AMEs computed at different levels of dyadic similarity included zero. Once again, imputing missing values in the most conservative manner, as well as non-response weighting, led to the same pattern of results, with overall smaller effect estimates.

Ethnicity

Again, we observed associational homophily: ethnically-matched dyads (Nboth Non-Roma = 16,811, Nboth Roma = 2,851) had a higher latent propensity for reciprocated friendships than dyads of mixed ethnicity (NMixed ethnicity dyad = 3,932). And again, we found no evidence that the ethnic constellation of the dyad modified the effect of sitting next to each other on the latent continuous friendship propensity. Considering the effects on manifest friendships, we found ambiguous support for effect modification by ethnic match. There was some rather weak evidence that the average marginal effect was higher for non-Roma dyads than for mixed ethnicity dyads (AMEmixed ethnicity dyadAMEboth NonRoma = 5.8 percentage points, CI95: [0.1, 11.2]; see Fig 1G and 1H, although the upper bound of the CI95 crossed zero in two alternative model specifications, reported on the Open Science Framework). There was no evidence that the AME for Roma dyads was higher than the AME for mixed-ethnicity dyads (AMEboth RomaAMEmixed ethnicity dyad = 0.8 percentage points, CI95: [-6.4, 8.2]). Taken together, this is at best ambiguous evidence for the modification-by-similarity hypothesis with respect to ethnicity, which would predict that the deskmate intervention is more effective at promoting friendships among both non-Roma and Roma dyads compared to dyads of mixed ethnicity. Imputing missing values, as well as non-response weighting, led to the same somewhat unclear pattern of results.

Discussion

We executed a large pre-registered field experiment that randomized the seating charts in 182 3rd through 8th grade classrooms. We found clear evidence for a positive causal effect of proximity on friendship: sitting next to each other at the beginning of the semester substantially increased the probability of students’ mutual best-friendship nominations after the semester had ended. This reverts the Null finding of the only recent, non-randomized, proximity intervention among school-age children [20, 21].

Crucially, our study contributes nuanced new findings regarding the interactions between proximity and homophily in friendship formation. First, replicating prior findings about the importance of homophily as a descriptive characteristic of friendship networks, we established that friendships were more likely to occur between students who shared the same gender, similar levels of academic achievement, and the same ethnicity. Next, we newly investigated the extent to which similarity modified the causal effect of being seated next to each other. Encouragingly, we found no evidence that induced proximity affected the latent continuous propensity towards friendship differentially for similar or dissimilar dyads of students. But since the effect of a given increase in the latent propensity toward friendship on the formation of a manifest friendship also depends on the dyad’s baseline propensity toward friendship, and since more similar dyads have a greater baseline propensity toward friendship (homophily), the intervention was more successful among similar students than among dissimilar students. One potential explanation could be that being seated next to each other may be particularly effective at preventing the dissolution of pre-existing ties (as compared to inducing new ties), which are more prevalent among similar dyads; our design, however, does not allow for the identification of different possible mechanistic explanations. The three dimensions of similarity that we investigated contributed to the overall pattern to varying degrees: Gender showed the clearest effect modification (smaller effects among mixed-gender dyads), with a weak but aligned trend for baseline GPA (smaller effects when grade differences were large), and a somewhat misaligned trend for ethnicity (smaller effects in mixed and in Roma dyads).

Induced spatial proximity nevertheless succeeded in inducing some diverse friendships. Randomly seating boys and girls next to each other doubled their probability of nominating each other as best friends (from less than 2 to 4 percentage points). The intervention also substantially increased friendships between students with strong and weak baseline GPAs (from 11 to 17 percentage points). Finally, whether or not seating Roma and non-Roma students next to each other increased friendships across ethnic lines remained unclear in our data; the estimate was beset with statistical uncertainty due to relatively small numbers of Roma students in the sample and sensitive to assumptions about missing data.

It remains, of course, an open question whether our findings generalize to other settings and countries. Our study took place in a less prosperous area of rural Hungary, where students’ standardized reading and math scores fell below the national average, and fewer parents had graduated from college. Furthermore, study participation depended on teachers’ and schools’ willingness to participate; and it is possible that the included schools share certain features (e.g., a certain degree of openness) that made students more susceptible to the effects of induced proximity. Despite these potential concerns regarding external validity, which naturally arise in field experiments, we consider our findings in this particular setting promising.

We conclude that even small changes in spatial proximity can substantially affect friendships, not only among the strangers studied in previous research, but also in groups that already know each other well. This documents that some friendship networks remain malleable long after intra-group friendships have presumably been established. Furthermore, proximity also increases the propensity towards friendships, and the probability of manifest friendships, that transgress certain socio-cultural group boundaries—even as the transformation of latent propensities into manifest friendships remains to some extent an uphill battle against pervasive homophily.

Re-seating students is a low-cost and scalable intervention. Friendships across gender, achievement, and ethnical divides formed at a young age likely contribute to the development of social skills and shape attitudes with lasting consequences. This suggests the exciting possibility that targeted, low cost, and scalable interventions may reshape social networks to foster positive life outcomes for students, decrease segregation, and improve inter-group relations.

Supporting information

S1 Text. Deviations between the pre-analysis plan and the reported analyses.

(DOCX)

S2 Text. Details regarding pre-treatment variables.

(DOCX)

S3 Text. Balance checks.

(DOCX)

S1 Table. Results of Bayesian multi-membership multilevel probit models investigating the modifying role of single dimensions of similarity.

(DOCX)

S1 Fig. Heterogeneity of the deskmate effect across classrooms.

Probit coefficients from random effects model (left panel) as well as the corresponding model-implied friendship probabilities for non-deskmates versus deskmates (right panel). The difference between each predicted probability for deskmates minus the predicted probability for non-deskmates is the classroom-specific AME.

(TIF)

Acknowledgments

We would like to thank Steffen Nestler, Felix Schönbrodt, Alexander J. Etz, Stefan C. Schmukle, Michael Sobel, Benjamin Rosche, and Jingying He for advice. All errors are ours.

Data Availability

The data are available on the Open Science Framework (https://osf.io/4vjc5/).

Funding Statement

This research is funded by a grant from the Hungarian National Research, Development and Innovation Office (NKFIH), Grant number: FK 125358 to Tamás Keller, and by a Vilas Faculty Mid-Career Award from the University of Wisconsin-Madison to Felix Elwert. The support from the János Bolyai Research Scholarship of the Hungarian Academy of Sciences and from the New National Excellence Program (ÚNKP) of the Ministry of Human Capacities are acknowledged (Grant number: ÚNKP-19-4-BCE-07).

References

  • 1.Burt RD, Peterson AV Jr. Smoking cessation among high school seniors. Prev Med. 1998May;27(3):319–27. doi: 10.1006/pmed.1998.0269 [DOI] [PubMed] [Google Scholar]
  • 2.Christakis NA, Fowler JH. Connected: The surprising power of our social networks and how they shape our lives. Little, Brown Spark; 2009. [Google Scholar]
  • 3.Clark AE, Lohéac Y. “It wasn’t me, it was them!” Social influence in risky behavior by adolescents. J Health Econ. 2007;26(4):763–84. doi: 10.1016/j.jhealeco.2006.11.005 [DOI] [PubMed] [Google Scholar]
  • 4.Granovetter MS. The Strength of Weak Ties. Am J Sociol. 1973May1;78(6):1360–80. [Google Scholar]
  • 5.Granovetter M. The Impact of Social Structure on Economic Outcomes. J Econ Perspect. 2005Mar;19(1):33–50. [Google Scholar]
  • 6.Rivera MT, Soderstrom SB, Uzzi B. Dynamics of Dyads in Social Networks: Assortative, Relational, and Proximity Mechanisms. Annu Rev Sociol. 2010Jun1;36(1):91–115. [Google Scholar]
  • 7.Stadtfeld C, Vörös A, Elmer T, Boda Z, Raabe IJ. Integration in emerging social networks explains academic failure and success. Proc Natl Acad Sci U S A. 2019Jan15;116(3):792–7. doi: 10.1073/pnas.1811388115 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.McPherson M, Smith-Lovin L, Cook JM. Birds of a Feather: Homophily in Social Networks. Annu Rev Sociol. 2001Aug1;27(1):415–44. [Google Scholar]
  • 9.Byrne D. The Influence of Propinquity and Opportunities for Interaction on Classroom Relationships. Hum Relat. 1961Feb1;14(1):63–9. [Google Scholar]
  • 10.Byrne D, Buehler JA. A note on the influence of propinquity upon acquaintanceships. J Abnorm Soc Psychol. 1955;51(1):147. doi: 10.1037/h0040985 [DOI] [PubMed] [Google Scholar]
  • 11.Boisjoly J, Duncan GJ, Kremer M, Levy DM, Eccles J. Empathy or Antipathy? The Impact of Diversity. Am Econ Rev. 2006Dec;96(5):1890–905. [Google Scholar]
  • 12.Back MD, Schmukle SC, Egloff B. Becoming friends by chance. Psychol Sci. 2008May;19(5):439–40. doi: 10.1111/j.1467-9280.2008.02106.x [DOI] [PubMed] [Google Scholar]
  • 13.Baker S, Mayer A, Puller SL. Do more diverse environments increase the diversity of subsequent interaction? Evidence from random dorm assignment. Econ Lett. 2011Feb1;110(2):110–2. [Google Scholar]
  • 14.Camargo B, Stinebrickner R, Stinebrickner T. Interracial Friendships in College. J Labor Econ. 2010;28(4):861–92. [Google Scholar]
  • 15.Carrell SE, Hoekstra M, West JE. The impact of college diversity on behavior toward minorities. NBER working paper No w [Internet]. 2016;20940. https://pdfs.semanticscholar.org/e037/de006f7b2db16bfb3d9e7a0c37ede1db524a.pdf
  • 16.Corno L, La Ferrara E, Burns J. Interaction, stereotypes and performance. Evidence from South Africa [Internet]. Institute for Fiscal Studies; 2019 Jan [cited 2019 Oct 20]. Report No.: W19/03. https://ideas.repec.org/p/ifs/ifsewp/19-03.html
  • 17.Van Laar C, Levin S, Sinclair S, Sidanius J. The effect of university roommate contact on ethnic attitudes and behavior. J Exp Soc Psychol. 2005Jul1;41(4):329–45. [Google Scholar]
  • 18.Marmaros D, Sacerdote B. How Do Friendships Form? Q J Econ. 2006;121(1):79–119. [Google Scholar]
  • 19.Cutrona CE. Transition to college: Loneliness and the process of social adjustment. Loneliness: A sourcebook of current theory, research, and therapy. 1982;36:291–309. [Google Scholar]
  • 20.van den Berg YHM, Cillessen AHN. Peer status and classroom seating arrangements: a social relations analysis. J Exp Child Psychol. 2015Feb;130:19–34. doi: 10.1016/j.jecp.2014.09.007 [DOI] [PubMed] [Google Scholar]
  • 21.van den Berg YHM, Segers E, Cillessen AHN. Changing peer perceptions and victimization through classroom arrangements: a field experiment. J Abnorm Child Psychol. 2012Apr;40(3):403–12. doi: 10.1007/s10802-011-9567-6 [DOI] [PubMed] [Google Scholar]
  • 22.Leaper C. Exploring the consequences of gender segregation on social relationships. New Dir Child Adolesc Dev. 1994;1994(65):67–86. [Google Scholar]
  • 23.Leaper C, Ayres MM. A meta-analytic review of gender variations in adults’ language use: talkativeness, affiliative speech, and assertive speech. Pers Soc Psychol Rev. 2007Nov;11(4):328–63. doi: 10.1177/1088868307302221 [DOI] [PubMed] [Google Scholar]
  • 24.Mehta CM, Strough J. Sex segregation in friendships and normative contexts across the life span. Dev Rev. 2009Sep1;29(3):201–20. [Google Scholar]
  • 25.Gower JC. A General Coefficient of Similarity and Some of Its Properties. Biometrics. 1971;27(4):857–71. [Google Scholar]
  • 26.Long JS. Regression Models for Categorical and limited dependent variables. Sage; 1997. [Google Scholar]
  • 27.Albers CJ, Kiers HAL, van Ravenzwaaij D. Credible Confidence: A pragmatic view on the frequentist vs Bayesian debate. Collabra: Psychology [Internet]. 2018;4(1). Available from: https://collabra.org/articles/10.1525/collabra.149/ [Google Scholar]
  • 28.VanderWeele TJ. On the distinction between interaction and effect modification. Epidemiology. 2009Nov;20(6):863–71. doi: 10.1097/EDE.0b013e3181ba333c [DOI] [PubMed] [Google Scholar]
  • 29.Ai C, Norton EC. Interaction terms in logit and probit models. Econ Lett. 2003Jul1;80(1):123–9. [Google Scholar]
  • 30.Loftus GR. On interpretation of interactions. Mem Cognit. 1978May1;6(3):312–9. [Google Scholar]
  • 31.Breen R, Karlson KB, Holm A. Interpreting and Understanding Logits, Probits, and Other Nonlinear Probability Models. Annu Rev Sociol. 2018Jul30;44(1):39–54. [Google Scholar]
  • 32.Mize TD. Best practices for estimating, interpreting, and presenting nonlinear interaction effects. Sociological Science. 2019;6:81–117. [Google Scholar]
  • 33.Simonsohn U. Interactions in Logit Regressions: Why Positive May Mean Negative [Internet]. Datacolada. 2017. http://datacolada.org/57
  • 34.Greene W, Han C, Schmidt P. The bias of the fixed effects estimator in nonlinear models. Unpublished Manuscript, Stern School of Business, NYU [Internet]. 2002;29. http://people.stern.nyu.edu/wgreene/nonlinearfixedeffects.pdf
  • 35.R Core Team, Others. R: A language and environment for statistical computing. R Foundation for statistical computing, Vienna. 2013. [Google Scholar]
  • 36.Bürkner P-C. brms: An R Package for Bayesian Multilevel Models Using Stan [Internet]. Vol. 80, Journal of Statistical Software. 2017. p. 1–28. 10.18637/jss.v080.i01 [DOI] [Google Scholar]
  • 37.Bürkner P-C. Advanced Bayesian Multilevel Modeling with the R Package brms [Internet]. Vol. 10, The R Journal. 2018. p. 395–411. 10.32614/RJ-2018-017 [DOI] [Google Scholar]
  • 38.Team R, Others. RStudio: integrated development for R. RStudio, Inc, Boston, MA: URL http://www.rstudio.com. 2015;42:14. [Google Scholar]
  • 39.Wickham H. ggplot2: Elegant Graphics for Data Analysis. Springer; 2016. 260 p. [Google Scholar]

Decision Letter 0

Federica Maria Origo

13 Jan 2021

PONE-D-20-33057

Proximity Can Induce Diverse Friendships: A Large Randomized Classroom Experiment

PLOS ONE

Dear Dr. Rohrer,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

I agree with the reviewers that the paper covers an interesting topic and it is overall well executed. However, both reviewers have some concerns about your research design and/or the interpretation of the results. Furthermore, they suggest a number of ways to improve further your analysis.

More specifically, I agree with Reviewer #2 that you should provide, if available, ex-ante information on pre-treatment students’ friendship and pre-treatment seating rules. This piece of information is crucial especially for pupils in higher grades. Furthermore, you specify in the paper that the students were randomly seated in three main subjects, but t is not clear how you chose these subjects and, most importantly, what happens during the other subjects. I also recommend to discuss more in detail the issues of unobservable teacher characteristics and sample selection raised by the same reviewer.

Finally, as suggested by Reviewer #1, it may be interesting to investigate the existence of heterogeneous effects by class size.  Regarding heterogeneous effects already discussed in this version of the paper, the reviewer provides a number of suggestions to clarify your results.

Please submit your revised manuscript by Feb 25 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

We look forward to receiving your revised manuscript.

Kind regards,

Federica Maria Origo

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Please provide additional details regarding participant consent. In the ethics statement in the Methods and online submission information, please ensure that you have specified what type you obtained (for instance, written or verbal, and if verbal, how it was documented and witnessed). If your study included minors, state whether you obtained consent from parents or guardians. If the need for consent was waived by the ethics committee, please include this information.

3. We note that you have stated that you will provide repository information for your data at acceptance. Should your manuscript be accepted for publication, we will hold it until you provide the relevant accession numbers or DOIs necessary to access your data. If you wish to make changes to your Data Availability statement, please describe these changes in your cover letter and we will update your Data Availability statement to reflect the information you provide.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: No

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: No

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: No

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: This paper reports results from a field experiment conducted on a sample of Hungarian pupils enrolled in primary school. It shows that exogenous induced physical proximity might increase the likelihood of friendship formation. This effect seems stronger when the physical proximity is induced on students with a higher degree of "similarity".

I find this paper quite interesting and generally well-executed. Overall, I see the potential for publication. I have just a couple of comments, which are reported below.

1) I would suggest moving the section "Modification-by-Similarity Hypothesis: Moderating Influence of Gender, Educational Achievement and Ethnicity" before section "Modification-by-Similarity Hypothesis: Moderating Role of Overall Similarity". I feel it is better to understand first the characteristics driving the effects' heterogeneity and then an overall assessment via a comprehensive measure of similarity.

2) Looking at the results for each dimension of similarity separately, it seems that gender is the only characteristic that matters in altering the primary effect of being a deskmate. Indeed, the grouped GPA and ethnicity marginal effects are never statistically significantly different from each other (Fig 1, H and F). This aspect should also be clarified in the text. For instance, the last sentence in the abstract gives the impression that ethnicity seems to be a relevant character but is not that clear from the results.

3) With the class fixed effects, you control for the unobservables related to the class. However, could you highlight how the effect changes depending on the size of the class-size? Is the effect stronger in larger classes? It should be appropriate to run an interaction model with an indicator for sample size (for instance, being in the top or bottom tertile of class-size distribution), but given how the results are presented, it could also work a sample splitting.

Reviewer #2: Is the manuscript technically sound, and do the data support the conclusions?

I have some concerns on the design:

- ex-ante information: we miss information on students’ ex-ante friendship relationship within the class. This would have allowed a much cleaner design and test of the research hypotheses. Do the authors have such information?

- how are students’ seat decided usually? Do students choose? Are they in alphabetical order? Do the authors have information on the previous seating scheme within each class?

- why do the authors choose only three subjects? I imagine they are the subjects who represent most of the teaching hours. But, wouldn’t be better to ask to fix students’ seat for all the teaching subjects? It is not clear what happens during the other subjects. Do students change their seating when the subject changes?

Has the statistical analysis been performed appropriately and rigorously?

I list here some concerns:

- selection: it can happen at different levels. First, line 127, recruitment depends on teachers’ decision to implement the protocol. Teachers deciding to take part to the study may have unobserved characteristics that also influence pupils attitudes towards - let’s call them - “several kinds” of friendship. Second, some schools are then dropped because they did not meet the inclusion criteria; third students and schools are dropped from the sample because they did not answer the friendship-nomination item which is used to create the outcome variable. The authors should show that selection is not an issue. The first step in this direction would be to compare observable characteristics across samples.

- can the authors check the robustness without students with self reported measures?

- line 149: have you checked the robustness of results without such 5.6%

- line 329: can the authors comment on the size of the effect? 1.6 compared with 7 percentage points seems quite a relevant difference

- the authors can include equations of the estimated models and tables with the estimation results. This would help to understand their econometric technique and make the reading of the paper more fluent

Is the manuscript presented in an intelligible fashion and written in standard English?

- the authors can include equations of the estimated models and tables with the estimation results. This would help to understand their econometric technique and make the reading of the paper more fluent

- Please, spellcheck the paper because I have spotted some typos: i.e. line 154 “students”; line 155 “with”; line 250 “be friend”; line 461 “replicating”

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2021 Aug 11;16(8):e0255097. doi: 10.1371/journal.pone.0255097.r002

Author response to Decision Letter 0


23 Feb 2021

[Please refer to the Word Document that we uploaded which is properly formatted to ensure readability]

Point-by-point response

Editorial remarks:

E.1. I agree with Reviewer #2 that you should provide, if available, ex-ante information on pre-treatment students’ friendship and pre-treatment seating rules. This piece of information is crucial especially for pupils in higher grades.

We have revised the manuscript to clarify that teachers typically design the seating chart. We provide additional information on typical seating practices in our response to R.2.2 below.

We do not have ex-ante information students’ prior friendships or past seating charts. We agree that information on prior friendships (i.e., lagged outcomes), or prior deskmates (i.e., lagged treatments) would have been interesting, as it would have allowed us to answer additional research questions, and may potentially have increased efficiency.

However, the absence of this information is not a problem for the causal claims we make: We are able to identify the causal effects of having a particular deskmate because we randomized the seating chart. These causal effects are the central focus of our manuscript. Thus, believe that our study makes valuable contributions since it is the first large randomized experiment on the effect of spatial proximity within classrooms on friendships.

E.2. Furthermore, you specify in the paper that the students were randomly seated in three main subjects, but it is not clear how you chose these subjects and, most importantly, what happens during the other subjects.

Thank you for this suggestion! Indeed, this is important background information. We have expanded the relevant manuscript section to include it.

p. 8, starting from line 163: Teachers were instructed to employ the intended seating chart in the three core subjects of the curriculum—mathematics, Hungarian literature, and Hungarian grammar—from the first day of classes (September 1, 2017) until the end of the fall semester (January 31, 2018). These three subjects form the core of the curriculum and receive the greatest weight in admission to selective secondary schools. They were taught in the same room for all grade levels and accounted for 6 to 10 lessons per week (25 to 45 percent of all lessons). Enforcing the seating chart across all subjects was not possible because (1) in some subjects, classrooms were split into smaller groups (e.g., different foreign languages), and (2) in some subjects, students were not seated in a fixed grid-layout (e.g., physical education and arts). However, seating charts typically apply to all subjects in a given room, and (depending on the grade level) most subjects were taught in the same room. Thus, students assigned to sit next to each other in the three core subjects likely also sat next to each other in other subjects; but we did not verify adherence outside of the core subjects.

The decision to limit the intervention instructions to the three core subjects was thus made for the sake of practicality. (We expect that non-adherence to the seating-chart in other subjects would, if it were random, dilute the effects of our intervention and render our estimates conservative. However, since we have no formal result for this expectation, we chose not to include it in the manuscript.)

E.3. I also recommend to discuss more in detail the issues of unobservable teacher characteristics and sample selection raised by the same reviewer.

We address this question below in response to R.2.4, in order to present our answer in the context of the verbatim reviewer comment.

E.4. Finally, as suggested by Reviewer #1, it may be interesting to investigate the existence of heterogeneous effects by class size. Regarding heterogeneous effects already discussed in this version of the paper, the reviewer provides a number of suggestions to clarify your results.

Done! Please see our detailed response below (R.1.3).

E.5. If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results.

We are commited to research transparency and reproducibility. We were unaware of the protocols.io platform, which seems like a great fit for laboratory research in particular. For the present project, we have already made all relevant information available on the Open Science Framework (https://osf.io/4vjc5/?view_only=d0e9a887b3da4ebcabd0f9afb7480d65). We hope this is acceptable.

E.6. Please ensure that your manuscript meets PLOS ONE’s style requirements, including those for file naming.

We have carefully checked all materials to ensure that they adhere to the style requirements.

E.7. Please provide additional details regarding participant consent. In the ethics statement in the Methods and online submission information, please ensure that you have specified what type you obtained (for instance, written or verbal, and if verbal, how it was documented and witnessed). If your study included minors, state whether you obtained consent from parents or guardians. If the need for consent was waived by the ethics committee, please include this information.

Done! We have provided the following details regarding participant consent in the revised manuscript:

p. 16, line 353: This study was reviewed and approved by the IRB offices at the Centre for Social (data collection and analysis), and at the University of Wisconsin-Madison (data analysis). Consent was obtained at multiple points. School districts, school principals, and teachers provided written consent to participating in the seating chart randomization. Parents provided written consent for the retrieval of administrative records via teachers, and for their children’s participation in the survey.

E.8. We note that you have stated that you will provide repository information for your data at acceptance. Should your manuscript be accepted for publication, we will hold it until you provide the relevant accession numbers or DOIs necessary to access your data. If you wish to make changes to your Data Availability statement, please describe these changes in your cover letter and we will update your Data Availability statement to reflect the information you provide.

We are fully committed to provide the data and all other relevant information on the Open Science framework. All materials can already be accessed under https://osf.io/4vjc5/?view_only=d0e9a887b3da4ebcabd0f9afb7480d65. We will make the corresponding project public upon acceptance.

Reviewer #1

This paper reports results from a field experiment conducted on a sample of Hungarian pupils enrolled in primary school. It shows that exogenous induced physical proximity might increase the likelihood of friendship formation. This effect seems stronger when the physical proximity is induced on students with a higher degree of “similarity”.

I find this paper quite interesting and generally well-executed. Overall, I see the potential for publication. I have just a couple of comments, which are reported below.

Thank you for this positive feedback.

R.1.1. I would suggest moving the section “Modification-by-Similarity Hypothesis: Moderating Influence of Gender, Educational Achievement and Ethnicity” before section “Modification-by-Similarity Hypothesis: Moderating Role of Overall Similarity”. I feel it is better to understand first the characteristics driving the effects’ heterogeneity and then an overall assessment via a comprehensive measure of similarity.

We see the merit of this suggestion—the “bottom-up” logic (start with single dimensions, then aggregate) may be easier to follow for some readers. On the other hand, our pre-analysis plan states that the combined index of similarity is of primary interest; the single dimensions are only listed as secondary (exploratory) analyses. Thus, we have respectfully elected to keep the current structure in order to to honor the pre-registered sequence of primary vs. secondary analyses. We thank the reviewer for this thoughtful idea.

R.1.2. Looking at the results for each dimension of similarity separately, it seems that gender is the only characteristic that matters in altering the primary effect of being a deskmate. Indeed, the grouped GPA and ethnicity marginal effects are never statistically significantly different from each other (Fig 1, H and F). This aspect should also be clarified in the text. For instance, the last sentence in the abstract gives the impression that ethnicity seems to be a relevant character but is not that clear from the results.

We agree with the reviewer’s interpretation that gender is the main driver of modification by similarity (on the average-marginal-effect [AME] scale). As for the other two dimensions, the story is somewhat complicated. For example, the pairwise comparisons of the effect by GPA-category may not be statistically significant in themselves, but they align with the trend higher similarity � larger AME; hence this GPA trend will still contribute to the modification by overall similarity.

As requested, we have sharpened the prose, including in the abstract, to be crystal clear.

Abstract: […] the probability of a manifest friendship increased more among similar than among dissimilar students—a pattern mainly driven by gender. Our findings demonstrate that a scalable light-touch intervention can affect face-to-face networks and foster diverse friendships in groups that already know each other, but they also highlight that transgressing boundaries, especially those defined by gender, remains an uphill battle.

p. 22, line 491: But since the effect of a given increase in the latent propensity toward friendship on the formation of a manifest friendship also depends on the dyad’s baseline propensity toward friendship, and since more similar dyads have a greater baseline propensity toward friendship (homophily), the intervention was more successful at inducing manifest friendships among similar students than among dissimilar students. The three dimensions of similarity that we investigated contributed to this pattern to varying degrees: Gender showed the clearest effect modification (smaller effects among mixed-gender dyads), with a weak but aligned trend for baseline GPA (smaller effects when grade differences were large), and a somewhat misaligned trend for ethnicity (smaller effects in mixed and in Roma dyads).

R.1.3. With the class fixed effects, you control for the unobservables related to the class. However, could you highlight how the effect changes depending on the size of the class-size? Is the effect stronger in larger classes? It should be appropriate to run an interaction model with an indicator for sample size (for instance, being in the top or bottom tertile of class-size distribution), but given how the results are presented, it could also work a sample splitting.

Thank you for suggesting this interesting additional analysis. We derived the requested estimates from the model in which the effect was allowed to vary freely from classroom to classroom. These exploratory findings are now briefly summarized in the manuscript:

p. 17, line 381: In models in which we allowed the deskmate effect to vary between classrooms, we found some variability of the deskmate effects, but the differences were substantively small (SDDeskmate effect = 0.09, CI_95: [0.00, 0.23]), see S3 Fig. These models also suggested that the number of students in the classroom did not modify the effect (bottom tertile, 17 students or fewer: AME = 7.5 percentage points, CI_95: [4.5, 10.5], top tertile, more than 20 students: AME = 7.3 percentage points, CI_95: [4.4, 10.5]).

Reviewer #2

R.2.1. I have some concerns on the design:

- ex-ante information: we miss information on students’ ex-ante friendship relationship within the class. This would have allowed a much cleaner design and test of the research hypotheses. Do the authors have such information?

This is a great question. Unfortunately, we do not have information on prior friendships within classes. While such information would have allowed us to ask additional questions and possibly have increased power (maybe this is what the reviewer means by a “cleaner test”), we respectfully submit that this information is not necessary to justify the claims we make in this study. Randomization of the seating chart on its own is sufficient to cleanly identify the causal effect of deskmates on friendship formation. (See also response to E1).

R.2.2. how are students’ seat decided usually? Do students choose? Are they in alphabetical order? Do the authors have information on the previous seating scheme within each class?

From a survey of classroom teachers (N = 160) prior to the intervention, we know that (a) 74% of teachers design the seating chart in their classrooms, (b) some teachers prefer to assign high and low ability students (48.8%), or well and badly behaved students (41.3%), to the same desk. According to teachers’ answers, students’ gender and ethnicity are not important considerations when desiging the seating chart: 75% and 95.6% of teachers reported that these characteristics do not play a role in designing the seating chart. We do not have information on prior deskmate relationship of these students; this information would have allowed us to ask additional questions, but it is not necessary to justify the claims we make in this study—the causal effects we estimate are identified by randomization.

We have revised the body of the text to include that (p. 4, line 91) “ordinarily, the majority of seating charts would be designed by teachers, see S2 Text.” S2 Text contains the entire paragraph above.

R.2.3. why do the authors choose only three subjects? I imagine they are the subjects who represent most of the teaching hours. But, wouldn’t be better to ask to fix students’ seat for all the teaching subjects? It is not clear what happens during the other subjects. Do students change their seating when the subject changes?

This is an important question—and one that we pondered extensively during the design phase of the study.

We fully agree that it would have been optimal to fix students’ seating charts for all subjects. But this was not feasible in practice, given that students sometimes change rooms for different subjects.

We now provide this additional information in the revised manuscript:

p. 8, starting from line 163: Teachers were instructed to employ the intended seating chart in the three core subjects of the curriculum—mathematics, Hungarian literature, and Hungarian grammar—from the first day of classes (September 1, 2017) until the end of the fall semester (January 31, 2018). These three subjects form the core of the curriculum and receive the greatest weight in admission to selective secondary schools. They were taught in the same room for all grade levels and accounted for 6 to 10 lessons per week (25 to 45 percent of all lessons).. Enforcing the seating chart across all subjects was not possible because (1) in some subjects, classrooms were split into smaller groups (e.g., different foreign languages), and (2) in some subjects, students were not seated in a fixed grid-layout (e.g., physical education and arts). However, seating charts typically apply to all subjects in a given room, and (depending on the grade level) most subjects were taught in the same room. Thus, students assigned to sit next to each other in the three core subjects likely also sat next to each other in other subjects; but we did not verify adherence outside of the core subjects.

R.2.4. selection: it can happen at different levels. First, line 127, recruitment depends on teachers’ decision to implement the protocol. Teachers deciding to take part to the study may have unobserved characteristics that also influence pupils attitudes towards - let’s call them - “several kinds“of friendship. Second, some schools are then dropped because they did not meet the inclusion criteria; third students and schools are dropped from the sample because they did not answer the friendship-nomination item which is used to create the outcome variable. The authors should show that selection is not an issue. The first step in this direction would be to compare observable characteristics across samples.

We believe that this comment addresses multiple types of selection (cf. Imai, King, Stuart. 2008—“Misunderstandings between experimentalists and observationalists about causal inference” JRSS-A).

Points 1 (teachers) and 2 (schools) concern “selection” on baseline characteristics. Selection on baseline characteristics is regrettable but par for the course in field experiments, which, to an extent, rely on subjects’ willingness to participate in an intervention. This type of selection only concerns external validity (i.e., generalizability or transportability of results across contexts). It does not threaten the internal validity, i.e., identification of causal effects within the study sample. The trade-off between internal and external validity in favor of achieving internal validity is standard in field experiments (Imai et al. 2008), although it goes without saying that it would be preferable to have both (Imai et al. 2008).

Selection on the outcome (friendship nominations), had it occurred, by contrast, would additionally threaten internal validity (i.e., identification). We are optimistic that this problem, should it exist, is minor in our study. First, we emphasize that all exclusion criteria were pre-registered prior to the receipt of outcome data. This prevents us from “fishing” for desired results. Second, outcome data are missing only for a small share of the sample (391 students, 10.3% of the pre-registered 3,814 students), a share that is in line with other well-regarded randomized field experiments. Third, most missingness in the outcome is owed to lack of parental consent to participate in the endline survey, rather than due to item non-response. One would have to craft very elaborate scenarios to link parental lack of consent to bias in the main analysis. Specifically, it would have to be the case that parental consent for the endline survey is a function of the outcome, i.e., whether or not their child had befriended their deskmate (and even in such a scenario, our analysis would be a valid test of the null hypothesis of no effect). That is not to say that we can rule out any threat of bias; only that we do not think this problem is of special concern for our study.

R.2.5. can the authors check the robustness without students with self reported measures?

We assume that this question referrs to the fact that, following the pre-analysis plan, missing teacher reports of baseline covariates were filled in with students’ self-reports collected at endline (line 189). Note that this decision was pre-registered, and it only affected a small number of students (depending on the subject, between 3.2 and 3.6% of the grades were filled in from self-reports). Nonetheless, to ensure that our results weren’t sensitive to this decision, we re-ran analyses limiting the sample to students for which teacher reports were available.

In brief, results were highly similar. The estimated average marginal effect of the intervention was 7.2 percentage points, 95% CI: [4.8; 9.6] as opposed to 7.0 percentage points, 95% CI: [4.6; 9.4]. Considering effect modification by similarity, once again results were virtually unchanged, for example: AME among low similarity students 1.7 [0.3; 3.3] as compared to 1.5 [0.2; 3.1]; among average similarity students 5.9 [3.5; 8.3] as compared to 5.7 [3.4; 8.0]; among high similarity students 11.6 [7.8; 15.6] as compared to 11.8 [8.0; 15.7].

We can thus be confident that the decision to rely on student self-reports (where necessary) did not affect conclusions. The full analytic output can be found on the Open Science Framework: https://osf.io/sbn6h/.

R.2.6. line 149: have you checked the robustness of results without such 5.6%

We apologize for not understanding this question. Neither on Line 149, nor anywhere in the manuscript or supplement do we refer to the number “5.6.” In the supplemental text, we state that 5.5% of dyads lacked information on ethnicity. These dyads have already been excluded from analysis (as stated in the supplement).

R.2.7. line 329: can the authors comment on the size of the effect? 1.6 compared with 7 percentage points seems quite a relevant difference

We believe this to be a misunderstanding—1.6 compared to 7 percentage points would indeed be quite relevant. However, as we state in the manuscript, this “1.6” refers to the largest discrepancy between estimates. We have clarified the phrasing, line 346: “The largest absolute difference in the estimated AMEs across models was 1.6 percentage points”

To provide the full context for this largest observed difference:

In the focal models (specification as reported in the manuscript), we estimate that the deskmate effect among boy-dyads is 41.28 – 28.14 = 13.14 percentage points (numbers rounded, more precise output reported on the OSF). For gender-mixed dyads, the deskmate effect is 4.02 - 1.69 = 2.33 percentage points. The estimated difference between these deskmate effects is 10.8 percentage points, 95% Credible Interval: 5.48, 16.15.

In our linear probability models, these estimates look slightly different. The deskmate effect among boy-dyads is 42.06 – 27.88 = 14.18 percentage points; among gender-mixed dyads it is 2.94 – 1.15 = 1.80. The estimated difference between these deskmate effects is 12.4 percentage points, 95% Credible Interval: 7.85, 17.10.

The difference between the differences estimated by the two model specifications—12.4 percentage points versus 10.8 percentage points—is 1.6 percentage points, the largest observed discrepancy. We think that this deviation is rather unsurprising given that (1) we would expect the probit model and the linear probability model to behave differently close to zero and given that (2) differences in differences are estimated with rather large uncertainty (see wide credible intervals).

Hence, this small discrepancy does not affect our qualitative conclusions.

R.2.8. the authors can include equations of the estimated models and tables with the estimation results. This would help to understand their econometric technique and make the reading of the paper more fluent

Thank you for this suggestion. We have revised the manuscript to state the main Bayesian multi-membership multilevel model in standard econometric notation, see page 10:

We model the effect of sitting next to each other on reciprocated friendship nominations using a Bayesian multi-membership multilevel probit model. This is a dyad-level model with one observation for each unordered dyad consisting of students i and j in classroom c,

〖Friendship〗_({ij}c)^*=β_0+β_D*〖Deskmate〗_({ij}c)+〖Classroom〗_c+∑_(s∈{i,j})▒〖Student〗_sc +ϵ_({ij}c) (1)

where Friendship_({ij}c)^* is the latent continuous friendship propensity of the dyad; 〖Deskmate〗_({ij}c)=1 if the students in the dyad are deskmates and =0 otherwise; Classroom_c is a vector of classroom fixed effects to account for randomization within classrooms; and ϵ_({ij}c)~N(0,σ_ϵ^2) is the i.i.d. dyad-specific error term. The term ∑_(s∈{i,j})▒〖Student〗_sc refers to the two i.i.d. random effects for the students s in the dyad, Student_sc~N(0,σ_Student^2 ). The latent continuous friendship propensity is linked to manifest friendship via the threshold function Friendship_ijc=1 if Friendship_ijc^*>0 and =0 otherwise.

Furthermore, we added a table with estimation results of the two primary analyses. We also added S5 Table with estimation results of the follow-up analyses of the single dimensions of similarity.

Table 1. Results of Bayesian multi-membership multilevel probit models for the effects of sitting next to each other on reciprocated friendship.

Main Analysis Modification by Overall Dyadic Similarity

Estimate 95% CI Estimate 95% CI

b0 -0.96 [-1.20; -0.72] -1.49 [-1.84; -1.14]

bDeskmate 0.27 [0.19; 0.35] 0.29 [0.19; 0.39]

σStudent 0.04 [0.00; 0.10] 0.52 [0.46; 0.58]

bSimilarity 0.83 [0.79; 0.86]

bSimilarity*Deskmate 0.07 [-0.05; 0.18]

NDyads 24,962 24,962

NStudents 2,996 2,996

This table lists probit coefficient. Average marginal effects were derived from the fitted probit model following the procedure described in the Methods section. We decided to retain the figure presenting the AMEs and believe that this dual-presentation strategy (Table and Figure) offers a good balance for readers from a broad variety of fields.

R.2.9. Please, spellcheck the paper because I have spotted some typos: i.e. line 154 “students“; line 155 “with“; line 250 “be friend“; line 461 “replicating”

Thank you for your careful reading of our manuscript! We have fixed the mistakes in (original) line 154, 155, and 461. Our use of the intransitive verb “to befriend” in line 250 is correct. We have given the entire manuscript another careful read to correct remaining mistakes.

Attachment

Submitted filename: Response to Reviewers V6.docx

Decision Letter 1

Federica Maria Origo

6 Apr 2021

PONE-D-20-33057R1

Proximity Can Induce Diverse Friendships: A Large Randomized Classroom Experiment

PLOS ONE

Dear Dr. Rohrer,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

While one of the two reviewers is happy with how you dealt with her main comments in this version of your paper, the other reviewer has raised a couple of concerns that I fully share.

The main issue that can threat your identification strategy remains self-selection, which may be influenced by treatment status. The reviewer provides an illuminating example on this. If you do not have data to provide further robustness checks on this issue, you should at least discuss whether and how this may influence your estimates. Similarly, you should discuss more clearly the role of pre-existing friendships in driving your results.

Please submit your revised manuscript by May 21 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Federica Maria Origo

Academic Editor

PLOS ONE

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: All comments have been addressed

Reviewer #2: (No Response)

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Partly

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: No

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: (No Response)

Reviewer #2: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: (No Response)

Reviewer #2: I think that the authors have done a good job in addressing the reviewers’ comments and the revised version of the paper is much clearer.

I still have some comments:

- about selection: first, I think that, if the authors do not want or can show the observable characteristics of the teachers and schools that selected out of the study, at least they should discuss the external validity of the study; second, as regards selection on the outcome variable, my concern is that selection may be influenced by the treatment status. Just as an example, it may happen that parents whose child was not happy with the intervention because he could not establish a good bond with the deskmate, are more likely to avoid giving consent. This in turn may be more likely to happen among dissimilar pairs. For this reason, it is important to see the characteristics of these observations and make sure that parents’ decision to deny consent (or generally the missings in the outcome) is not related to treatment status.

- related to the above point and to the identification of the effect: I was asking about pre-existing friendships because it is more likely that, within a class, pupils tend to befriend similar peers. Thus, the strongest effect for similar peers may be due to higher likelihood of a pre-existing bond. Since the authors do not have such information, they should acknowledge this caveat when describing their design and above all their results.

- I better explain the question in R.2.6 (I apologise for the mistake, it was line 169): the authors state “94.4 percent of the dyads in which students actually sat next to each after the second week of classes comprised students who were supposed to sit next to each in the intended seating chart” (line 180). Given that the authors have information on compliance, are the results robust (stronger?) if the authors exclude the 5.6% (100-94.4) of the dyads who were not compliant. What if they exclude also the dyads in which students did not actually sat next to each?

- I suggest again to spellcheck the paper: line 207 “1if”; line 512 “the transformations of latent propensities into manifest friendships remains”

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2021 Aug 11;16(8):e0255097. doi: 10.1371/journal.pone.0255097.r004

Author response to Decision Letter 1


21 May 2021

(Please refer to the uploaded file for more readable formatting)

Editorial Remarks

E.1. The main issue that can threat your identification strategy remains self-selection, which may be influenced by treatment status. The reviewer provides an illuminating example on this. If you do not have data to provide further robustness checks on this issue, you should at least discuss whether and how this may influence your estimates.

Reviewer 2 brings up two potential issues of selection: (1) study participation (i.e., selection prior to baseline which does not threaten internal validity but may affect external validity) and (2) selective attrition (i.e., selection after baseline), which may threaten internal validity.

Concerning (1), we added a paragraph to the body of the text that explicitly acknowledges potential threats to external validity, see response to R.2.2.

Concerning (2), we ran extensive additional analyses to investigate and address the matter. Despite low levels of attrition, we did indeed find some evidence for selective attrition, as suggested by the reviewer. We thus ran two additional sets of analyses: lower bound analyses in which missing responses were imputed under the most conservative assumptions; and analyses incorporating non-response weights. Both resulted in somewhat lower point estimates than the 7.0 percentage-points estimate in our pre-registered primary analyses of complete cases: our lower bound estimate is 4.0 percentage points, and our non-response weighted estimate is 6.1 percentage points. In both cases, the 95% CI still comfortably excludes zero. More details can be found in our response to R.2.3. We have adjusted the language throughout the manuscript at multiple points to acknowledge these new and smaller estimates (see also response to R.2.3).

E.2. Similarly, you should discuss more clearly the role of pre-existing friendships in driving your results.

Thanks to the helpful clarification by the reviewer, we now understand that the reviewer’s remark does not relate to the identification of the causal deskmate effect, which is the focus of our study, but to the interpretation of the role of similarity, which is an effect modifier (i.e., it indexes effect heterogeneity in the causal deskmate effect). In short, the interpretation implied by the reviewer is fully compatible with our interpretation (and we added it to the manuscript), see response to R.2.4. for details.

Reviewer #2

R.2.1. I think that the authors have done a good job in addressing the reviewers’ comments and the revised version of the paper is much clearer.

Thank you very much!

R.2.2. About selection: first, I think that, if the authors do not want or can show the observable characteristics of the teachers and schools that selected out of the study, at least they should discuss the external validity of the study

We did not collect extensive data on the teachers and schools who were asked to participate in the study and thus cannot provide a comprehensive analysis of potential limitations of external validity. However, we have school-level data of the schools who decided to participate, which we can compare to all other Hungarian primary schools to give a better idea of the setting of the study.

Of course, the limited external validity of experimental studies always merits a reminder, and so we added a paragraph to the discussion:

p. 24, line 552: “It remains, of course, an open question whether our findings generalize to other settings and countries. Our study took place in a less prosperous area of rural Hungary, where students’ standardized reading and math scores fell below the national average, and fewer parents had graduated from college. Furthermore, study participation depended on teachers’ and schools’ willingness to participate; and it is possible that the included schools share certain features (e.g., a certain degree of openness) that made students more susceptible to the effects of induced proximity. Despite these potential concerns regarding external validity, which naturally arise in field experiments, we consider our findings in this particular setting promising.”

R.2.3 Second, as regards selection on the outcome variable, my concern is that selection may be influenced by the treatment status. Just as an example, it may happen that parents whose child was not happy with the intervention because he could not establish a good bond with the deskmate, are more likely to avoid giving consent. This in turn may be more likely to happen among dissimilar pairs. For this reason, it is important to see the characteristics of these observations and make sure that parents’ decision to deny consent (or generally the missings in the outcome) is not related to treatment status.

Thank you very much for this clarifying explanation. Such selective dropout seems possible and would indeed threaten the identification of the effect of interest (although it is not quite straightforward to gauge the impact, as the students who did not answer the questionnaire are also part of non-deskmate dyads).

We thus ran additional analyses to (1) assess the selectivity of attrition, and (2) to estimate the deskmate effect taking attrition into account. In short, we did find some evidence for selective attrition. Additional analyses that account for selective attrition in various ways (including a worst-case analysis under extreme assumptions) return somewhat smaller estimates of the deskmate effects, although the corresponding 95% credible intervals still comfortably exclude zero.

We added the following to the manuscript:

Methods section, p. 15, line 349: “Attrition. About 10% of students were omitted from our main analysis, because they did not provide friendship nominations (e.g., because they lacked parental consent for the endline survey, did not attend school on the day of the assessment, or skipped the question). Multivariate non-response models indicated some selective non-response. While gender and ethnicity did not predict missingness (p > .12), a 1 SD increase in GPA, the model predicted a 2.4 percentage point increase in the probability of response (p = .001), and a 1 SD increase in similarity (Gower’s index) predicted a small but statistically significant decrease of 1.5 percentage points in the probability of response (p = .004). To address possible bias from selective attrition, we ran two additional sets of analyses.

First, we estimated a lower bound for the deskmate effect by imputing missing friendships nominations under extremely conservative assumptions: whenever nominations were missing, we assumed that (1) the student did not nominate their deskmate and (2) the student nominated all non-deskmates who had nominated them. This minimized the number of friendships between deskmates and maximized the number of friendships between non-deskmates.

Second, we re-ran the central analyses with dyadic non-response weights. The resulting estimates identify the causal effect of interest under the assumptions that our non-response model is correctly specified. A more detailed description, the full analysis code and results of these additional analyses can be found on the Open Science Framework.”

On the OSF, we additionally provide the following details:

Additional Analyses Attrition: […] Second, we calculated (dyadic) non-response weights. Using a probit model that parallels our central analyses, we predicted whether or not dyads’ friendship status was missing from whether or not they were deskmates, baseline covariates (all three combinations of ethnicity, all three combinations of gender, mean dyad GPA and GPA difference), as well as the interaction between the deskmate indicator and the covariates, and classroom fixed effects. From this model, we predicted response weights, and re-ran the central analyses weighting observations with the inverse of the response probabilities. The resulting estimates identify the causal effect of interest under the assumptions that our non-response model is correctly specified. The full analysis code and results of these additional analyses can be found on the Open Science Framework.

Furthermore, the new results are reported throughout the manuscript in the respective sections:

Results section, Deskmate Hypothesis, p. 18, line 407: “Imputing missing outcomes in the most conservative manner results in a lower bound estimate of b = 0.17, CI_95: [0.09, 0.24]). In this model, sitting next to each other increased the probability of a manifest friendship by 4.0 percentage points (CI_95: [2.0; 6.1]), from 14.6 percent to 18.7 percent. Lastly, applying non-response weights, we estimated that the deskmate effect was b = 0.24, CI_95: [0.15, 0.33]). In this model, sitting next to each other increased the probability of a manifest friendship by 5.9 percentage points (CI_95: [3.5; 8.4]), from 14.8 percent to 20.8 percent.”

Results section, Modification-by-Similarity Hypothesis: Moderating Role of Overall Similarity, p. 20, line 457: Imputing missing values in the most conservative manner did not change conclusions regarding the lack of an interaction on latent friendship propensities. Furthermore, we still observed an interaction on the probability of manifest friendships (i.e., 95% credible intervals for the differences between the deskmate effects for dyads with low, average, and high similarity exclude zero), but all average marginal effects were somewhat lower and the 95% credible interval now contained zero for low-similarity dyads: AME_Low=1.7 percentage points (CI_95:[-0.4,1.9]); AME_Average=3.1 percentage points (CI_95:[1.2,5.1]); and AME_High=7.6 percentage points (CI_95:[4.0,11.1]). The same pattern held for analyses applying non-response weights, with average marginal effects falling between the estimates from the complete cases analysis and from the lower bound analysis: AME_Low=1.1 percentage points (CI_95:[-0.1,2.7]); AME_Average=4.8 percentage points (CI_95:[2.5,7.2]); and AME_High=10.6 percentage points (CI_95:[6.5,15.0]).

Results section, modification by gender: Imputing missing values in the most conservative manner, as well as non-response weighting, led to the same pattern of results (albeit with smaller effect estimates).

Results section, modification by educational achievement: Once again, imputing missing values in the most conservative manner, as well as non-response weighting, led to the same pattern of results, with overall smaller effect estimates.

Results section, modification by ethnicity: Imputing missing values, as well as non-response weighting, led to the same somewhat unclear pattern of results.

We believe that the results from these additional analyses warrant some qualifications to the way we present our results. We changed parts of the discussion where effect modification is discussed to accommodate the more conservative estimates.

p. 24, line 544: “Induced spatial proximity nevertheless succeeded in inducing some diverse friendships. Randomly seating boys and girls next to each other doubled their probability of nominating each other as best friends (from less than 2 to 4 percentage points). The intervention also substantially increased friendships between students with strong and weak baseline GPAs (from 11 to 17 percentage points). Finally, whether or not seating Roma and non-Roma students next to each other increased friendships across ethnic lines remained unclear in our data; the estimate was beset with statistical uncertainty due to relatively small numbers of Roma students in the sample and sensitive to assumptions about missing data.”

R.2.4. related to the above point and to the identification of the effect: I was asking about pre-existing friendships because it is more likely that, within a class, pupils tend to befriend similar peers. Thus, the strongest effect for similar peers may be due to higher likelihood of a pre-existing bond. Since the authors do not have such information, they should acknowledge this caveat when describing their design and above all their results.

Thank you for this helpful clarification. We initially thought that the reviewer was concerned about the causal identification of the deskmate effect; we now believe that the reviewer is wondering about the mechanism that may explain effect modification (effect heterogeneity) of the deskmate effect by similarity. We apologize for our earlier misunderstanding.

The reviewer’s hypothesis strikes us as plausible.

Suppose, for example, that the intervention of being seated next to each other may be highly effective in preventing the dissolution of existing friendships, but less effective in inducing new friendships. If similar students are more likely to have pre-existing friendships, then our finding that sitting next to each other, on net, increases the probability of friendship more among similar dyads than among dissimilar dyads could be explained by the greater probability of pre-existing friendships among similar dyads. Assuming that this reasoning describes the actual data-generating mechanism, then pre-existing friendships would be a mediator of the causal effects of similarity on friendship at endline.

This account does not impinge on the causal interpretation of the deskmate effect (either of the average effect, or of the subgroup effects conditional on baseline similarity). We struggled to understand this particular point because we started from the premise that our design does not allow us to causally identify the effects of similarity (as explained in the Method section). But of course, the term “effect modification” may still be evocative of certain types of causal stories. To avoid any such misinterpretation, we re-read the discussion section, took greater care when communicating the effect modification issue, and also included the possible and plausible interpretation suggested by the reviewer.

p. 23, line 531: “But since the effect of a given increase in the latent propensity toward friendship on the formation of a manifest friendship also depends on the dyad’s baseline propensity toward friendship, and since more similar dyads have a greater baseline propensity toward friendship (homophily), the intervention was more successful among similar students than among dissimilar students. One potential explanation could be that being seated next to each other may be particularly effective at preventing the dissolution of pre-existing ties (as compared to inducing new ties), which are more prevalent among similar dyads; our design, however, does not allow for the identification of different possible mechanistic explanations. The three dimensions of similarity that we investigated contributed to the overall pattern to varying degrees:…”

R.2.5. I better explain the question in R.2.6 (I apologise for the mistake, it was line 169): the authors state “94.4 percent of the dyads in which students actually sat next to each after the second week of classes comprised students who were supposed to sit next to each in the intended seating chart” (line 180). Given that the authors have information on compliance, are the results robust (stronger?) if the authors exclude the 5.6% (100-94.4) of the dyads who were not compliant. What if they exclude also the dyads in which students did not actually sat next to each?

Thank you for this clarification! We re-ran analyses excluding the deskmate dyads who did not adhere to the treatment (i.e., dyads who were assigned to sit next to each other but didn’t do so, and dyads who were not assigned to sit next to each other but did do so).

The resulting estimates were indeed stronger than the results of the main pre-reported analysis reported in the manuscript. In the analysis that excluded these dyads, students had a 22.6% probability of being friends with a deskmate (vs. 22.3% in our main analysis) and a 15.3% probability of being friends with a non-deskmate (vs. 15.3%), the average marginal effect of being seated next to each other was 7.2 [4.9; 9.8] percentage points (vs. 7.0, [4.6; 9.4]).

We now briefly mention these additional results in the main body of the text:

p. 18, line 399: “Excluding dyads who did not adhere to treatment resulted in a slightly larger effect estimate of 7.2 percentage points (CI_95: [4.9; 9.8]).”

R.2.6. I suggest again to spellcheck the paper: line 207 “1if”; line 512 “the transformations of latent propensities into manifest friendships remains”

Thank you very much for carefully reading the manuscript and catching these typos. We again spellchecked the whole manuscript and caught two superfluous whitespaces.

Attachment

Submitted filename: Response to Reviewers_V3.docx

Decision Letter 2

Federica Maria Origo

12 Jul 2021

Proximity Can Induce Diverse Friendships: A Large Randomized Classroom Experiment

PONE-D-20-33057R2

Dear Dr. Rohrer,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Federica Maria Origo

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: All comments have been addressed

Reviewer #2: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: (No Response)

Reviewer #2: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: (No Response)

Reviewer #2: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: (No Response)

Reviewer #2: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: (No Response)

Reviewer #2: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: (No Response)

Reviewer #2: (No Response)

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Acceptance letter

Federica Maria Origo

16 Jul 2021

PONE-D-20-33057R2

Proximity Can Induce Diverse Friendships: A Large Randomized Classroom Experiment

Dear Dr. Rohrer:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Federica Maria Origo

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Text. Deviations between the pre-analysis plan and the reported analyses.

    (DOCX)

    S2 Text. Details regarding pre-treatment variables.

    (DOCX)

    S3 Text. Balance checks.

    (DOCX)

    S1 Table. Results of Bayesian multi-membership multilevel probit models investigating the modifying role of single dimensions of similarity.

    (DOCX)

    S1 Fig. Heterogeneity of the deskmate effect across classrooms.

    Probit coefficients from random effects model (left panel) as well as the corresponding model-implied friendship probabilities for non-deskmates versus deskmates (right panel). The difference between each predicted probability for deskmates minus the predicted probability for non-deskmates is the classroom-specific AME.

    (TIF)

    Attachment

    Submitted filename: Response to Reviewers V6.docx

    Attachment

    Submitted filename: Response to Reviewers_V3.docx

    Data Availability Statement

    The data are available on the Open Science Framework (https://osf.io/4vjc5/).


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES