Abstract
Complex problems often require coordinated group effort and can consume significant resources, yet our understanding of how teams form and succeed has been limited by a lack of large-scale, quantitative data. We analyse activity traces and success levels for approximately 150 000 self-organized, online team projects. While larger teams tend to be more successful, workload is highly focused across the team, with only a few members performing most work. We find that highly successful teams are significantly more focused than average teams of the same size, that their members have worked on more diverse sets of projects, and the members of highly successful teams are more likely to be core members or ‘leads’ of other teams. The relations between team success and size, focus and especially team experience cannot be explained by confounding factors such as team age, external contributions from non-team members, nor by group mechanisms such as social loafing. Taken together, these features point to organizational principles that may maximize the success of collaborative endeavours.
Keywords: teamwork, collective dynamics, data science, open source software
1. Introduction
Massive datasets describing the activity patterns of large human populations now provide researchers with rich opportunities to quantitatively study human dynamics [1,2], including the activities of groups or teams [3,4]. New tools, including electronic sensor systems, can quantify team activity and performance [5,4]. With the rise in prominence of network science [6,7], much effort has gone into discovering meaningful groups within social networks [8–15] and quantifying their evolution [15,16]. Teams are increasingly important in research and industrial efforts [3,4,17–21], and small, coordinated groups are a significant component of modern human conflict [22,23]. There are many important dimensions along which teams should be studied, including their size, how work is distributed among their members, and the differences and similarities in the experiences and backgrounds of those team members. Recently, there has been much debate on the ‘group size hypothesis’ that larger groups are more robust or perform better than smaller ones [24–27]. Scholars of science have noted for decades that collaborative research teams have been growing in size and importance [20,28–30]. At the same time, however, social loafing, where individuals apply less effort to a task when they are in a group than when they are alone, may counterbalance the effectiveness of larger teams [31–33]. Meanwhile, case studies show that leadership [3,34–36] and experience [37,38] are key components of successful team outcomes, while specialization and multitasking are important but potentially error-prone mechanisms for dealing with complexity and cognitive overload [39,40]. In all of these areas, large-scale, quantitative data can push the study of teams forward.
Teams are important for modern software engineering tasks, and researchers have long studied the digital traces of open source software projects to better quantify and understand how teams work on software projects [41,42]. Researchers have investigated estimators of work activity or effort based on edit volume, such as different ways to count the number of changes made to a software's source code [43–46]. Various dimensions of success of software projects such as popularity, timeliness of bug fixes or other quality measures have been studied [47–49]. Successful open source software projects show a layered structure of primary or core contributors surrounded by lesser, secondary contributors [50]. At the same time, much work is focused on case studies [45,51] of small numbers of highly successful, large projects [41]. Considering these studies alone runs the risk of survivorship bias or other selection biases, so large-scale studies of large quantities of teams are important complements to these works.
Users of the GitHub web platform can form teams to work on real-world projects, primarily software development but also music, literature, design work and more. A number of important scientific computing resources are now developed through GitHub, including astronomical software, genetic sequencing tools and key components of the Compact Muon Solenoid experiment's data pipeline.1 A ‘GitHub for science’ initiative has been launched2 and GitHub is becoming the dominant service for open scientific development.
GitHub provides rich public data on team activities, including when new teams form, when members join existing teams and when a team's project is updated. GitHub also provides social media tools for the discovery of interesting projects. Users who see the work of a team can choose to flag it as interesting to them by ‘starring’ it. The number of these ‘stargazers’ S allows us to quantify one aspect of the success of the team, in a manner analogous to the use of citations of research literature as a proxy for ‘impact’ [52]. Of course, as with bibliometric impact, one should be cautious and not consider success to be a perfectly accurate measure of quality, something that is far more difficult to objectively quantify. Instead this is a measure of popularity as would be other statistics such as web traffic, number of downloads and so forth [47].
In this study, we analyse the memberships and activities of approximately 150 000 teams, as they perform real-world tasks, to uncover the blend of features that relate to success. To the best of our knowledge this is the largest study of real-world team success to date. We present results that demonstrate (i) how teams distribute or focus work activity across their members, (ii) the mixture of experiential diversity and collective leadership roles in teams, and (iii) how successful teams are different from other teams while accounting for confounds such as team size.
The rest of this paper is organized as follows: in §2, we describe our GitHub dataset; give definitions of a team, team success and work activity/focus of a team member; and introduce metrics to measure various aspects of the experience and experiential diversity of a team's members. In §3, we present our results relating these measures to team success. In §4, we present statistical tests on linear regression models of team features to control for potential confounds between team features and team success. Lastly, we conclude with a discussion in §5.
2. Material and methods
2.1. Dataset and team selection
Public GitHub data covering 1 January 2013 to 1 April 2014 was collected from githubarchive.org in April 2014. In their own words, ‘GitHub Archive is a project to record the public GitHub timeline, archive it, and make it easily accessible for further analysis’. These activity traces contain approximately 110M unique events, including when users create, join, or update projects. Projects on GitHub are called ‘repositories’. For this work, we define a team as the set of users who can directly update (push to) a repository. These users constitute the primary team members as they have either created the project or been granted autonomy to work on the project. The number of team members was denoted by M. Activity or workload W was estimated from the number of pushes. A push is a bundle of code updates (known as commits), however most pushes contain only a single commit (electronic supplementary material; see also [46]). As with all studies measuring worker effort from lines-of-code metrics, this is an imperfect measure as the complexity of a unit of work does not generally map to the quantity of edits. Users on GitHub can bookmark projects they find interesting. This is called ‘stargazing’. We take the maximum number of stargazers for a team as its measure of success S. This is a popularity measure of success; however, the choice to bookmark a project does imply it offers some value to the user. To avoid abandoned projects, studied teams have at least one stargazer (S>0) and at least two updates per month on average within the githubarchive data. These selection criteria leave N=151 542 teams. We also collect the time of creation on GitHub for each team project. This is useful for measuring confounds: for example, older teams may tend to have both more members and more opportunities to increase success. Of the teams studied, 67.8% were formed within our data window. Beyond considering team age as a potential confounder, we do not study temporal dynamics such as team formation in this work. A small number of studied teams (1.08%) have more than 10 primary members (M>10); those teams were not shown in figures, but they were present in all statistical analyses. Lastly, to ensure our results are not due to outliers, in some analyses we excluded teams above the 99th percentile of S. Despite a strong skew in the distribution of S, these highly popular teams account for only 2.54% of the total work activity of the teams considered in this study (2.27% when considering teams with M≤10 members).
2.1.1. Secondary team
GitHub provides a mechanism for external, non-team contributors to propose work that team members can then choose to use or not. These proposals are called pull requests. (Other mechanisms, such as discussions about issues, are also available to non-team contributors.) These secondary or external team contributors are not the focus of this work and have already been well studied by OSS researchers [41]. However, it is important to ensure that they do not act as confounding factors for our results, as more successful teams will tend to have more secondary contributions than other teams. So we measure for each team Mext, the number of unique users who submit at least one pull request, and Wext, the number of pull requests. We will include these measures in our combined regression models. Despite their visibility in GitHub, pull requests are rare [53]; in our data, 57.7% of teams we study have Wext=0, and when present pull requests are greatly outnumbered by pushes on average: 〈W/Wext|Wext>0〉=42.3 (median 16.0), averaged over all teams with at least one pull request.
2.2. Effective team size
The number of team members, M, does not fully represent the size of a team as the distribution of work may be highly skewed across team members. To capture the effective team size m, accounting for the relative contribution levels of members, we use m=2H, where , and fi=wi/W is the fraction of work performed by team member i. This gives m=M when all fi=1/M, as expected. This simple, entropic measure is known as perplexity in linguistics and is closely related to species diversity indices used in ecology and the Herfindahl–Hirschman index used in economics.
2.3. Experience, diversity and leads
Denote with Ri the set of projects that user i works on (has pushed to). (Projects in Ri need at least twice-monthly updates on average, as before, but may have S=0 so as to better capture i's background, not just successful projects.) We estimate the experience E of a team of size M as
and the experiential diversity D as
where the sums and union run over the M members of the team. Note that D∈[1/M,1). Experience measures the quantity of projects the team works on while diversity measures how many or how few projects the team members have in common, the goal being to capture how often the team has worked together. Lastly, someone is a lead when, for at least one project they work on, they contribute more work to that project than any other member. A non-lead member of team j may be the lead of project k≠j. The number of leads Lk in team k of size Mk is
where Lij=1 if user i is the lead of team j, and zero otherwise. The first sum runs over the Mk members of team k, the second runs over all projects j. Of course, the larger the team the more potential leads it may contain so when studying the effects of leads on team success we only compare teams of the same size (comparing L while holding M fixed). Otherwise, E and D already account for team size.
3. Results
We began our analysis by measuring team success S as a function of team size M, the number of primary contributors to the team's project. As S is, at least partially, a popularity measure, we expect larger teams to also be more successful. Indeed, there was a positive and significant relationship (p<10−10, rank correlation ρ=0.0845) between the size of a team and its success, with 300% greater success on average for teams of size M=10 compared with solos with M=1 (figure 1). This strong trend also holds for the median success (inset). While this observed trend was highly significant, the rank correlation ρ indicates that there remains considerable variation in S that is not captured by team size alone.
Our next analysis reveals an important relationship between team focus and success. Unlike bibliographic studies, where teams can only be quantified as the listed coauthors of a paper, the data here allow us to measure the intrinsic work or volume of contributions from each team member to the project. For each team we measured the contribution wr of a member to the team's ongoing project, how many times that member updated the project (see Material and methods). Team members were ranked by contribution, so w1 counts the work of the member who contributed the most, w2 the second heaviest contributor and so forth. The total work of a team is .
We found that the distribution of work over team members showed significant skew, with w1 often more than two to three times greater than w2 (figure 2a; electronic supplementary material). This means that the workloads of projects are predominantly carried by a handful of team members, or even just a single person. Larger teams perform more total work, and the heaviest contributor carries much of that effort: the inset of figure 2a shows that w1/W, the fraction of work carried by the rank one member, falls slowly with team size, and is typically far removed from the lower bound of equal work among all team members. See the electronic supplementary material for more details. This result is in line with prior studies [51], supporting the plausibility of our definition of a team and our use of pushes to measure work.
This focus in work activity indicates that the majority of the team serves as a support system for a core set of members. Does this arrangement play a role in whether or not teams are successful? We investigated this in several ways. First, we asked whether or not a team was dominated, meaning that the lead member contributed more work than all other members combined (). Highly successful ‘top’ teams, those in the top 10% of the success distribution, were significantly more likely to be dominated than average teams, those in the middle 20% of S, or ‘bottom’ teams, those in the bottom 10% of the S (figure 2b).
Can this result be due to a confounding effect from success? More successful projects will tend to have more external contributors, for example, which can change the distribution of work. For example, in one scenario a team member may be a ‘community manager’ merging in large numbers of external contributions from non-team members. To test this we examined only the 57.7% of teams that had no external contributions (Wext=0) and tested among only those teams whether dominated teams were more successful than non-dominated teams. Within this subset of teams, dominated teams had significantly higher S than non-dominated teams (Mann–Whitney U test (MWU) with continuity correction, p<10−8). The MWU is non-parametric, using ranks of (in this case) S to mitigate the effects of skewed data, and does not assume normality. We conclude from this that external contributions do not fully explain the relationship between workload focus and team success.
Next, we moved beyond the effects of the heaviest contributor by performing the following analysis. For each team we computed its effective team size m, directly accounting for the skew in workload (see Material and methods for full details). This effective size can be roughly thought of as the average number of unique contributors per unit time and need not be a whole number. For example, a team of size M=2 where both members contribute equally will have effective size m=2, but if one member is responsible for 95% of the work the team would have m≈1.22. Note that M and m are positively correlated (ρ=0.985).
Figure 2c shows that (i) all teams are effectively much smaller than their total size would indicate, for all sizes M>1, and (ii) top teams are significantly smaller in effective size (and therefore more focused in their work distribution) than average or bottom teams with the same M. Further, success is significantly, negatively correlated with m, for all M (figure 2d). More focused teams have significantly more success than less focused teams of the same size, regardless of total team size.
Further analyses revealed the importance of team composition and its role in team success.
Team members do not perform their work in a vacuum, they each bring experiences from their other work. Often members of a team will work on other projects. We investigated these facets of a team's composition by exploring (i) how many projects the team's members have worked on, (ii) how diverse the other projects are (whether the team members have many or few other projects in common) and (iii) how many team members were ‘leads’ of other projects.
An estimate of experience, E, the average number of other projects that team members have worked on (see Material and methods), was significantly related to success. However, the trend was not particularly strong (see the electronic supplementary material) and, as we later show via combined modelling efforts, this relationship with success was entirely explainable by the teams' other measurable quantities.
It may be that the volume of experience does not contribute much to the success of a team, but this seems to contradict previous studies on the importance of experience and wisdom [37,38]. To investigate, we turned to a different facet of a team's composition, the diversity of the team's background. Successful teams may tend to be composed of members who have frequently worked together on the same projects in the past, perhaps developing an experiential shorthand. Conversely, successful teams may instead have multiple distinct viewpoints, solving challenges with a multi-disciplinary perspective [54].
To estimate the distinctness of team member backgrounds, the diversity D was measured as the fraction of projects that team members have worked on that are unique (see Material and methods). Diversity is low when all M members have worked on the same projects together (D=1/M), but D grows closer to 1 as their backgrounds become increasingly diverse. A high team diversity was significantly correlated with success, regardless of team size (figure 3). Even small teams seem to have benefited greatly from diversity: high-D duos averaged nearly eight times the success of low-D duos. The relationship between D and S was even stronger for larger teams (figure 3, inset), implying that larger teams can more effectively translate this diversity into success. Even if the raw volume of experience a team has does not play a significant role in the team's success, the diversity of that experience was significantly correlated with team success. See also our combined modelling efforts.
Considerable attention has been paid recently to collective leadership, where decision-making structures emerge from the mass of the group instead of being imposed via a top-down hierarchy [34,36]. The open collaborations studied here have the potential to display collective leadership due to their volunteer-driven, self-organized nature. The heaviest contributor to a team is most likely to occupy such a leadership role. Further, as teams overlap, a secondary member of one team may be the ‘lead,’ or heaviest contributor to another. This poses an interesting question: Even though teams are heavily focused, are teams more successful when they contain many leads, or few? A team with many leads will bring considerable experience, but most of its members may also be unable to dedicate their full attention to the team.
To answer this, we measured L, the number of team members who are the lead of at least one project (1≤L≤M, see Material and methods) and found that teams with many leads have significantly higher success than teams of the same size with fewer leads (figure 4). Only one team member can be the primary contributor to the team, so a team can only have many leads if the other members have focused their work activity on other projects. Team members who are focused on other projects can potentially only provide limited support, yet successful teams tend to arrange their members in exactly this fashion. Of course, the strong focus in work activity (figure 2) is probably interrelated with these observations. However, we will soon show that both remain significantly related to success in combined models.
Expanding on this observation, table 1 illustrates the extreme case of teams of size M with a single lead (L=1) compared with teams of the same size composed entirely of leads (L=M). The latter always displayed significantly higher success than the former (MWU test, see table 1), independent of team size, underscoring the correlations displayed in figure 4. Often the difference was massive: teams of size M=7, for example, averaged more than 1200% higher success when L=7 than when L=1.
Table 1.
no. teams N | mean success S | ||||
---|---|---|---|---|---|
M | L=1 | L=M | L=1 | L=M | MWU p-value |
2 | 14 823 | 8894 | 18.9 | 42.5 | <10−213 |
3 | 6171 | 2261 | 14.5 | 58.3 | <10−210 |
4 | 3063 | 717 | 12.8 | 62.1 | <10−112 |
5 | 1489 | 289 | 12.1 | 94.5 | <10−55 |
6 | 740 | 124 | 12.3 | 85.0 | <10−36 |
7 | 350 | 46 | 9.8 | 120.5 | <10−15 |
8 | 179 | 19a | 7.5 | 224.1 | <10−8 |
9 | 125 | 9 | 22.2 | 316.8 | <0.008 |
10 | 66 | 6 | 17.8 | 163.5 | <0.005 |
aWhen M≥8, the number of teams with L=M is too small (N<20) for us to reasonably conclude the difference in S is significant, despite the small p-values.
These results on team composition cannot be easily explained as a confound with success or secondary contributions as they study specific features and projects of the individuals who comprise a team, those features are not related to the successes of other projects an individual may work on, and they strictly control for total team size M (e.g. we only compare teams with different values of L when they have the same value M). These results further amplify our findings on team focus, and augment important existing research [3,4,36,37,54].
Taken together, our results demonstrate that successful teams tend to be focused (figure 2), successful teams tend to be experientially diverse (figure 3) and successful teams tend to have many leads (figure 4). We have found that teams tend to do best when optimized along all three of these dimensions. Of course, it is necessary to explore the joint effects of quantities, to see if one relationship can be explained by another, which we will do with multivariate statistical models.
4. Combined models and confounds
One important aspect of the individual team measurements is that they do not exist in isolation. For example, successful teams also have high work activity (high W). This can correlate with effective team size m as the potential inequality between team members can grow as their total activity grows. In other words, we need to see how our team measures relate to success together.
To understand the relative effects of these team composition measures, we fitted a linear regression model of success as a function of all explored measures (table 2). Not only did this regression allow us to determine whether a variable was significant or if it was confounded by the other measures, but the coefficients (on the standardized variables) let us measure the relative strengths of each variable. We also included the age of a project T (measured as the time difference between the recorded creation time of the project and the end of our data window; see Material and methods) as this may also be a potential confounding factor (older projects have had more time to gain members and to gain success).
Table 2.
variable x | coefficient | p-value |
---|---|---|
constant, α | 1.351×10−14±0.004951 | 1 |
team size, M | 0.0848±0.013963 | <10−31 |
eff. team size, m | −0.0989±0.012140 | <10−56 |
total work, W | 0.0323±0.004997 | <10−35 |
experience, E | 0.0004068±0.004985 | 0.8729 |
diversity, D | 0.04099±0.006357 | <10−35 |
no. leads, L | 0.1388±0.006921 | 0 |
age, T | 0.1273±0.005014 | 0 |
aVariables are standardized for comparison such that a coefficient βx implies that increasing a variable x by one standard deviation σx corresponds to a βxσS increase in S, holding other variables fixed.
Examining the regression coefficients showed that the number of leads L was the variable most strongly correlated with team success. Team age T, effective team size m and team size M play the strongest roles after L in team success, and all three were also significant in the presence of the other variables. The coefficient on m was negative while for M it was positive, further underscoring our result that, while teams should be big, they effectively should be small. Next, the diversity D of the team, followed by the total work W done on the project, were also significant measures related to success. Finally, overall team experience E was not significant in this model (p>0.1). We conclude that, while S and E are correlated by themselves, any effects of E are explained by the other quantities.
What about secondary contributions, those activities made by individuals outside the primary team? We already performed one test showing that dominated teams are more successful than non-dominated teams even when there are no secondary contributions. Continuing along these lines, we augmented this linear model with two more dependent variables, Mext and Wext. Regressing on this expanded model (see the electronic supplementary material for details) did not change the significance of any coefficients at the p=0.05 level; E remained insignificant (p>0.1). Both new variables were significant (p<0.05). Note that there were no multicollinearity effects in either regression model (condition numbers less than 10). We conclude that secondary contributions cannot alone explain the observations relating team focus, experience and lead number to team success.
5. Discussion
There has been considerable debate concerning the benefits of specialization compared with diversity in the workplace and other sectors [39]. Our discoveries here show that a high-success team forms a diverse support system for a specialist core, indicating that both specialization and diversity contribute to innovation and success. Team members should be both specialists, acting as the lead contributor to a team, and generalists, offering ancillary support for teams led by another member. This has implications when organizations are designing teams and wish to maximize their success, at least as success was measured in these data. Teams tend to do best on average when they maximize M (figure 1b) while minimizing m (figure 2d) and maximizing D (figure 3) and L (figure 4).
Of course, some tasks are too large for a single person or small team to handle, necessitating the need for mega teams of hundreds or even thousands of members. Our results imply that such teams may be most effective when broken down into large numbers of small, overlapping groups, where all individuals belong to a few teams and are the lead of at least one. Doing so will help maximize the experiential diversity of each sub-team, while ensuring each team has someone ‘in charge’. An important open question is what the best ways are to design such pervasively overlapping groups [14], a task that may be project- or domain-specific but which is worth further exploration.
The negative relationship between effective team size m and success S (as well as the significantly higher presence of dominated teams among high success teams) further belies the myth of multitasking [39] and supports the ‘surgical team’ arguments of Brooks [17]. Focused work activity, often by even a single person, is a hallmark of successful teams. This focus both limits the cognitive costs of task switching, and lowers communication and coordination barriers, as so much work is being accomplished by one or only a few individuals. We have provided statistical tests demonstrating that the relationship between focus and success cannot be due to secondary/external team contributions alone.
Work focus could possibly be explained by social loafing where individual members of a group contribute less effort as part of the group than they would alone, yet loafing does not explain the correlation between e.g. leads and success (figure 4). Likewise, our team composition results on group experience, experiential diversity and the number of leads cannot be easily explained as a confound with success or secondary contributions: they study specific features of the individuals who comprise a team, those features are not related to the successes of other projects an individual may work on, and they strictly control for total team size M (except for the number of leads L, so for that measure we only compared teams with the same M). The measures we used for external team contributions, Mext and Wext, may be considered measures of success themselves, and studying or even predicting their levels from team features may prove a fruitful avenue of future work.
Lastly, there are two remaining caveats worth mentioning. We do not specifically control for automatically mirrored repositories (where a computer script copies updates to GitHub). Accurately detecting such projects at scale is a challenge beyond the scope of this work. However, we expect most will either be filtered out by our existing selection criteria or else they will probably only have a single (automated) user that only does the copying. The second concern is work done outside of GitHub or, more generally, mismatched assignments between usernames and their work. This is also challenging to fully address (one issue is that the underlying git repository system does not authenticate users). We acknowledge this concern for our workload focus results, but even it cannot explain the significant trends we observed on team composition such as the density of leads. Noise due to improperly recorded or ‘out-of-band’ work has in principle affected all quantitative studies of online software repositories.
Supplementary Material
Acknowledgements
We thank Josh Bongard, Brian Tivnan, Paul Hines, Michael Szell and Albert-László Barabási for useful discussions, and we gratefully acknowledge the computational resources provided by the Vermont Advanced Computing Core, supported by NASA (NNX-08AO96G).
Footnotes
For examples, see https://github.com/showcases/science.
Data accessibility
All data analysed are made publicly available by the GitHub Archive Project (https://www.githubarchive.org).
Authors' contributions
M.K. participated in data collection and data analysis, and helped draft the manuscript; J.B. conceived the study, designed the study, carried out data collection and analysis, and drafted the manuscript. All authors gave final approval for publication.
Competing interests
We have no competing interests.
Funding
J.B. has been supported by the University of Vermont and the Vermont Complex Systems Center.
References
- 1.Lazer D. et al. 2009. Life in the network: the coming age of computational social science. Science 323, 721–723. (doi:10.1126/science.1167742) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Vespignani A. 2012. Modelling dynamical processes in complex socio-technical systems. Nat. Phys. 8, 32–39. (doi:10.1038/nphys2160) [Google Scholar]
- 3.Dyer JR, Johansson A, Helbing D, Couzin ID, Krause J. 2009. Leadership, consensus decision making and collective behaviour in humans. Phil. Trans. R. Soc. B 364, 781–789. (doi:10.1098/rstb.2008.0233) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Pentland A. 2012. The new science of building great teams. Harv. Bus. Rev. 90, 60–69.23074865 [Google Scholar]
- 5.Choudhury T, Pentland A. 2003. Sensing and modelling human networks using the sociometer. In 2012 16th Int. Symp. on Wearable Computers, p. 216.
- 6.Barabási AL. 2003. Linked: how everything is connected to everything else and what it means for business, science, and everyday life. New York, NY: Plume. [Google Scholar]
- 7.Newman MEJ. 2010. Networks: an introduction. Oxford, UK: Oxford University Press. [Google Scholar]
- 8.Wasserman S, Faust K. 1994. Social network analysis: methods and applications. Cambridge, UK: Cambridge University Press. [Google Scholar]
- 9.Sparrowe RT, Liden RC, Wayne SJ, Kraimer ML. 2001. Social networks and the performance of individuals and groups. Acad. Manage. J. 44, 316–325. (doi:10.2307/3069458) [Google Scholar]
- 10.Girvan M, Newman MEJ. 2002. Community structure in social and biological networks. Proc. Natl Acad. Sci. USA 99, 7821–7826. (doi:10.1073/pnas.122653799) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Watts DJ, Dodds PS, Newman MEJ. 2002. Identity and search in social networks. Science 296, 1302–1305. (doi:10.1126/science.1070120) [DOI] [PubMed] [Google Scholar]
- 12.Newman MEJ, Park J. 2003. Why social networks are different from other types of networks. Phys. Rev. E. 68, 036122 (doi:10.1103/PhysRevE.68.036122) [DOI] [PubMed] [Google Scholar]
- 13.Bird C, Pattison D, D'souza R, Filkov V, Devanbu P. 2008. Latent social structure in open source projects. In Proc. of the 16th ACM SIGSOFT Int. Symp. on Foundations of Software Engineering. SIGSOFT '08/FSE-16, pp. 24–35. New York, NY: ACM.
- 14.Ahn YY, Bagrow JP, Lehmann S. 2010. Link communities reveal multiscale complexity in networks. Nature 466, 761–764. (doi:10.1038/nature09182) [DOI] [PubMed] [Google Scholar]
- 15.Mucha PJ, Richardson T, Macon K, Porter MA, Onnela JP. 2010. Community structure in time-dependent, multiscale, and multiplex networks. Science 328, 876–878. (doi:10.1126/science.1184819) [DOI] [PubMed] [Google Scholar]
- 16.Palla G, Barabási AL, Vicsek T. 2007. Quantifying social group evolution. Nature 446, 664–667. (doi:10.1038/nature05670) [DOI] [PubMed] [Google Scholar]
- 17.Brooks FP., Jr 1995. The mythical man-month. Boston, MA: Pearson Education. [Google Scholar]
- 18.Faraj S, Sproull L. 2000. Coordinating expertise in software development teams. Manage. Sci. 46, 1554–1568. (doi:10.1287/mnsc.46.12.1554.12072) [Google Scholar]
- 19.Rising L, Janoff NS. 2000. The Scrum software development process for small teams. IEEE Softw. 17, 26–32. (doi:10.1109/52.854065) [Google Scholar]
- 20.Milojević S. 2014. Principles of scientific research team formation and evolution. Proc. Natl Acad. Sci. USA 111, 3984–3989. (doi:10.1073/pnas.1309723111) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.de Montjoye YA, Stopczynski A, Shmueli E, Pentland A, Lehmann S. 2014. The strength of the strongest ties in collaborative problem solving. Sci. Rep. 4, 5277 (doi:10.1038/srep05277) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Clauset A, Young M, Gleditsch KS. 2007. On the frequency of severe terrorist events. J. Confl. Resolut. 51, 58–87. (doi:10.1177/0022002706296157) [Google Scholar]
- 23.Bohorquez JC, Gourley S, Dixon AR, Spagat M, Johnson NF. 2009. Common ecology quantifies human insurgency. Nature 462, 911–914. (doi:10.1038/nature08631) [DOI] [PubMed] [Google Scholar]
- 24.Henrich J. 2004. Demography and cultural evolution: how adaptive cultural processes can produce maladaptive losses: the Tasmanian case. Am. Antiquity 69, 197–214. (doi:10.2307/4128416) [Google Scholar]
- 25.Derex M, Beugin MP, Godelle B, Raymond M. 2013. Experimental evidence for the influence of group size on cultural complexity. Nature 503, 389–391. (doi:10.1038/nature12774) [DOI] [PubMed] [Google Scholar]
- 26.Andersson C, Read D. 2014. Group size and cultural complexity. Nature 511, E1 (doi:10.1038/nature13411) [DOI] [PubMed] [Google Scholar]
- 27.Derex M, Beugin MP, Godelle B, Raymond M. 2014. Derex et al. reply. Nature 511, E2 (doi:10.1038/nature13412) [DOI] [PubMed] [Google Scholar]
- 28.Price DdS. 1963. Little science, big science. New York, NY: Columbia University Press. [Google Scholar]
- 29.Pao ML. 1992. Global and local collaborators: a study of scientific collaboration. Inf. Process Manage. 28, 99–109. (doi:10.1016/0306-4573(92)90096-I) [Google Scholar]
- 30.Hudson J. 1996. Trends in multi-authored papers in economics. J. Econ. Perspect. 10, 153–158. (doi:10.1257/jep.10.3.153) [Google Scholar]
- 31.Latane B, Williams K, Harkins S. 1979. Many hands make light the work: the causes and consequences of social loafing. J. Pers. Social Psychol. 37, 822–832. (doi:10.1037/0022-3514.37.6.822) [Google Scholar]
- 32.Harkins SG. 1987. Social loafing and social facilitation. J. Exp. Soc. Psychol. 23, 1–18. (doi:10.1016/0022-1031(87)90022-9) [Google Scholar]
- 33.Karau SJ, Williams KD. 1993. Social loafing: a meta-analytic review and theoretical integration. J. Pers. Soc. Psychol. 65, 681–706. (doi:10.1037/0022-3514.65.4.681) [Google Scholar]
- 34.Denis JL, Lamothe L, Langley A. 2001. The dynamics of collective leadership and strategic change in pluralistic organizations. Acad. Manage. J. 44, 809–837. (doi:10.2307/3069417) [Google Scholar]
- 35.Johnstone RA, Manica A. 2011. Evolution of personality differences in leadership. Proc. Natl Acad. Sci. USA 108, 8373–8378. (doi:10.1073/pnas.1102191108) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Contractor NS, DeChurch LA, Carson J, Carter DR, Keegan B. 2012. The topology of collective leadership. Lead. Q. 23, 994–1011. (doi:10.1016/j.leaqua.2012.10.010) [Google Scholar]
- 37.Katzenbach JR, Smith DK. 1993. The wisdom of teams: creating the high-performance organization. Boston, MA: Harvard Business Press. [Google Scholar]
- 38.Delmar F, Shane S. 2006. Does experience matter? The effect of founding team experience on the survival and sales of newly founded ventures. Strateg. Organ. 4, 215–247. (doi:10.1177/1476127006066596) [Google Scholar]
- 39.Lindbeck A, Snower DJ. 2000. Multitask learning and the reorganization of work: from Tayloristic to holistic organization. J. Lab. Econ. 18, 353–376. (doi:10.1086/209962) [Google Scholar]
- 40.Postrel S. 2002. Islands of shared knowledge: specialization and mutual understanding in problem-solving teams. Organ. Sci. 13, 303–320. (doi:10.1287/orsc.13.3.303.2773) [Google Scholar]
- 41.Crowston K, Wei K, Howison J, Wiggins A. 2012. Free/Libre open-source software development: what we know and what we do not know. ACM Comput. Surv. 44, 1–35. (doi:10.1145/2089125.2089127) [Google Scholar]
- 42.Scholtes I, Mavrodiev P, Schweitzer F. In press From Aristotle to Ringelmann: a large-scale analysis of team productivity and coordination in Open Source Software projects. Empir. Softw. Eng. (doi:10.1007/s10664-015-9406-4) [Google Scholar]
- 43.Albrecht AJ, Gaffney JE Jr. 1983. Software function, source lines of code, and development effort prediction: a software science validation. IEEE Trans. Softw. Eng. 9, 639–648. (doi:10.1109/TSE.1983.235271) [Google Scholar]
- 44.Rosenberg J. 1997. Some misconceptions about lines of code. In Proc. of Fourth Int. Symp. on Software Metrics, pp. 137–142. New York, NY: IEEE.
- 45.Koch S, Schneider G. 2002. Effort, co-operation and co-ordination in an open source software project: GNOME. Inf. Syst. J. 12, 27–42. (doi:10.1046/j.1365-2575.2002.00110.x) [Google Scholar]
- 46.Alali A, Kagdi H, Maletic J. 2008. What's a typical commit? A characterization of open source software repositories. In Proc. of the 16th IEEE Int. Conf. on Program Comprehension, 2008, pp. 182–191. New York, NY: IEEE.
- 47.Crowston K, Howison J, Annabi H. 2006. Information systems success in free and open source software development: theory and measures. Softw. Process: Improv. Pract. 11, 123–148. (doi:10.1002/spip.259) [Google Scholar]
- 48.Subramaniam C, Sen R, Nelson ML. 2009. Determinants of open source software project success: a longitudinal study. Decis. Support Syst. 46, 576–585. (doi:10.1016/j.dss.2008.10.005) [Google Scholar]
- 49.Ghapanchi AH, Aurum A, Low G. 2011. A taxonomy for measuring the success of open source software projects. First Monday 16 (doi:10.5210/fm.v16i8.3558) [Google Scholar]
- 50.Crowston K, Howison J. 2005. The social structure of free and open source software development. First Monday 10 (doi:10.5210/fm.v10i2.1207) [Google Scholar]
- 51.Mockus A, Fielding RT, Herbsleb JD. 2002. Two case studies of open source software development: apache and mozilla. ACM Trans. Softw. Eng. Methodol. 11, 309–346. (doi:10.1145/567793.567795) [Google Scholar]
- 52.Wang D, Song C, Barabási AL. 2013. Quantifying long-term scientific impact. Science 342, 127–132. (doi:10.1126/science.1237825) [DOI] [PubMed] [Google Scholar]
- 53.Kalliamvakou E, Gousios G, Blincoe K, Singer L, German DM, Damian D. 2014. The promises and perils of mining GitHub. In Proc. of the 11th Working Conf. on Mining Software Repositories, pp. 92–101. New York, NY: ACM.
- 54.Horwitz SK, Horwitz IB. 2007. The effects of team diversity on team outcomes: a meta-analytic review of team demography. J. Manag. 33, 987–1015. (doi:10.1177/0149206307308587) [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All data analysed are made publicly available by the GitHub Archive Project (https://www.githubarchive.org).