Skip to main content
Nature Portfolio logoLink to Nature Portfolio
. 2024 Aug 26;8(10):1959–1971. doi: 10.1038/s41559-024-02512-7

Larger colony sizes favoured the evolution of more worker castes in ants

Louis Bell-Roberts 1,, Juliet F R Turner 1, Gijsbert D A Werner 1,2, Philip A Downing 3, Laura Ross 4, Stuart A West 1
PMCID: PMC7616618  EMSID: EMS198583  PMID: 39187609

Abstract

The size–complexity hypothesis is a leading explanation for the evolution of complex life on earth. It predicts that in lineages that have undergone a major transition in organismality, larger numbers of lower-level subunits select for increased division of labour. Current data from multicellular organisms and social insects support a positive correlation between the number of cells and number of cell types and between colony size and the number of castes. However, the implication of these results is unclear, because colony size and number of cells are correlated with other variables which may also influence selection for division of labour, and causality could be in either direction. Here, to resolve this problem, we tested multiple causal hypotheses using data from 794 ant species. We found that larger colony sizes favoured the evolution of increased division of labour, resulting in more worker castes and greater variation in worker size. By contrast, our results did not provide consistent support for alternative hypotheses regarding either queen mating frequency or number of queens per colony explaining variation in division of labour. Overall, our results provide strong support for the size–complexity hypothesis.

Subject terms: Social evolution, Behavioural ecology


Using data from 794 ant species, the authors test alternative causal models to explain the evolution of complexity in ant colonies. They find evidence that larger colony sizes favoured the evolution of greater division of labour, providing support for the size–complexity hypothesis.

Main

Division of labour has played a pivotal role in the evolution of life on earth, by facilitating evolutionary transitions to more complex organisms14. In these transitions, individuals cooperate to form a new higher-level individual (organism). For example, cells formed multicellular organisms, with germline and soma, and insects formed complex colonies termed ‘superorganisms’, with queen and worker castes3. Division of labour is fundamental to these transitions because specialization allows the individuals that form a higher-level organism to perform more diverse functions and to become more reliant upon each other for reproduction (mutual dependence)4. Consequently, a major aim in evolutionary biology is to understand the factors that favour an increased level of division of labour2,5,6.

The size–complexity hypothesis is one of the leading explanations for the evolution of increased division of labour because of the generality of its explanatory power across different levels of life. In lineages that have undergone a major evolutionary transition, such as to obligate multicellularity or superorganismality, the hypothesis predicts that larger numbers of lower-level individuals (subunits) select for increased division of labour1,2,69. This prediction arises because organisms or superorganisms formed by larger numbers of individuals have more tasks that require doing and are better able to maintain the optimal ratio of specialized individuals performing each task2,6,10,11. Support for the size–complexity hypothesis has been provided by examining division of labour in both individual multicellular organisms and insect colonies. Across multicellular species, organisms with larger numbers of cells have more cell types79. Across ants, species with larger colony sizes have more worker castes1217. However, despite these correlations, challenges to the size–complexity hypothesis remain.

First, alternative factors have been suggested to explain variation in the level of division of labour. For example, in ants, variation in both queen mating frequency and number of queens per colony has been predicted to influence the evolution of number of worker castes6,18,19. When colonies are formed by a single monogamous pair, relatedness among colony members will be maximal, which may reduce conflict within the colony and favour the evolution of multiple worker castes6,19. Alternatively, the opposite prediction could also be made in hymenopteran species, where multiple mating by the queen selects for workers to remove (police) the eggs laid by other workers20. Policing could reduce conflict, leading to a positive correlation between the number of castes and queen mating frequency18. The problem of multiple hypotheses is exasperated by the fact that colony size can be correlated with queen mating frequency13,14,21. Consequently, the correlation between colony size and number of castes could just be the result of a relationship caused by variation in queen mating frequency.

Second, in superorganismal social insects, the correlation between colony size and number of worker castes is open to alternative causal explanations. For example, while larger colony sizes may favour the evolution of greater division of labour, it could alternatively be that increased division of labour favours the evolution of larger colony sizes. The problem of causal direction becomes even greater when we consider that there are multiple correlated variables. For example, it could be that an increase in colony size favours an increase in queen mating frequency, and it is an increase in queen mating frequency that favours the evolution of a greater number of worker castes. In this case, colony size and the number of castes would be correlated. However, this pattern would lead to the false impression that larger colony size directly favours increased division of labour and misses the involvement of an intermediate factor. Ultimately, distinguishing between the role of different factors requires phylogenetically based analyses examining the causal direction and order of evolutionary changes22.

We resolve this problem by performing a series of phylogenetic analyses on the number of physical worker castes in ants. We focus on physical castes which are irreversibly fixed for life and determined during development, rather than behavioural differentiation among workers. This is because extreme specialization, involving mutual dependence, requires differences in size and morphology. Ants provide an excellent opportunity for testing the size–complexity hypothesis because they are a single monophyletic clade in which the number of worker castes varies across species from one to four, and there is considerable information on factors that could influence the evolution of castes2327. We analysed data from 794 species to examine the influence of three possible factors: colony size, queen mating frequency and number of queens per colony. We performed phylogenetic regressions, to determine how these three traits are correlated with the number of castes and each other. This step provided an overview that helped to guide further causal analyses and allowed comparison with previous studies. We then examined the likely causal relationships between number of worker castes, queen mating frequency and colony size with three methods: (1) phylogenetic path analysis28, (2) transition rate analysis between pairs of traits29 and (3) ancestral state reconstruction30. These analyses allowed us to tease apart the different roles of each variable. Last, to assess the robustness of our results, we also tested whether the same patterns emerged when examining an alternative possible measure of division of labour—variation in worker size12,13,18,31.

Results

Summary of ant social traits

Across different ant species, there was considerable variation in the characteristics of a colony. The number of discrete physical worker castes varied from one to four, although 76% of species had only one worker caste (Fig. 1a). Ancestral state reconstruction suggested that polymorphic workers, with more than one caste, had evolved from monomorphic ancestors at least 27 times (Fig. 2 and Supplementary Table 1). Colony sizes varied from 7 to 14,750,000, with a median of 300 (Fig. 1b). The effective number of males that inseminated each queen varied from 1 to 26, with 31% of species mating with a single male, 36% mating with between 1 and 2 males and 33% mating with more than 2 males. The median value for queen mating frequency was 1.1 mates (Fig. 1c). The number of queens per colony varied from 1 to 70, with 63% of species possessing a single queen, 18% possessing between 1 and 2 queens and 19% possessing more than 2 queens. The median number of queens per colony was 1 (Fig. 1d). The dataset was not complete for all traits for each species, and so for each analysis we used the maximum amount of data available. Therefore, sample sizes vary between analyses depending on the variables being analysed, as detailed within each section.

Fig. 1. Distribution of colony characteristics across ant species.

Fig. 1

a, Number of worker castes (nspecies = 699). b, Colony size (nspecies = 541). c, Queen mating frequency (nspecies = 126). d, Number of queens per colony (nspecies = 252). e, Variation in worker size quantified using the CV for worker head width (nspecies = 152). f, Ants vary in their number of physical worker castes. Left: Lasius claviger, a species characterized by having just a single physical worker caste. Right: Two distinct physical worker castes of the same species, Eciton burchelli, representing one of the most extreme cases of morphological caste variation (photo by Alex Wild (https://www.alexanderwild.com/)). Axes for colony size and number of queens per colony are plotted on a log10 scale.

Fig. 2. Evolutionary origins of multiple worker castes (worker polymorphism).

Fig. 2

The ancestral state at the root of the ant phylogeny is a single physical worker caste. Coloured circles represent evolutionary increases in number of worker castes, with colour corresponding to the number of castes that have evolved. The number of worker castes present in each species is also displayed around the outside of the phylogeny. Branch lengths are not to scale. Ant phylogenetic lineages, of varying levels, are displayed in black text around the outside of the phylogeny. Analyses were repeated over 400 phylogenetic trees, but we display the results of a single representative tree.

Phylogenetic correlations

We used Bayesian phylogenetic mixed effect models (BPMMs) to test for correlations between the different ant traits. We found that colony size and queen mating frequency were both positively correlated with number of worker castes (Fig. 3a,b; BPMMs: colony size: β = 0.17, credible interval (CI) = 0.11 to 0.24, nspecies = 436; queen mating frequency: β = 0.47, CI = 0.04 to 0.91, nspecies = 104). By contrast, we found no relationship between number of queens per colony and number of worker castes (Fig. 3c; BPMM: β = −0.14, CI = −0.56 to 0.30, nspecies = 192).

Fig. 3. Relationships between colony size, queen mating frequency and number of queens per colony, both among themselves and with number of worker castes.

Fig. 3

a,b, Species with larger colony sizes (nspecies = 436) (a) and higher queen mating frequencies (nspecies = 104) (b) had significantly more worker castes. c,d,f, We did not find a significant association between number of queens per colony and either number of worker castes (nspecies = 192) (c), queen mating frequency (nspecies = 111) (d) or colony size (nspecies = 195) (f). e, Species with higher queen mating frequencies had significantly larger colony sizes (nspecies = 109). ad, Data show mean ± standard error. e,f, Fitted lines are mean regression slopes with 95% CIs from BPMMs using a single phylogenetic tree. Solid regression lines represent significant relationships, while dashed regression lines represent non-significant relationships. Dots represent species averages. Axes for colony size, queen mating frequency and number of queens per colony are plotted on a log10 scale.

We also found that species with higher queen mating frequencies had larger colony sizes (Fig. 3e; BPMM: β = 1.25, CI = 0.50 to 2.03, nspecies = 109, R2 = 0.08). The significant correlations between queen mating frequency, colony size and number of worker castes emphasizes the need to carry out analyses that can examine the underlying causality between these variables. By contrast, we found no relationship between number of queens per colony and colony size (Fig. 3f; BPMM: β = 0.07, CI = −0.37 to 0.49, nspecies = 195, R2 = 0.00). However, we did find a negative relationship, which was close to significance, between number of queens per colony and queen mating frequency (Fig. 3d; BPMM: β = −0.20, CI = −0.39 to 0.01, nspecies = 111, R2 = 0.00). Based on the results of our correlational analysis, we excluded the number of queens per colony from our causality analysis, except for a potential relationship between number of queens per colony and queen mating frequency, where a negative relationship was also observed previously25.

Causality analyses

We then used three different methods to examine the causal relationships that underly these correlations: (1) phylogenetic path analysis, (2) transition rate analysis and (3) ancestral state reconstruction. These forms of analysis required number of worker castes to be modelled as a binary variable, and so we classified species as possessing either a single (monomorphic) or multiple (>1, polymorphic) worker castes.

Phylogenetic path analysis

We used phylogenetic path analysis to test four alternative causal models of the relationships between the following: number of worker castes, colony size, queen mating frequency and number of queens per colony (Extended Data Fig. 1). We found strong evidence in favour of the size–complexity hypothesis. The analysis supported a single causal model, where large colony sizes favoured the evolution of multiple worker castes, and size was the only variable that directly did so (Fig. 4, Extended Data Fig. 2 and Supplementary Tables 2 and 3; ω (relative weight of support for a model within a set of competing models) = 0.67, nspecies = 94). We found no support for alternative causal models where colony size is influenced by number of worker castes or where number of worker castes is influenced by queen mating frequency (Supplementary Table 2; nspecies = 94).

Extended Data Fig. 1. The full set of potential causal models tested in the phylogenetic path analyses illustrated using directed acyclic graphs (DAGs).

Extended Data Fig. 1

In different path analyses, division of labour was modelled either as the number of discrete physical worker castes (Fig. 4) or as variation in worker size (Fig. 6b).

Fig. 4. The best-supported causal model identified by phylogenetic path analysis.

Fig. 4

Larger colony sizes facilitated the evolution of multiple worker castes. We display the results from the analysis of one MCC consensus tree (Supplementary Tables 2 and 3). However, we repeated the analyses on three additional MCC trees and, in each case, found that the conclusions drawn remained unchanged due to the similarity in the presence, direction and magnitude of the causal relationships identified (Extended Data Fig. 2 and Supplementary Tables 2 and 3). Values represent standardized path coefficients with 95% confidence intervals (path coefficients describe the strength and direction of the relationship between two variables in terms of their correlated variance, after accounting for the effects of other variables in the model). Arrows indicate the direction of the relationship between variables, with heavier lines indicating larger coefficients. Number of worker castes was modelled as a discrete binary variable, while colony size, queen mating frequency and number of queens per colony were modelled as continuous variables.

Extended Data Fig. 2. Average of the best-performing models where division of labour analysed as the number of physical worker castes (single caste/multiple castes).

Extended Data Fig. 2

Analysis repeated over four different MCC ant trees from Economo et al. 65: a) NC uniform stem b) NC uniform crown c) FBD stem d) FBD crown. Values represent standardised average coefficients. CN = number of worker castes; CS = colony size; MF = queen mating frequency; QN = number of queens per colony.

Transition rate analysis

We used transition rate analysis as a second method to test the hypothesized causal relationships between number of worker castes and both colony size and queen mating frequency. This method allowed us to test whether the evolution of two traits is correlated, as well as the underlying causal direction, but required all traits to be modelled as discrete binary variables. We classified species as having small or large colony sizes depending upon whether colony size was smaller (≤300) or larger (>300) than the median colony size. We classified species as having low or high mating frequencies depending on whether queens mated more (>2) or less than or equal to (≤2) two times.

We found very strong support for the size–complexity hypothesis, with correlated evolution between colony size and number of worker castes (Supplementary Table 4; Bayes factor (BF) = 20.83, nspecies = 436; BFs quantify the relative support for two competing models where BF > 2 offers positive evidence, > 5 provides strong evidence and >10 very strong evidence that the more complex model performs better than the simpler model29). Transitions to multiple castes only occurred in species with large colony sizes (Fig. 5). In species with only a single worker caste, transition rates from small colony sizes to large colony sizes were slightly elevated in comparison to species with multiple castes (Fig. 5). By contrast, we found that variation in queen mating frequency does not influence the evolution of number of worker castes, as queen mating frequency evolved independently of the number of worker castes (Supplementary Table 5; queen mating frequency and worker castes: BF = −1.39, nspecies = 104).

Fig. 5. Evolutionary transitions between number of worker castes and colony size.

Fig. 5

Multiple worker castes only evolved in lineages with large colony sizes. Values represent the inferred transition rates between number of worker castes and colony size, accompanied by 95% CIs (nspecies = 436). Transition rates were estimated using rjMCMC implemented in BayesTraits. Each possible state is shown as a box, and the direction of transitions between states are shown with arrows, with higher rates of transition indicated by a heavier line.

Ancestral state reconstruction

We used ancestral state reconstruction as a third method to examine the underlying causal relationships and again found support for the size–complexity hypothesis. We found that the ancestors of species that evolved multiple worker castes had colony sizes that were more than 3.5 times larger compared with the ancestors of species that retained one worker caste (Extended Data Fig. 3a; BPMM: ancestral colony size in species with one worker caste, β = 281.67, CI = 40.04 to 1,590.90; ancestral colony size in species with multiple worker castes, β = 1,025.05, CI = 137.17 to 9,064.82; Psingle vs multiple < 0.01, nspecies = 436). By contrast, queen mating frequency did not differ significantly between ancestors that evolved multiple worker castes and those that did not (Extended Data Fig. 3b; BPMM: ancestral queen mating frequency in species with one worker caste, β = 1.99, CI = 0.95 to 3.90; ancestral queen mating frequency in species with multiple worker castes, β = 2.46, CI = 1.10 to 5.55; Psingle vs multiple = 0.16, nspecies = 104). The phylogenetic signal present in both colony size and queen mating frequency across ants allowed us to estimate ancestral values relatively accurately in these traits (Supplementary Table 6; BPMMs: phylogenetic heritability of colony size = 72.61%, CI = 59.67% to 84.02%; phylogenetic heritability of queen mating frequency = 88.94%, CI = 74.01% to 99.86%).

Extended Data Fig. 3. Ancestral state reconstruction: larger colony sizes favour the evolution of multiple worker castes.

Extended Data Fig. 3

(a) The posterior distribution from a BPMM estimating colony size in ancestors with a single worker caste that led to descendants with a single caste (single to single) or multiple castes (single to multiple). Colony size is larger in the ancestors of lineages that evolve multiple castes, Psingle vs multiple = 0.003 (nspecies= 436). (b) The posterior distribution from a BPMM estimating queen mating frequency in ancestors with a single worker caste that led to descendants with a single caste (single to single) or multiple castes (single to multiple). Queen mating frequency was not significantly different in the ancestors of lineages that evolved multiple castes and the ancestors of lineages that retained a single caste, Psingle vs multiple = 0.16 (nspecies= 104).

Division of labour and worker size

Taken together, our analyses on the number of worker castes provide strong support for the size–complexity hypothesis. Larger colony sizes correlate with and appear to lead to the evolution of more worker castes (Figs. 35). However, most ant species do not possess more than one physical worker caste. Despite this, there can be considerable variation in worker size within castes, and there is some evidence that different-sized workers perform different roles even within ant species that have a single physical worker caste32. Consequently, an alternative method for testing the size–complexity hypothesis would be to examine the relationship between colony size and variation in worker size, rather than number of castes, as a measure of non-reproductive division of labour12,13,18,31.

Using data obtained from AntWeb, we examined variation in the head width measurements of 1,064 workers from 152 species and found mixed support for the size–complexity hypothesis (Extended Data Fig. 4). We found a positive correlation between colony size and variation in worker size and that larger colony sizes may favour the evolution of greater variation in worker size (Fig. 6, Extended Data Fig. 5 and Supplementary Tables 7 and 8; BPMM: colony size, β = 0.05, CI = 0.03 to 0.08, nspecies = 122, R2 = 0.14; path analysis: nspecies = 94). However, there was some uncertainty regarding the causal direction of the relationship between colony size and variation in worker size. Among the three supported causal models identified by path analysis, larger colony sizes favoured the evolution of greater variation in worker size in two cases, and greater variation in worker size favoured the evolution of larger colony sizes in the third case (the model fit was compared using the corrected C-statistic Information Criterion (CICc), where models within 2 CICc units of the best model were considered to have equal support; Supplementary Table 7; combined weight for the models supporting the hypothesis that variation in worker size is influenced by colony size: ω = 0.55). Due to the uncertainty regarding the best-supported model, the confidence intervals for the path coefficients in the average model, linking both colony size and queen mating frequency with variation in worker size, were relatively large and overlapped with zero (Fig. 6b). When examining species with only a single worker caste, we found the contrasting result that greater variation in worker size favours the evolution of larger colony sizes (Fig. 6a, Extended Data Fig. 6 and Supplementary Tables 9 and 10; BPMM: β = 0.04, CI = 0.01 to 0.07, nspecies = 84, R2 = 0.08; path analysis: ω = 0.75, nspecies = 60).

Extended Data Fig. 4. Measurement of ant head width.

Extended Data Fig. 4

We downloaded front-view images of ant heads from www.antweb.org and measured at the widest point excluding the eyes (horizontal white line) using the image-processing software ImageJ. © California Academy of Sciences. Clockwise from top left, images by Erin Prado, Erin Prado, Will Ericson, and Z. Lieberman. Specimen codes: CASENT0179923, CASENT0179924, CASENT0280193, CASENT0911107. All images licensed under CC BY 4.0. https://creativecommons.org/licenses/by/4.0/.

Fig. 6. Larger colony sizes may favour the evolution of greater variation in worker size.

Fig. 6

a, Species with larger colony sizes possess significantly greater variation in worker size when analysing either of the following: (1) all species for which data were available (all dots and regression line in black, nspecies = 122) or (2) only species which possess a single worker caste (dots and regression line in light blue, nspecies = 84). Fitted lines represent mean regression slopes from BPMMs. Dots represent species averages. Colony size is plotted on a log10 scale, while variation in worker size is on a square root scale. b, Averaged model from the three supported path analysis models (Supplementary Tables 7 and 8). Larger colony sizes may favour the evolution of greater variation in worker size, but opposite causation remains possible. Analysis performed using all species where data were available for variation in worker size, colony size, number of queens per colony and queen mating frequency (nspecies = 94). We display the results from the analysis of one MCC consensus tree. We repeated the analyses on three additional MCC trees and, in each case, found that the conclusions drawn remained unchanged due to the similarity in the presence, direction and magnitude of the causal relationships identified (Extended Data Fig. 5 and Supplementary Tables 7 and 8). Values represent standardized path coefficients with 95% confidence intervals. Arrows indicate the direction of the relationship between variables, with heavier lines indicating larger coefficients. Variation in worker size, colony size, queen mating frequency and number of queens per colony were modelled as continuous variables.

Extended Data Fig. 5. Average of the best-performing models where division of labour analysed as variation in worker size.

Extended Data Fig. 5

Analysis performed over all species for which data on all four traits was available. Analysis repeated over four different MCC ant trees from Economo et al. 65 a) NC uniform stem b) NC uniform crown c) FBD stem d) FBD crown. Values represent standardised average coefficients. SV = variation in worker size (degree of worker polymorphism); CS = colony size; MF = queen mating frequency; QN = number of queens per colony.

Extended Data Fig. 6. Average of the best-performing models where division of labour analysed as variation in worker size and only using species with a single physical worker caste.

Extended Data Fig. 6

Analysis repeated over four different MCC ant trees from Economo et al. 65: a) NC uniform stem b) NC uniform crown c) FBD stem d) FBD crown. Values represent standardised average coefficients. SV = variation in worker size (degree of worker polymorphism); CS = colony size; MF = queen mating frequency; QN = number of queens per colony.

By contrast, we did not find support for the hypothesis that variation in worker size is influenced by queen mating frequency. This was the case when analysing either all species or just those with a single worker caste (Fig. 6b, Extended Data Fig. 57 and Supplementary Tables 711).

Extended Data Fig. 7. Relationships between queen mating frequency and number of queens per colony with variation in worker size.

Extended Data Fig. 7

Species with higher queen mating frequencies exhibited significantly greater variation in worker size (a, BPMM: β = 0.12, CI = 0.01 to 0.23, nspecies = 105). We did not find a significant association between number of queens per colony and variation in worker size (b, BPMM: β = −0.03, CI = −0.12 to 0.07, nspecies = 106). When reanalysing our data so that we only included species with a single worker caste (c-d), we found no association between either queen mating frequency or number of queens per colony with variation in worker size (c, BPMMs: queen mating frequency: β = −0.00, CI = −0.10 to 0.10, nspecies = 65; d, queen number: β = −0.02, CI = −0.12 to 0.07, nspecies = 68). Fitted lines are mean regression slopes with 95% CIs from BPMMs using a single phylogenetic tree. Solid regression lines represent significant relationships, while dashed regression lines represent non-significant relationships. Dots represent species averages. Axes for queen mating frequency and number of queens per colony are plotted on a log10 scale. Axes for variation in worker size are on a square root scale.

Sensitivity analyses

We examined whether our results were robust to alternative methods of quantifying variation in queen mating frequency and number of queens per colony as categorical rather than continuous variables21,25,33. Detailed statistics for each analysis are presented in the supplementary material (Extended Data Figs. 810 and Supplementary Tables 1119). In all cases, the size–complexity hypothesis was still supported, but some of the other relationships depended upon how data were categorized. Our conclusions regarding the phylogenetic correlations were robust, except for the finding that the number of worker castes did not significantly differ between singly mated and facultatively multiply mated species (Supplementary Table 19). The conclusions of our causal analyses were also robust, except when analysing the number of discrete worker castes using phylogenetic path analysis. We identified a single supported model where the evolution of multiple worker castes was favoured by both larger colony sizes and obligate multiple mating by queens. However, the confidence interval for the path coefficient associated with queen mating frequency overlapped with zero (Extended Data Fig. 8 and Supplementary Tables 12 and 13).

Extended Data Fig. 8. Average of the best-performing models where division of labour analysed as the number of physical worker castes (single caste/multiple castes).

Extended Data Fig. 8

Queen mating frequency (obligately monandrous & facultatively polyandrous or obligately polyandrous) and number of queens per colony (obligately monogynous & facultatively polygynous or obligately polygynous) were analysed as binary traits. Analysis repeated over four different MCC ant trees from Economo et al. 65: a) NC uniform stem b) NC uniform crown c) FBD stem d) FBD crown. However, when using the FBD stem tree for the analysis, the model did not converge, and we exclude this result. Values represent standardised average coefficients. CN = number of worker castes; CS = colony size; MF = queen mating frequency; QN = number of queens per colony.

Extended Data Fig. 10. Average of the best-performing models where division of labour analysed as variation in worker size and only using species with a single physical worker caste.

Extended Data Fig. 10

Queen mating frequency (obligately monandrous & facultatively polyandrous or obligately polyandrous) and number of queens per colony (obligately monogynous & facultatively polygynous or obligately polygynous) were analysed as binary traits. Analysis repeated over four different MCC ant trees from Economo et al. 65: a) NC uniform stem b) NC uniform crown c) FBD stem d) FBD crown. However, when using the NCuniform stem tree for the analysis, the model did not converge, and we exclude this result. Values represent standardised average coefficients. SV = variation in worker size (degree of worker polymorphism); CS = colony size; MF = queen mating frequency; QN = number of queens per colony.

Discussion

We found strong and consistent support for the size–complexity hypothesis, with larger colony sizes appearing to favour the evolution of multiple worker castes (Figs. 36). By contrast, we did not find consistent support for the hypothesis that multiple mating favours the evolution of multiple worker castes (Figs. 3, 4 and 6b). We found no evidence to suggest a direct relationship between the evolution of number of worker castes and number of queens per colony (Fig. 3c). Our conclusions were robust to different analysis methods.

Our results suggest that larger colony sizes favour the evolution of multiple worker castes in ants. Previous studies had highlighted a positive correlation between increased division of labour and both cell number in multicellular organisms and colony size in social insects, but these results were open to multiple explanations79,1214,18,21. Our results reveal a distinct evolutionary pattern where larger colony sizes tend to evolve before multiple worker castes (Figs. 4 and 5). There are at least two reasons why larger colonies may promote the evolution of multiple castes. First, as colony size increases, the number of tasks that need to be performed also increases2. For example, species with larger colonies may experience greater logistical challenges in transporting food or waste within or outside the colony. Second, species with larger colonies are better able to maintain the optimal ratio of the different castes10,34. Consequently, these species face a lower risk of losing essential worker functions if workers are lost.

Our analysis also suggests that larger colony sizes favoured the evolution of greater variation in worker size (Fig. 6). However, there is some uncertainty regarding the direction of this relationship, as causality could potentially be reversed. In addition, we found that this result only applied when analysing across species with variable numbers of physical worker castes. When analysing variation in worker size exclusively in species with a single caste, we found that greater variation in worker size preceded the evolution of larger colony sizes. Consequently, while our analyses of variation in worker size support the size–complexity hypothesis, they suggest that colony size is not responsible for the evolution of variation in worker size in species that lack discrete physical worker castes.

We did not find consistent support for the hypothesis that multiple mating favours the evolution of multiple worker castes. There is a positive correlation between both number of worker castes and variation in worker size with queen mating frequency, regardless of whether it is analysed as a continuous or categorical trait (Fig. 3b). However, when analysing the causal relationship between these variables, we found that higher mating frequency was not consistently associated with evolutionary transitions from single to multiple worker castes (Figs. 4 and 6b, Extended Data Figs. 8 and 9 and Supplementary Tables 5 and 18). This finding is consistent with the interpretation that queen mating frequency is positively correlated with number of worker castes due to both variables being positively associated with colony size (Figs. 3 and 4). Colony size and queen mating frequency may be positively correlated due to the advantages of increased genetic diversity that result from multiple mating21,3541. If species with larger colony sizes are at greater risk of infection from pathogens, for example, resulting from increased traffic of foragers into the nest, they may benefit more from the protection offered against disease by increased genetic diversity4042.

Extended Data Fig. 9. Average of the best-performing models where division of labour analysed as variation in worker size.

Extended Data Fig. 9

Analysis performed over all species for which data on all four traits was available. Queen mating frequency (obligately monandrous & facultatively polyandrous or obligately polyandrous) and number of queens per colony (obligately monogynous & facultatively polygynous or obligately polygynous) were analysed as binary traits. Analysis repeated over four different MCC ant trees from Economo et al. 65 a) NC uniform stem b) NC uniform crown c) FBD stem d) FBD crown. Values represent standardised average coefficients. SV = variation in worker size (degree of worker polymorphism); CS = colony size; MF = queen mating frequency; QN = number of queens per colony.

Excluding ants, possessing more than one physical worker caste is extremely rare in the social Hymenoptera. Multiple worker castes appear to be entirely absent in vespine wasps, while in bees, the presence of two castes has only been identified in some stingless bee species31,43. For example, Tetragonisca angustula possesses two castes—soldiers are both approximately 30% heavier than foragers and have a different morphology43. This contrasts with the much larger variation in ants—for example, major workers in Atta colombica leafcutter ants can be 60 times heavier than the smallest workers44. Why does the causal link between colony size and number of castes that we have identified in ants seemingly not apply in superorganismal species of bees and wasps31? Multiple hypotheses have been proposed, suggesting that factors such as the presence of a powerful sting or body size constraints imposed by having winged workers could have impeded the evolution of multiple worker castes in bees and wasps6,24,45. However, the phylogenetic power available to carry out statistical tests of these hypotheses is low. Other ways to test the size–complexity hypothesis include examining behavioural variation in bees and wasps or the physical caste variation in termites6,31,4648.

To conclude, our results provide strong support for the size–complexity hypothesis in ants, suggesting that larger colony sizes have favoured the evolution of greater division of labour (more castes and greater variation in size). The generality of this hypothesis could be tested by carrying out similar analyses on other forms of division of labour, such as the number of cell types in multicellular organisms9,49,50 or the difference between queens and workers in superorganismal social insects48,51.

Methods

To test whether colony size, queen mating frequency or number of queens per colony explains variation in non-reproductive division of labour in ants, we performed a large-scale phylogenetic comparative analysis. We first tested which factors correlate with number of worker castes and then used three different methods to examine the causal relationships that underly these correlations: phylogenetic path analysis, transition rate analysis and ancestral state reconstruction. We examined causality in these three different ways as it also allowed us to test that our results are robust to different methods of analysis. We also examined whether our results were robust to alternative methods of quantifying variation in queen mating frequency and number of queens per colony. All the species used in the analysis are listed in the supplementary materials (Supplementary Table 20), and we follow the taxonomic classification of ants listed in Bolton’s Catalogue available from AntCat.org52. When analysed as continuous variables, colony size, queen mating frequency and number of queens per colony were log10 transformed for all analyses, while variation in worker size was square root transformed.

Data collection

We collected data on five ant traits: (1) the number of discrete physical worker castes, (2) colony size (number of workers in mature colonies), (3) effective queen mating frequency21,53 (estimated number of mates weighted by the proportion of offspring sired by each male; referred to as ‘queen mating frequency’), (4) observed number of queens per colony and (5) variation in worker size. We started by gathering data from major reviews, comparative studies and books and then performed a topic search of the primary literature using Web of Science2427. We also incorporated data collected by the Global Ant Genomics Alliance consortium54. In total, we collected data for 794 species from 160 different genera.

We collected data on queen mating frequency and number of queens per colony by searching papers published between 1 January 2008 and 2 May 2020. Among these papers, we included data from Hughes et al. 25, which summarizes the available data on these variables in the eusocial Hymenoptera. We assumed that all relevant data published before 1 January 2008 had been captured by Hughes et al. 25. We used the following keywords: ‘Ant’ AND (monandr* OR monogyn* OR polyandr* OR polygyn* OR effective-mating-frequenc* OR mating-frequenc* OR paternity-frequenc* OR mating-system* OR sociogenetic-structure*). We chose to use estimates of effective mating frequency, rather than number of copulations, because it more accurately reflects the genetic relatedness among workers in a colony, considering the potential influence of biased sperm use by queens that have mated multiple times21,55. When multiple estimates of queen mating frequency were available, we calculated species-level averages for queen mating frequency using the harmonic mean. We used the harmonic mean because when mating frequency is low, increases in mating frequency will have greater effects on intra-colony relatedness. Therefore, harmonic means, which assign greater weight to small numbers, better represent the genetic effects of multiple mating on colony-wide relatedness than arithmetic means56. We collected data on the average observed number of queens per colony, rather than the effective number of queens per colony57 (the estimated number of reproductive queens weighted by the respective contribution of each queen to the production of offspring) because this was the only available type of continuous data with a sufficient sample size for a large-scale phylogenetic comparative analysis. When multiple estimates of number of queens per colony were available, we calculated species-level averages. For queen mating frequency and number of queens per colony, we also collected categorical data with three factor levels for each trait from the papers identified in our literature search. These factor levels include obligate single mating (monandry), facultative multiple mating (facultative polyandry) where some monandrous queens still occur in the population and obligate multiple mating (obligate polyandry) where mature colonies are never monandrous. Similarly, for number of queens per colony, these levels were obligate single queen (monogyny), facultative multiple queens (facultative polygyny) and obligate multiple queens (obligate polygyny).

Species were classed as having one to four physical worker castes based on the following criteria: Species were classed as having a single worker caste if they showed limited variation in size or monophasic allometric scaling. Species were classified as having more than one caste if there was variation in body size with non-allometric scaling, where the number of castes depended on the number of different scaling relationships between body parts. When morphometric data were absent, the number of castes was determined using direct estimates provided in the literature. We searched for published data up to and including 13 July 2020, and we used the following keywords: ‘Ant’ AND (worker polymorph* OR worker monomorph* OR (morphometric AND worker* AND caste*) OR subcaste* OR sub-caste* OR worker dimorph* OR major-worker* OR minor-worker* OR worker AND allometr*). We then performed two additional searches to collect missing data for number of worker castes. First, we performed a species-specific literature search for instances where data on queen mating frequency or number of queens per colony was available for a given species but number of castes was not. Our search query included the species name along with the terms ‘(worker polymorph* OR worker monomorph* OR (morphometric AND worker* AND caste*) OR subcaste* OR sub-caste* OR worker dimorph* OR major-worker* OR minor-worker* OR worker AND allometr* OR soldier* OR replete*)’. Second, we performed a search at the genus level for genera where no caste data had been collected. The search query used the same terms as mentioned above, in combination with the genus name.

To collect data on colony size, we performed a literature search focussing on species for which we already possessed data on queen mating frequency or number of queens per colony. Using Google Scholar, we searched for the following keywords in combination with the species name ‘colony size OR colony collection OR worker number’. We adopted this approach based on the methodology outlined in Dornhaus et al. 14. Google Scholar was chosen as it indexes the entire text of research papers. Notably, details regarding colony size are often not the primary focus of papers, resulting in brief mentions within the methods or results sections. Therefore, it is less likely to be recovered by search engines that rely solely on keywords, titles and abstracts to retrieve such information. When multiple measurements existed for a given species, we calculated arithmetic means weighted by sample size.

We collected data on variation in worker size by measuring the width of ant heads12,13,18,31. We quantified the relative variation in worker size for each species by calculating the coefficient of variation (CV), which is achieved by dividing the standard deviation of worker head width by its mean value (CVworkersize=workerheadwidthstandarddeviationworkerheadwidthmean). To obtain images of the species for which we already possessed data on colony size and queen mating frequency, we searched for scaled images on the AntWeb online database (antweb.org)58. We downloaded front-view images of the ants’ heads and measured at the widest point excluding the eyes using the image-processing software ImageJ v1.53i59. In total, we measured 1,064 worker ant heads (Extended Data Fig. 4). The number of workers measured per species ranged from 2 to 33, with a mean of 7.

We excluded 112 species from our dataset for the analysis based on their distinct life history traits. These traits represent evolutionarily derived elaborations of the ancestral full-sibling-colony state at the transition to superorganismal colonies at the root of the ant clade. They included (1) species that formed supercolonies (vast networks of connected nests); (2) social parasites, lacking some or all of the worker castes (although temporary social parasites which only establish new colonies with the assistance of a host species were included in the analysis); (3) those that can reproduce parthenogenetically to produce queens and/or workers; (4) those that reproduce via gamergates (mated workers that reproduce sexually) following the evolutionary loss of the original queen caste; or (5) those that use interlineage hybridization for genetic caste determination (Supplementary Table 21). We gathered the information from papers that had summarized the available data on these five traits6064. We excluded these species from the analysis as they represent secondary reductions of complexity in social organization and are likely experiencing different selection pressures for either the evolution of worker castes, colony size, queen mating frequency or number of queens per colony (Supplementary Table 22). This decision to remove species characterized by evolutionarily derived social systems improved our ability to detect which variables could influence the evolution of the number of worker castes.

Phylogeny

To control for shared evolutionary history in our comparative analyses, we used a posterior sample of 400 phylogenetic trees produced by Economo et al. 65, which included 14,594 species and of which 731 overlapped with the species in our dataset65. However, as time-calibrated, molecular phylogenies are only available for ants at the genus level, the species topology within genera varied widely across the sample of 400 trees. Therefore, we took steps to account for phylogenetic uncertainty in each of our analyses (see each section below for details).

Statistical analyses

All analyses were performed in R v4.2.2 apart from transition rate models that were conducted in Bayestraits V429,66.

Bayesian phylogenetic models

We fitted BPMMs with Markov chain Monte Carlo (MCMC) estimation using the MCMCglmm package v2.3467. Models were run for a minimum of 1,100,000 iterations, with a burn-in of 100,000 and thinning interval of 1,000. However, models were run for as long as necessary to obtain posterior effective sample sizes of at least 300 for all parameters. Overall, the majority of estimates had an effective sample size of at least 1,000. To ensure model convergence, we used the coda package v0.19-4 to calculate the degree of autocorrelation between successive iterations in each chain. We fitted each model independently two times and used Gelman and Rubin’s convergence test to compare within- and between-chain variance68,69. We modelled the number of worker castes as a discrete variable for our regression analyses and used a Poisson error distribution with a log-link function. We modelled colony size, queen mating frequency and number of queens per colony as Gaussian traits. The prior settings used for each analysis are specified in the supplementary R code. To select priors for random effects, we first checked model convergence using inverse-Wishart priors (V = 1, ν = 0.002). However, in situations where the MCMC chain showed poor mixing properties, particularly in cases involving discrete response variables, we examined two different parameter expanded priors: the Fisher prior (V = 1, ν = 1, α.μ = 0, α.V = 1,000) and the χ2 prior (V = 1, ν = 1,000, α.μ = 0, α.V = 1). We specified an inverse-Wishart prior for residual variances (V = 1, ν = 0.002)70. For fixed effects, we used the default priors in MCMCglmm. We report parameter estimates from models as posterior modes along with 95% lower and upper CIs. For BPMMs with a Gaussian error distribution, we also calculated marginal R2 values and report the median value for each analysis from a distribution of R2 values estimated across a sample of 400 trees71. The reported P values, which assess differences between levels, such as species with a single worker caste versus species with multiple worker castes, represent the number of iterations where one level is greater than the other level divided by the total number of iterations.

Correlational analyses

Estimating phylogenetic correlations using MCMCglmm

We used BPMMs to test for correlations between four traits: number of worker castes, colony size, queen mating frequency and number of queens per colony. In addition, we investigated the correlations between variation in worker size and colony size, queen mating frequency and number of queens per colony. When analysing variation in worker size, our analysis was split into two parts. The first part examined correlations across all species for which worker size data were available, while the second part specifically examined species that have a single worker caste. To account for phylogenetic uncertainty in our BPMMs, we repeated each analysis 400 times, each time with a different tree, and combined the posterior samples produced from each tree before parameter estimation. Model convergence was assessed as described in the section ‘Statistical analyses’.

Causality analyses: examining colony size and levels of polyandry that preceded the evolution of multiple worker castes

Phylogenetic path analysis analysing number of discrete physical worker castes

To test how the evolution of number of worker castes was influenced by colony size and queen mating frequency, we used phylogenetic path analysis28,72. This method compares alternative models of the causal relationships between traits, disentangling direct from indirect effects. After removing species with incomplete data for all variables, we analysed the data for a total of 94 species. By comparing four different causal models, we tested between the following possibilities: (1) variation in colony size directly influenced the evolution of number of worker castes, but queen mating frequency did not; (2) variation in number of worker castes influenced the evolution of colony size, which then predicted queen mating frequency; (3) variation in both colony size and queen mating frequency directly influenced the evolution of number of worker castes; and (4) variation in queen mating frequency directly influenced the evolution of number of worker castes, but colony size did not (list of full models in Extended Data Fig. 1). In each of the causal models tested, we assumed that the evolution of queen mating frequency was influenced by variation in both the number of queens per colony and colony size. We made these assumptions because (1) it has been suggested that the evolution of queen mating frequency has been influenced by variation in the number of queens per colony21,25 and (2) larger colony sizes may select for multiple mating40,41. Larger colonies may be at greater risk of infection from pathogens, and multiple mating may offer protection against disease because it increases genetic diversity within colonies37,4042. However, multiple mating is a costly trait, and therefore, we hypothesized that larger colony sizes may evolve first and then select for multiple mating if the benefits of multiple mating outweigh the costs73. We excluded the relationship between the number of queens per colony and either the number of castes or colony size in our models as we found no correlation between these variables in our regression analysis (section ‘Phylogenetic correlations’).

Based on the causal relationships presented in the alternative path analysis models, we generated conditional independence statements that can be formulated and tested as a set of phylogenetic generalized linear models. For each model, we calculated Fisher’s C statistic and conducted the d-sep test28. P values less than 0.05 indicate that a proposed candidate model should be rejected. Next, we compared the fit of different models using CICc scores, where the lowest score represents the best candidate model74. Models that have ΔCICc < 2 were considered to have equal support. We used a model-averaging approach to estimate path coefficients that assigned weights to causal links based on the CICc weight (ω) of the supported models (ΔCICc < 2). When a path was absent in a model, we assumed its coefficients and variance to be zero for the purpose of model averaging. We transformed the number of castes into a binary variable (single worker caste/multiple worker castes) for the analysis as the phylopath package does not support discrete count data as response variables. We then used a binomial family for binary response variables when modelling number of worker castes. To the best of our knowledge, there is currently no established method for combining results obtained across different trees to account for phylogenetic uncertainty in frequentist analyses. Therefore, we performed our analysis four times, each time using a different tree, from a sample of four maximum clade credibility (MCC) trees produced by Economo et al. 65. Each MCC tree was constructed from a sample of 100 trees. We then compared the results from each of the four analyses to determine whether our results remained consistent irrespective of the phylogeny that was used. In the main text, we present the results from a single tree as we found that the conclusions drawn across trees remained unchanged due to the similarity in the presence, direction and magnitude of the causal relationships identified. We present the results using the alternative MCC trees in the supplementary material (Extended Data Fig. 2 and Supplementary Tables 2 and 3). All phylogenetic path analyses were carried out using phylopath v1.1.3 in R75.

Phylogenetic path analysis analysing variation in worker size

We repeated our analysis, examining variation in worker size instead of number of worker castes. We used head width as a measure of worker size and modelled it as continuous variable. Models were constructed using Pagel’s λ76 for the associated error structure, and we compared the same four causal models that were used in the previous path analysis (Extended Data Fig. 1). To investigate whether differences in the number of worker castes were driving the results when analysing worker size, we performed the analysis in two different ways: (1) with the full set of species for which we had data available for all four traits (nspecies = 94) and (2) limiting our analysis to only species that possess a single worker caste (monomorphic; nspecies = 60). To account for phylogenetic uncertainty, we repeated each analysis using the same four MCC trees used in the previous path analysis. In the main text, we present the results from a single tree as we found that the conclusions drawn across trees remained unchanged due to the similarity in the presence, direction and magnitude of the causal relationships identified. We present the results for both analyses using the alternative MCC trees in the supplementary material (Extended Data Figs. 5 and 6 and Supplementary Tables 710).

Testing models of dependent versus independent evolution and estimating evolutionary transition rates using BayesTraits

We tested for correlated evolution between number of worker castes and both colony size and queen mating frequency. We used the Discrete module with reverse jump MCMC estimation implemented in BayesTraits V4. This method requires pairs of binary traits, and so we transformed all traits into binary variables. We classified species as having either a single worker caste or multiple worker castes, to focus on the evolution of multiple castes. We classified species as having queens that mated with less than or equal to two males (≤2 males) or more than two males (>2 males), because (1) the influence of larger numbers of matings on relatedness is diminishing; (2) an effective mating frequency value of 2 leads to the average level of relatedness among workers in the colony being midway between the maximal and minimal values of 0.75 and 0.25 that are possible with a single queen; and (3) mating with 2 males is the threshold at which worker policing is favoured20. We classified species as having either small or large colonies depending upon whether colony size was smaller (≤300) or larger (>300) than the median colony size. To assess the sensitivity of our results to different colony size classification thresholds, we repeated our analysis, but divided species into classes based on the 40th/60th and 60th/40th quantile boundaries. Across these thresholds, we obtained the same result that large colony size was necessary for the evolution of multiple worker castes. Therefore, we present the results of the 40th/60th and 60th/40th only in the supplementary material (Supplementary Tables 23 and 24). In all cases, we tested the fit of the independent model of evolution against the dependent model of evolution using BFs (2 × (log(likelihood of the dependent model) − log(likelihood of independent model))). We estimated the marginal likelihood of both the independent model and the dependent model using a stepping-stone sampler77. As recommended in the Bayestraits manual, we scaled tree branch lengths by a factor of 0.001. This helps prevent the estimated transition rates from becoming very small, which can make the values hard to estimate or search for. We accounted for phylogenetic uncertainty in our analysis by resampling each iteration from the posterior distribution of 400 trees to estimate parameters.

To reduce uncertainty over prior selection, we used a hyper prior approach to seed the mean and variance of an exponential prior29. The values for the hyper priors were drawn from a uniform distribution ranging from 0 to 100, based on the estimated range of transition rates determined by using analyses with maximum likelihood estimation. In addition, we investigated the sensitivity of our models to the selection of priors by running models with gamma priors initialized using hyper priors. We found that the conclusions drawn between models using different priors remained the same. Therefore, we only present the results from the models using the gamma priors in the supplementary materials (Supplementary Tables 25 and 26). All models were run for at least 11,000,000 iterations with 1,000,000 iterations of burn-in and a thinning interval of 5,000. However, some models were run for 110,000,000 with 10,000,000 iterations of burn-in and a thinning interval of 50,000 iterations to improve chain convergence and mixing. Model convergence was assessed using the same approach described in ‘Statistical analyses’.

We identified correlated evolution between colony size and number of worker castes. Therefore, using the transition rates estimated by the model, we examined whether transitions to multiple worker castes were influenced by small or large colony sizes, as well as whether transitions to large colony sizes were influenced by number of worker castes. We found that queen mating frequency and number of worker castes evolved independently. Therefore, we did not explore the potential causal relationship between these traits. To determine the likelihood of transitions occurring, we examined the proportion of models visited by the reverse jump MCMC algorithm in which the rates were set to zero (Supplementary Tables 4, 5, 18 and 2326).

Ancestral state reconstructions

We used ancestral state reconstruction to examine how colony size and queen mating frequency differed between the ancestors of lineages with a single caste and lineages with multiple castes30. First, we reconstructed the ancestral number of worker castes (single/multiple castes) to identify transitions between single and multiple worker castes. Second, we tested whether species that evolved multiple worker castes had greater estimated colony sizes and queen mating frequencies. To account for potential variation in the rate of caste evolution across the ant phylogeny, we used a hidden Markov model technique called hidden rate models (HRMs) to reconstruct the ancestral numbers of castes78. This method enabled us to incorporate heterogeneity in the loss and gain rates of our reconstruction. We used the R package corHMM v2.8 to examine HRMs with one to three rate classes and generated both equal rates and all-rates-different models of evolution for each rate class78. To determine the best HRM model from the candidate set, we examined corrected Akaike Information Criteria (AICc) values for each model across 400 trees using all available species with data on the number of castes. We selected the equal rates model with two rate categories because it had the lowest AICc value over 57.5% of the 400 trees. We then ran the equal rates model with two rate categories for each analysis, examining the relationship between colony size and number of worker castes and queen mating frequency and number of worker castes. Next, we classified each node in the phylogeny depending on which state (single/multiple castes) had the highest likelihood, resulting in four possible classification categories for each node: (1) single caste with all descendants having a single caste, (2) single caste with at least one descendant having multiple castes, (3) multiple castes with all descendants having multiple castes and (4) multiple castes with at least one descendant having a single caste.

The nodal classifications were entered as an explanatory variable in a BPMM and treated as a fixed factor with four levels. Depending on whether we were analysing the relationship between number of worker castes and either colony size or queen mating frequency, we specified either colony size or queen mating frequency as the response variable and used a phylogenetic covariance matrix that was linked to ancestral nodes as a random effect. We removed the global intercept and estimated the colony size and queen mating frequency values before the evolutionary origin of multiple worker castes (comparison of classification 1 versus 2). To account for phylogenetic uncertainty in our analysis, we repeated this process 400 times, once for each tree from our sample of 400 phylogenetic trees. We then combined the resulting posterior samples across the models to calculate parameter estimates. Accurate estimation of ancestral colony size and queen mating frequency values required phylogenetic signal in these traits. Using BPMMs, we estimated the amount of variation in each trait that can be attributed to shared ancestry between species, calculated as phylogenetic heritability (phyloh2=VAVA+VR×100, where VA = phylogenetic variance, VR = residual variance). For each trait, we repeated the analysis 400 times, each time with a different tree, and combined the posterior samples of the variance components before phylogenetic heritability was estimated.

In addition, we estimated the ancestral number of discrete worker castes (one to four worker castes) using HRMs. We ran each model over a sample of 400 trees and used the same model selection approach for HRMs as described above. The symmetric model with 2 rate classes performed the best over 58.5% of trees. We then estimated the frequency of transitions between each number of castes, classifying each node based on the state with the highest likelihood estimate.

Sensitivity analysis

To assess the robustness of our findings to different methods of quantifying queen mating frequency and number of queens per colony, we performed a sensitivity analysis. We repeated our phylogenetic correlations analysis, treating queen mating frequency and number of queens per colony as categorical variables with three factor levels. This approach was chosen based on evidence indicating that these categorizations may reflect discrete reproductive strategies that may have evolved for different reasons and align with previous studies21,25,33,79,80. They were categorized as follows: obligate single mating (monandry), facultative multiple mating (facultative polyandry) and obligate multiple mating (obligate polyandry); and obligate single queen (monogyny), facultative multiple queens (facultative polygyny) and obligate multiple queens (obligate polygyny)79,80.

We also repeated our phylogenetic path analysis and transition rate analysis. However, we transformed queen mating frequency and number of queens per colony into discrete binary variables, as these analytical methods cannot accept categorical variables with more than two factor levels. Species were classified as being obligately polyandrous or not, and obligately polygynous or not. We combined monandry with facultative polyandry, and monogyny with facultative polygyny to avoid excluding a large proportion of species from our analysis. We combined these categories because species that are obligately polyandrous or polygynous have made an evolutionary transition where mature colonies can no longer possess a monandrous or monogynous queen. By contrast, species that are facultatively polyandrous or polygynous may still show monandry or monogyny. Detailed statistics for each analysis are provided in the supplementary material (Extended Data Figs. 810 and Supplementary Tables 1119).

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Supplementary information

Reporting Summary (2.2MB, pdf)
Peer Review File (15.8MB, pdf)
Supplementary Tables (151.6KB, xlsx)

Supplementary Tables 1–26.

Acknowledgements

We thank R. Bonifacii, J. Green, T. Scott, A. Dewar and M. Liu for their helpful comments, M. Liu for producing ant silhouettes, C. Cornwallis for advice on performing ancestral state reconstruction and M. Brindle for advice on performing transition rate analysis. We thank the Global Ant Genomics Alliance consortium for making their data on ant life history traits available to us, and the Biotechnology and Biological Sciences Research Council (BB/M011224/1: L.B.-R.) and European Research Council (834164: S.A.W. and J.F.R.T.) for funding.

Extended data

Author contributions

Conceptualization: S.A.W., L.R. and G.D.A.W. Data collection: L.B.-R., J.F.R.T. and L.R. Methodology: L.B.-R., J.F.R.T., S.A.W., G.D.A.W. and P.A.D. Investigation: L.B.-R., J.F.R.T., S.A.W. and G.D.A.W. Visualization: L.B.-R., J.F.R.T. and S.A.W. Funding acquisition: S.A.W. Supervision: S.A.W. and G.D.A.W. Writing—original draft: L.B.-R., J.F.R.T. and S.A.W. Writing—review and editing: L.B.-R., J.F.R.T., S.A.W., G.D.A.W., P.A.D. and L.R.

Peer review

Peer review information

Nature Ecology & Evolution thanks Jacobus Boomsma, Solomon Tin Chi Chak, Joel Vizueta and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Data availability

All data are provided in Supplementary Table 20 and are made available, along with their associated references, via the public repository, Oxford University Research Archive (ORA) at https://ora.ox.ac.uk/objects/uuid:453278a3-61b9-4144-b08a-98d9379d5ce8 (ref. 81).

Code availability

R code for analyses is available via GitHub at https://github.com/LouisBell-Roberts/Larger_colony_sizes_favoured_the_evolution_of_more_worker_castes_in_ants.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

is available for this paper at 10.1038/s41559-024-02512-7.

Supplementary information

The online version contains supplementary material available at 10.1038/s41559-024-02512-7.

References

  • 1.Boomsma, J. J. Domains and Major Transitions of Social Evolution (Oxford Univ. Press, 2023).
  • 2.Bourke, A. F. G. Principles of Social Evolution (Oxford Univ. Press, 2011).
  • 3.Szathmáry, E. & Smith, J. M. The major evolutionary transitions. Nature374, 227–232 (1995). [DOI] [PubMed] [Google Scholar]
  • 4.West, S. A., Fisher, R. M., Gardner, A. & Kiers, E. T. Major evolutionary transitions in individuality. Proc. Natl Acad. Sci. USA112, 10112–10119 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Cooper, G. A. & West, S. A. Division of labour and the evolution of extreme specialization. Nat. Ecol. Evol.2, 1161–1167 (2018). [DOI] [PubMed] [Google Scholar]
  • 6.Oster, G. F. & Wilson, E. O. Caste and Ecology in the Social Insects (Princeton Univ. Press, 1978). [PubMed]
  • 7.Bell, G. & Mooers, A. O. Size and complexity among multicellular organisms. Biol. J. Linn. Soc.60, 345–363 (1997). [Google Scholar]
  • 8.Bonner, J. T. Perspective: the size–complexity rule. Evolution58, 1883–1891 (2004). [DOI] [PubMed] [Google Scholar]
  • 9.Fisher, R. M., Shik, J. Z. & Boomsma, J. J. The evolution of multicellular complexity: the role of relatedness and environmental constraints. Proc. R. Soc. B287, 20192963 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Cooper, G. A., Liu, M., Peña, J. & West, S. A. The evolution of mechanisms to produce phenotypic heterogeneity in microorganisms. Nat. Commun.11, 14475–14489 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Ulrich, Y., Saragosti, J., Tokita, C. K., Tarnita, C. E. & Kronauer, D. J. C. Fitness benefits and emergent division of labour at the onset of group living. Nature560, 635–638 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Ferguson-Gow, H., Sumner, S., Bourke, A. F. G. & Jones, K. E. Colony size predicts division of labour in attine ants. Proc. R. Soc. B281, 20141411 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Lecocq de Pletincx, N., Dellicour, S. & Aron, S. The evolution of ant worker polymorphism correlates with multiple social traits. Behav. Ecol. Sociobiol.75, 1–11 (2021). [Google Scholar]
  • 14.Dornhaus, A., Powell, S. & Bengston, S. Group size and its effects on collective organization. Annu. Rev. Entomol.57, 123–141 (2012). [DOI] [PubMed] [Google Scholar]
  • 15.Bonner, J. T. Dividing the labour in cells and societies. Curr. Sci.64, 459–466 (1993). [Google Scholar]
  • 16.Anderson, C. & McShea, D. W. Individual versus social complexity, with particular reference to ant colonies. Biol. Rev. Camb. Philos. Soc.76, 211–237 (2001). [DOI] [PubMed] [Google Scholar]
  • 17.Wills, B. D., Powell, S., Rivera, M. D. & Suarez, A. V. Correlates and consequences of worker polymorphism in ants. Annu. Rev. Entomol.63, 575–598 (2018). [DOI] [PubMed] [Google Scholar]
  • 18.Fjerdingstad, E. J. & Crozier, R. H. The evolution of worker caste diversity in social insects. Am. Nat.167, 390–400 (2006). [DOI] [PubMed] [Google Scholar]
  • 19.Frumhoff, P. C. & Ward, P. S. Individual-level selection, colony-level selection, and the association between polygyny and worker monomorphism in ants. Am. Nat.139, 559–590 (1992). [Google Scholar]
  • 20.Ratnieks, F. L. W. Reproductive harmony via mutual policing by workers in eusocial Hymenoptera. Am. Nat.132, 217–236 (1988). [Google Scholar]
  • 21.Boomsma, J. J. & Ratnieks, F. L. W. Paternity in eusocial Hymenoptera. Phil. Trans. R. Soc. Lond. B351, 947–975 (1996). [Google Scholar]
  • 22.Garamszegi, L. Z. Modern Phylogenetic Comparative Methods and Their Application in Evolutionary Biology (Springer, 2014).
  • 23.Bourke, A. F. G. & Franks, N. R. Social Evolution in Ants (Princeton Univ. Press, 1995).
  • 24.Blanchard, B. D. & Moreau, C. S. Defensive traits exhibit an evolutionary trade‐off and drive diversification in ants. Evolution71, 315–328 (2017). [DOI] [PubMed] [Google Scholar]
  • 25.Hughes, W. O. H., Ratnieks, F. L. W. & Oldroyd, B. P. Multiple paternity or multiple queens: two routes to greater intracolonial genetic diversity in the eusocial Hymenoptera. J. Evol. Biol.21, 1090–1095 (2008). [DOI] [PubMed] [Google Scholar]
  • 26.Hölldobler, B. & Wilson, E. O. The Ants (Harvard Univ. Press, 1990).
  • 27.Burchill, A. T. & Moreau, C. S. Colony size evolution in ants: macroevolutionary trends. Insectes Soc.63, 291–298 (2016). [Google Scholar]
  • 28.Gonzalez-Voyer, A. & von Hardenberg, A. An introduction to phylogenetic path analysis. in Modern Phylogenetic Comparative Methods and Their Application in Evolutionary Biology (ed. Garamszegi, L. Z.) 201–229 (Springer, 2014).
  • 29.Pagel, M. & Meade, A. Bayesian analysis of correlated evolution of discrete characters by reversible-jump Markov chain Monte Carlo. Am. Nat.167, 808–825 (2006). [DOI] [PubMed] [Google Scholar]
  • 30.Cornwallis, C. K. et al. Cooperation facilitates the colonization of harsh environments. Nat. Ecol. Evol.1, 0057 (2017). [DOI] [PubMed] [Google Scholar]
  • 31.Grüter, C. et al. Repeated evolution of soldier sub-castes suggests parasitism drives social complexity in stingless bees. Nat. Commun.8, 4 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.West, M. & Purcell, J. Task partitioning in ants lacking discrete morphological worker subcastes. Behav. Ecol. Sociobiol.74, 66 (2020). [Google Scholar]
  • 33.Boomsma, J. J., Kronauer, D. J. C. & Pedersen, J. S. The evolution of social insect mating systems. in Organization of Insect Societies: From Genome to Sociocomplexity (eds. Gadau, J. & Fewell, J. H.) 3–25 (Harvard Univ. Press, 2009).
  • 34.Liu, M., West, S. A. & Cooper, G. A. Relatedness and the evolution of mechanisms to divide labor in microorganisms. Ecol. Evol.11, 14475–14489 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Baer, B. & Schmid-Hempel, P. Experimental variation in polyandry affects parasite loads and fitness in a bumble-bee. Nature397, 151–154 (1999). [Google Scholar]
  • 36.Cremer, S., Armitage, S. A. O. & Schmid-Hempel, P. Social immunity. Curr. Biol.17, 693–702 (2007). [DOI] [PubMed] [Google Scholar]
  • 37.Hamilton, W. D. Kinship, recognition, disease, and intelligence: constraints of social evolution. in Animal Societies: Theories and Facts (eds Ito, Y., Brown, J. L. & Kikkawa, J.) 81–102 (Japan Scientific Societies Press, 1987).
  • 38.Hughes, W. O. H. & Boomsma, J. J. Genetic diversity and disease resistance in leaf‐cutting ant societies. Evolution58, 1251–1260 (2004). [DOI] [PubMed] [Google Scholar]
  • 39.Ugelvig, L. V., Kronauer, D. J. C., Schrempf, A., Heinze, J. & Cremer, S. Rapid anti-pathogen response in ant societies relies on high genetic diversity. Proc. R. Soc. B277, 2821–2828 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Schmid-Hempel, P. Parasites in Social Insects (Princeton Univ. Press, 1998).
  • 41.Loope, K. J., Chien, C. & Juhl, M. Colony size is linked to paternity frequency and paternity skew in yellowjacket wasps and hornets. BMC Evol. Biol.14, 1–12 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Naug, D. & Camazine, S. The role of colony organization on pathogen transmission in social insects. J. Theor. Biol.215, 427–439 (2002). [DOI] [PubMed] [Google Scholar]
  • 43.Grüter, C., Menezes, C., Imperatriz-Fonseca, V. L. & Ratnieks, F. L. W. A morphologically specialized soldier caste improves colony defense in a neotropical eusocial bee. Proc. Natl Acad. Sci. USA109, 1182–1186 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Feener, D. H., Lighton, J. R. B. & Bartholomew, G. A. Curvilinear allometry, energetics and foraging ecology: a comparison of leaf-cutting ants and army ants. Funct. Ecol.2, 509–520 (1988). [Google Scholar]
  • 45.Molet, M., Wheeler, D. E. & Peeters, C. Evolution of novel mosaic castes in ants: modularity, phenotypic plasticity, and colonial buffering. Am. Nat.180, 328–341 (2012). [DOI] [PubMed] [Google Scholar]
  • 46.Noirot, C. H. & Pasteels, J. M. Ontogenetic development and evolution of the worker caste in termites. Experientia43, 851–860 (1987). [Google Scholar]
  • 47.Brown, M. J. F. & Schmid-Hempel, P. The evolution of female multiple mating in social hymenoptera. Evolution57, 2067–2081 (2003). [DOI] [PubMed] [Google Scholar]
  • 48.Revely, L., Eggleton, P., Clement, R., Zhou, C. & Bishop, T. R. The diversity of social complexity in termites. Proc. R. Soc. B291, 20232791 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Fisher, R. M., Cornwallis, C. K. & West, S. A. Group formation, relatedness, and the evolution of multicellularity. Curr. Biol.23, 1120–1125 (2013). [DOI] [PubMed] [Google Scholar]
  • 50.Howe, J., Cornwallis, C. K. & Griffin, A. S. Conflict-reducing innovations in development enable increased multicellular complexity. Proc. R. Soc. B291, 20232466 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Matte, A. & LeBoeuf, A. C. Ruling the unruly: Innovation in ant larval feeding led to increased caste dimorphism and social complexity. Preprint at bioRxiv10.1101/2022.12.08.519655 (2024).
  • 52.Bolton, B. An online catalog of the ants of the world. AntCathttps://antcat.org (2023).
  • 53.Nielsen, R., Tarpy, D. R. & Reeve, H. K. Estimating effective paternity number in social insects and the effective number of alleles in a population. Mol. Ecol.12, 3157–3164 (2003). [DOI] [PubMed] [Google Scholar]
  • 54.Boomsma, J. J. et al. The Global Ant Genomics Alliance (GAGA). Myrmecol. News25, 61–66 (2017). [Google Scholar]
  • 55.Pamilo, P. Polyandry and allele frequency differences between the sexes in the ant Formica aquilonia. Heredity70, 472–480 (1993). [Google Scholar]
  • 56.Wade, M. J. The influence of multiple inseminations and multiple foundresses on social evolution. J. Theor. Biol.112, 109–121 (1985). [DOI] [PubMed] [Google Scholar]
  • 57.Ross, K. G. The breeding system of the fire ant Solenopsis invicta: effects on colony genetic structure. Am. Nat.141, 554–576 (1993). [DOI] [PubMed] [Google Scholar]
  • 58.Antweb. Version 8.91.2. (California Academy of Science, 2023).
  • 59.Schneider, C. A., Rasband, W. S. & Eliceiri, K. W. NIH image to ImageJ: 25 years of image analysis. Nat. Methods9, 671–675 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Helanterä, H. Supercolonies of ants. Myrmecol. News32, 1–22 (2022). [Google Scholar]
  • 61.Buschinger, A. Social parasitism among ants: a review (Hymenoptera: Formicidae). Myrmecol. News12, 219–235 (2009). [Google Scholar]
  • 62.Rabeling, C. & Kronauer, D. J. C. Thelytokous parthenogenesis in eusocial Hymenoptera. Annu. Rev. Entomol.58, 273–292 (2013). [DOI] [PubMed] [Google Scholar]
  • 63.Monnin, T. & Peeters, C. How many gamergates is an ant queen worth? Naturwissenschaften95, 109–116 (2008). [DOI] [PubMed] [Google Scholar]
  • 64.Schwander, T., Lo, N., Beekman, M., Oldroyd, B. P. & Keller, L. Nature versus nurture in social insect caste differentiation. Trends Ecol. Evol.25, 275–282 (2010). [DOI] [PubMed] [Google Scholar]
  • 65.Economo, E. P., Narula, N., Friedman, N. R., Weiser, M. D. & Guénard, B. Macroecology and macroevolution of the latitudinal diversity gradient in ants. Nat. Commun.9, 1778 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.R Core Team. R: a language and environment for statistical computing. http://www.R-project.org/ (R Foundation for Statistical Computing, 2020).
  • 67.Hadfield, J. D. MCMC methods for multi-response generalized linear mixed models: the MCMCglmm R package. J. Stat. Softw.33, 1–22 (2010).20808728 [Google Scholar]
  • 68.Plummer, M., Best, N., Cowles, K. & Vines, K. CODA: convergence diagnosis and output analysis for MCMC. R News6, 7–11 (2006). [Google Scholar]
  • 69.Gelman, A. & Rubin, D. B. Inference from iterative simulation using multiple sequences. Stat. Sci.7, 457–472 (1992). [Google Scholar]
  • 70.de Villemereuil, P., Gimenez, O. & Doligez, B. Comparing parent–offspring regression with frequentist and Bayesian animal models to estimate heritability in wild populations: a simulation study for Gaussian and binary traits. Methods Ecol. Evol.4, 260–275 (2013). [Google Scholar]
  • 71.Nakagawa, S. & Schielzeth, H. A general and simple method for obtaining R2 from generalized linear mixed-effects models. Methods Ecol. Evol.4, 133–142 (2013). [Google Scholar]
  • 72.Shipley, B. Cause and Correlation in Biology: A User’s Guide to Path Analysis, Structural Equations and Causal Inference with R (Cambridge Univ. Press, 2016).
  • 73.Crozier, R. H. & Fjerdingstad, E. J. Polyandry in social Hymenoptera – disunity in diversity? Ann. Zool. Fenn.38, 267–285 (2001). [Google Scholar]
  • 74.Shipley, B. The AIC model selection method applied to path analytic models compared using a d-separation test. Ecology94, 560–564 (2013). [DOI] [PubMed] [Google Scholar]
  • 75.van der Bijl, W. phylopath: easy phylogenetic path analysis in R. PeerJ6, e4718 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Pagel, M. Inferring the historical patterns of biological evolution. Nature401, 877–884 (1999). [DOI] [PubMed] [Google Scholar]
  • 77.Xie, W., Lewis, P. O., Fan, Y., Kuo, L. & Chen, M.-H. Improving marginal likelihood estimation for Bayesian phylogenetic model selection. Syst. Biol.60, 150–160 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Beaulieu, J. M., O’Meara, B. C. & Donoghue, M. J. Identifying hidden rate changes in the evolution of a binary morphological character: the evolution of plant habit in campanulid angiosperms. Syst. Biol.62, 725–737 (2013). [DOI] [PubMed] [Google Scholar]
  • 79.Boomsma, J. J. Beyond promiscuity: mate-choice commitments in social breeding. Phil. Trans. R. Soc. B368, 20120050 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Boomsma, J. J., Huszár, D. B. & Pedersen, J. S. The evolution of multiqueen breeding in eusocial lineages with permanent physically differentiated castes. Anim. Behav.92, 241–252 (2014). [Google Scholar]
  • 81.Bell-Roberts, L. Larger colony sizes favoured the evolution of more worker castes in ants. ORA 10.5287/ora-erx7kmw6a (2024). [DOI] [PMC free article] [PubMed]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Reporting Summary (2.2MB, pdf)
Peer Review File (15.8MB, pdf)
Supplementary Tables (151.6KB, xlsx)

Supplementary Tables 1–26.

Data Availability Statement

All data are provided in Supplementary Table 20 and are made available, along with their associated references, via the public repository, Oxford University Research Archive (ORA) at https://ora.ox.ac.uk/objects/uuid:453278a3-61b9-4144-b08a-98d9379d5ce8 (ref. 81).

R code for analyses is available via GitHub at https://github.com/LouisBell-Roberts/Larger_colony_sizes_favoured_the_evolution_of_more_worker_castes_in_ants.


Articles from Nature Ecology & Evolution are provided here courtesy of Nature Publishing Group

RESOURCES