Abstract
Given the need to predict the outcomes of (co)evolution in host-associated microbiomes, whether microbial and host fitnesses tend to trade-off, generating conflict, remains a pressing question. Examining the relationships between host and microbe fitness proxies at both the phenotypic and genomic levels can illuminate the mechanisms underlying interspecies cooperation and conflict. We examined naturally occurring genetic variation in 191 strains of the model microbial symbiont Sinorhizobium meliloti, paired with each of two host Medicago truncatula genotypes in single- or multi-strain experiments to determine how multiple proxies of microbial and host fitness were related to one another and test key predictions about mutualism evolution at the genomic scale, while also addressing the challenge of measuring microbial fitness. We found little evidence for interspecies fitness conflict; loci tended to have concordant effects on both microbe and host fitnesses, even in environments with multiple co-occurring strains. Our results emphasize the importance of quantifying microbial relative fitness for understanding microbiome evolution and thus harnessing microbiomes to improve host fitness. Additionally, we find that mutualistic coevolution between hosts and microbes acts to maintain, rather than erode, genetic diversity, potentially explaining why variation in mutualism traits persists in nature.
Keywords: interspecies conflict, legume–rhizobium symbiosis, GWAS, mutualism, pleiotropy
1. Introduction
Recent advances in sequencing and microbiome research have revealed the ubiquity of microbial symbioses, meaning that many important host phenotypes, such as plant yield in agriculture or disease-related traits in humans, are actually symbiotic extended phenotypes, their variation being influenced by loci present within interacting microbial symbionts in addition to the host [1–6]. When loci influence fitness-related traits of both host and symbiont, which we henceforth refer to as symbiotic pleiotropy, they determine the degree to which partners' fitnesses are aligned (i.e. same-sign or concordant effects) or in conflict (i.e. opposite-sign or discordant effects). Identifying the loci underlying symbiotic pleiotropy is, therefore, critical not only for illuminating the genetic basis of symbiotic extended phenotypes, but also for predicting how hosts and symbionts coevolve in nature.
In symbiotic mutualisms, wherein partners trade fitness benefits [7], whether fitness conflict or alignment drives the evolution of these interactions is hotly debated [8–13]. Many models of mutualism rely on the key assumption that cooperation is inherently costly and thus that selection should favour less-cooperative, potentially ‘cheating’, partners that forgo paying the costs while continuing to receive benefits [14–17]. At the phenotypic level, cheaters would be seen as symbiont genotypes that gain fitness at the host's expense (i.e. points in the bottom-right quadrant of figure 1a), while fitness conflict would be seen as an overall negative correlation (grey line in figure 1a). By contrast, cooperators would be seen as symbiont genotypes whose increase in fitness is associated with an increase in host fitness (i.e. points in the top right quadrant of figure 1a), while fitness alignment would be seen as an overall positive correlation (orange line in figure 1a). Evidence for fitness conflict within mutualism is mixed: although several studies have identified symbiont genotypes that gain fitness at their host's expense (e.g. [18,19]), recent experimental evolution studies have instead found that microbial adaptation to particular host genotypes is associated with an increase, rather than a decrease, in host fitness [20–22]. Moreover, fitness alignment at the phenotypic level does not necessarily preclude fitness conflict at the genomic level: rather than dichotomous categories of ‘cooperator’ or ‘cheater’, mutualist genomes are best viewed as mosaics of loci [5], some underlying cooperation while others underlie conflict. Whether coevolution resulting from symbiosis leads to more beneficial interactions and greater mutualism stability, or alternatively, more antagonism and less stable interactions, therefore, requires examining the relationships between host and symbiont fitness proxies at both the phenotypic and genomic levels.
Figure 1.
Interpreting patterns of fitness alignment and conflict at both the phenotypic and genomic levels. (a) Phenotypic-level correlations of genotypic (strain) means for rhizobium fitness proxies (e.g. nodule number, nodule size, rhizobium relative fitness) or plant fitness proxies (shoot biomass or leaf chlorophyll A content) on the x- and y-axes, respectively. The positive (orange) or negative (grey) correlations represent fitness alignment or conflict, respectively. (b) Genomic-level correlations of allelic effect sizes determined in genome-wide association studies (GWAS) for rhizobium fitness proxies or plant fitness proxies on the x- and y-axes, respectively. Points appearing in orange or grey quadrants represent pleiotropic variants with concordant (same-sign) or discordant (opposite-sign) effects, respectively, on both host and symbiont. (c) Plant and rhizobium fitness proxies measured, parentheses indicating the experiment type for which proxies were measured (SS, single-strain; MS, multi-strain). (d) Rhizobium fitness proxies corresponding to the pink rhizobium strain. In single-strain experiments, nodule number indicates the number of reproductive cells released per plant, while nodule size indicates the number of reproductive cells released per nodule. In the multi-strain experiment, rhizobium relative fitness is a composite metric that combines competition among strains for nodule occupancy and for host resources once in nodules. Nodules pink versus blue in colour are used here to illustrate two different rhizobium strains competing; however, a total of 89 strains were inoculated together onto plants in the multi-strain experiment. (Online version in colour.)
Genome-wide association studies (GWAS) can be used to reveal the genes, as well as specific segregating mutations (i.e. variants), that underlie variation in symbiotic extended phenotypes in natural populations [20,23–27]. Because they provide an estimate of both the strength and direction of effects of particular alleles on the trait, henceforth referred to as allelic effect sizes, GWAS are especially useful for identifying loci underlying symbiotic pleiotropy, and thus defining the mutational landscape of mutualism evolution in nature. For example, if symbiotic pleiotropy is extensive and its effects on fitness-related traits in the interacting partners tend to be discordant (grey quadrants in figure 1b), then conflict should underlie the evolution of mutualism, allowing the possibility of cheating individuals that are competitively superior, as mutualism theory predicts [14–17]. By contrast, if pleiotropic effects are overwhelmingly concordant (orange quadrants in figure 1b), fitness alignment rather than conflict should be the null hypothesis in mutualism, and models relying on cheating genotypes as the main driver of mutualism evolution may not be suitable. Because GWAS have the power to reveal individual allelic effects, they can be used to identify loci underlying fundamental sources of conflict at the genomic level (e.g. grey quadrants in figure 1b) even if fitness alignment is realized at the phenotypic level (e.g. orange line in figure 1a).
A long-standing mutualism paradox is that host-driven selection for the ‘best’ symbiont genotype should reduce overall symbiont diversity, yet diverse symbiont populations persist in nature (reviewed by Heath & Stinchcombe [28] and Stoy et al. [29]). Identifying patterns of selection acting on loci that determine fitness outcomes in natural populations could be key for resolving this paradox. Studies to date examining patterns of molecular variation have found stabilizing or purifying selection acting on candidate genes associated with partner recognition or quality, rather than patterns suggesting rapid turnover of alleles underlying conflict [30–33]. In a recent in silico GWAS [5], conflict over mutualistic trait optima between hosts and microbes tended to increase genetic variance due to repeated sweeps, while alignment scenarios resulted in stabilizing selection and decreased genetic variance as trait optima were reached [5]. Whether the loci associated with symbiotic pleiotropy identified by GWAS in natural symbiont populations show signatures of purifying selection, consistent with fitness conflict, or stabilizing selection, consistent with fitness alignment, remains to be tested.
Over the past approximately 25 years, legume–rhizobium symbioses have been developed as models for understanding mutualism evolution [34–37]. This interaction is one of the most ecologically and economically important symbioses, contributing upwards of 20 million tonnes of nitrogen (N) to the global N-cycle, and saving billions of dollars that would have otherwise been spent on synthetic N fertilizer production [38]. Legumes house rhizobia within specialized root nodules and supply them with photosynthate, while rhizobia within the nodules convert atmospheric N to a plant-usable form, resulting in a beneficial exchange of resources. Key traits of this symbiosis, such as plant biomass and nodule number, are known to be influenced by variants in the genomes of both host and microbes, as well as the epistatic interactions between them (genotype-by-genotype or G × G interactions; e.g. [1,26,34,39–42]), making this symbiosis an excellent model for understanding mutualistic coevolution.
A quantitative comparison of multiple proxies of rhizobium fitness, as well as their correlations with plant fitness, is needed to reveal interspecies conflict. Seed number, or close correlates such as aboveground (shoot) biomass and leaf chlorophyll A content, are well-established proxies for reproductive fitness in annual plants [23,43,44] (figure 1c). By contrast, estimating rhizobium fitness has been an empirical and conceptual challenge [8–10,45,46]. Early attempts to estimate rhizobium fitness relied on single-strain experiments, whereby plants were inoculated with a single rhizobium strain, and fitness proxies including nodule number and nodule size were measured (figure 1d). Both measures reflect rhizobium fitness because a rhizobium that establishes a nodule in symbiosis can gain a fitness benefit of the order of 105–107 [47], larger nodules release more rhizobia [43,46,48], and, intuitively, strains that produce more nodules will release more rhizobia [1,37,48]. While rhizobium fitness proxies measured in single-strain experiments can be directly correlated with host benefit (e.g. shoot biomass), these measures have been criticized for producing spurious positive correlations due to stronger fitness feedbacks between host plants and rhizobia [9]. By contrast, multi-strain experiments decouple individual strain fitness from host growth and better reflect rhizobia fitness in natural and agricultural soils, where many strains coexist. Recent advances merging population genomics and metagenomics have enabled measuring rhizobium relative fitness, i.e. a strain's ability to compete for nodulation opportunities and extract host resources once in nodules when other strains are present [25,49]. Multi-strain experiments that have estimated rhizobium relative fitness have so far hinted at a surprising lack of correlation between single-strain and multi-strain measures of rhizobium fitness [25], motivating our comprehensive analysis here.
Here, we combine two datasets: first, GWAS that test for the association between rhizobium variants and both plant and rhizobium fitness proxies measured in single-strain experiments (figure 1c), whereby 191 strains of Sinorhizobiumm eliloti collected from natural populations were inoculated individually onto one of two genotypes of the host plant Medicago truncatula. Second, a GWAS based on a new dataset that measured rhizobium relative fitness in a multi-strain experiment (figure 1d), whereby 89 of the 191 S. meliloti strains were inoculated together onto the same two host genotypes. We combine these datasets to first ask: What are the relationships among rhizobium fitness proxies measured in both single-strain (nodule number, nodule size) and multi-strain (rhizobium relative fitness) experiments? Are these proxies genetically distinct and thus likely to evolve independently, or are they linked through pleiotropy, and thus, evolve together? We use this information to next address the potential for genomic conflict in this symbiosis, asking: Do variants tend to have aligned or conflicting effects on host and symbiont fitnesses? Finally, we ask whether there is any evidence that historical selection has differentially shaped loci associated with fitness alignment versus conflict, as we might predict under different models of mutualism evolution.
2. Material and methods
(a) . Study system
Full details are provided in Riley et al. [50] and electronic supplementary material, Methods. Sinorhizobium meliloti is an alphaproteobacterium that undergoes symbiosis with various Medicago spp., fixing atmospheric N in return for plant photosynthate. All 191 strains used here were collected from the native range of the host M. truncatula, spanning Spain to France (as detailed in [50]). Sinorhizobium meliloti has a multipartite genome, including a large (approx. 3.7 Mbp) chromosome and two symbiosis plasmids (pSymA and pSymB, approx. 1.4 Mbp and approx. 1.7 Mbp, respectively); pSymA contains many of the canonical genes for nodulation and N-fixation [51,52]. We used two lines of M. truncatula DZA 315.16 (hereafter DZA) and A17 in separate single-strain experiments and a multi-strain experiment detailed below.
(b) . Single-strain experiments
Full details are provided in Batstone et al. [42], and electronic supplementary material, Methods. Briefly, we conducted two separate experiments in September and November 2018, one for each M. truncatula line, DZA and A17, respectively. At least three replicate seeds of each plant line were planted individually into pots (most treatments having four to five plant replicates) and were singly inoculated with one of 191 strains of S. meliloti described above. Within each experiment, we measured proxies for both plant fitness (shoot biomass, leaf chlorophyll A content) and rhizobium fitness (nodule number per nodule weight; figure 1c,d).
(c) . Multi-strain experiment
We grew A17 and DZA hosts with a multi-strain inoculum composed of 89 of the 191 strains used in the single-strain experiments (described in more detail in electronic supplementary material, Methods). Using the ‘select-and-resequence’ approach [25], this experiment allowed us to generate new data on rhizobium relative fitness, which represents a strain's ability to both compete for nodulation opportunities and extract host resources once in nodules when 88 other strains were present (figure 1d). This fitness proxy was obtained by sequencing pooled nodule samples from each plant at the end of the experiment and estimating each strain's frequency using a haplotype reconstruction method [53]. We then obtained each strain's relative fitness by calculating the fold-change in the frequency of a strain at the end of the experiment relative to its mean frequency at the beginning (see electronic supplementary material, Methods). Because we wanted our rhizobium relative fitness metric to represent which strains will be present in the soil in subsequent generations, our method focuses on measuring strain frequencies of undifferentiated rhizobia in nodule pools; while differentiated bacteroids are responsible for fixing N, they are reproductively sterile, and thus do not contribute to the next generation. Although the plant and microbe growth conditions used here differed slightly from those used in the single-strain experiments (owing to these experiments being conducted in separate places and at separate times), previous work has shown that strain frequencies are stable in response to environmental variation within an experiment, between experiments, and across host generations [49,54]. Thus, environmental variation due to differences in growth conditions is unlikely to significantly influence how strains behave across experiments.
(d) . Phenotypic analyses
As described in Batstone et al. [42], we calculated the estimated marginal means for each fitness proxy (i.e. nodule number, nodule size, rhizobium relative fitness, shoot biomass, leaf chlorophyll A content) in each experiment, correcting for the effect of rack, using the emmeans package (v. 1.4.1, [55]) in the R environment [56]. We then conducted linear pairwise regressions (lm option in base R) for each fitness proxy against the other within each plant line.
(e) . DNA isolation, whole-genome sequencing, and variant calling
Detailed methods are provided in Riley et al. [50] and electronic supplementary material, Methods. We obtained DNA from each of the 191 rhizobium isolates, sequenced their full genomes including the chromosome and two symbiosis plasmids, used a common reference-based assembly pipeline to align sequences and call single nucleotide polymorphisms (SNPs), and filtered the resulting SNPs based on sequence quality, depth, minor allele frequency and missingness, resulting in a total of 36 526 filtered SNPs.
(f) . Genome-wide association tests
Detailed methods are provided in electronic supplementary material, Methods. We conducted linear mixed models (LMMs) implemented in GEMMA (v. 0.98.1, [57]) to quantify allelic effect sizes that represent both the strength and direction of the association between variants and fitness proxies after correcting for rhizobium population structure. We ran ten separate association analyses in GEMMA, one for each of the four fitness proxies measured in single-strain experiments (figure 1c), plus rhizobium relative fitness measured in the multi-strain experiment, and for both host plant lines (DZA, A17; 5 proxies × 2 hosts = 10 runs).
(g) . Genomic analyses
We first identified pleiotropic variants as those that were significantly associated with more than one trait on the same host line, significance determined using a permutation method (described in more detail in electronic supplementary material, Methods). Variants that were significantly associated with two or more rhizobium fitness proxies were categorized as being associated with rhizobium fitness pleiotropy, whereas variants that were significantly associated with at least one rhizobium AND plant fitness proxy were categorized as being associated with symbiotic pleiotropy. We further categorized whether pleiotropic variants had discordant (opposite-sign; +,− or −,+) or concordant (same-sign; +,+ or −,−) effects on pairwise fitness proxy combinations (figure 1b). Finally, to test whether selection acted on genes containing variants associated with rhizobium fitness and symbiotic pleiotropy, we used the R package PopGenome (v. 2.7.5, [58]) to compute several commonly used test statistics that can detect signatures of historical selection and/or departures from neutrality, namely, nucleotide diversity (i.e. π), Tajima's D, as well as Fu & Li's F and D [59,60]. Additional details appear in electronic supplementary material, Methods.
3. Results
(a) . Relationships among rhizobium fitness proxies
In single-strain experiments, when we regressed strain means for nodule number and nodule weight, we found significant negative correlations for both hosts (figure 2a, left), indicating that strains creating larger nodules tended to form fewer total nodules on both host genotypes. At the genomic level, most variants had discordant effects (figure 2a, right), similarly indicating a trade-off whereby variants that were positively associated with nodule weight tended to be negatively associated with nodule number, or vice-versa.
Figure 2.
Trade-offs among rhizobia (Sinorhizobium meliloti) fitness proxies prevail at both the phenotypic (left panels) and genomic levels (right panels). Phenotypic: Genetic correlations between pairwise fitness proxies measured on plant lines DZA (green, top rows) or A17 (pink, bottom rows), based on 89 S. meliloti strains. Dots represent estimated marginal strain means for nodule number and nodule weight, both being measured in single-strain experiments, or medians for rhizobium relative fitness measured in multi-strain experiments. Numbers at top right of each correlation represent Pearson correlation coefficients, while asterisks represent significance: *, p < 0.05; **, p < 0.01; ***, p < 0.001. Genomic: Dots represent allelic effect sizes (i.e. beta-scores calculated in GEMMA); those falling along the diagonal (orange quadrants) or off-diagonal (grey quadrants) represent variants with concordant or discordant effects, respectively. Coloured dots represent variants that were significantly associated with one of the two fitness proxies, while black dots represent pleiotropic variants, i.e. significantly associated with both fitness proxies. Numbers outside and percentages within parentheses at the top left and right of each plot represent the pleiotropic variant counts and the proportion of total significant variants, respectively, that are discordant (left, in grey) or concordant (right, in orange). (Online version in colour.)
Comparing single- and multi-strain experiments, when we regressed strain means for rhizobium relative fitness and nodule number, we found a weak but significant negative relationship for host line DZA only (figure 2b, left), indicating that strains that were more common in the multi-strain experiment formed fewer nodules in the single-strain experiment. At the genomic level on DZA, most variants had discordant effects (figure 2b, right), again indicating a trade-off between nodule number and rhizobium relative fitness. For host line A17, most pleiotropic variants (20/28) had discordant effects (figure 2b, right), despite no relationship between these fitness proxies at the phenotypic level (figure 2b, left).
We found a significant, positive relationship between strain means for rhizobium relative fitness and nodule weight, again for DZA only (figure 2c, left). This result indicates that strains that were more commonly found in nodules in the multi-strain experiment formed larger nodules in the single-strain experiment. At the genomic level, all pleiotropic variants were concordant (i.e. appearing in the top right or bottom left quadrants of figure 2c, right). For A17, while we found a lack of significant phenotypic correlation between nodule weight and rhizobium relative fitness (figure 2c, left), at the genomic level all pleiotropic variants had discordant effects on these fitness proxies (figure 2c, right), indicating a trade-off at this scale.
These results overall support trade-offs at the phenotypic level, plus underlying discordant pleiotropy at the genomic level, suggesting that strains for which plants form numerous small nodules in single-strain experiments are less able to proliferate within and compete for nodules in multi-strain experiments. We additionally found host-dependent relationships among rhizobium fitness proxies, especially at the genomic level when regressing rhizobium relative fitness and nodule weight (figure 2c, right). However, we found fewer total pleiotropic variants associated with this relationship compared with the other two fitness proxy combinations (electronic supplementary material, figure S2A, right), suggesting that nodule weight and rhizobium relative fitness are largely governed by different molecular mechanisms and, thus, likely to evolve independently.
(b) . Relationships between plant and rhizobium fitness proxies
We found no correlation, in either host, between strain means for nodule number and shoot biomass (figure 3a, left), or nodule number and leaf chlorophyll A content on A17 (electronic supplementary material, figure S3A, left). However, we found a significantly negative correlation between nodule number and chlorophyll A for DZA (electronic supplementary material, figure S3A, left). Assuming chlorophyll A is related to the N-fixation efficiency of each strain [61–63], this result suggests that strains forming more numerous (and smaller) nodules tended to fix less N. At the genomic level, most variants had concordant effects for nodule number and shoot biomass on DZA, whereas the opposite was true for the same proxy pair on A17 (figure 3a, right). Most variants had discordant effects on nodule number and chlorophyll A on both hosts (electronic supplementary material, figure S3A, right).
Figure 3.
Fitness alignment between rhizobia (Sinorhizobium meliloti) and host (Medicago truncatula) prevails at both the phenotypic (left panels) and genomic (right panels) levels. Phenotypic: Genetic correlations between pairwise fitness proxies measured on plant lines DZA (green, top rows) or A17 (pink, bottom rows), based on 89 S. meliloti strains. Dots represent estimated marginal strain means for shoot biomass, nodule number and nodule weight, all being measured in single-strain experiments, or medians for rhizobium relative fitness measured in multi-strain experiments. Numbers at the top right of each correlation represent Pearson correlation coefficients, while asterisks represent significance: *, p < 0.05; **, p < 0.01; ***, p < 0.001. Genomic: Dots represent allelic effect sizes (i.e. beta-scores calculated in GEMMA), those falling along the diagonal (orange quadrants) or off-diagonal (grey quadrants) represent variants with concordant or discordant effects, respectively. Coloured dots represent variants that were significantly associated with one of the two fitness proxies, while black dots represent pleiotropic variants, i.e. significantly associated with both fitness proxies. Numbers outside and percentages within parentheses at the top left and right of each plot represent the pleiotropic variant counts and the proportion of total significant variants, respectively, that are discordant (left, in grey) or concordant (right, in orange). (Online version in colour.)
Strain means for nodule weight and shoot biomass were significantly positively correlated for both hosts (figure 3b, left). Nodule weight and chlorophyll A were significantly positively correlated on DZA, but uncorrelated on A17 (electronic supplementary material, figure S3B, left). At the genomic level for both hosts, all pleiotropic variants had concordant effects for nodule weight and shoot biomass (figure 3b, right), as well as nodule weight and chlorophyll A (electronic supplementary material, figure S3B, right).
Finally, rhizobium relative fitness and shoot biomass were significantly positively correlated for DZA but not for A17 (figure 3c, left), whereas rhizobium relative fitness and leaf chlorophyll A content were significantly positively correlated on both hosts (electronic supplementary material, figure S3C, left). At the genomic level, most pleiotropic variants had concordant effects on rhizobium relative fitness and shoot biomass for both hosts (figure 3c, right), and this pattern was even stronger between rhizobium relative fitness and chlorophyll A on DZA, with all but one pleiotropic variant having concordant effects (electronic supplementary material, figure S3C right). Variant effects were mixed on A17 for rhizobium relative fitness and leaf chlorophyll A content (electronic supplementary material, figure S3C, right).
Overall, we found mostly concordant relationships between rhizobium and plant fitnesses at both the phenotypic and genomic levels, suggesting a strong signal of fitness alignment in natural rhizobium populations. However, we note a lack of pleiotropic variants underlying the relationship between leaf chlorophyll A content and both rhizobium fitness proxies measured in single-strain experiments (i.e. nodule number and nodule weight; electronic supplementary material, figure S2C, left and middle), suggesting these proxies are governed by different molecular mechanisms, and thus should evolve independently.
(c) . Selection acts differently on genes associated with alignment versus conflict
Using multiple diversity and neutrality metrics, we found that rhizobium genes associated with fitness alignment exhibit higher nucleotide diversity and stronger signatures of balancing selection compared with any other gene category analysed. For three of the four test statistics, genes associated with concordant symbiotic pleiotropy (i.e. solid orange lines in figure 4) had significantly elevated values relative to the ‘null’ (i.e. distributions in figure 4, all genes containing significant variants identified by GWAS). We did not see any significant deviations from the null for genes associated with discordant symbiotic pleiotropy (i.e. dashed grey lines in figure 4) or for genes associated with both concordant and discordant rhizobium fitness pleiotropy (electronic supplementary material, table S1, figure S4 and Dataset S1).
Figure 4.
Neutrality statistics show elevated values for genes associated with fitness alignment. Vertical lines represent the average values calculated for four separate statistics, grey for discordant and orange for concordant genes. Distributions represent the same statistics calculated for all genes containing significant variants based on GWAS. Dashed and solid lines represent non-significant (p > 0.1) and significant (p < 0.1) differences, respectively, between each focal gene category and all significant genes (i.e. distributions). (Online version in colour.)
(d) . Loci associated with fitness alignment are host-dependent
Comparing the identities and putative functions of variants associated with symbiotic pleiotropy on both host lines revealed little overlap—concordant variants associated with an increase in both high host and symbiont fitness (i.e. fitness alignment) largely differed between the two host genotypes. Specifically, we identified a total of 168 variants associated with symbiotic pleiotropy, corresponding to 128 coding-genes (see electronic supplementary material, Dataset S2 for variant-level and electronic supplementary material, table S2 and Dataset S3 for gene-level summaries); 60 and 93 of these variants were uniquely associated with fitness proxies measured on plant lines A17 and DZA, respectively, while only 15 variants were shared between hosts. We highlight some of the noteworthy genes identified in our analysis for each pleiotropic category in electronic supplementary material, Results.
4. Discussion
Leveraging genomics to quantify the genetic architecture underlying symbiotic extended phenotypes gives us the power to address long-standing issues in mutualism evolution with genome-scale resolution. Our results overall suggest that: (i) fitness alignment between hosts and rhizobia is common at both the phenotypic and genomic levels, with genes associated with fitness alignment showing elevated nucleotide diversity and signatures of balancing selection; and (ii) the lack of a relationship or even trade-offs between rhizobium nodule number and rhizobium relative fitness means that measures of rhizobium fitness in multi-strain experiments should be prioritized when we want to predict rhizobium evolution. We discuss these main points in turn below.
(a) . Alignment of host and symbiont fitnesses
In one-to-many symbioses, such as a single legume associating with a diverse population of rhizobia, less beneficial symbionts are predicted to achieve higher relative fitness compared with more beneficial counterparts [14–16]. While we found evidence for less beneficial rhizobia (i.e. points closer to zero along the y-axes of panels in figure 3), we found little evidence for ‘cheating’ rhizobia genotypes or loci associated with a gain in fitness at the host's expense (i.e. a lack of points in the bottom right quadrants of figure 3 and few discordant loci associated with symbiotic pleiotropy). Instead, we found strong fitness alignment even in environments where multiple strains occupy the same plant.
Fitness alignment is ultimately governed by the degree to which mutualistic trait optima are shared among partners [5], as well as the degree of fitness feedbacks that enforce alignment between partners [11,14–17]. For example, legume host plants have autoregulation of nodulation to limit the formation of costly nodules once sufficient N levels are achieved [64]. No such constraint exists for rhizobia; every nodule formed is expected to lead to greater potential fitness benefits. Thus, a mismatch between the optimum nodule number for a plant versus rhizobium could generate conflict [5,65]. Indeed, the strongest evidence for conflict in our and other studies (e.g. [1,44,48]) comes from regressing plant fitness proxies on nodule number, suggesting conflict over a host's total investment in nodulation. In addition to controlling the total number of nodules formed, legumes can also allocate more carbon to nodules that fix more nitrogen [35,66,67], which acts to couple rhizobium quality and fitness even when multiple strains are present. Such fitness-coupling mechanisms can be disrupted by mismatches among legume and rhizobium genotypes, allowing strains that fix little to no N to proliferate within nodules [19,68,69]. Our observations of abundant alignment between host fitness proxies, nodule size, and rhizobium relative fitness would predict that such mismatches may be rare in nature, although we only included two host lines and neither was likely to naturally co-occur with our rhizobia strains. Overall, our results suggest that trait optima can be shared even in one-to-many interactions, and that fitness feedbacks operating to align host and symbiont fitness can be maintained even in diverse communities irrespective of coevolutionary history.
(b) . Genomic resolution of conflict and alignment in symbiosis
Rather than to identify causal loci underlying symbiotic pleiotropy, the goal of our study was to examine broader patterns of fitness alignment or conflict across the genomes of numerous naturally occurring rhizobium strains. We found abundant concordant pleiotropic variants associated with both host and symbiont fitnesses, alongside evidence that selection has acted to maintain genetic variation within rhizobium genomes through time and/or space. A lack of discordant pleiotropic variants, like we have found, may have resulted from physiological constraints that make such variants impossible (i.e. alleles associated with larger nodules and small host biomass are rare or non-existent), or because of correlational selection that removes discordant variants from the population altogether [70,71]. In addition to direction, the extent of symbiotic pleiotropy (i.e. the number of pleiotropic variants) can inform whether traits are likely to evolve together or independently. Fitness proxy pairs with abundant pleiotropy (e.g. nodule size versus shoot biomass) suggest a highly polygenic basis governed by many small-effect pleiotropic variants; other proxies with little pleiotropy (e.g. nodule number versus chlorophyll) are largely governed by different sets of genetic mechanisms and are thus likely to evolve independently.
One of our more critical findings, that the loci associated with both high host and symbiont fitness benefits (i.e. fitness alignment) largely differed across host genotypes, provides one solution to the mutualism paradox: if host genotypes act as distinct selective environments for rhizobia, meaning that the ‘best’ symbiont genotype differs among host genotypes, then symbiont diversity could be maintained in the face of host selection. Host-dependent loci associated with fitness alignment were also found in experimentally evolved rhizobia isolates [20], and host-mediated balancing selection was previously proposed as a mechanism maintaining rhizobial diversity in native populations of Bradyrhizobium [72]. Our results of heightened nucleotide diversity and signatures of balancing selection acting on genes underlying fitness alignment additionally support the role of host-dependent selection in maintaining diverse symbiont populations in nature.
We nonetheless found several instances of fitness conflict at the genomic level despite alignment at the phenotypic level (e.g. shoot biomass versus rhizobium relative fitness on DZA). Linkage disequilibrium (LD) could lead to discordant alleles being ‘packaged’ into multilocus genotypes that show fitness alignment. Such LD results from multiple, non-mutually exclusive factors including epistatic interactions among individual variants that render discordant variants effectively neutral, and/or past selection favouring allelic combinations that increase both host and symbiont fitness (and disfavour discordant combinations; [73,74]).
Overall, these findings highlight the polygenic nature of symbiotic extended phenotype variation in nature—where the collective action of individual mutations, their additive and nonadditive effects [42], and a history of selection shapes the trait variation currently present in natural populations [75,76]. However, because of the limitations inherent within GWAS, including LD which makes it difficult to pinpoint causal variants as well as false positive and negative associations, we acknowledge that the function of any specific locus identified here needs to be further validated in follow-up experiments. Additionally, we only focus on allelic substitutions rather than presence/absence variation generated via gene gain and loss, the latter of which previous studies have found to be associated with exploitative traits in rhizobia (e.g. [67]). Whether fitness conflict is predominantly associated with genetic variation generated via gene gain/loss rather than allelic substitution remains to be tested in our dataset.
(c) . Trade-offs among rhizobium fitness proxies and the rhizobium competition problem
Understanding how microbial symbioses, which are ubiquitous and important in nature, evolve (or coevolve) requires accurate estimates of symbiont fitness in ecologically realistic settings. Given our evidence at both the phenotypic and genomic levels for trade-offs among rhizobium fitness proxies, and because diverse strains of rhizobia co-occur in nature, relative fitness proxies should be used whenever possible [25]. Nevertheless these proxies are not replacements for those measured in single-strain experiments because they cannot be used to assign individual genetic means for whole-plant measures of host benefit (plant biomass and seed number) to individual strains (e.g. [1,26,39,42]), necessitating that host benefit and rhizobium fitness be measured on separate individuals.
Together our results suggest that the genetic architectures associated with rhizobium fitness proxies, and their relationships, are host-dependent, and thus that their evolutionary trajectories are influenced not only by variation in the rhizobium's genome, but also by variation in host traits. For example, host genotypes could differ in their ability to exert sanctions or partner choice, quantitative traits known to vary in host populations [44,77]. At the genomic level, distinct variants underlying rhizobium fitness pleiotropy on each host genotype suggest that the genetic mechanisms (i.e. genes, pathways, metabolic processes) governing the relationship between fitness proxies are largely non-overlapping when rhizobia interact with different hosts. Such host genotype-dependent shifts in the rhizobium genetic architecture of these symbiotic extended phenotypes are supported by transcriptomics studies of G × G interactions [25,40] and GWAS revealing distinct sets of candidate genes in different host backgrounds [26,42]. Similar genetic variation exists in hosts [78] and undoubtedly interacts with the variants we identify here, and thus, should be accounted for if we want to uncover the multi-genomic basis of symbiotic extended phenotypes.
Data accessibility
Strains and plant lines are available upon request. All raw data and analysis code are available on GitHub (https://github.com/rtbatstone/complex_genetics_of_symbiosis—see ‘Genetics_conflict_cooperation’ folder). The electronic supplementary material [79] contains additional methods, results, figures and tables, as well as descriptions of datasets, all of which can be accessed both from bioRχiv (https://www.biorxiv.org/content/10.1101/2021.07.19.452989v6.supplementary-material) and from GitHub (https://github.com/rtbatstone/complex_genetics_of_symbiosis/tree/main/Genetics_conflict_cooperation/Supp_files). Raw sequence reads and assemblies for the 191 strains are available at NCBI (BioProject: PRJNA501532), with metadata available in [50].
Acknowledgements
For their insightful feedback on the manuscript, we thank four anonymous reviewers and an associate editor. For greenhouse help, we thank Alex Riley, Hanna Lindgren, Cassandra Allsup, Laura Goralka, Mannix Burns, Brendan Epstein and Amy-Marshall Colon, as well as the greenhouse staff, particularly Debbie Black. For feedback on the results and analyses, we thank Peter Tiffin, Brendan Epstein, and Robert Williamson.
Authors' contributions
R.T.B.: conceptualization, data curation, formal analysis, funding acquisition, investigation, methodology, project administration, software, validation, visualization, writing—original draft, writing—review and editing; L.T.B.: conceptualization, funding acquisition, investigation, methodology, validation, writing—original draft, writing—review and editing; K.D.H.: conceptualization, funding acquisition, validation, writing—original draft, writing—review and editing.
All authors gave final approval for publication and agreed to be held accountable for the work performed therein.
Conflict of interest declaration
We declare we have no competing interests.
Funding
We acknowledge funding from the National Science Foundation (K.D.H.: IOS-1645875 and NPGI-1401864; L.T.B.: IOS-1724993 and IOS-1856744), a Carl R. Woese Institute for Genomic Biology postdoctoral fellowship to R.T.B. and strain sequencing by Joint Genome Institute (CSP-1223795). This work was additionally supported by the USDA National Institute of Food and Agriculture Federal Appropriations under Project PEN04760 and Accession no. 1025611.
References
- 1.Heath KD. 2010. Intergenomic epistasis and coevolutionary constraint in plants and rhizobia. Evolution 64, 1446-1458. ( 10.1111/j.1558-5646.2009.00913.x) [DOI] [PubMed] [Google Scholar]
- 2.Friesen ML, Porter SS, Stark SC, Von Wettberg EJ, Sachs JL, Martinez-Romero E. 2011. Microbially mediated plant functional traits. Annu. Rev. Ecol. Evol. Syst. 42, 23-46. ( 10.1146/annurev-ecolsys-102710-145039) [DOI] [Google Scholar]
- 3.Gould AL, et al. 2018. Microbiome interactions shape host fitness. Proc. Natl Acad. Sci. USA 115, E11 951-E11 960. ( 10.1073/pnas.1809349115) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Lynch J, Hsiao E. 2019. Microbiomes as sources of emergent host phenotypes. Science 365, 1405-1409. ( 10.1126/science.aay0240) [DOI] [PubMed] [Google Scholar]
- 5.O'Brien AM, Jack CN, Friesen ML, Frederickson ME. 2021. Whose trait is it anyways? Coevolution of joint phenotypes and genetic architecture in mutualisms. Proc. R. Soc. B 288, 20202483. ( 10.1098/rspb.2020.2483) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Batstone RT. 2021. Genomes within genomes: nested symbiosis and its implications for plant evolution. New Phytol. 234, 28-34. ( 10.1111/nph.17847) [DOI] [PubMed] [Google Scholar]
- 7.Bronstein JL. 2015. Mutualism. Oxford, UK: Oxford University Press. [Google Scholar]
- 8.Friesen ML. 2012. Widespread fitness alignment in the legume–rhizobium symbiosis. New Phytol. 194, 1096-1111. ( 10.1111/j.1469-8137.2012.04099.x) [DOI] [PubMed] [Google Scholar]
- 9.Kiers ET, Ratcliff WC, Denison RF. 2013. Single-strain inoculation may create spurious correlations between legume fitness and rhizobial fitness. New Phytol. 198, 4-6. ( 10.1111/nph.12015) [DOI] [PubMed] [Google Scholar]
- 10.Friesen ML, Heath KD. 2013. One hundred years of solitude: integrating single-strain inoculations with community perspectives in the legume–rhizobium symbiosis. New Phytol. 198, 7-9. ( 10.1111/nph.12173) [DOI] [PubMed] [Google Scholar]
- 11.Frederickson ME. 2013. Rethinking mutualism stability: cheaters and the evolution of sanctions. Q. Rev. Biol. 88, 269-295. ( 10.1086/673757) [DOI] [PubMed] [Google Scholar]
- 12.Jones EI, et al. 2015. Cheaters must prosper: reconciling theoretical and empirical perspectives on cheating in mutualism. Ecol. Lett. 18, 1270-1284. ( 10.1111/ele.12507) [DOI] [PubMed] [Google Scholar]
- 13.Frederickson ME. 2017. Mutualisms are not on the verge of breakdown. Trends Ecol. Evol. 32, 727-734. ( 10.1016/j.tree.2017.07.001) [DOI] [PubMed] [Google Scholar]
- 14.Bull JJ, Rice W. 1991. Distinguishing mechanisms for the evolution of co-operation. J. Theor. Biol. 149, 63-74. ( 10.1016/S0022-5193(05)80072-4) [DOI] [PubMed] [Google Scholar]
- 15.West SA, Kiers ET, Pen I, Denison R. 2002. Sanctions and mutualism stability: when should less beneficial mutualists be tolerated? J. Evol. Biol. 15, 830-837. ( 10.1046/j.1420-9101.2002.00441.x) [DOI] [Google Scholar]
- 16.West SA, Kiers ET, Simms EL, Denison RF. 2002. Sanctions and mutualism stability: why do rhizobia fix nitrogen? Proc. R. Soc. Lond. B 269, 685-694. ( 10.1098/rspb.2001.1878) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Foster KR, Kokko H. 2006. Cheating can stabilize cooperation in mutualisms. Proc. R. Soc. B 273, 2233-2239. ( 10.1098/rspb.2006.3571) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Sachs J, Ehinger M, Simms E. 2010. Origins of cheating and loss of symbiosis in wild Bradyrhizobium. J. Evol. Biol. 23, 1075-1089. ( 10.1111/j.1420-9101.2010.01980.x) [DOI] [PubMed] [Google Scholar]
- 19.Crook MB, Lindsay DP, Biggs MB, Bentley JS, Price JC, Clement SC, Clement MJ, Long SR, Griffitts JS. 2012. Rhizobial plasmids that cause impaired symbiotic nitrogen fixation and enhanced host invasion. Mol. Plant Microbe Interact. 25, 1026-1033. ( 10.1094/MPMI-02-12-0052-R) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Batstone RT, O'Brien AM, Harrison TL, Frederickson ME. 2020. Experimental evolution makes microbes more cooperative with their local host genotype. Science 370, 476-478. ( 10.1126/science.abb7222) [DOI] [PubMed] [Google Scholar]
- 21.Quides KW, Weisberg AJ, Trinh J, Salaheldine F, Cardenas P, Lee H-H, Jariwala R, Chang JH, Sachs JL. 2021. Experimental evolution can enhance benefits of rhizobia to novel legume hosts. Proc. R. Soc. B 288, 20210812. ( 10.1098/rspb.2021.0812) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Li E, Howell SEL, Brady M, Xu X, Mcneil K. 2021. Rapid evolution of bacterial mutualism in the plant rhizosphere. Nat. Commun. 12, 3829. ( 10.1038/s41467-021-24005-y) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Friesen ML, et al. 2014. The ecological genomic basis of salinity adaptation in tunisian Medicago truncatula. BMC Genom. 15, 1160. ( 10.1186/1471-2164-15-1160) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Klinger CR, Lau JA, Heath KD. 2016. Ecological genomics of mutualism decline in nitrogen-fixing bacteria. Proc. R. Soc. B 283, 20152563. ( 10.1098/rspb.2015.2563) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Burghardt LT, Epstein B, Guhlin J, Nelson MS, Taylor MR, Young ND, Sadowsky MJ, Tiffin P. 2018. Select and resequence reveals relative fitness of bacteria in symbiotic and free-living environments. Proc. Natl Acad. Sci. USA 115, 2425-2430. ( 10.1073/pnas.1714246115) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Epstein B, Abou-Shanab RAI, Shamseldin A, Taylor MR, Guhlin J, Burghardt LT, Nelson M, Sadowsky MJ, Tiffin P. 2018. Genome-wide association analyses in the model rhizobium Ensifer meliloti. mSphere 3, e00386-18. ( 10.1128/mSphere.00386-18) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Porter SS, Faber-Hammond J, Montoya AP, Friesen ML, Sackos C. 2019. Dynamic genomic architecture of mutualistic cooperation in a wild population of Mesorhizobium. ISME J. 13, 301-315. ( 10.1038/s41396-018-0266-y) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Heath KD, Stinchcombe JR. 2014. Explaining mutualism variation: a new evolutionary paradox? Evolution 68, 309-317. ( 10.1111/evo.12292) [DOI] [PubMed] [Google Scholar]
- 29.Stoy KS, Gibson AK, Gerardo NM, Morran LT. 2020. A need to consider the evolutionary genetics of host–symbiont mutualisms. J. Evol. Biol. 33, 1656-1668. ( 10.1111/jeb.13715) [DOI] [PubMed] [Google Scholar]
- 30.Kimbrel JA, Thomas WJ, Jiang Y, Creason AL, Thireault CA, Sachs JL, Chang JH. 2013. Mutualistic co-evolution of type III effector genes in Sinorhizobium fredii and Bradyrhizobium japonicum. PLoS Pathog. 9, e1003204. ( 10.1371/journal.ppat.1003204) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Grillo MA, De Mita S, Burke PV, Solórzano-Lowell KL, Heath KD. 2016. Intrapopulation genomics in a model mutualist: population structure and candidate symbiosis genes under selection in Medicago truncatula. Evolution 70, 2704-2717. ( 10.1111/evo.13095) [DOI] [PubMed] [Google Scholar]
- 32.Yoder JB. 2016. Understanding the coevolutionary dynamics of mutualism with population genomics. Am. J. Bot. 103, 1742-1752. ( 10.3732/ajb.1600154) [DOI] [PubMed] [Google Scholar]
- 33.Epstein B, Tiffin P. 2021. Comparative genomics reveals high rates of horizontal transfer and strong purifying selection on rhizobial symbiosis genes. Proc. R. Soc. B 288, 20201804. ( 10.1098/rspb.2020.1804) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Parker MA. 1995. Plant fitness variation caused by different mutualist genotypes. Ecology 76, 1525-1535. ( 10.2307/1938154) [DOI] [Google Scholar]
- 35.Kiers ET, Rousseau RA, West SA, Denison RF. 2003. Host sanctions and the legume–rhizobium mutualism. Nature 425, 78-81. ( 10.1038/nature01931) [DOI] [PubMed] [Google Scholar]
- 36.Simms EL, Taylor DL, Povich J, Shefferson RP, Sachs JL, Urbina M, Tausczik Y. 2006. An empirical test of partner choice mechanisms in a wild legume–rhizobium interaction. Proc. R. Soc. B 273, 77-81. ( 10.1098/rspb.2005.3292) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Heath KD, Tiffin P. 2009. Stabilizing mechanisms in a legume–rhizobium mutualism. Evolution 63, 652-662. ( 10.1111/j.1558-5646.2008.00582.x) [DOI] [PubMed] [Google Scholar]
- 38.Foyer CH, et al. 2016. Neglecting legumes has compromised human health and sustainable food production. Nat. Plants 2, 16112. ( 10.1038/nplants.2016.112) [DOI] [PubMed] [Google Scholar]
- 39.Burdon J, Thrall P. 1999. Spatial and temporal patterns in coevolving plant and pathogen associations. Am. Nat. 153, S15-S33. ( 10.1086/303209) [DOI] [PubMed] [Google Scholar]
- 40.Heath KD, Burke PV, Stinchcombe JR. 2012. Coevolutionary genetic variation in the legume-rhizobium transcriptome. Mol. Ecol. 21, 4735-4747. ( 10.1111/j.1365-294X.2012.05629.x) [DOI] [PubMed] [Google Scholar]
- 41.Burghardt LT, Guhlin J, Chun CL, Liu J, Sadowsky MJ, Stupar RM, Young ND, Tiffin P. 2017. Transcriptomic basis of genome by genome variation in a legume-rhizobia mutualism. Mol. Ecol. 26, 6122-6135. ( 10.1111/mec.14285) [DOI] [PubMed] [Google Scholar]
- 42.Batstone RT, Lindgren HK, Allsup CM, Goralka LA, Riley AB, Grillo MA, Marshall-Colon A, Heath KD. 2021. The complex genetics of symbiotic extended phenotypes across multiple environments in a model mutualism. bioRχiv, 2021.08.03.454976. ( 10.1101/2021.08.03.454976) [DOI] [PMC free article] [PubMed]
- 43.Heath KD, Tiffin P. 2007. Context dependence in the coevolution of plant and rhizobial mutualists. Proc. R. Soc. B 274, 1905-1912. ( 10.1098/rspb.2007.0495) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Simonsen AK, Stinchcombe JR. 2014. Standing genetic variation in host preference for mutualist microbial symbionts. Proc. R. Soc. B 281, 20142036. ( 10.1098/rspb.2014.2036) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Denison RF. 2000. Legume sanctions and the evolution of symbiotic cooperation by rhizobia. Am. Nat. 156, 567-576. ( 10.1086/316994) [DOI] [PubMed] [Google Scholar]
- 46.Ratcliff WC, Underbakke K, Denison RF. 2011. Measuring the fitness of symbiotic rhizobia. Symbiosis 55, 85-90. ( 10.1007/s13199-011-0150-2) [DOI] [Google Scholar]
- 47.Denison RF, Kiers ET. 2011. Life histories of symbiotic rhizobia and mycorrhizal fungi. Curr. Biol. 21, R775-R785. ( 10.1016/j.cub.2011.06.018) [DOI] [PubMed] [Google Scholar]
- 48.Porter SS, Simms EL. 2014. Selection for cheating across disparate environments in the legume-rhizobium mutualism. Ecol. Lett. 17, 1121-1129. ( 10.1111/ele.12318) [DOI] [PubMed] [Google Scholar]
- 49.Burghardt LT, Epstein B, Tiffin P. 2019. Legacy of prior host and soil selection on rhizobial fitness in planta. Evolution 73, 2013-2023. ( 10.1111/evo.13807) [DOI] [PubMed] [Google Scholar]
- 50.Riley AB, Grillo MA, Epstein B, Tiffin P, Heath KD. 2022. Discordant population structure within rhizobium divided genomes and between rhizobia and legume hosts in their native range. Authorea. ( 10.22541/au.164637769.91740255) [DOI] [PubMed] [Google Scholar]
- 51.Finan TM, et al. 2001. The complete sequence of the 1,683-kb pSymB megaplasmid from the N2-fixing endosymbiont Sinorhizobium meliloti. Proc. Natl Acad. Sci. USA 98, 9889-9894. ( 10.1073/pnas.161294698) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Galibert F, et al. 2001. The composite genome of the legume symbiont Sinorhizobium meliloti. Science 293, 668-672. ( 10.1126/science.1060966) [DOI] [PubMed] [Google Scholar]
- 53.Kessner D, Turner TL, Novembre J. 2013. Maximum likelihood estimation of frequencies of known haplotypes from pooled sequence data. Mol. Biol. Evol. 30, 1145-1158. ( 10.1093/molbev/mst016) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Burghardt LT, Epstein B, Hoge M, Trujillo D, Tiffin P. 2021. Environment dependence of rhizobial relative fitness in the legume-rhizobia symbiosis. bioRχiv, 2020.11.20.392183. ( 10.1101/2020.11.20.392183) [DOI] [PMC free article] [PubMed]
- 55.Searle SR, Speed FM, Milliken GA. 1980. Population marginal means in the linear model: an alternative to least squares means. Am. Stat. 34, 216-221. ( 10.1080/00031305.1980.10483031) [DOI] [Google Scholar]
- 56.R Core Team. 2016. R: a language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing. See http://www.R-project.org/. [Google Scholar]
- 57.Zhou X, Stephens M. 2014. Efficient multivariate linear mixed model algorithms for genome-wide association studies. Nat. Methods 11, 407-409. ( 10.1038/nmeth.2848) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Pfeifer B, Wittelsbürger U, Ramos-Onsins SE, Lercher MJ. 2014. PopGenome: an efficient Swiss army knife for population genomic analyses in R. Mol. Biol. Evol. 31, 1929-1936. ( 10.1093/molbev/msu136) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Tajima F. 1989. Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics 123, 585-595. ( 10.1093/genetics/123.3.585) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Fu YX, Li WH. 1993. Statistical tests of neutrality of mutations. Genetics 133, 693-709. ( 10.1093/genetics/133.3.693) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Bullock D, Anderson D. 1998. Evaluation of the Minolta SPAD 502 chlorophyll meter for nitrogen management in corn. J. Plant Nutr. 21, 741-755. ( 10.1080/01904169809365439) [DOI] [Google Scholar]
- 62.Swiader JM, Moore A. 2002. SPAD-chlorophyll response to nitrogen fertilization and evaluation of nitrogen status in dryland and irrigated pumpkins. J. Plant Nutr. 25, 1089-1100. ( 10.1081/PLN-120003941) [DOI] [Google Scholar]
- 63.Weese DJ, Heath KD, Dentinger BT, Lau JA. 2015. Long-term nitrogen addition causes the evolution of less-cooperative mutualists. Evolution 69, 631-642. ( 10.1111/evo.12594) [DOI] [PubMed] [Google Scholar]
- 64.Reid DE, Ferguson BJ, Hayashi S, Lin YH, Gresshoff PM. 2011. Molecular mechanisms controlling legume autoregulation of nodulation. Ann. Bot. 108, 789-795. ( 10.1093/aob/mcr205) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Sachs JL, Quides KW, Wendlandt CE. 2018. Legumes versus rhizobia: a model for ongoing conflict in symbiosis. New Phytol. 219, 1199-1206. ( 10.1111/nph.15222) [DOI] [PubMed] [Google Scholar]
- 66.Regus JU, Quides KW, O'neill MR, Suzuki R, Savory EA, Chang JH, Sachs JL. 2017. Cell autonomous sanctions in legumes target ineffective rhizobia in nodules with mixed infections. Am. J. Bot. 104, 1299-1312. ( 10.3732/ajb.1700165) [DOI] [PubMed] [Google Scholar]
- 67.Westhoek A, et al. 2021. Conditional sanctioning in a legume–rhizobium mutualism. Proc. Natl Acad. Sci. USA 118, e2025760118. ( 10.1073/pnas.2025760118) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Price PA, Tanner HR, Dillon BA, Shabab M, Walker GC, Griffitts JS. 2015. Rhizobial peptidase Hrrp cleaves host-encoded signaling peptides and mediates symbiotic compatibility. Proc. Natl Acad. Sci. USA 112, 15 244-15 249. ( 10.1073/pnas.1417797112) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Wendlandt CE, et al. 2022. Negotiating mutualism: a locus for exploitation by rhizobia has a broad effect size distribution and context-dependent effects on legume hosts. J. Evol. Biol. 35, 844-854. ( 10.1111/jeb.14011) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Guillaume F, Whitlock MC. 2007. Effects of migration on the genetic covariance matrix. Evolution 61, 2398-2409. ( 10.1111/j.1558-5646.2007.00193.x) [DOI] [PubMed] [Google Scholar]
- 71.Saltz JB, Hessel FC, Kelly MW. 2017. Trait correlations in the genomics era. Trends Ecol. Evol. 32, 279-290. ( 10.1016/j.tree.2016.12.008) [DOI] [PubMed] [Google Scholar]
- 72.Parker MA. 2012. Legumes select symbiosis island variants in Bradyrhizobium. Mol. Ecol. 21, 1769-1778. ( 10.1111/j.1365-294X.2012.05497.x) [DOI] [PubMed] [Google Scholar]
- 73.Sinervo B, Svensson E. 2002. Correlational selection and the evolution of genomic architecture. Heredity 89, 329-338. ( 10.1038/sj.hdy.6800148) [DOI] [PubMed] [Google Scholar]
- 74.Jones AG, Arnold SJ, Bürger R. 2003. Stability of the G-matrix in a population experiencing pleiotropic mutation, stabilizing selection, and genetic drift. Evolution 57, 1747-1760. ( 10.1111/j.0014-3820.2003.tb00583.x) [DOI] [PubMed] [Google Scholar]
- 75.Conner JK, Hartl DL. 2004. A primer of ecological genetics. Sunderland, MA: Sinauer. [Google Scholar]
- 76.Lynch M, Walsh B. 1998. Genetics and analysis of quantitative traits, vol. 1. Sunderland, MA: Sinauer. [Google Scholar]
- 77.Batstone RT, Dutton EM, Wang D, Yang M, Frederickson ME. 2017. The evolution of symbiont preference traits in the model legume Medicago truncatula. New Phytol. 213, 1850-1861. ( 10.1111/nph.14308) [DOI] [PubMed] [Google Scholar]
- 78.Stanton-Geddes J, et al. 2013. Candidate genes and genetic architecture of symbiotic and agronomic traits revealed by whole-genome, sequence-based association genetics in Medicago truncatula. PLoS ONE 8, e65688. ( 10.1371/journal.pone.0065688) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Batstone RT, Burghardt LT, Heatha KD. 2022. Phenotypic and genomic signatures of interspecies cooperation and conflict in naturally occurring isolates of a model plant symbiont. Figshare. ( 10.6084/m9.figshare.c.6070338) [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
Strains and plant lines are available upon request. All raw data and analysis code are available on GitHub (https://github.com/rtbatstone/complex_genetics_of_symbiosis—see ‘Genetics_conflict_cooperation’ folder). The electronic supplementary material [79] contains additional methods, results, figures and tables, as well as descriptions of datasets, all of which can be accessed both from bioRχiv (https://www.biorxiv.org/content/10.1101/2021.07.19.452989v6.supplementary-material) and from GitHub (https://github.com/rtbatstone/complex_genetics_of_symbiosis/tree/main/Genetics_conflict_cooperation/Supp_files). Raw sequence reads and assemblies for the 191 strains are available at NCBI (BioProject: PRJNA501532), with metadata available in [50].




