Abstract
In the mutualism between leguminous plants and rhizobial bacteria, rhizobia live inside root nodules, creating potential for host genes to shape the rhizobial selective environment. Many host genes that affect symbiosis have been identified; however, the extent to which these genes affect selection acting on rhizobia is unknown. In this study, we inoculated 18 Medicago truncatula symbiotic mutants (including mutants that alter Nodule Cysteine-Rich (NCR) peptide production, plant defence, and nodule number regulation) with a mixture of 86 Sinorhizobium meliloti strains. Most mutations resulted in reduced host benefits, but the effects on rhizobial benefit (i.e. relative strain fitness) varied widely, revealing widespread host-by-strain fitness interactions. Genome-wide association analyses identified variants on rhizobial replicons pSymA and pSymB as important in mediating strain fitness responses to host mutations. Whereas most top variants affected rhizobial fitness with one host mutation (limited effect variants), nine affected fitness across six or more host mutations. These pervasive variants occurred primarily on pSymA, the symbiotic replicon, and include fixL and some metabolic genes. In contrast to the limited effect variants, variants with pervasive positive effects on strain fitness when host genes were mutated tended to adversely affect fitness in wild-type hosts. Competition assays across Medicago genotypes confirmed a pervasive role for one candidate (malonyl-CoA synthase), and AlphaFold multimer modelling suggests that many rhizobial top candidates could interact with host NCR peptides. Our results reveal how host genetic mutations alter strain fitness, setting the stage for improving rhizobial inoculants and breeding legume hosts better adapted to multi-strain environments.
Keywords: rhizobia, medicago, GWAS, select and resequence, strain fitness
Introduction
Many bacteria live in close association with eukaryotic hosts, and a subset of these bacterial taxa significantly influence host fitness [1]. If host genetic variation alters selective pressures on bacteria, that variation can shape bacterial evolution. These genome x genome (GxG) interactions are an essential prerequisite for co-evolution, and a potential mechanism for maintaining genetic variation in bacterial and host populations [2]. Most examples of (GxG) interaction involve pathogens, including a host immune receptor and microbial targets (e.g. human HLA and HIV epitopes [3]) or plant R genes [4]. However, examinations of quantitative pathogen resistance leveraging techniques such as co-GWAS (Genome Wide Association Studies) [5, 6] suggest complex genetic architectures underlie these host-pathogen interactions [7, 8]. More recently, studies of (GxG) interactions have expanded to mutualisms [9–12]. These studies suggest that (GxG) interactions in mutualisms may be equally, if not more, genetically complex than in parasitism [13–18]. To what extent and in what ways host genes alter bacterial selection has important implications for bacterial evolution and deploying beneficial microbes in translational contexts such as agriculture [19, 20]. Here, we take a novel approach to measuring (GxG) interactions by measuring how over a dozen host mutations alter the selective landscape experienced by scores of strains of mutualistic bacteria.
Legume hosts and rhizobium bacteria are a tractable system for studying how individual host genes alter the selection of their microbial mutualists. Rhizobia inhabit diverse environments, including soil, the rhizosphere, and root nodules, where they experience selection pressures that shape their fitness [21]. In nodules, rhizobia convert atmospheric nitrogen into plant assimilable forms and receive photosynthates for growth [22, 23]. Host genetic variation can influence rhizobial fitness within root nodules [16, 24–26] in ways likely shaped by nodule development and subsequent nitrogen fixation [27]. Decades of work have resulted in extensive collections of mutant lines in model legumes, including Lotus and Medicago, which have been exhaustively used to dissect the functional genetics of nodule formation and function. Most research on these mutants has focused on their effect on host benefits (e.g. shoot biomass) and investment (e.g. nodule number/size). Far less attention has been paid to the effect of host mutations on rhizobia [28]. However, doing so is necessary to clarify the potential for host genes to shape the rhizobial fitness.
Measuring the effect of host genes on rhizobia requires considering three aspects of rhizobial fitness—absolute fitness, relative fitness, and nodule strain diversity. Because most nodules are initiated by a single rhizobium, rhizobial population sizes in nodules indicate the rewards an individual bacterium receives from forming a nodule [28, 29]. However, measuring population size alone does not enable an estimate of fitness difference among strains. Assessing strain-level relative fitness requires comparing strain frequencies before and after exposure to a selective environment, such as host nodules. A Select and Resequence (S&R) approach leverages whole-genome shotgun (WGS) sequencing to enable high-throughput tracking of strain frequencies [25, 30]. This method allows for the direct quantification of selection pressures on rhizobia, revealing how selective an environment is (nodule strain diversity) and which strains are enriched or depleted compared to the initial community (relative strain fitness) [25, 30]. A key advantage of the S&R method is that, because all strains are already sequenced, GWAS can be used to identify genetic variants associated with rhizobial relative fitness [16, 25, 30, 31].
Work in the legume rhizobium mutualism suggests that host genetic variation can alter these metrics of rhizobial fitness [16, 24, 25, 30]. For instance, host mutations that increase plant defence responses (nad1) reduce the number of rhizobia released from each nodule, whereas disrupting genes that cause hosts to form more nodules (sunn1, sunn4, rdn1) can increase rhizobia released at the plant level [32]. Other host genes like rj2 and rj4 in soybean restrict the diversity of strains that form nodules [33, 34]. Disruption of the Sinorhizobium gene encoding HRRP peptidase, potentially involved in processing host NCR peptides, can restrict the rhizobial strains in nodules by modulating host discrimination or defence thresholds [35]. Lastly, disruptions within a host gene can alter relative strain fitness. For instance, members of the NPD gene family in Medicago truncatula differentially influenced which strain frequencies increased in nodules relative to the inoculated strain community [30]. Taken together, these studies highlight the potential for single host genes to reshape the selective environment and alter the evolution of rhizobia.
Here, we characterize how genetically disrupting host genes alters selection on their nitrogen-fixing rhizobium symbionts in planta. To accomplish this goal, we chose 18 existing M. truncatula mutant lines with disruptions in the known symbiotic genes. These genes affect a wide range of host phenotypes and encompass the developmental trajectory of nodulation. We inoculated these mutants with a mixture of 86 Sinorhizobium meliloti strains (Fig. 1). We studied standard host investment and benefit traits, as well as multiple aspects of rhizobial fitness (Table 1). We then used a S&R approach to measure the frequencies of rhizobial strains (relative strain fitness) in nodules [25] and followed up with GWAS, to identify variants in rhizobium genes that altered strain fitness when host genes were perturbed. We examined the following questions: (i) Which host genotypes alter rhizobial fitness? (ii) Do altered host genotypes have strain-specific effects on rhizobial fitness (i.e. is there evidence of genome x genome interactions)? (iii) If yes, which rhizobium genes harbour genetic variation underlying the observed shifts in strain fitness, and are any of those variants selected in multiple host mutations? And (iv) Are any host investment or benefit traits correlated with rhizobial fitness traits? Whereas few host genotypes altered rhizobial population sizes or nodule strain diversity, many hosts did shift relative strain fitness. Most rhizobial genes associated with strain fitness shifts were found to be important within specific hosts, although 18 variants, (primarily on pSymA) were important with six or more host mutations, suggesting candidate genes that may play fundamental roles in in-plant rhizobial fitness.
Figure 1.
Schematic overview of the experiment used to evaluate rhizobial fitness and host traits across different host mutant backgrounds. A set of 86 Sinorhizobium strains was cultured and combined into a mixed inoculum. This inoculum was applied to Medicago truncatula wild-type (A17) and host mutants with gene disruptions that act at different stages of symbiosis: Early, late, and autoregulation of nodulation (AON). The number of host mutants per stage is indicated in parentheses. Host and rhizobial traits were measured at the nodule and whole-plant scales. The experimental conditions are clarified under “experimental details.” Each treatment included five pots per host mutant with three plants per pot, grown in randomized blocks under nitrogen (N)-free conditions for 8 weeks. Traits measured include indicators of rhizobial benefit, host benefit, and host investment (refer to Table 1 for more details). Different coloured nodules indicate nodules formed by different rhizobial strains.
Table 1.
Definitions of the symbiotic traits examined.
| Symbiotic traits | Category | Interpretation |
|---|---|---|
| Colony forming units per nodule | Rhizobial benefit | Individual strain fitness increase from host association |
| Colony forming units per plant | Rhizobial benefit | Host level contribution to rhizobial population growth |
| Nodule strain diversity | Rhizobial benefit | Shannon indices (H) of nodule community reflects host selectivity |
| Strain relative fitness | Rhizobial benefit | Change in strain frequency relative to the inoculum (positive values indicate enriched rhizobial strains) |
| Nodules per plant | Host investment | Number of symbiotic associations initiated by host |
| Average area per nodule | Host investment | Host investment in each nodule |
| Total nodule area per plant | Host investment | Cumulative host investment in nodules |
| Total biomass per plant | Host benefit | Proxy for nitrogen acquired by the host from associating with rhizobia |
| Root-to-shoot ratio | Host benefit | Lower values indicate reduced root investment suggesting plant nitrogen demand was met. |
Materials and methods
Host symbiotic mutants
The development of symbiosis in nodules is complex, with each step regulated by multiple hosts and rhizobium genes [27, 36, 37]. To encompass these steps, we chose 18 M. truncatula symbiosis mutants (Table S1), which we grouped into three categories: early-acting, late-acting, and regulation of nodule number (Autoregulation of nodule number; AON) [38]. The early-acting genes (hcl, ipd3, latd, dmi1–2, dmi2–3, and dmi3–1) affect the initial processes, such as Nod Factor recognition and nodule organogenesis [39–43]. The late-acting genes (nad1, dnf1, dnf2, dnf4, dnf2/dnf1, dnf5/dnf2, dnf6, dnf7) influence nitrogen fixation [32, 44], whereas the regulation of nodule number genes are in the AON (Autoregulation Of Nodulation) pathway (sunn1, sunn4, rdn1) [45, 46]. Whereas these categorizations help set expectations for the impact of the host mutations, each gene may have additional pleiotropic effects. Together, these mutants provide a broad view of the effects of host symbiosis genes on their rhizobial partners in multi-strain inoculations. Seeds for all host mutant lines were obtained from Dr. Dong Wang, Department of Biochemistry and Molecular Biology, University of Massachusetts, Amherst. The host mutant lines used here were generated through forward genetic screens and thus may carry additional background mutations. Our aim is not to assign strict causality to individual host mutations but rather to examine how alterations in the host genetic landscape broadly influence rhizobial fitness.
Initial inoculum preparation
We created a mixed inoculum using 86 sequenced S. meliloti (henceforth S. meliloti) strains, which were part of a collection isolated from soils from 21 locations in Spain and France, using the nodules of Medicago plants as a “trap” for capturing S. meliloti isolates [47]. We grew each strain separately for 72 h in Tryptone-Yeast extract (TY) media at 28°C, shaking at 120 rpm, before mixing equal volumes of each culture to create an initial inoculum with roughly equal proportions of each strain. To inoculate plants, we added 60 ml of the multi-strain mixture to 6 litres of sterile 0.85% NaCl solution and applied 10 [7] cells per pot. This density was chosen to reflect S. meliloti populations in field soil [48] and to avoid triggering quorum-sensing mechanisms [49], which typically occur at higher cell densities. We measured rhizobial density in the inoculum by plating a dilution series and counting the number of colonies. We measured the strain frequencies in the inoculum by centrifuging eight 1 ml aliquots at 18 000 × g for 2 min, removing the supernatant, and then freezing the pellet at −20°C before extracting DNA. Initial communities were sequenced, and frequencies inferred (Fig. S1) as outlined in the Rhizobial Fitness section below.
Greenhouse experiment to measure multi-strain symbiotic phenotypes
We grew five replicate pots of each of the 18 Medicago mutants plus wildtype A17 (Fig. 1). Before planting, seeds were scarified by gently abrading with sandpaper, surface sterilized by soaking in a 10% bleach solution for 90 s and rinsed in de-ionized water. Seeds were stratified on wet filter paper at 4°C in the dark for 2 days, then planted in pots containing steam-sterilized Sunshine Mix #5. Germinated seedlings were thinned to three plants per pot. All pots were inoculated 48 h after seed planting with the initial 86-strain inoculum. Following inoculation, pots were randomized into five spatial blocks with one representative of each genotype per block. Plants were watered as needed and fertilized weekly using increasing concentrations of N-free Fårhaeus medium [50]. Uninoculated plants were included to monitor cross-contamination. Pots were harvested and traits measured 8 weeks post-inoculation. We measured nodule number per plant, average area per nodule and total nodulated area per plant as measures of host investment, total biomass per plant and root-to-shoot ratio as measures of host benefit, and population size of rhizobia released per plant and per nodule, nodule strain diversity and relative strain fitness in nodule pools as measures of rhizobial fitness. After 8 weeks, we harvested inoculated plants and pooled nodules from ~3 plants per pot to ensure enough nodules (>100 /pot) to limit stochastic variation and saturate diversity. None of the uninoculated plants formed nodules. We photographed nodule pools per replicate to assess nodule number and size using ImageJ. Plant biomass was measured by weighing the plant roots and shoots after oven-drying at 75°C for 3 days. Because not all plants survived in every pot, the number of plants occasionally varied. To control for this variation, all host traits were normalized per plant for analysis.
Rhizobial fitness
We surface-sterilized the nodules from each pot in 10% bleach for 1 min, rinsed the nodules in sterile water, and homogenized them for 90 s in 4 ml sterile 0.85% sodium chloride solution using an Omni TH tissue homogenizer. We estimated the number of rhizobia released from the nodule pools by dilution plating a 500–1000 ul aliquot of the homogenate and counting the number of colonies formed. We then estimated the absolute fitness of the rhizobia on both a per-plant and per-nodule level by dividing the total number of colonies by the number of plants or nodules harvested from each pot and plotting the log10 transformation of this value. The remaining nodule homogenate was spun at 400 × g for 10 min at 10°C. to pellet the bacteroids and the plant debris. We collected the supernatant enriched for undifferentiated rhizobial cells, pelleted them at 18000 × g for 5 min, pipetted off the supernatant before freezing the pellets at −20°C until DNA extraction. DNA was extracted using a Qiagen DNeasyPlant extraction kit following the manufacturer’s protocol, except we used 3 μL of RNase in the first step and 50 μL of nuclease water for the final elution.
Samples were sequenced using Illumina short-read technology at the Huck Genomic Core Facility at The Pennsylvania State University. Indexed libraries were generated using the Illumina DNA Prep Kit. 12X libraries were pooled and sequenced on a 150 × 150 paired-end NextSeq High Output run, resulting in an average read depth of ~75. We trimmed the reads using TrimGalore! (v0.4.1, www.bioinformatics.babraham.ac.uk/projects/trim_galore/) with default settings, except with minimum read length: 100 bp; quality threshold: 30 (−−quality); and minimum adapter match: 3 (−stringency), used bwa mem (v0.7.17) with default settings to align reads to the S. meliloti USDA1106 genome (NCBI BioProject: PRJNA388336) and estimated strain frequencies using HARP [51]. Strain frequencies were used to (i) estimate nodule strain diversity using Shannon’s H index (via the diversity function from the vegan package in R) and (ii) calculate relative strain fitness of each strain in each replicate of a particular host mutant by taking the log2 of the ratio of the final strain frequency in the nodules to the initial frequency in the inoculum (median initial frequency = 0.0092, 5%–95% quantile: 0.006–0.018). The log transformation puts increases and decreases in proportional representation on the same scale (e.g. a fourfold decrease and increase will have a value of −2 and 2, respectively). Strains with frequency estimates of 0 in the selected communities were assigned a fitness value of −8, less than the lowest measurable reductions in other strains [25].
Statistical analyses
To assess the effects of the host mutants and the symbiotic stage on plant traits and rhizobial fitness, we used linear models and analysis of variance (ANOVA) for the univariate traits and redundancy analysis (RDA) for the multivariate trait of relative strain fitness. All analyses were conducted in the R statistical environment (version 4.3.2). For each univariate trait, we fit two linear models using the lm () function: one that includes host genotype and block, and a second that includes the symbiotic disruption stage and block as additive predictors. We compared variation explained by the two models using the adjusted R [2] (anova() function) to infer whether models that included mutant identity explained more variance than the symbiotic stage alone. Because the genotypes are nested within the stage of symbiosis, we ran two separate linear models to avoid introducing collinearity. We then compared the variance explained by each model. To ensure this comparison was not biased by differences in the number of levels (degrees of freedom) in each predictor, we used the adjusted R [2], which corrects for model complexity and sample size. This approach allows us to assess the relative explanatory power of broad functional categories (stage) versus specific genetic backgrounds (genotypes) on rhizobial community structure. We log10-transformed average nodule area/plant and total nodule area traits to reduce heteroscedasticity of variance. To assess pairwise differences between A17 and each mutant for each trait, we used Tukey’s Honest Significant Difference (HSD) tests. Comparisons with an adjusted P value <0.05 were considered statistically significant. To investigate how host genotype or symbiotic stage altered the relative fitness of rhizobial strains (a multivariate trait), we performed an RDA (rda() function of the vegan package, v2.6–4) using relative strain fitness as the response matrix and host genotype or symbiotic stage as the explanatory variable. The significance of each model was assessed by extracting the adjusted R2 and via permutation-based ANOVA (1000 permutations) using the anova() function with by = "terms." To test whether the relative strain fitness within each mutant differed from A17, we ran the model for each mutant-A17 pairwise contrast. Predictors with P value<0.05 indicated significant changes in strain composition.
Genome-wide association study on relative strain fitness shifts in mutants vs. wildtype nodules
The relative strain fitness shift of each strain for each host mutant was calculated by subtracting the relative strain fitness in A17 from the relative strain fitness in the host mutants. We conducted 15 GWAS (three mutants formed no nodules) to identify rhizobial variants underlying host mutation-dependent strain fitness shifts for each nodulating mutant. To be thorough, we also conducted a GWAS on strain fitness in each host genotype individually, which highlights variants associated with strain success in host nodules generally, as well as the effect of the perturbations. For each of these GWAS, we ran a linear mixed model (LMM) using GEMMA (version 0.98.3). Analyses were run separately for the three replicons (chromosome, pSymA, pSymB) to account for the different population structures. We used a published database of 30 076 SNPs representing previously defined linkage-disequilibrium (LD) groups within S. meliloti [16]. SNP filtering was performed with bcftools-1.18 [52] and PLINK 1.9 [53] based on minor allele frequency (0.05) and missingness (0.2). We selected the top five variant groups for each mutation-driven fitness shift tested based on their P value ranks, cross-compared in the other 15 GWAS. To control for multiple tests while enabling comparisons across replicons that differed in variant group size from ~3000 to 12 000, we adopted a Bonferroni-corrected P value cutoff of P < 10−5 to approximate a nominal P < .05 across replicons. This choice of a consistent intermediate threshold aligns with our goal of examining selection landscapes rather than identifying individual causal variants. This allowed us to categorise a rhizobial candidate as 'pervasive' if it ranked in the top 10 within six or more host mutants or 'limited effect' if it appeared in five or fewer. We focused on ranks rather than P-values for this comparative analysis, prioritizing variants with the most evidence, regardless of differences in power or genetic architecture across host genetic perturbations.
Functional validation of malonyl CoA synthase gene through competition assays
To validate the functional role of a candidate gene identified through GWAS, we mutated the malonyl-CoA synthase gene within the genetically amenable S. meliloti Rm1021 background (mutant strain named RmND1439). We competed the mutant against the corresponding wild type Rm1021, across four host genotypes (A17, dnf3, dnf7, and sunn4). The mutant and wild-type strains were cultured individually and mixed in equal proportions to create the initial inoculum. Strain representation in the inoculum and post-inoculation nodule pools was quantified by performing a dilution series and counting colony-forming units (CFU) on selective and non-selective plates. Competitive fitness was expressed as the log₂ fold change in the proportion of RmND1439 recovered from the nodules relative to its proportion in the initial inoculum. Detailed methods for strain construction and competition assays are provided in the Supplementary Methods.
In-silico prediction of the rhizobial candidate gene functions through protein–protein interaction using AlphaFold-Multimer and Foldseek
To obtain additional information about the potential for the protein products of the candidate rhizobial genes to interact with host proteins such as the NCR peptides directly, we performed in silico protein–protein interaction predictions using AlphaFold-Multimer [54, 55]. The rhizobial candidate proteins were putatively annotated using FoldSeek [56]. To infer candidate gene involvement during symbiosis, we also listed the nodule-enrichment scores of each rhizobial candidate from the Symbimics database (https://iant.toulouse.inra.fr/symbimics/) database [57]. These scores quantify the likelihood of the rhizobial candidate gene involvement during root nodule symbiosis, with higher values indicating stronger nodule-specific gene expression. Detailed procedures for structural modelling, interaction scoring, and data filtering are provided in the Supplementary Methods.
Results
Effect of symbiotic mutations on rhizobial population sizes
Community inoculation resulted in nodulation across all host mutants except for dmi1–2, dmi2–3, and dmi3–1, consistent with the expectation that these genes are required for nodule formation (Fig. 2A). For the 15 mutants that formed nodules, we measured the number of rhizobia released from the nodules and tested if each mutant differed significantly from A17 plants. For instance, ipd3 releases ~2.1 × 105 rhizobial cells per nodule and ~ 2.435 × 107 per plant, a 10-fold decrease compared to A17 (Fig. 2B and C). For both, rhizobia released per nodule and plant, symbiosis stage did not explain nearly as much variation in rhizobia population sizes as individual host mutations (Table S2). Only two mutants significantly reduced the rhizobial population size at both per plant and nodule levels: ipd3 (per plant P = .0001; per nodule P = .04) and nad1 (per plant P = .02; per nodule P = .03), and no mutations increased population size (Table S3). Because ipd3 and nad1 do not produce significantly different numbers of nodules compared to A17 (P > .05), the smaller per-plant rhizobial population size is primarily due to reduced reproduction of rhizobia within nodules. This makes sense given that nad1 host mutant is known to evoke a defense-like response in nodules that kills reproductively active bacteria, and ipd3 host mutant has aborted infection threads, the part of the nodule in which reproductively competent rhizobia are found [43, 58].
Figure 2.
Host symbiotic mutations influence rhizobial fitness. (A) Host mutations altered nodule numbers in consistent and predictable ways. (B and C) Host genotype also influenced rhizobial fitness, as measured by log₁₀-transformed colony-forming units per nodule and per plant (CFU/nodule) (B) and CFU /plant (C). (D) Nodule strain diversity was largely unaffected by host mutations. Closed triangles indicate a significant increase or decrease relative to wild-type A17 (based on Tukey-adjusted P values). Boxplots are color-coded by the symbiotic stage disrupted by each host mutation (see Table S1). (E) Heatmap showing the median relative fitness of each of 86 S. meliloti strains across the different host genotypes. Strains enriched relative to their initial inoculum appear in shades of red; depleted strains appear in shades of blue. Host genotypes are hierarchically clustered based on similarity in strain fitness profiles. Strains are ranked by their relative fitness in A17. Mutants in bold differ significantly from A17 (see Table S5 for RDA paired with the PERMANOVA results).
Host symbiotic genes shape relative strain success
The total rhizobial population released from individual nodules and plants does not tell us whether the effects of the mutations are strain specific. We examined strain-specific effects of the mutants in two ways. First, we evaluated the nodule strain diversity, as measured by Shannon’s H, in each of the 15 nodulating mutants compared to A17. This analysis revealed that dnf7 was the only mutant with significantly reduced nodule strain diversity (P = .009, Fig. 2C, Table S3). Second, we compared the relative fitness of each strain in the nodules of each host mutant to A17 (Fig. 2E). Hierarchical clustering revealed that strain relative fitness did not cluster solely by symbiotic stage and that the latd nodule strain fitness was quite different from all other hosts, including wildtype. The latd host mutant results should be interpreted carefully—latd uniquely had a single estimate for rhizobial fitness traits, because plants formed so few nodules that we had to combine nodules across all pots to create a large enough sample for sequencing. A redundancy analysis with PERMANOVAs reinforced this finding. Whereas the symbiosis stage explained some variation in relative strain fitness, symbiotic genes had strong individual effects (Fig. S2, Table S4 and S5). Both early-acting (ipd3 and nad1) and late-acting (all dnf mutants except dnf4, which was marginal at P = .06, and dnf1) significantly altered rhizobial relative fitness in nodules (Fig. 2E). The dnf mutants did not cluster together in the RDA space, reinforcing that defects in nitrogen fixation did not produce uniform effects on rhizobial enrichment. Some strains consistently exhibited high or low fitness across all host genotypes (e.g. MAG322 vs. MAG27), but the fitness of most strains depended on host symbiotic mutations and often changed the rank order of relative strain fitness of the strains compared to wildtype (Fig. S3). Strain shifts were often specific to individual host mutants; e.g. MAG726B, MAG701B, and MAG201 had high fitness in dnf4 but low fitness in latd. The host-dependent changes in strain fitness allow us to use GWAS to identify rhizobial allelic variants associated with the observed strain fitness shifts caused by each mutation relative to wildtype.
Rhizobial genes underlying strain fitness shifts in response to host mutations were primarily located on extrachromosomal replicons
To determine genetic variants associated with shifts in strain fitness when the host genes were disrupted, we performed 16 GWAS, one for the shifts in strain fitness caused by each of the 15 mutants (relative fitness in mutant-relative fitness in A17) and for comparison, one for relative fitness in A17 (Fig. 3, Fig. S4). We found nine linkage groups on pSymA and 17 linkage groups on pSymB that were strongly associated with strain fitness shifts (P < 10−5) (Table 2, Table S6). In contrast, linkage groups on the chromosome had weaker effects (none with P < 10−5), except for LD group 838 (P = 6.38 × 10−6), which was associated with increased strain fitness with dnf6 and latd. The P values of variants across different replicons may differ in part due to replicon-specific patterns of relatedness among strains. However, the impact of population structure on GWAS power is not always straightforward to predict. For instance, the chromosome may exhibit long-range linkage disequilibrium, resulting in large linkage blocks that tag multiple genes, which can both help or hinder association resolution. For completeness, we ran a second GWAS to identify the genes underlying strain fitness in each host mutant individually (Fig. S5, Table S7). This second analysis identified bacterial variants that are associated with strain fitness in each host genotype, rather than variants associated with doing better or worse relative to WT. Both approaches identified variants that were distinct from those important in wild-type hosts when we compared lists of the top 0.1% of candidate SNPs based on P values (Fig. S6). However, candidate SNPs identified by the strain fitness approach showed >50% overlap with A17 for seven host mutations, whereas the strain fitness shift approach identified largely non-overlapping SNPs (Fig. S6). Below, we focus our discussion on candidates from the fitness shift analyses because it aligns with our goal of identifying the genomic basis of rhizobial responses to changes in the host environment and allows us to focus on those bacterial loci whose effects change depending on the host genetic environment.
Figure 3.
GWAS of strain fitness shifts in host mutants. (A) The rhizobial strain fitness shifts for each of the 86 strains, calculated by subtracting individual relative strain fitness measured in A17 from the relative strain fitness in the host mutants. The heatmap is hierarchically based on the similarity of median “shift” values across hosts. Shades of orange denote positive fitness shift compared to A17, whereas shades of purple denote negative fitness shift compared to A17. (B) Manhattan plot indicating the SNP (single-nucleotide polymorphism) positions, based on S. meliloti USDA1106 association with the relative strain fitness shifts across the 15 host mutants and strain fitness in A17. The dotted line indicates the P-value threshold used for identifying candidate genes (P < 10−5), and significant associations are color-coded according to host mutation and A17. The GWAS was run separately on each replicon to account for different population structures. The “∆” indicates “strain fitness shift” in that host mutant. For comparison, we also ran the GWAS on A17 relative strain fitness.
Table 2.
Candidate rhizobial genes in small variant groups (three genes or fewer) associated with (P < 10−5) variation in rhizobial relative strain fitness. See Table S6 for information on all variant groups.
| LD group1 | Host mutants | Location | Alt. allele freq2 | Eff. alt. vars in mutants | Eff alt. vars in WT | Variants | Genes ID's3 |
|---|---|---|---|---|---|---|---|
| 9034 | dnf1,dnf5/dnf2, dnf2, dnf4, dnf7, ipd3, nad1,rdn1 sunn1, sunn4 | pSymA | 0.07 | + | − | 1 | CDO30_RS23065:SGNH/GDSL hydrolase family protein |
| 6978.alt4 | dnf1, dnf1/dnf2, dnf2, dnf4, dnf7, dnf5/dnf2, ipd3, rdn1, sunn1 | pSymA | 0.07 | + | none | 1 | CDO30_RS23160:NAD(P)H-binding protein |
| 5279 | dnf1, dnf2, dnf7, ipd3, nad1,rdn1, sunn1 | pSymA | 0.07 | + | − | 1 | CDO30_RS21460:malonyl-CoA synthase |
| 7969 | dnf1, dnf2, dnf6, nad1, sunn4 | pSymA | 0.06 | + | none | 1 | CDO30_RS22325:hypothetical protein |
| 6968 | dnf4, dnf7,ipd3, sunn1, sunn4 | pSymA | 0.10 | + | none | 1 | CDO30_RS23125:SDR family oxidoreductase (pseudogene) |
| 10 122 | dnf1, dnf1/dnf2, dnf2, sunn1 | pSymB | 0.07 | + | − | 1 | CDO30_RS27255:hypothetical protein |
| 12 516 | dnf1/dnf2, rdn1, sunn1 | pSymB | 0.06 | + | none | 2 | CDO30_RS28530:ABC transporter ATP-binding protein; CDO30_RS28525:RkpR |
| 10 601 | dnf7, ipd3, ltd | pSymB | 0.05 | + | none | 1 | CDO30_RS28740:pepT, peptidase T |
| 3770 | rdn1, sunn4 | pSymA | 0.14 | + | − | 1 | CDO30_RS18080:fixL, oxygen sensor histidine kinase fixL |
| 12 013 | dnf1/dnf2, ipd3 | pSymB | 0.05 | + | none | 1 | CDO30_RS28675:hypothetical protein |
| 10 882 | dnf4 | pSymB | 0.05 | + | none | 1 | CDO30_RS30150:glycosyltransferase |
| 9671 | dnf3 | pSymB | 0.07 | + | none | 3 | CDO30_RS27435:four-carbon acid sugar kinase family protein |
| 10 140 | dnf3 | pSymB | 0.08 | + | none | 1 | CDO30_RS27435:four-carbon acid sugar kinase family protein |
| 10 913 | dnf3 | pSymB | 0.05 | + | none | 2 | CDO30_RS27440:xanthine dehydrogenase family protein molybdopterin-binding subunit; CDO30_RS27445:xanthine dehydrogenase family protein subunit M |
| 10 915 | dnf3 | pSymB | 0.05 | + | none | 42 | CDO30_RS27440:xanthine dehydrogenase family protein molybdopterin-binding subunit; CDO30_RS27445:xanthine dehydrogenase family protein subunit M |
| 8294 | dnf4 | pSymA | 0.07 | + | none | 1 | CDO30_RS19605:stc4, stachydrine N-demethylase reductase subunit Stc4 |
| 8754 | dnf5/dnf2 | pSymB | 0.1 | + | none | 2 | CDO30_RS32445:hypothetical protein |
| 9239 | dnf5/dnf2 | pSymB | 0.09 | + | none | 1 | CDO30_RS32445:hypothetical protein |
| 10 903 | ipd3 | pSymB | 0.05 | + | none | 1 | CDO30_RS27310:ABC transporter ATP-binding protein |
| 838 | latd | chr | 0.09 | + | none | 1 | CDO30_RS12270:glutamine synthetase family protein |
| 4107 | nad1 | pSymA | 0.11 | − | none | 1 | CDO30_RS22005:amidase |
| 4386 | nad1 | pSymA | 0.09 | − | none | 2 | CDO30_RS22060:Atu4866 domain-containing protein;CDO30_RS22055:class I SAM-dependent methyltransferase; |
| 9865 | rdn1 | pSymB | 0.07 | + | none | 7 | CDO30_RS32685:four-carbon acid sugar kinase family protein; CDO30_RS32675:NAD(P)-dependent oxidoreductase; CDO30_RS32705:LacI family transcriptional regulator |
| 9926 | rdn1 | pSymB | 0.07 | + | none | 4 | CDO30_RS31555:MASE1 domain-containing protein; CDO30_RS32090:TRAP transporter large permease; CDO30_RS32070:class I SAM-dependent methyltransferase |
| 9941 | rdn1 | pSymB | 0.07 | + | none | 1 | CDO30_RS25485:pyridoxamine 5′-phosphate oxidase family protein |
| 10 728 | rdn1 | pSymB | 0.06 | + | none | 1 | CDO30_RS32910:LysR family transcriptional regulator |
| 12 841 | sunn1 | pSymB | 0.07 | + | none | 1 | CDO30_RS28530:ABC transporter ATP-binding protein |
Notes:
1LD group numbers were obtained from1616.
2AF= non-reference (alternate) allele frequency.
3Locus tags are from the S. meliloti USDA1106 reference annotation available on MAGE (http://www.genoscope.cns.fr/agc/microscope/home/index.php).
4A tri allelic SNP.
GWAS identifies rhizobial genes with pervasive and specific effects on strain fitness shifts
To study the genomic landscape underlying rhizobial fitness shifts in response to host mutations, we identified the top five variants for each host mutant (in total, 114 variant groups) based on P value ranks (Table S6). Then we compared the P value ranks and effect sizes of these groups across all the host mutants (Fig. 4, Fig. S7, Table S8). Most of the candidates on the chromosome were classified as “limited effect” variant groups because they ranked among the top 10 candidates, in five or fewer host mutations (Table S8). Exceptions included a massive chromosomal variant group (901) with 144 variants in 65 genes, among the top 10 candidates among nine host mutants. Six variants with pervasive effects were found on pSymB. On pSymB, four small variant groups were top-ranked in at least three host mutants (ABC transporter and RkpR, a dihydrodipicolinate synthase family protein, peptidase T, and a hypothetical protein (Fig. S7). However, the seven pervasive variants on pSymA were most interesting to us because they spanned similar hosts and had strong statistical support. These candidates included malonyl-CoA synthase, SGNH/GDSL hydrolase family protein, fixL, an oxygen sensor histidine kinase, a short-chain oxidoreductase, and a transport protein belonging to the MFS (Major Facilitator Superfamily). We validated one pervasive candidate, the malonyl-CoA synthase, by knocking it out in S. meliloti Rm1021 genetic background (See Supplemental Methods) and competing the mutant (RmND1439) against the wild type strain across four host genotypes (sunn4, dnf3, dnf7, and A17). RmND1439 was almost completely absent from nodules, indicating a functional version of this gene is crucial for strain fitness in all hosts tested (Fig. S8). This result supports the gene’s pervasive effect and confirms that our approach can identify rhizobial genes influencing strain performance. We further observed that the non-reference variant often increased strain fitness within the host mutants. In addition, we noted a trend where the pervasive variants that conferred higher relative fitness in the mutant hosts, tended to have lower fitness in wildtype A17. For example, the malonyl-COA synthase had an alternate allele associated with increased strain fitness (positive beta) in several host mutant backgrounds (ipd3, dnf1, dnf2, dnf6, dnf7, nad1, rdn1, and sunn1). In contrast, the same allele was associated with decreased fitness (negative beta) in A17 (Fig. 4, Table S8). This pattern suggests a host-dependent allelic effect with the allele conferring a fitness advantage in mutant hosts becoming disadvantageous in the wild-type host.
Figure 4.
Rhizobial candidate genes underlying strain fitness shifts predominantly occur on pSymA. Heatmaps based on the inverse P value ranks (red-yellow) and β values, which denote the effect sizes (red-blue) for the top five variant groups on the pSymA across the 15 host mutants and A17. Rows represent candidate variant groups, and columns represent host mutants. The host mutants are colour coded based on the stage of disruption they affect. Progressive dark shades (low P value ranks) indicate a stronger association between the variant group and strain fitness shift in that host mutant. Heatmap based on the effect sizes (β values) for the same candidate variant groups was ordered to match the P value ranking. Shades of red indicate host mutants where the corresponding variant has a positive effect on strain fitness shift, and shades of blue indicate negative effects as compared to A17. Colour intensity corresponds to effect magnitude. Alternate allele frequencies (i.e. not present in the reference genome USDA1106) for each candidate are also shown in parentheses next to the gene names. Pervasiveness of a variant group is defined as the number of times a variant group is among the top 10 candidates (based on the P value ranks) within a host mutant. A variant group is classified as “pervasive” if it is in the top 10 across six or more host genotypes. In contrast, a variant is classified as “limited effect” if it is in the top 10 candidates across five or fewer host genotypes. We have mentioned the number of times a variant appears in the top 10 under the “Pervasiveness” column in the figure and have indicated the pervasive genes in bold. (see Fig. S5 for the other replicons.)
Most limited-effect variant groups (82 of 94) were small (having fewer than three genes). The majority of these were on pSymB (Table S8). Those with strong statistical support (P value<10−5) included variants in or nearby genes coding two ABC transporters (sunn1, ipd3, dnf1/dnf2, rdn1), glutamine synthetase (latd) methyltransferases (nad1), transcriptional regulators (rdn1), a couple of hypothetical proteins (ipd3, dnf1/dnf2,dnf5/dnf2,dnf2,sunn1)and genes involved in purine metabolism (dnf3). A rhizobial glutamine synthetase gene, which converts glutamate and ammonia into glutamine to assimilate nitrogen, was the strongest candidate for variation in strain fitness shift in the latd genotype, which encodes a nitrate transporter [59]. This was unexpected because rhizobial glutamine synthetases are generally thought to be strongly downregulated in nodules where host-driven nitrogen assimilation processes dominate [57]. To further investigate the potential functional relevance of these candidates, we utilized AlphaFold Multimer, coupled with Foldseek analysis, to predict possible rhizobial protein- NCR peptide interactions. Host NCR peptides are a logical starting point for putative plant protein-rhizobial protein interactions because (i) several of our hosts are NCR mutants [27], (ii) NCRs are known to have strain-specific effects [60] and (iii) the large NCR family is derived from antimicrobial peptides, but we only know what a handful of them do [61–63]. We found that more than half of the top rhizobial candidate genes associated with strain fitness shifts (20 out of 35) had a high likelihood of physically interacting with one or more NCR peptides (Table S9 and S10). These included rhizobial candidates associated with fitness shifts in NCR-compromised mutants whose annotations suggest that rhizobia might adjust their membrane transport, redox balance, and surface defences when host NCR activity changes (e.g. xanthine dehydrogenase family protein subunit M, a Teicoplanin resistance protein-like, and an ABC transporter ATP-binding protein, which could be an NCR efflux protein according to Foldseek analysis). These findings provide additional support for the biological relevance of these candidates in mediating rhizobium–host interactions.
Correlations of host investment and benefit traits with rhizobial traits
Because mutualistic partners affect each other’s fitness, we were curious if any host investment or benefit traits were correlated with rhizobial fitness. At the level of individuals pots, nodule area, plant biomass, and root-to-shoot ratio covaried with nodule strain diversity and position on the first RDA axis describing shifts in strain relative fitness (Fig. 5A). Total nodulated area per plant covaried with the second RDA axis describing shifts in strain relative fitness. Perhaps unsurprisingly, the host trait that covaried with the number of CFUs released per plant was the number of nodules per plant. Across the first PC of trait space, we found the wild-type plants grouped with the AON mutants, hcl, and three dnfs [3, 4, 6]. In contrast, the second PC tended to separate the early-acting, AON, and wildtype plants from the late-acting mutants. Genotypic trait correlations confirmed these patterns (Fig. 5B). Host mutants that were larger and had lower root-to-shoot ratios, (i.e. those that benefited more from rhizobial N-fixation), tended to release more rhizobia (CFUs) from individual nodules, more rhizobia on a whole-plant level, and had nodule strain communities that resembled those in A17 nodules (Fig. 5B). Most host traits correlated strongly with strain diversity and position along the first rhizobial relative fitness RDA axis; The exception was average nodule area/plant, which was most strongly correlated with host position on the second RDA axis. Details on mutation-specific effects on host traits can be found in Fig. S8 and Table S3. Our results suggest that symbiotic gene disruptions strongly shape both partners' fitness in a manner that reinforces mutualism—mutations that reduce rhizobial fitness also reduce host fitness.
Figure 5.

Covariation and correlation structure among host and rhizobial traits. (A) Principal component analysis (PCA) of host investment traits, benefit traits, and rhizobial fitness outcomes, conducted at the level of individual pots. Arrows represent the loadings of each trait on the first two principal components, which together explain 51.2% and 16.7% of the total variance, respectively. To avoid loss of representation due to missing data, values for two genotypes (latd and rdn1) were averaged across replicates prior to PCA, as these genotypes had incomplete trait data that would otherwise be excluded.
(B) All pairwise combinations of genotypic trait correlations (Pearsons). Each ellipse represents a pairwise Pearson correlation between traits, with ellipse elongation and color saturation indicating the strength and direction of the correlation (blue = positive, red = negative). Strong correlations may reflect shared genetic regulation or pleiotropy.
Discussion
By combining a S&R experiment with GWAS, we demonstrate that mutations in known host symbiotic genes create distinct environments that shape rhizobial strain success and identify rhizobial candidate genes underlying strain fitness shifts in response to host mutations. By applying a multi-strain inoculation approach here, we have created a more natural ecological setting to evaluate host-imposed selection on naturally-occurring rhizobial genetic variants. Within this ecological framework, we found rhizobial genetic variants with both pervasive and limited effects across host mutants, and that symbiotic stage categories provide little clarity about how rhizobial genes influence strain fitness. We also found that mutations that increased rhizobial fitness tend to increase host investment and benefit traits, which highlight a directional effect where host genetic variations reshape selection on rhizobial partners. Our findings underscore the complexity of mutualism, where both host and rhizobial genomes jointly determine the outcomes of symbiosis at the community and individual strain levels, adding to knowledge of how Medicago hosts determine rhizobial fitness [16, 30].
Host genetic mutations impact rhizobial absolute fitness
Legumes releasing rhizobia from their senescing nodules directly contribute to the soil rhizobial population [28, 48, 64]. We found that most mutations did not affect the number of rhizobia released from each nodule, and those that did, ipd3 and nad1, released fewer rhizobia per nodule. The reduced number of rhizobia per nodule with these two mutants is not unexpected, given that disrupting nad1 leads to overactive plant defence responses in nodules and disrupting ipd3 traps rhizobia within infection threads, likely reducing survival [32, 43]. Whereas our results suggest that the effects of host genetic changes on rhizobial fitness may sometimes align with known gene functions, we also observed broader positive correlations between rhizobia released per nodule and average nodule area across mutations. Increasing the number of rhizobia the host supports represents one pathway to nurture robust populations of rhizobia, and identifying the host allelic variation that increases this amplification could set the stage for targeted interventions aimed at bolstering soil rhizobial populations in agricultural fields [65]. However, the host mutant lines used in this study were generated through forward genetic screens and, therefore, will likely carry additional background mutations. Thus, our aim is not to assign strict causality to individual mutations but rather to examine how alterations in the host genetic landscape broadly influence rhizobial fitness.
Rhizobial fitness shifts depend on host genetics
A large group of NCR peptides is one of the major known host factors for altering strain survival and growth in nodules [63, 66–68]. Although late-acting NCRs can be important determinants of host strain specificity in M. truncatula, our investigation reveals a more complex landscape [27, 69]. Our analyses of two NCR mutations revealed differing effects on strain fitness within nodules. The dnf7 mutant, which lacks functional NCR169 [62], was the only mutant background where we observed reduced nodule strain diversity, and it clustered with ipd3, which also supported a smaller rhizobial population. In contrast, dnf4, which does not produce NCR211 and influences rhizobial differentiation during bacteroid formation [70], showed shifts in strain relative fitness similar to dnf3 and dnf6, suggesting these uncharacterized mutants may also involve NCR-related processes. A few rhizobia genes (bacA, hrrP, yejABEF) are known to affect the efficacy of NCRs [35, 66, 71]; however, none of those genes were top candidates in our GWAS, suggesting additional mechanisms of host restriction of rhizobia [61, 72, 73].
Rhizobial variants with pervasive and limited effects across host mutants
Among the ~120 top rhizobial variants, ~18% affected rhizobial fitness in context of six or more host mutations (pervasive genes), and ~82% were associated in context of five or fewer host mutations (limited effect genes). Pervasive effect alleles tended to exhibit fitness trade-offs; alleles linked to higher fitness with many host mutations tended to be linked with lower fitness with wild-type hosts [16]. Many of the GWAS candidates were identified with only one or a few host mutations, with the strongest associations on pSymB. Unlike the pervasive variants, most limited-effect variants had little to no effect on strain fitness in the wild-type background. Most candidate genes with pervasive associations were highly expressed in the nitrogen fixation zone (Zone III) [57]. For example, the fixL gene, which regulates the expression of nitrogen fixation genes in response to low oxygen levels, malonyl-CoA synthase i.e. induced in the rhizosphere of alfalfa, responsible for malonate catabolism within root nodules [74], and a NAD(P)H-binding protein from the conserved NmrA domain family that shows high similarity to a transcriptional regulator, involved in nitrogen metabolite repression in fungi [75]. These genes represent strong candidates for uncovering novel mechanisms that influence symbiont performance across diverse host contexts [76].
Beyond these candidates, most associations appeared mutation-specific, indicating that each mutated host background creates a unique challenge for rhizobia reproduction. Notable candidates include: A Class I SAM-dependent methyltransferase associated with strain fitness in nad1 mutants, suggesting the importance of epigenetic modifications for rhizobial survival under host defences [32], and stc4, encoding stachydrine N-demethylase reductase, which is linked to enhanced strain fitness in dnf4 mutants, possibly by protecting differentiating rhizobia [70]. Whereas these genes are candidates for altering strain fitness in some host mutant contexts and not others, our experiments do not pinpoint the stage at which each host gene exerts the strongest selective filter. One way to follow up on these results is with assays comparing rhizobia mutated in the candidate genes identified in our GWAS on wildtype and mutant hosts, which would allow one to identify the selective stage- rhizosphere, infection thread, or nodule- where each gene exerts its influence on strain fitness.
Considerations for interpreting mixed-strain experiments
Final strain fitness in nodules is a composite trait that reflects host genetics and microbial traits such as rhizosphere colonization, competition for infection, and proliferation within nodules. As such, shifts in relative strain fitness across host genotypes could also be influenced by differential competitive abilities among strains before infection of each host genotype. Thus, whereas our design allows us to detect which host symbiotic genes alter strain fitness, it does not allow us to disentangle the relative contributions of microbial competition versus host filtering. This limitation is especially relevant in complex communities, where strain–strain interactions influence host effects [21]. Whereas data suggest the effects of microbe × microbe interactions on strain relative fitness in our system are weak [26], we caution that it is important to interpret our host-driven fitness shifts as emergent outcomes of host × microbe × microbe interactions. An additional consideration is that our experiment is designed to measure how growing plants of a single genotype affects the fitness of rhizobial strains. Linking these fine-scale host effects to long-term dynamics of rhizobial adaptation is an important overarching goal in plant-microbe interactions that requires incorporating additional ecological and evolutionary contexts, including the importance of soil selection [77] or the cooccurrence of host genotypes across time and space [13, 78].
Conclusion
Despite legumes being used for soil fertility improvement and actively managed for centuries, biological nitrogen fixation in agriculture remains inconsistent spatially and temporally [79–82]. This challenge has driven interest and investment in developing microbial inoculants to enhance crop yield, stress tolerance, and disease and pest management [20]. Rhizobia have been included in microbe-based bioinoculant formulations for centuries [76]. However, most of those strain selection decisions are dominated by strain performances observed under single-strain assays (inoculating a single plant with a single strain) in controlled environments. Thus, strains that show promise as potential bioinoculants under controlled conditions often fail to reach densities high enough to influence crop yields in realistic agricultural systems [83]. Formulating ways to sustain rhizobial fitness in agricultural fields is a critical avenue for plants to continue acquiring nitrogen from their associated rhizobia [84].
Advances in host-mediated microbiome engineering have demonstrated that plant genes can be leveraged to assemble trait-optimized microbial communities, with the selection of beneficial microbes being dependent on the host genotype [81, 85]. The findings in this study suggest a route towards expanding the search for host genes that alter rhizobial fitness. If natural allelic variants of these genes also favor colonization by high-performing symbionts, it will enable the development of “designer plants” with enriched mutualistic microbiomes [86]. Here, we identify host genes that drive strain-specific selection and show that even functionally similar mutations can have divergent effects on strain fitness in multi-strain contexts [21]. Whereas our study focused on loss-of-function mutations, studies on segregating natural genetic variation in Medicago suggest that specific host genes can differentially enrich rhizobia but find limited evidence for widespread balancing or positive selection [16]. Though we do not expect to find these loss-of-function mutations segregating in natural populations, this work lays a critical foundation: identifying the host genes that influence rhizobial fitness. A next step could be to systematically test the effect of naturally-occurring alleles within these genes on rhizobial fitness, to pinpoint the variants that promote or reduce fitness and the mechanisms by which they do it.
Supplementary Material
Acknowledgements
We thank Prof. Dong Wang of the University of Massachusetts Amherst for generously providing the M. truncatula mutant seeds. We also appreciate the assistance of Lillian Cherry and Jennifer E. Harris with the tedious task of picking and processing nodules. Prof. Katy Heath of the University of Illinois Urbana-Champaign and Dr. Mike Grillo (currently Assistant Professor at Loyola University, Chicago) for providing strains and sequencing data.
Contributor Information
Sohini Guha, Department of Plant Science, The Pennsylvania State University, University Park, PA 16802, United States.
Regina B Bledsoe, Department of Plant Science, The Pennsylvania State University, University Park, PA 16802, United States.
Jeremy Sutherland, Department of Plant Science, The Pennsylvania State University, University Park, PA 16802, United States.
Brendan Epstein, Department of Plant and Microbial Biology, University of Minnesota, St. Paul, MN 55108, United States.
Gwendolyn M Fry, Department of Plant Science, The Pennsylvania State University, University Park, PA 16802, United States.
Vikram Venugopal, Stowers Institute for Medical Research, Kansas City, MO 64110, United States.
Siva Sankari, Stowers Institute for Medical Research, Kansas City, MO 64110, United States.
Alejandra Gil-Polo, Department of Plant Science, The Pennsylvania State University, University Park, PA 16802, United States.
Garrett Levin, Department of Microbiological Sciences, North Dakota State University, Fargo, ND 58102, United States.
Barney A Geddes, Department of Microbiological Sciences, North Dakota State University, Fargo, ND 58102, United States.
Nevin D Young, Department of Plant Pathology, University of Minnesota, St. Paul, MN 55108, United States.
Peter Tiffin, Department of Plant and Microbial Biology, University of Minnesota, St. Paul, MN 55108, United States.
Liana T Burghardt, Department of Plant Science, The Pennsylvania State University, University Park, PA 16802, United States.
Author contributions
LTB, NY, PT idea conception and funding, LTB experimental design, LTB and RGB plant care; RB, GF, LTB data collection for main experiment; GL, AG, BG constructed the malonyl-CoA synthase mutant, SG did the competition assays; VV and SS did the Alphafold Multimer and Foldseek analysis; SG, LTB, RGB, BE, JS did the S&R and GWAS analysis; LTB, SG, RB drafted the manuscript; SG, LTB, RB, JS, BE, PT, NY, GF, BG, GL, VV, AG, and SS reviewed the final version of the manuscript.
Conflicts of interest
Barney A. Geddes is a cofounder of Lilac Agriculture Inc.
Funding
This work was supported by the National Science Foundation grants IOS-1856744 to PT, NY, and LTB, and IOS-2243819, 1724993, 2243817, and 2243819 to PT and LTB. It was completed using computing resources provided by the Penn State Institute for Computational and Data Science, and sequencing was performed by the Huck Genomics Core Facility. LTB’s work was supported by the USDA National Institute of Food and Agriculture and Hatch Appropriations under Project #PEN04760 and Accession #1025611. The authors acknowledge the Huck Institutes’ Genomics Core Facility (RRID:SCR_023645) for providing library prep and sequencing services.
Data availability
All the raw reads from sequencing the rhizobial strains in the initial community and in the rhizobial genomic DNA extracted from the nodule pools of the individual host mutants have been deposited in NCBI under the Bioproject PRJNA401437 (Submission ID: SUB14622715). All raw data and analysis code have been uploaded to Github. (https://github.com/BurghardtLab/HostMutants2021_C86meliloti).
References
- 1. Friesen ML, Porter SS, Stark SC et al. Microbially mediated plant functional traits. Annu Rev Ecol Evol Syst 2011;42:23–46. [Google Scholar]
- 2. Percy DM, John N. Thompson, the geographic mosaic of coevolution. Syst Biol 2007;56:149–51. [Google Scholar]
- 3. Bartha I, Carlson JM, Brumme CJ et al. A genome-to-genome analysis of associations between human genetic variation, HIV-1 sequence diversity, and viral control. elife 2013;2:e01123. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Jones JDG, Dangl JL. The plant immune system. Nature 2006;444:323–9. [DOI] [PubMed] [Google Scholar]
- 5. Amandine C, Ebert D, Stukenbrock E et al. Unraveling coevolutionary dynamics using ecological genomics. Trends Genet TIG 2022;38:1003–12. 10.1016/j.tig.2022.05.008 [DOI] [PubMed] [Google Scholar]
- 6. Demirjian C, Vailleau F, Berthomé R et al. Genome-wide association studies in plant pathosystems: success or failure? Trends Plant Sci 2023;28:471–85. 10.1016/j.tplants.2022.11.006 [DOI] [PubMed] [Google Scholar]
- 7. Corwin JA, Copeland D, Feusier J et al. The quantitative basis of the arabidopsis innate immune system to endemic pathogens depends on pathogen genetics. PLoS Genet 2016;12:e1005789. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Newberry EA, Minsavage GV, Holland A et al. Genome-wide association to study the host-specificity determinants of Xanthomonas perforans. Phytopathology 2023;113:400–12. 10.1094/PHYTO-08-22-0294-R [DOI] [PubMed] [Google Scholar]
- 9. Medina M, Sachs JL. Symbiont genomics, our new tangled bank. Genomics 2010;95:129–37. 10.1016/j.ygeno.2009.12.004 [DOI] [PubMed] [Google Scholar]
- 10. Clare EL, Schiest FP, Leitch AR et al. The promise of genomics in the study of plant-pollinator interactions. Genome Biol 2013;14:207. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Mutualism. Oxford, United Kingdom: Oxford University Press, 2015. [Google Scholar]
- 12. Batstone RT. Genomes within genomes: nested symbiosis and its implications for plant evolution. New Phytol 2022;234:28–34. 10.1111/nph.17847 [DOI] [PubMed] [Google Scholar]
- 13. Heath KD, Stinchcombe JR. Explaining mutualism variation: a new evolutionary paradox? Evolution 2014;68:309–17. 10.1111/evo.12292 [DOI] [PubMed] [Google Scholar]
- 14. Turkarslan S, Stopnisek N, Thompson AW et al. Synergistic epistasis enhances the co-operativity of mutualistic interspecies interactions. ISME J 2021;15:2233–47. 10.1038/s41396-021-00919-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Batstone RT, Lindgren H, Allsup CM et al. Genome-wide association studies across environmental and genetic contexts reveal complex genetic architecture of symbiotic extended phenotypes. MBio 2022;13:e01823–2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Epstein B, Burghardt LT, Heath KD et al. Combining GWAS and population genomic analyses to characterize coevolution in a legume-rhizobia symbiosis. Mol Ecol 2023;32:3798–811. 10.1111/mec.16602 [DOI] [PubMed] [Google Scholar]
- 17. Carlson C, Frederickson ME. An emerging view of coevolution in the legume–rhizobium mutualism. Mol Ecol 2023;32:3793–7. 10.1111/mec.17055 [DOI] [PubMed] [Google Scholar]
- 18. Gohar D, Põldmaa K, Pent M et al. Genomic evidence of symbiotic adaptations in fungus-associated bacteria. iScience 2025;28:112253. 10.1016/j.isci.2025.112253 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Denison RF, Kiers ET, West SA. Darwinian agriculture: when can humans find solutions beyond the reach of natural selection? Q Rev Biol 2003;78:145–68. [DOI] [PubMed] [Google Scholar]
- 20. Busby PE, Soman C, Wagner MR et al. Research priorities for harnessing plant microbiomes in sustainable agriculture. PLoS Biol 2017;15:e2001793. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Burghardt LT, diCenzo GC. The evolutionary ecology of rhizobia: multiple facets of competition before, during, and after symbiosis with legumes. Curr Opin Microbiol 2023;72:102281. [DOI] [PubMed] [Google Scholar]
- 22. Oldroyd GED, Murray JD, Poole PS et al. The rules of engagement in the legume-rhizobial symbiosis. Annu Rev Genet 2011;45:119–44. 10.1146/annurev-genet-110410-132549 [DOI] [PubMed] [Google Scholar]
- 23. Poole P, Ramachandran V, Terpolilli J. Rhizobia: from saprophytes to endosymbionts. Nat Rev Microbiol 2018;16:291–303. 10.1038/nrmicro.2017.171 [DOI] [PubMed] [Google Scholar]
- 24. Heath KD, Tiffin P. Context dependence in the coevolution of plant and rhizobial mutualists. Proc R Soc B Biol Sci 2007;274:1905–12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Burghardt LT, Epstein B, Guhlin J et al. Select and resequence reveals relative fitness of bacteria in symbiotic and free-living environments. Proc Natl Acad Sci USA 2018;115:2425–30. 10.1073/pnas.1714246115 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Burghardt LT, Epstein B, Hoge M et al. Host-associated rhizobial fitness: dependence on nitrogen, density, community complexity, and legume genotype. Appl Environ Microbiol 2022;88:e00526–2. 10.1128/aem.00526-22 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Roy S, Liu W, Nandety RS et al. Celebrating 20 years of genetic discoveries in legume nodulation and symbiotic nitrogen fixation. Plant Cell 2020;32:15–41. 10.1105/tpc.19.00279 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Burghardt LT. Evolving together, evolving apart: measuring the fitness of rhizobial bacteria in and out of symbiosis with leguminous plants. New Phytol 2020;228:28–34. 10.1111/nph.16045 [DOI] [PubMed] [Google Scholar]
- 29. Ratcliff WC, Underbakke K, Denison RF. Measuring the fitness of symbiotic rhizobia. Symbiosis 2011;55:85–90. [Google Scholar]
- 30. Burghardt LT, Trujillo DI, Epstein B et al. A select and resequence approach reveals strain-specific effects of Medicago nodule-specific PLAT-domain genes. Plant Physiol 2020;182:463–71. 10.1104/pp.19.00831 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Epstein B, Abou-Shanab RAI, Shamseldin A et al. Genome-wide association analyses in the model rhizobium Ensifer meliloti. mSphere 2018;3:e00386–18. 10.1128/mSphere.00386-18 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Wang C, Yu H, Luo L et al. NODULES WITH ACTIVATED DEFENSE 1 is required for maintenance of rhizobial endosymbiosis in Medicago truncatula. New Phytol 2016;212:176–91. 10.1111/nph.14017 [DOI] [PubMed] [Google Scholar]
- 33. Yang S, Tang F, Gao M et al. R gene-controlled host specificity in the legume–rhizobia symbiosis. Proc Natl Acad Sci USA 2010;107:18735–40. 10.1073/pnas.1011957107 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Tang F, Yang S, Liu J et al. Rj4, a gene controlling nodulation specificity in soybeans, encodes a thaumatin-like protein but not the one previously reported. Plant Physiol 2016;170:26–32. 10.1104/pp.15.01661 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Price PA, Tanner HR, Dillon BA et al. Rhizobial peptidase HrrP cleaves host-encoded signaling peptides and mediates symbiotic compatibility. Proc Natl Acad Sci USA 2015;112:15244–9. 10.1073/pnas.1417797112 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36. Oldroyd GED. Speak, friend, and enter: signalling systems that promote beneficial symbiotic associations in plants. Nat Rev Microbiol 2013;11:252–63. 10.1038/nrmicro2990 [DOI] [PubMed] [Google Scholar]
- 37. Wheatley RM, Ford BL, Li L et al. Lifestyle adaptations of rhizobium from rhizosphere to symbiosis. Proc Natl Acad Sci USA 2020;117:23823–34. 10.1073/pnas.2009094117 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38. Ferguson BJ, Mens C, Hastwell AH et al. Legume nodulation: the host controls the party. Plant Cell Environ 2019;42:41–51. 10.1111/pce.13348 [DOI] [PubMed] [Google Scholar]
- 39. Bright LJ, Liang Y, Mitchell DM et al. The latd gene of Medicago truncatula is required for both nodule and root development. Mol Plant-Microbe Interactions 2005;18:521–32. [DOI] [PubMed] [Google Scholar]
- 40. Hogg BV, Cullimore JV, Ranjeva R et al. The DMI1 and DMI2 early symbiotic genes of Medicago truncatula are required for a high-affinity nodulation factor-binding site associated to a particulate fraction of roots. Plant Physiol 2006;140:365–73. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Messinese E, Mun J-H, Yeun LH et al. A novel nuclear protein interacts with the symbiotic dmi3 calcium- and calmodulin-dependent protein kinase of Medicago truncatula. Mol Plant-Microbe Interactions 2007;20:912–21. [DOI] [PubMed] [Google Scholar]
- 42. Smit P, Limpens E, Geurts R et al. Medicago LYK3, an entry receptor in rhizobial nodulation factor signaling. Plant Physiol 2007;145:183–91. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43. Ovchinnikova E, Journet E-P, Chabaud M et al. Ipd3 controls the formation of nitrogen-fixing symbiosomes in pea and Medicago spp. Mol Plant-Microbe Interactions 2011;24:1333–44. [DOI] [PubMed] [Google Scholar]
- 44. Starker CG, Parra-Colmenares AL, Smith L et al. Nitrogen fixation mutants of Medicago truncatula fail to support plant and bacterial symbiotic gene expression. Plant Physiol 2006;140:671–80. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45. Schnabel E, Journet E-P, de Carvalho-Niebel F et al. The Medicago truncatula sunn gene encodes a clv1-like leucine-rich repeat receptor kinase that regulates nodule number and root length. Plant Mol Biol 2005;58:809–22. 10.1007/s11103-005-8102-y [DOI] [PubMed] [Google Scholar]
- 46. Schnabel EL, Kassaw TK, Smith LS et al. The root determined nodulation1 gene regulates nodule number in roots of Medicago truncatula and defines a highly conserved, uncharacterized plant gene family. Plant Physiol 2011;157:328–40. 10.1104/pp.111.178756 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47. Riley AB, Grillo MA, Epstein B et al. Discordant population structure among rhizobium divided genomes and their legume hosts. Mol Ecol 2023;32:2646–59. 10.1111/mec.16704 [DOI] [PubMed] [Google Scholar]
- 48. Denison RF, Kiers ET. Life histories of symbiotic rhizobia and mycorrhizal fungi. Curr Biol 2011;21:r775–85. 10.1016/j.cub.2011.06.018 [DOI] [PubMed] [Google Scholar]
- 49. Darch SE, West SA, Winzer K et al. Density-dependent fitness benefits in quorum-sensing bacterial populations. Proc Natl Acad Sci USA 2012;109:8259–63. 10.1073/pnas.1118131109 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50. Fåhraeus G. The infection of clover root hairs by nodule bacteria studied by a simple glass slide technique. Microbiology 1957;16:374–81. [DOI] [PubMed] [Google Scholar]
- 51. Kessner D, Turner TL, Novembre J. Maximum likelihood estimation of frequencies of known haplotypes from pooled sequence data. Mol Biol Evol 2013;30:1145–58. 10.1093/molbev/mst016 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52. Li H. A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics 2011;27:2987–93. 10.1093/bioinformatics/btr509 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53. Purcell S, Neale B, Todd-Brown K et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet 2007;81:559–75. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54. Jumper J, Evans R, Pritzel A et al. Highly accurate protein structure prediction with AlphaFold. Nature 2021;596:583–9. 10.1038/s41586-021-03819-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55. Varadi M, Bertoni D, Magana P et al. AlphaFold protein structure database in 2024: providing structure coverage for over 214 million protein sequences. Nucleic Acids Res 2024;52:D368–75. 10.1093/nar/gkad1011 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56. Van Kempen M, Kim SS, Tumescheit C et al. Fast and accurate protein structure search with Foldseek. Nat Biotechnol 2024;42:243–6. 10.1038/s41587-023-01773-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57. Roux B, Rodde N, Jardinaud M et al. An integrated analysis of plant and bacterial gene expression in symbiotic root nodules using laser-capture microdissection coupled to RNA sequencing. Plant J 2014;77:817–37. 10.1111/tpj.12442 [DOI] [PubMed] [Google Scholar]
- 58. Domonkos Á, Kovács S, Gombár A et al. NAD1 controls defense-like responses in Medicago truncatula symbiotic nitrogen fixing nodules following rhizobial colonization in a baca-independent manner. Gen 2017;8:387. 10.3390/genes8120387 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59. Harris JM, Dickstein R. Control of root architecture and nodulation by the LATD/NIP transporter. Plant Signal Behav 2010;5:1365–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60. Wang Q, Liu J, Li H et al. Nodule-specific cysteine-rich peptides negatively regulate nitrogen-fixing symbiosis in a strain-specific manner in Medicago truncatula. Mol Plant-Microbe Interactions 2018;31:240–8. [DOI] [PubMed] [Google Scholar]
- 61. Guefrachi I, Nagymihaly M, Pislariu CI et al. Extreme specificity of NCR gene expression in Medicago truncatula. BMC Genomics 2014;15:712. 10.1186/1471-2164-15-712 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62. Horváth B, Domonkos Á, Kereszt A et al. Loss of the nodule-specific cysteine rich peptide, NCR169, abolishes symbiotic nitrogen fixation in the Medicago truncatula dnf7 mutant. Proc Natl Acad Sci USA 2015;112:15232–7. 10.1073/pnas.1500777112 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63. Montiel J, Downie JA, Farkas A et al. Morphotype of bacteroids in different legumes correlates with the number and type of symbiotic NCR peptides. Proc Natl Acad Sci USA 2017;114:5041–6. 10.1073/pnas.1704217114 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64. Brewin NJ. Development of the legume root nodule. Annu Rev Cell Biol 1991;7:191–226. [DOI] [PubMed] [Google Scholar]
- 65. Gopalakrishnan S, Sathya A, Vijayabharathi R et al. Plant growth promoting rhizobia: challenges and opportunities. 3 Biotech 2015;5:355–77. 10.1007/s13205-014-0241-x [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66. Haag AF, Baloban M, Sani M et al. Protection of Sinorhizobium against host cysteine-rich antimicrobial peptides is critical for symbiosis. PLoS Biol 2011;9:e1001169. 10.1371/journal.pbio.1001169 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67. Czernic P, Gully D, Cartieaux F et al. Convergent evolution of endosymbiont differentiation in dalbergioid and inverted repeat-lacking clade legumes mediated by nodule-specific cysteine-rich peptides. Plant Physiol 2015;169:1254–65. 10.1104/pp.15.00584 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68. Mergaert P, Uchiumi T, Alunni B et al. Eukaryotic control on bacterial cell cycle and differentiation in the rhizobium –legume symbiosis. Proc Natl Acad Sci USA 2006;103:5230–5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69. Walker L, Lagunas B, Gifford ML. Determinants of host range specificity in legume-rhizobia symbiosis. Front Microbiol 2020;11:585749. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70. Kim M, Chen Y, Xi J et al. An antimicrobial peptide essential for bacterial survival in the nitrogen-fixing symbiosis. Proc Natl Acad Sci USA 2015;112:15238–43. 10.1073/pnas.1500123112 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71. Nicoud Q, Barrière Q, Busset N et al. Sinorhizobium meliloti functions required for resistance to antimicrobial NCR peptides and bacteroid differentiation. MBio 2021;12:e00895–21. 10.1128/mBio.00895-21 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72. Burghardt LT, Guhlin J, Chun CL et al. Transcriptomic basis of genome by genome variation in a legume-rhizobia mutualism. Mol Ecol 2017;26:6122–35. 10.1111/mec.14285 [DOI] [PubMed] [Google Scholar]
- 73. Lima RM, Kylarová S, Mergaert P et al. Unexplored arsenals of legume peptides with potential for their applications in medicine and agriculture. Front Microbiol 2020;11:1307. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74. Karunakaran R, East AK, Poole PS. Malonate catabolism does not drive N2 fixation in legume nodules. Appl Environ Microbiol 2013;79:4496–8. 10.1128/AEM.00919-13 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75. Stammers DK, Ren J, Leslie K et al. The structure of the negative transcriptional regulator NmrA reveals a structural superfamily which includes the short-chain dehydrogenase/reductases. EMBO J 2001;20:6619–26. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76. Mus F, Crook MB, Garcia K et al. Symbiotic nitrogen fixation and the challenges to its extension to nonlegumes. Appl Environ Microbiol 2016;82:3698–710. 10.1128/AEM.01055-16 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77. Burghardt LT, Epstein B, Tiffin P. Legacy of prior host and soil selection on rhizobial fitness in planta. Evolution 2019;73:2013–23. 10.1111/evo.13807 [DOI] [PubMed] [Google Scholar]
- 78. Wippel K, Tao K, Niu Y et al. Host preference and invasiveness of commensal bacteria in the lotus and Arabidopsis root microbiota. Nat Microbiol 2021;6:1150–62. 10.1038/s41564-021-00941-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79. Kaminsky LM, Trexler RV, Malik RJ et al. The inherent conflicts in developing soil microbial inoculants. Trends Biotechnol 2019;37:140–51. 10.1016/j.tibtech.2018.11.011 [DOI] [PubMed] [Google Scholar]
- 80. Soumare A, Diedhiou AG, Thuita M et al. Exploiting biological nitrogen fixation: a route towards a sustainable agriculture. Plants 2020;9:1011. 10.3390/plants9081011 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81. Trivedi P, Batista BD, Bazany KE et al. Plant–microbiome interactions under a changing world: responses, consequences and perspectives. New Phytol 2022;234:1951–9. 10.1111/nph.18016 [DOI] [PubMed] [Google Scholar]
- 82. Batstone RT, Ibrahim A, MacLean LT. Microbiomes: getting to the root of the rhizobial competition problem in agriculture. Curr Biol 2023;33:R777–80. 10.1016/j.cub.2023.05.053 [DOI] [PubMed] [Google Scholar]
- 83. Parnell JJ, Berka R, Young HA et al. From the lab to the farm: an industrial perspective of plant beneficial microorganisms. Front Plant Sci 2016;7:1–7. 10.3389/fpls.2016.01110 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84. Porter SS, Sachs JL. Agriculture and the disruption of plant–microbial symbiosis. Trends Ecol Evol 2020;35:426–39. 10.1016/j.tree.2020.01.006 [DOI] [PubMed] [Google Scholar]
- 85. Batstone RT, O’Brien AM, Harrison TL et al. Experimental evolution makes microbes more cooperative with their local host genotype. Science 2020;370:476–8. 10.1126/science.abb7222 [DOI] [PubMed] [Google Scholar]
- 86. Batstone RT, Burghardt LT, Heath KD. Phenotypic and genomic signatures of interspecies cooperation and conflict in naturally occurring isolates of a model plant symbiont. Proc R Soc B Biol Sci 2022;289:20220477. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All the raw reads from sequencing the rhizobial strains in the initial community and in the rhizobial genomic DNA extracted from the nodule pools of the individual host mutants have been deposited in NCBI under the Bioproject PRJNA401437 (Submission ID: SUB14622715). All raw data and analysis code have been uploaded to Github. (https://github.com/BurghardtLab/HostMutants2021_C86meliloti).




