ABSTRACT
Associations between leguminous plants and symbiotic nitrogen-fixing rhizobia are a classic example of mutualism between a eukaryotic host and a specific group of prokaryotic microbes. Although this symbiosis is in part species specific, different rhizobial strains may colonize the same nodule. Some rhizobial strains are commonly known as better competitors than others, but detailed analyses that aim to predict rhizobial competitive abilities based on genomes are still scarce. Here, we performed a bacterial genome-wide association (GWAS) analysis to define the genomic determinants related to the competitive capabilities in the model rhizobial species Sinorhizobium meliloti. For this, 13 tester strains were green fluorescent protein (GFP) tagged and assayed versus 3 red fluorescent protein (RFP)-tagged reference competitor strains (Rm1021, AK83, and BL225C) in a Medicago sativa nodule occupancy test. Competition data and strain genomic sequences were employed to build a model for GWAS based on k-mers. Among the k-mers with the highest scores, 51 k-mers mapped on the genomes of four strains showing the highest competition phenotypes (>60% single strain nodule occupancy; GR4, KH35c, KH46, and SM11) versus BL225C. These k-mers were mainly located on the symbiosis-related megaplasmid pSymA, specifically on genes coding for transporters, proteins involved in the biosynthesis of cofactors, and proteins related to metabolism (e.g., fatty acids). The same analysis was performed considering the sum of single and mixed nodules obtained in the competition assays versus BL225C, retrieving k-mers mapped on the genes previously found and on vir genes. Therefore, the competition abilities seem to be linked to multiple genetic determinants and comprise several cellular components.
IMPORTANCE Decoding the competitive pattern that occurs in the rhizosphere is challenging in the study of bacterial social interaction strategies. To date, the single-gene approach has mainly been used to uncover the bases of nodulation, but there is still a knowledge gap regarding the main features that a priori characterize rhizobial strains able to outcompete indigenous rhizobia. Therefore, tracking down which traits make different rhizobial strains able to win the competition for plant infection over other indigenous rhizobia will improve the strain selection process and, consequently, plant yield in sustainable agricultural production systems. We proved that a k-mer-based GWAS approach can efficiently identify the competition determinants of a panel of strains previously analyzed for their plant tissue occupancy using double fluorescent labeling. The reported strategy will be useful for detailed studies on the genomic aspects of the evolution of bacterial symbiosis and for an extensive evaluation of rhizobial inoculants.
KEYWORDS: GWAS, competition, Sinorhizobium meliloti, rhizobia, legume
INTRODUCTION
The nitrogen-fixing symbiotic interaction between rhizobia and legumes (mostly Fabaceae) is a classic example of a mutualistic association (1). It starts with the mutual recognition of molecular signals, specifically flavonoids released from the plant roots and Nod factors produced by rhizobia (2). Nod factors induce a molecular response in plant root cells, which ultimately leads to rhizobium entry in the radical tissue and intracellular colonization (3). Molecular signaling also drives the development of new structures on plant roots, called nodules, where intracellular rhizobia differentiate into bacteroids, the form responsible for dinitrogen fixation (4–6). In a single nodule (a mass of a few hundreds of milligrams), up to 106 bacterial cells can be recovered, whereas in the soil, free-living rhizobia do generally not exceed 103 to 104/g of soil (7, 8).
As in a trade framework, the benefit for the rhizobium is obtaining a protected environment where it can reproduce (under control) and receive carbon and energy supplies from the plant, whereas the reward for the plant is the availability of fixed nitrogen (7, 9). Since rhizobial transmission is horizontal, plants may also be colonized by poorly effective (low-nitrogen-fixing) strains. However, host plants could control the colonization by inefficient strains via sanctioning root nodules and limiting their growth (10–13). Moreover, the presence of multiple strains within a nodule (mixed nodule) may occur (14, 15). Under these circumstances, inefficient rhizobia can behave as cheaters, decreasing the overall nitrogen-fixing performance (7, 14). Consequently, understanding the mechanisms underlying strain competition has great importance for fully understanding the evolution of symbiosis (16, 17) and predicting rhizobial inoculant efficiency (18). The genetic bases of competitiveness among rhizobial strains are still elusive. To date, most of the studies have identified symbiotic genetic determinants from experiments carried out with mutants of a few different natural strains (7).
The link between a phenotype and its genetic basis, hence predicting phenotypes from the sole genomic information, is one of the challenges of biology (19). Genome-wide association studies (GWASs) are commonly used for identifying the putative functional role of a set of allelic variations in groups of individuals. In bacteria, GWASs have been applied to several species for predicting complex (i.e., multigenic) phenotypes, such as antibiotic tolerance and host interaction (20–22). However, most of the studies investigated phenotypes under strong selective pressure, whereas it is still challenging to determine the genetic basis of phenotypes under mild selection (23). The identification of the genetic determinants in host-bacterium interactions is essential for the improvement of sustainable agriculture: GWASs on plant holobionts (the ensemble of the plant and the other organisms living in or around it) have been proposed (73–74), aiming to provide the basis for future breeding programs, which includes, among the plant traits, the recruitment of the “good” microbiome.
Recent studies have reported the feasibility of experimental setups combining symbiotic assays with genome sequencing approaches and GWASs in rhizobia to define the genetic determinants of symbiotic performances (24–27). In the model rhizobium Sinorhizobium meliloti, association analyses have been employed to explore the genetic basis of various phenotypic traits, including antibiotic resistance and symbiotic and metabolic traits (24). A select-and-resequence approach has successfully been applied to measure the fitness of a set of 101 S. meliloti strains with two genotypes of the host plant Medicago truncatula (25). However, the predictive value of single rhizobial genotypes (i.e., genomes) toward the expected fitness in terms of competitive capabilities to establish a successful symbiosis is still unclear. To date, although many genetic details of the symbiotic interaction are known for single strain colonization (28), we still do not know which rhizobial features increase the chances to win the competition for plant infection, outcompeting other indigenous rhizobia. Unearthing these genetic determinants can advance our knowledge on the genomic aspects of the evolution of bacterial symbiosis and may have a direct application in the screening and amelioration of rhizobial inoculants for sustainable agricultural production systems.
The main objective of this work was to identify a set of candidate genes which may increase rhizobial competition capabilities. We chose as model rhizobial species Ensifer (syn. Sinorhizobium) meliloti; abundant molecular genetics data and tools are available for this bacterium (29), a good number of strains was sequenced, and preliminary data on symbiotic performances and competition are available (14, 20, 21, 25, 30). The competition phenotype was measured by performing a series of nodulation assays where pairs of fluorescently labeled S. meliloti strains were used to infect alfalfa (Medicago sativa) plants. The obtained data were then coupled with the genomic sequences of the same strains to perform a genome-wide association analysis.
RESULTS
Construction of fluorescently tagged S. meliloti strains.
To set up in vitro tests for measuring competition capabilities, a panel of 16 S. meliloti strains was selected. Three well-characterized strains for competition capabilities (S. meliloti BL225C, AK83, and Rm1021) were chosen (14) and used as reference competitors versus 13 S. meliloti strains (tester strains) whose genome sequences were available (see Table S1 in the supplemental material). The phylogenetic relationships among the 13 S. meliloti strains were evaluated (Fig. S1A), and their pangenome was analyzed (Fig. S1B to D). The pangenome was composed of 15,419 genes: 4,278 were shared by all strains (core genome) and 6,622 were strain specific (Fig. S1D). For all above-mentioned 13 tester strains, green fluorescent protein (GFP) derivatives were constructed by cloning the pHC60 plasmid, which constitutively expresses the GFP. Strains S. meliloti Rm1021, BL225C, and AK83 were tagged with red fluorescent protein (RFP) by using the pBHR mRFP plasmid. Preliminary single inoculation assays were performed, showing that all strains were able to form nodules on the roots of alfalfa plants (Fig. S2). For all but two strains (M270 and T073), nitrogenase activity inside the nodules was detected (Fig. S2D), in agreement with previous results that showed low nitrogen fixation abilities in symbiotic interaction with M. truncatula (31). Differences in nodulation, plant growth, and nitrogenase activity among strains were observed. Strains S. meliloti AK58, RU11/001, SM11, USDA1157, GR4, and CCMM B554 showed the highest values of nitrogenase activity and plant growth promotion (Fig. S2).
Competition capabilities for nodule colonization differ in relation to competitor counterparts.
Tagged S. meliloti strains were used in a set of competition experiments: each with a GFP-tagged strain (13 strains in total) versus an RFP-tagged competitor strain (S. meliloti BL225C, AK83, or Rm1021; 13 × 3, total of 39). A large variability in nodule colonization was observed among and within the three sets of competition experiments (versus Rm1021, versus AK83, and versus BL225C). The three competition tests showed differences in the number of total nodules produced per plant (P value < 0.001, Fig. S3A); in the competition experiments with AK83, the highest number of nodules was observed (Fig. S3A).
Competition capabilities were evaluated as single nodule occupancy (nodules colonized by a single strain) of the tested strain in respect to a reference strain; good competitors were characterized by a single nodule occupancy higher than 60%, medium competitors between 20 and 60%, and weak competitors below 20%. In competition with Rm1021, most of the strains outperformed (Fig. 1A). This competition test was characterized by a high average value of nodule occupancy of the tested strains, equal to 65.12%. Most of the strains showed a single nodule occupancy higher than 60%, with the highest value (93.4%) observed for GR4. Three strains (HM006, Rm41, and M270) exhibited medium competition capabilities, and two strains (T073 and USDA1157) poorly performed (Fig. 1A and Table S2). Among the three strains with medium competition capabilities, HM006 and Rm41 displayed values close to those of high-performing strains (58.6 and 50.5%, respectively), whereas M270 nodule occupancy was 37%. Medium-low performance of strains M270 and T073 (T073 competition was characterized by the presence of mixed nodules or nodules with strain Rm1021 only) may be related to nodule sanctioning (plant limiting nutrients to inefficient nodules), as T073 and M270 were unable to fix nitrogen (Fig. S2D).
In the competition experiments with AK83, conversely to the pattern highlighted with Rm1021, a general decrease in nodule occupancy of the 13 strains tested was observed (Fig. 1B), resulting in a lower average value of nodule occupancy (21.99%; Fig. 1B and Table S2). Except for GR4, showing the highest percentage of occupancy (63.9%), all strains displayed weak-medium competitive capabilities (nodule occupancy lower than 60%; Fig. 1B and Table S2). The lowest values of nodule occupancy were detected for Rm41 and T073 (1.7 and 1.8%, respectively).
Lastly, in the competition with BL225C (Fig. 1C and Table S2), the average value of nodule occupancy of the strains tested was equal to 39.45%. The most competitive strains were GR4, KH35c, KH46, and SM11, showing nodule occupancies ranging from 63.4 to 68.9%. The lowest percentage of nodule occupancy, 0.4%, was detected for T073.
In both competition with AK83 and that with BL225C, a higher abundance of mixed nodules (nodules infected by both S. meliloti strains) was observed compared to competition with Rm1021 (Fig. S3B). Considering single nodule occupancy of the whole data set (mean values), we may roughly divide the strains into two groups: one containing highly competitive strains (formed by GR4, KH35c, AK58, KH46, SM11, RU11/001, CCMM B554, and HM006) and one with low-performance strains (composed of T073, USDA1157, Rm41, M270, and 2011) (Fig. S4A; permutational multivariate analysis of variance [PERMANOVA], Bonferroni-corrected P value = 0.0009). The same subdivision could be observed also considering the sum of single occupied nodules and mixed nodules, with the only exception of strain HM006, which moved from the highly competitive group to the low one (Fig. S4B; PERMANOVA, Bonferroni-corrected P value = 0.0005).
Putative genetic determinants associated with good competition capabilities versus S. meliloti BL225C.
Short DNA oligomers with constant length k, termed k-mers, allow capture of a large set of genetic variants in a population, including single nucleotide polymorphisms (SNPs) and insertions/deletions (indels) (20, 30). To pinpoint genetic determinants that might be responsible for the competition capability variation among S. meliloti strains, we performed an association analysis for each competition assay using PhenotypeSeeker (32) (Table 1). Nested cross-validation analyses were also performed to evaluate the competition phenotype predictabilities (see Text S1 in the supplemental material). The genome sequences of the tested strains and a matrix based on the competition phenotype, considered the single nodule occupancy (Table S2), were used to identify specific k-mers significantly associated with competition phenotypes (P value < 0.05). Due to the greater extent of positive performances, ranging from 50.5 to 93.4%, shown by most of the strains tested versus Rm1021 (Fig. 1A; Table S2), a large amount of total k-mers significantly associated with this competition phenotype was obtained (Table 1). In contrast, the number of total k-mers identified in tested strains related to the competition versus BL225C was smaller in comparison with other data sets (Table 1). Moreover, the wider P value range for significantly associated k-mers was found in the analysis of competition versus BL225C, which was also characterized by higher P values (Table 1). Therefore, for the subsequent steps, we selected the set of k-mers related to the competition versus BL225C. This set was mapped on the genomes of the 13 tested strains to retrieve the genetic determinants associated with the competition phenotype.
TABLE 1.
Data set | Total k-mers (P value < 0.05) | P value range | |
---|---|---|---|
vs Rm1021 | Single nodule occupancy | 4.39E+05 | 4.99E−02–4.31E−03 |
(Single and mixed nodule occupancy) | (4.36E+05) | (4.88E−02–1.07E−02) | |
vs BL225C | Single nodule occupancy | 1.82E+05 | 4.98E−02–1.35E−05 |
(Single and mixed nodule occupancy) | (3.60E+05) | (4.96E−02–5.14E−05) | |
vs AK83 | Single nodule occupancy | 2.92E+05 | 4.97E−02–1.05E−03 |
(Single and mixed nodule occupancy) | (1.70E+05) | (4.99E−02–6.15E−04) |
The numbers of total k-mers associated with the competition phenotype (P value < 0.05) for the three competition experiments and their range of P values are reported.
Among the top k-mers (P value threshold < 0.001, see supplemental File S2A at https://doi.org/10.5061/dryad.x95x69pj5), 51 k-mers (P value = 1.31 × 10−4) mapped in genomes of the four strains that showed single nodule occupancy higher than 60% (S. meliloti GR4, SM11, KH35c, and KH46) in the competition test versus BL225C (see supplemental File S1 at https://doi.org/10.5061/dryad.x95x69pj5, highlighted in bold). These best k-mers tagged 103 predicted protein-coding sequences (CDSs); one k-mer may tag multiple genes (see supplemental File S2A at https://doi.org/10.5061/dryad.x95x69pj5). Among the 103 CDSs, a set of orthologous genes was identified (see supplemental File S2A at https://doi.org/10.5061/dryad.x95x69pj5). These orthologous gene hits were mostly tracked in S. meliloti GR4, KH35c, and KH46 genomes (Fig. S5A) and were predominantly located on the symbiosis-required megaplasmid pSymA (ranging from 93.3% to 100%; Fig. S5B, C, and D), particularly in a specific region of 26 kb, present in the genome of these three strains only (Fig. 2A). In contrast, 60% of the orthologous gene hits in the SM11 genome were located on the chromid pSymB (Fig. S5E).
The distribution of the candidate function of gene hits within the most competitive strain genomes was not uniform. Enrichment for COG categories E (amino acid transport and metabolism), C (energy production and conversion), H (coenzyme transport and metabolism), and I (lipid transport and metabolism) was found in S. meliloti GR4, KH35c, and KH46. The most represented orthologous gene groups were related to the coenzyme F420 biosynthesis process, transmembrane transport via ABC-type systems for branched-chain amino acids, and pyrimidine nucleotide biosynthetic processes (Fig. 2B). Further, a putative caffeine dehydrogenase engaged in the pathway of caffeine transformation via C-8 oxidation and for the two subunits (PntA and PntB) of a presumptive proton-translocating NAD(P) transhydrogenase liable for NADPH balancing mechanisms (Table 2) was found. Other presumed functions common to the three strains (GR4, KH35c, and KH46) were related to amino acid degradation, carbohydrate metabolism, and oxidation-reduction, as well as transcriptional regulation by a GntR-type regulator (Table 2). The number of orthologous gene groups with functional annotation tagged by the 51 best k-mers was lower in SM11. Except for the orthologous group related to the fatty acid metabolic process, the candidate functions of gene hits identified in SM11 were exclusive (Table 2).
TABLE 2.
COG ID | COG class(es) | COG functional category | Prokka annotation/other annotation | Biological process(es) |
---|---|---|---|---|
COG0210 | L | Superfamily I DNA or RNA helicase | ATP-dependent DNA helicase PcrA | Mismatch repair, nucleotide excision repair |
COG0391 | GH | Archaeal 2-phospho-l-lactate transferase/bacterial gluconeogenesis factor, CofD/UPF0052 family | Putative phosphoenolpyruvate transferase FbiA | Coenzyme F420 biosynthesis |
COG0410 | E | ABC-type branched-chain amino acid transport system, ATPase component | High-affinity branched-chain amino acid transport ATP-binding protein LivF | High-affinity branched-chain amino acid transport |
COG0418 | F | Dihydroorotase | Dihydroorotase PyrC | Pyrimidine nucleotide biosynthesis |
COG0624 | E | Acetylornithine deacetylase/succinyl-diaminopimelate desuccinylase or related deacylase | Probable N-formyl-4-amino-5-aminomethyl-2-methylpyrimidine deformylase | Unknown function |
COG0683 | E | ABC-type branched-chain amino acid transport system, periplasmic component | Leucine-, isoleucine-, valine-, threonine-, and alanine-binding protein Brac | Branched-chain amino acid transport |
COG1024 | I | Enoyl-CoA hydratase/carnitine racemase | Putative fatty acid oxidation complex subunit alpha–enoyl-CoA hydratase FadJ | Fatty acid metabolism |
COG1028 | IQR | NAD(P)-dependent dehydrogenase, short-chain alcohol dehydrogenase family | Putative NAD-dependent glycerol dehydrogenase | Glycerol metabolic process |
COG1060 | H | 2-Iminoacetate synthase ThiH (tyrosine cleavage enzyme, thiamine biosynthesis) | FbiC F0 synthase | Coenzyme F0 biosynthesis |
COG1129 | G | ABC-type sugar transport system, ATPase component | Ribose import ATP-binding protein RbsA, CUT2 family | Ribose transmembrane transport |
COG1171 | E | Threonine dehydratase | Diaminopropionate ammonia-lyase | Cellular amino acid catabolic process |
COG1172 | G | Ribose/xylose/arabinose/galactoside ABC-type transport system, permease component | Autoinducer 2 import system permease protein LsrD | AI-2 transport system |
COG1282 | C | NAD/NADP transhydrogenase beta subunit | NAD(P) transhydrogenase subunit beta PntB | Oxidation-reduction process; nicotinate and nicotinamide metabolism |
COG1529 | C | CO or xanthine dehydrogenase, Mo-binding subunit | Putative caffeine dehydrogenase subunit alpha | Oxidation-reduction process |
COG1788 | I | Acyl-CoA:acetate/3-ketoacid-CoA transferase, alpha subunit | Glutaconate CoA-transferase GctA subunit A | Glutamate catabolic process (via hydroxyglutarate) |
COG1802 | K | DNA-binding transcriptional regulator, GntR family | HTH-type transcriptional repressor RspR | DNA-binding transcriptional regulation |
COG2057 | I | Acyl-CoA:acetate/3-ketoacid-CoA transferase, beta subunit | Glutaconate CoA-transferase GctB subunit B | Glutamate catabolic process (via hydroxyglutarate) |
COG2141 | HR | Flavin-dependent oxidoreductase, luciferase family (includes alkanesulfonate monooxygenase SsuD and methylene tetrahydromethanopterin reductase) | F420-dependent glucose-6-phosphate dehydrogenase | Carbohydrate metabolic process, oxidation-reduction process |
COG2188 | K | DNA-binding transcriptional regulator, GntR family | Putative transcriptional repressor | DNA-binding transcriptional regulation |
COG2205 | T | K+-sensing histidine kinase KdpD | Two-component system sensor histidine kinase KdpD | Two-component regulatory system K+ sensing that regulates kdpABCF operon for potassium transport |
COG3158 | P | K+ transporter | Low affinity potassium transport system protein Kup | Potassium ion transport |
COG3288 | C | NAD/NADP transhydrogenase alpha subunit | NAD(P) transhydrogenase subunit alpha PntA | Oxidation-reduction process; nicotinate and nicotinamide metabolism |
COG description of gene hits identified by 51 k-mers (P value 1.13 × 10−4) in the most competitive strains (Sinorhizobium meliloti GR4, KH35c, KH46, SM11). Function/annotation are reported according to the annotation performed with Prokka in this work and/or using original annotation.
Among the 103 CDSs tagged by the 51 best k-mers, predicted protein-coding sequences (CDSs) with no assigned function were also identified (see supplemental File S2A at https://doi.org/10.5061/dryad.x95x69pj5). A large part of these tagged CDSs could be identified in the SM11 genome and was almost entirely located on the SM11 chromosome (Fig. S5F and J). In contrast, CDSs that were found in GR4, KH35c, and KH46 genomes were located on homologs of the symbiosis-required megaplasmid pSymA (Fig. S5G, H, and I).
Regulatory regions were also analyzed, and we considered bona fide promoter sequence hit mapping within 600 nucleotides upstream of the CDS start (30). Ten of the 51 best k-mers analyzed pinpointed 15 regulatory regions (see supplemental File S3A at https://doi.org/10.5061/dryad.x95x69pj5). Seven regulatory region hits were associated with CDSs with no assigned function (see supplemental File S3A at https://doi.org/10.5061/dryad.x95x69pj5). These regulatory region hits were tracked exclusively in GR4 and SM11 genomes (Fig. S5P) and were mainly located on the chromosomes of the two strains (Fig. S6Q and R). Eight regulatory region hits of putatively orthologous gene targets were identified (see Fig. S5C and see supplemental File S3A at https://doi.org/10.5061/dryad.x95x69pj5) and were entirely located on symbiosis-required megaplasmid pSymA in S. meliloti GR4, KH35c, and KH46 (Fig. S5L, M, and N) and on pSymB of S. meliloti SM11 (Fig. S5O). In GR4, KH35c, and KH46, the regulatory region hits were associated with genes encoding proteins whose functions (COG1529 and COG2188) have previously been observed (Fig. 2B and C and Table 3). In SM11, the regulatory region hits were associated with a LysR-type orthologous gene and with a gene belonging to COG2308, possibly involved in the biosynthesis of small peptides (Table 3).
TABLE 3.
COG ID | COG class | COG functional category | Prokka annotation/product | Biological process(es) |
---|---|---|---|---|
COG2308 | S | Uncharacterized conserved protein, circularly permuted ATP-grasp superfamily | Uncharacterized putative protein | Function unknown |
COG2188 | K | DNA-binding transcriptional regulator, GntR family | Putative transcriptional repressor | DNA-binding transcriptional regulation |
COG0583 | K | DNA-binding transcriptional regulator, LysR family | HTH-type transcriptional regulator DmlR | Transcriptional regulator of dmlA (aerobic growth on d-malate as the sole carbon source) |
COG1529 | C | CO or xanthine dehydrogenase, Mo-binding subunit | Putative caffeine dehydrogenase subunit alpha | Oxidation-reduction process |
COG description of supposed target orthologous genes of regulatory region hits identified by 10 k-mers (P value 1.13 × 10−4) in the most competitive strains (Sinorhizobium meliloti GR4, KH35c, KH46, SM11).
Most of the genes associated with the 51 best k-mers belonged to the accessory genome, and only a few (nine genes) were within the core genome of the strains tested, suggesting a higher scoring of indel k-mers. All reported genes were also checked with SYMbiMICS (https://iant.toulouse.inra.fr/symbimics/) to examine their levels of transcription within nodules (33). As expected for most of the genes, we did not obtain any match because they were not present in the genome of S. meliloti 2011. However, the expression of all those genes that are shared with S. meliloti 2011 has been observed in different zones of the nodule (33), and depending on the gene, FI, FIId, and ZIII were the zones with higher levels of expression (see supplemental File S2A at https://doi.org/10.5061/dryad.x95x69pj5).
Putative genetic determinants associated with increased competition and coinfecting nodule capabilities in assays versus S. meliloti BL225C.
Association analysis was also performed applying the competition matrix with the sum of single and mixed nodules (Table S2) as well as modeling analysis (see Text S1). Significantly associated-phenotype k-mers (P value < 0.05) were identified for each competition data set (see supplemental File S1B at https://doi.org/10.5061/dryad.x95x69pj5). Similar to the previous analysis, the set of k-mers related to competition versus BL225C showed the widest range of P values (Table 1). Among these, 51 top k-mers (P value = 5.14 × 10−5, Table S4b) were tracked down in the genomes of six strains that showed competition capabilities higher than 80% (evaluated as the nodule occupancy of single and mixed nodules; S. meliloti GR4, SM11, KH35c, KH46, AK58, and RU11/001; see supplemental File S1B at https://doi.org/10.5061/dryad.x95x69pj5, highlighted in bold). Fifty of the 51 top k-mers were mapped on 202 CDSs (see supplemental File S2B at https://doi.org/10.5061/dryad.x95x69pj5). Among these 202 CDSs, 99 were orthologous gene hits. The presence of gene hits with annotated function was uniformly distributed in all six strains (Fig. S6A); they were mainly located on the symbiosis-required megaplasmid pSymA (ranging from 52.6 to 84.4%; Fig. S6C, D, E, F, and G). Thirty-three orthologous gene hits were identified in both association analyses performed (Table 4 and see supplemental File S2B at https://doi.org/10.5061/dryad.x95x69pj5, highlighted in bold). In particular, these common genes were located on the specific region of the symbiotic megaplasmid previously reported (Fig. 2A) and exclusively found in the genomes of GR4, KH35c, and KH46 (Fig. 2A and see supplemental File S2B at https://doi.org/10.5061/dryad.x95x69pj5). Consequently, “common” candidate functions were outlined by both sets of k-mers as linked to the competition phenotype ascertained in GR4, KH35c, and KH46 (Table 4). Moreover, additional genes located in the same genomic region, which were not retrieved among the top k-mers in the single nodule occupancy data set, were found (Fig. 2A).
TABLE 4.
COG ID | COG class(es) | COG functional category | Prokka annotation/other annotation | Biological process |
---|---|---|---|---|
COG0154 | J | Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit or related amidase | Acylamidase | Aminoacyl-tRNA biosynthesis (glutaminyl-tRNAGln and l-asparaginyl-tRNAAsn) |
COG0156 | H | 7-Keto-8-aminopelargonate synthetase or related enzyme | 8-Amino-7-oxononanoate synthase | Biotin biosynthetic process |
COG0242 | J | Peptide deformylase | Peptide deformylase | Protein biosynthesis (cotranslational protein modification) |
COG0376 | P | Catalase (peroxidase I) | Catalase-peroxidase KatG | Hydrogen peroxide catabolic process; response to oxidative stress |
COG0391* | GH | Archaeal 2-phospho-l-lactate transferase/bacterial gluconeogenesis factor, CofD/UPF0052 family | Putative phosphoenolpyruvate transferase FbiA | Coenzyme F420 biosynthesis |
COG0410* | E | ABC-type branched-chain amino acid transport system, ATPase component | High-affinity branched-chain amino acid transport ATP-binding protein LivF | High-affinity branched-chain amino acid transport |
COG0418* | F | Dihydroorotase | Dihydroorotase PyrC | Pyrimidine nucleotide biosynthesis |
COG0494 | V | 8-Oxo-dGTP pyrophosphatase MutT and related housecleaning NTP | GDP-mannose pyrophosphatase NudK/nudix-type nucleoside diphosphatase (YffH/AdpP family) | Nucleoside phosphate metabolic process; ribose phosphate metabolic process |
COG0507 | L | ATP-dependent exoDNase (exonuclease V), alpha subunit, helicase superfamily I | ATP-dependent RecD-like DNA helicase | Recombinational DNA repair |
COG0516 | F | IMP dehydrogenase/GMP reductase | Inosine-5′-monophosphate dehydrogenase | Purine metabolism |
COG0559 | E | Branched-chain amino acid ABC-type transport system, permease component | High-affinity branched-chain amino acid transport system permease protein LivH | Branched-chain amino acid transport |
COG0582 | L | Integrase | Tyrosine recombinase XerC | Chromosome segregation; DNA recombination |
COG0630 | U | Type IV secretory pathway ATPase VirB11 (archaellum biosynthesis ATPase) | Type IV secretion system protein VirB11 | Protein transport; infection |
COG0683* | E | ABC-type branched-chain amino acid transport system, periplasmic component | Leucine-, isoleucine-, valine-, threonine-, and alanine-binding protein Brac | Branched-chain amino acid transport |
COG1024* | I | Enoyl-CoA hydratase/carnitine racemase | Putative fatty acid oxidation complex subunit alpha–enoyl-CoA hydratase FadJ | Fatty acid metabolism |
COG1028 | IQR | NAD(P)-dependent dehydrogenase, short-chain alcohol dehydrogenase family | Putative dihydroanticapsin 7-dehydrogenase | Short-chain dehydrogenase |
COG1060* | H | 2-Iminoacetate synthase ThiH (tyrosine cleavage enzyme, thiamine biosynthesis) | FbiC FO synthase | Coenzyme F0 biosynthesis |
COG1086 | MO | NDP-sugar epimerase, includes UDP-GlcNAc-inverting 4,6-dehydratase FlaA1 and capsular polysaccharide biosynthesis protein EpsC | UDP-N-acetyl-alpha-d-glucosamine C6 dehydratase | Protein glycosylation |
COG1137 | M | ABC-type lipopolysaccharide export system, ATPase component | Lipopolysaccharide export system ATP-binding protein LptB | Lipopolysaccharide transmembrane transport |
COG1171* | E | Threonine dehydratase | Diaminopropionate ammonia-lyase | Cellular amino acid catabolic process |
COG1172 | G | Ribose/xylose/arabinose/galactoside ABC-type transport system, permease component | Ribose import permease protein RbsC | Ribose transmembrane transport |
COG1176 | E | ABC-type spermidine/putrescine transport system, permease component I | Putrescine transport system permease protein PotH | Putrescine transport |
COG1228 | Q | Imidazolonepropionase or related amidohydrolase | Putative imidazolonepropionase | Nucleotide metabolism |
COG1282* | C | NAD/NADP transhydrogenase beta subunit | NAD(P) transhydrogenase subunit beta PntB | Oxidation-reduction process; nicotinate and nicotinamide metabolism |
COG1317 | NU | Flagellar biosynthesis/type III secretory pathway protein FliH | Type III secretion protein L | Protein transport; infection |
COG1478 | H | F420-0:gamma-glutamyl ligase (F420 biosynthesis) | Bifunctional F420 biosynthesis protein FbiB | Coenzyme F420 biosynthesis (factor 420 polyglutamylation) |
COG1529* | C | CO or xanthine dehydrogenase, Mo-binding subunit | Putative caffeine dehydrogenase subunit alpha | Oxidation-reduction process |
COG1629 | P | Outer membrane receptor proteins, mostly Fe transport | Heme/hemopexin utilization protein C | Siderophore transmembrane transport |
COG1793 | L | ATP-dependent DNA ligase | Multifunctional nonhomologous end joining protein LigD | DNA damage and repair |
COG1920 | H | 2-Phospho-l-lactate guanylyltransferase, coenzyme F420 biosynthesis enzyme | Phosphoenolpyruvate guanylyltransferase FbiD | Coenzyme F420 biosynthesis |
COG2081 | R | Predicted flavoprotein YhiN | Putative protein/predicted flavoprotein (HI0933-like protein) | Unknown function |
COG2898 | S | Lysylphosphatidylglycerol synthetase, C-terminal domain, DUF2156 family | Putative phosphatidylglycerol lysyltransferase | Lipid metabolism |
COG2948 | U | Type IV secretory pathway, VirB10 components | Type IV secretion system protein VirB10 | Protein transport; infection |
COG3288* | C | NAD/NADP transhydrogenase alpha subunit | NAD(P) transhydrogenase subunit alpha PntA | Oxidation-reduction process; nicotinate and nicotinamide metabolism |
COG3451 | U | Type IV secretory pathway, VirB4 component | Type IV secretion system protein VirB4 | Protein transport; infection |
COG4178 | R | ABC-type uncharacterized transport system, permease and ATPase components | Putative inner membrane ABC transporter ATP-binding protein YddA | Lipid transport |
COG description of gene hits identified by 50 top k-mers (P value 5.14 × 10−5) in the most competitive strains as single occupants and coinhabitants of mixed nodules (Sinorhizobium meliloti GR4, SM11, KH35c, KH46, AK58, and RU11/001). Function/annotation are reported according to the annotation performed with Prokka in this work and/or using original annotation. Asterisks indicate orthologous gene hits identified in both association analyses.
Similar to previous results, the frequency of identified candidate functions was dissimilar among the six strains (Fig. 3A). The most represented orthologous gene groups were related to the type IV secretion system (T4SS) (specifically VirB10 and VirB11 proteins) and the nucleoside phosphate metabolic process, which are common to all strains except for KH46 (Fig. 3A and B; Table 4). The species S. meliloti is characterized by the presence of different T4SSs. In particular, in S. meliloti KH35c, three T4SSs are present: a, b, and f (31). The organization of vir genes in the operon, identified in this work, was characterized by the presence of a homolog of virD4 (Fig. 3A) and the lack of virG/virF genes (typical of T4SSb) and rctA (which is typical of T4SSa, which is common in all S. meliloti strains). Therefore, the vir genes identified were probably related to the T4SSf family. Candidate functions related to putrescine transport were also observed in all strains except for KH46. The k-mers related to type IV secretion system protein VirB4 were tracked only in AK58, RU11/001, and SM11 strains (Fig, A; Table 4). Besides, these three strains (AK58, RU11/001, and SM11) showed similar distribution profiles of candidate functions (Fig. 3B), with the exclusive presence of orthologous genes related to aminoacyl-tRNA biosynthesis and protein biosynthesis, as well as purine and lipid metabolism (Fig. 3B; Table 4). Several other functions were exclusively detected in single strains only (see Text S1).
Predicted protein-coding sequences (CDSs) with no assigned function, picked by the 50 top k-mers, were also identified (see supplemental File S2B at https://doi.org/10.5061/dryad.x95x69pj5), mostly in the AK58, RU11/001, and SM11 genomes (Fig. S6H). The distribution of these CDSs among strain replicons was different compared to the single nodule occupancy results (Fig. S5 and S6). In strains SM11 and RU11/001, a large part of the CDS with no assigned function was located on symbiosis-required megaplasmid pSymA (Fig. S6M and N), whereas in GR4, KH46, and KH35c, most of the CDSs were also located on the chromosome and other accessory replicons (Fig. S5J, K, and L).
Fourteen of the 51 top k-mers analyzed pinpointed 29 regulatory regions hit (Fig. 3C and Table 5; see also supplemental File S3B at https://doi.org/10.5061/dryad.x95x69pj5). Among these, 12 regulatory region hits on putatively orthologous gene targets were identified, which were located on pSymA in all S. meliloti strains (Fig. S6Q to U). Six of them were found in both association analyses performed in the genomes of GR4, KH35c, and KH46 (see supplemental File S3B at https://doi.org/10.5061/dryad.x95x69pj5). Seventeen regulatory regions were associated with CDS targets with no assigned function (see supplemental File S3B at https://doi.org/10.5061/dryad.x95x69pj5) and tracked to a greater extent in RU11/001, SM11, KH46, and AK58 (Fig. S6V). The distribution of these regulatory regions among replicons was not similar among strains (Fig. S6). In strains GR4 and KH35c, all regulatory region hits were located on the chromosome (Fig. S6X and Z), whereas in KH46, they were mainly found on pSymB (Fig. S6Y), and in RU11/001 and SM11, they were found on pSymA (Fig. S6A1 and B1).
TABLE 5.
COG ID | COG class(es) | COG functional category | Prokka annotation/product | Biological process(es) |
---|---|---|---|---|
COG2132 | DMP | Multicopper oxidase with three cupredoxin domains (includes cell division protein FtsP and spore coat protein CotA) | Putative blue copper oxidase CueO | Detoxification of copper ion |
COG2188 | K | DNA-binding transcriptional regulator, GntR family | Putative transcriptional repressor | DNA-binding transcriptional regulation |
COG0559 | E | Branched-chain amino acid ABC-type transport system, permease component | High-affinity branched-chain amino acid transport system permease protein LivH | Branched-chain amino acid transport |
COG1529 | C | CO or xanthine dehydrogenase, Mo-binding subunit | Putative caffeine dehydrogenase subunit alpha | Oxidation-reduction process |
COG2124 | QV | Cytochrome P450 | Putative pentalenolactone synthase (cytochrome P450 CYP2 subfamily) | Pentalenolactone biosynthetic process |
COG description of supposed target orthologous genes of regulatory region hits identified by 14 top k-mers (P value 5.14 × 10−5) in the most competitive strains as single occupants and coinhabitants of mixed nodules (Sinorhizobium meliloti GR4, SM11, KH35c, KH46, AK58, and RU11/001).
DISCUSSION
Rhizobium-legume symbiosis is a paradigmatic example of a bacterium-plant association. The ability to colonize plant tissue is under selective pressure, and rhizobial strains which efficiently colonize host plants can more effectively promote plant growth, giving rise to a partial “fitness alignment” between the host and the symbiont (34, 35). However, in nature, several strains compete for forming symbiotic associations with the host plant, and often, nodules are simultaneously colonized by different strains, which in turn may have different efficiencies in promoting plant growth; some of them also behave as “cheaters” (7, 14, 18, 36). In this sense, measuring the competitiveness for plant and nodule colonization and predicting this phenotype from rhizobial genome sequences are of paramount importance for understanding the evolution of rhizobium-plant symbiosis and developing effective inoculant strains for increasing agricultural yields of legume crops (7).
In a recent work, the first step of symbiotic colonization was elucidated by analyzing gene expression patterns related to the response of root exudates among different rhizobial strains and plant varieties (37). Here, we addressed the possibility of identifying some of the genetic determinants involved in strain competitiveness for symbiotic nodule formation (the functional-symbiotic structure). The host plant-rhizobial symbiont system of alfalfa and S. meliloti was employed, and direct measurements of competitiveness were obtained through the analysis of nodule occupancy. This experimental design pointed out a wide variety of strain responses to the diverse competitive conditions, identifying three different competition patterns and outlining a highly complex phenotype that strongly depends on the engaged competitor. The strains used in this work were originally isolated from different Medicago species. However, they displayed good competition capabilities in nodulation of alfalfa, indicating that nodulation competitiveness is not strictly bound to the host genotype.
An abundant presence of mixed nodules was also observed in all three sets of experiments, confirming previous results (11, 13, 37) and suggesting that the possibility to coinfect nodules by different strains could be more widespread than expected.
Strains M270 and T073 were characterized by low nitrogen fixation rates and showed low competition capabilities. Conversely, the differential responses of strains with medium-high nitrogen-fixation efficiency advances the idea that a greater competitive ability is not correlated with a high nitrogen fixation efficiency in alfalfa–S. meliloti interaction, as previously suggested (38, 39). Concerning the competition versus BL225C, except for GR4 and SM11, this assumption seemed to be particularly true for highly efficient N-fixer strains (USDA1157, CCMM B554, RU11/001, 2011, and AK58), which turned out to be medium-weak competitors.
Subsequently, a method based on k-mers was used for the evaluation of the genetic determinants responsible for an increased competition phenotype. The number of GWASs has progressively increased in recent years (20, 40, 41) because of the flexibility in capturing different types of genetic variants and overcoming the alignment of sequences to a reference genome. Among the different pipelines adopted for this kind of analysis, PhenotypeSeeker is one of the up-to-date tools that use machine learning for predicting phenotypes from the sole genome sequences (32). The k-mers related to the competition versus BL225C (single nodule occupancy) taken into account were significantly correlated with the phenotype of interest and mapped on the genomes of the four most competitive strains (GR4, KH35c, KH46, and SM11). Therefore, they may be considered the most informative k-mers for tagging the genetic variants associated with remarkable competition capabilities. The same approach was used considering the sum of single and mixed nodules. In this case, the group formed by highly competitive strains was larger and included strains AK58 and RU11/001. Moreover, 40% of the k-mers previously identified considering only single nodules could also be retrieved in this second analysis.
The largest part of the genes putatively associated with competitiveness was harbored by the megaplasmid pSymA (or pSymA homologs depending on the strain), which is the genomic element carrying all the genes necessary for symbiosis (e.g., nod, fix, and nif genes) (42). According to a previous study, pSymA harbors the largest part of genomic diversity in S. meliloti (43), largely contributing to the phenotypic diversity among strains; considering the obtained results, it may also be linked to competition capabilities.
Previous studies have highlighted the importance of exopolysaccharide production, motility, and signaling for responses to root exudates, symbiosis establishment, and competition (25, 27, 37). In the work of Burghardt et al., a variation in allele frequency for genes whose function is related to cell motility, nitrogen fixation, and nodule formation was observed (25). Conversely, the genes (with functional annotation) found in this work, putatively associated with competitiveness (for single nodule occupancy), were mostly related to biosynthesis and transport functions. Concerning the differences between our results and those from the study by Burghardt et al. (25), we should point out that the experimental settings were different. In the work of Burghardt et al. (25), a mix of 101 strains was used to inoculate plants, whereas we performed competition experiments with two strains at a time. Additionally, none of the strains used were common in the two studies. Concerning the association analysis, Burghardt et al. focused on single nucleotide polymorphisms (SNPs), identifying as responsible for variation in relative fitness those alleles showing the highest variation in frequency in response to selection (25). In our approach, we observed a stronger signal related to indels, with most of the genes retrieved belonging to the accessory genome. However, one SNP associated with the gene htpG (one of the genes with the strongest signal identified in the work of Burghardt et al. [25]) was also retrieved in our analysis by four k-mers, albeit with a lower P value (3.09 × 10−04). These four k-mers separate strains into two groups: one formed by 10 strains and the other one composed of strains T073, 2011, and USDA1157, which are the strains with the lowest competition capabilities in the assay versus BL225C. In both experiments, a sterilized substrate was used, and therefore, we need to consider that under soil conditions, the competition and the advantage in nodule entry may differ depending on the soil physical/chemical features and the presence of an indigenous microbiota.
Many k-mers were related to COG1060, which, together with COG0391, was linked to the presence of the fbi operon in strains KH46, KH35c, and GR4. The fbi operon is widely distributed in aerobic soil bacteria and is responsible for the synthesis of the functional versatile redox factor F420 (44, 45). This cofactor is involved in the redox modification of many organic compounds, facilitating low-potential two-electron redox reactions (44, 45). Moreover, the presence of this cofactor is linked to several important processes such as persistence, antibiotic biosynthesis (tetracyclines, lincosamides, and thiopeptides), and prodrug activation (46), possibly increasing the fitness of these strains. However, the role of this function over symbiotic competition deserves further attention.
Another group of COGs highly represented was related to ABC transporters. The S. meliloti genome encodes a large number of ABC uptake and export systems (47, 48). This feature is probably linked to the selective adaptation to oligotrophic soils (48, 49). According to our association analysis, a group of k-mers tagged putative genes encoding ATPase and permease subunits of branched-chain amino acid ABC transport complex Bra/Liv (50). In S. meliloti, a double mutant for the two main amino acid ABC transport complexes (aap bra) showed no reduction in N2 fixation efficiency and no influence on the plant phenotype, suggesting that in bacteroids, branched-chain amino acid auxotrophy, called “symbiotic auxotrophy,” does not occur (51). However, an attenuated competitive phenotype was found in S. meliloti mutated in the livM gene, which encodes the permease subunit of the Bra/Liv complex (52). It is therefore reasonable to suppose that this complex may provide a noteworthy benefit in the competition dynamics, ensuring a higher supply of amino acids under free-living rhizosphere conditions and increasing strain competitiveness (52, 53). Other COGs related to the ABC transporter were COG1129 and COG1172. Proteins grouping in COG1129 are ATPase components of an ABC-type ribose import system. In Rhizobium leguminosarum, a putative ribose ABC transporter (RbsA, RL2720) was induced by the presence of arabinogalactan, and it was specifically overexpressed in the alfalfa rhizosphere (50). The COG1172, detected by both association analyses, contains a ribose/xylose/arabinose/galactoside ABC-type transport system permease component, highlighting the importance of efficient carbon uptake in the rhizosphere to outcompete other bacteria. The COG1172 is also related to the import of autoinducer signaling molecules in the quorum sensing process, whose connection with S. meliloti competitive behavior has been reported (54).
The type IV secretion system was also found in our association analysis (T4SSf). Interestingly, T4SSa (the type commonly found in S. meliloti) is not required for symbiosis (55), whereas T4SSb has been linked to competitive advantage in S. meliloti for nodule occupancy in M. truncatula (56) and is associated with competition for rhizosphere colonization (57). We cannot exclude a role of T4SS in the management of host defenses, as suggested for Mesorhizobium loti (31, 58, 59). However, the role of T4SSf in the competition should be further elucidated. Several COGs were related to the metabolism of different compounds: pyrimidine (COG0418; dihydroorotase), glutamate (COG1788 and COG2057; glutaconate coenzyme A [CoA] transferase), amino acids (COG1171; threonine dehydratase), fatty acids (COG1024; enoyl-CoA hydratase/carnitine racemase), and glycerol (COG1028; glycerol dehydrogenase), reinforcing the importance of metabolic versatility for nodule colonization (60, 61). Testing those functions through metabolic model reconstructions of the different strains may clarify the importance of metabolism in the competition for nodule colonization (62).
One group of k-mers fell within transcriptional regulation genes, suggesting their involvement in a fine-tuning bacterial response to the presence of other competitors and/or a quick response to variations in the external conditions.
Other COGs retrieved had less-clear connections with competitiveness, and it will require further studies to infer their possible role in this process.
A substantial part of k-mers mapped on hypothetical genes with unknown functions. Although many genes required for rhizobial adaptation to the rhizosphere are not yet characterized, transcriptomic analysis of rhizobia isolated from the rhizosphere revealed the expression of many hypothetical genes (49). This finding suggests that in the pangenome of S. meliloti, several functions potentially important in the fitness associated with the symbiotic interaction, and possibly in plant growth promotion, are yet to be discovered. However, the list of candidate genes (hypothetical and not) identified in this work needs to be experimentally validated to confirm their effectiveness in competition and to understand how their function influences strain competitiveness.
Most of the identified k-mers mapped on genes belonging to the accessory genome, and only a few genes of these lists were common with S. meliloti 2011 genes. This strain has been used for transcriptomic analysis within nodules, applying laser capture microdissection coupled to transcriptome sequencing (RNAseq) (33). Thanks to the SYMbiMICS website, we observed that these common genes were expressed within all different zones of the nodule infected by S. meliloti 2011 (33), indicating that they may play an active role in the symbiotic process.
Rhizobial competitiveness is a cornerstone for plant colonization, making the selection of highly competitive rhizobia fundamental for sustainable agricultural production. Here, we report the feasibility and reliability of using a k-mer-based GWAS approach to detect genes associated with this complex quantitative phenotype in the plant symbiont S. meliloti. Several functions possibly contribute to ameliorate competitiveness, indicating that many different bricks, increasing rhizobial versatility, pave the way for success in competition. This approach may provide the basis for a large-scale screening of putative competitiveness capabilities among pairs of strains, based on genome sequences. Interestingly, the evidence that most of the genes putatively associated with competition reside on the megaplasmid pSymA can offer the possibility to extend the creation of ad hoc hybrid strains by mobilizing the pSymA megaplasmid from different hosts (63) to develop novel ameliorated inoculants (7).
MATERIALS AND METHODS
Bacterial strains, plasmids, and growth conditions.
The strains and plasmids used in this work are listed in Table S3 in the supplemental material. Escherichia coli strains were grown in liquid or solid Luria-Bertani (LB) medium (Sigma-Aldrich) at 37°C (64), supplemented with tetracycline (10 μg/ml). Sinorhizobium meliloti strains were cultured in broth or agar tryptone yeast (TY) medium with 0.2 g/liter CaCO3 at 30°C (65), supplemented with streptomycin (500 μg/ml in broth and agar media), rifampin (50 μg/ml), and tetracycline (1 μg/ml in liquid broth medium, 2 μg/ml in agar medium), when necessary.
Construction of S. meliloti fluorescently tagged strains.
The S. meliloti strains were tagged with green fluorescent protein (GFP) or red fluorescent protein (RFP). Donor E. coli S17-1 strains containing plasmid pHC60 (harboring a constitutively expressed GFP [66]) or pBHR mRFP (harboring a constitutively expressed RFP [67]) were used for biparental conjugations with rifampin-resistant derivative S. meliloti strains. Spontaneous rifampin-derivative S. meliloti strains were isolated by plating aliquots of 100 μl of cell suspension of 109 cells on agar TY medium with rifampin (50 μg/ml). Conjugal transfer was performed as previously described (68), and GFP and RFP expression was verified for each single strain by fluorescence microscopy with a Leica DM L (Leica, Germany) equipped with an N plan oil-immersion objective (100×/1.25 oil).
Competition assay.
Medicago sativa (cv. Maraviglia) plantlets were germinated and grown as described in the nodulation assay section (Text S1). The S. meliloti strains were grown at 30°C to the late exponential phase (optical density at 600 nm [OD600] = 0.6 to 0.8) in TY with opportune antibiotics. Subsequently, each culture was washed twice in nitrogen-free solution (69) and diluted to a final concentration of approximately 5 × 104 CFU/ml. The inoculum mixtures were prepared with equal volumes of cellular suspensions of two different fluorescently tagged strains. A total of 39 competition experiments were set up (13 GFP-tagged strains × 3 RFP-tagged strains), and we assumed an equal amount of fluorescence emission by all strains used. Six plants for each competition experiment were inoculated with 1 ml of inoculum mixtures per seedling. After 4 weeks, nodule fluorescence was detected using a fluorescence stereomicroscope, Stereo Discovery V12 (Zeiss, Oberkochen, Germany), equipped with a charge-coupled-device (CCD) camera controlled by the AxioVision software for image acquisition. For each plant, we determined the total number of nodules and the numbers of green (occupied by GFP-tagged strain only), red (occupied by RFP-tagged strain only), and yellow (occupied by both strains) nodules. For nodule color determination, each nodule of each single plant was imaged with filters for GFP (Zeiss filter set 38HE; excitation 470/40, emission 525/50) and DsRed (Zeiss filter set 43HE; excitation 550/25 and emission 605/70). The obtained images were processed with the ImageJ software (70). Nodule occupancy was expressed as the ratio of the number of nodules (green, red, or mixed) to the total number of nodules present on the roots of each plant. For each competition, we calculated the mean of the nodule occupancy for the six replicates.
Statistical analysis.
Statistical data analysis was performed with the RStudio software (71). The Shapiro test was applied to evaluate data distribution; analysis of variance (ANOVA) and Tukey’s post hoc tests or nonparametric Kruskal-Wallis and post hoc Dunn tests were performed using the FSA and rcompanion packages. The principal-component analysis (PCA) and PERMANOVA were performed using the competition mean values with the PAST software (72).
PhenotypeSeeker analysis.
The single nodule occupancy and the mean value of the sum of the single occupied nodules and the mixed nodules of the strains, assessed in the three competition experiments, were converted into continuous matrices of equivalent values between 0 and 1 for each data set. The FASTA genome sequences of 13 strains and the obtained matrices were used as input to count all k-mers for each set of competition; k-mer length was set to 13 nucleotides. In the first filtering step, the k-mers that were present in or missing from fewer than two samples (“–min 2–max 2”; default) were rejected. Clonal population structure correction was performed. The k-mers were tested for the analyses of association with the phenotype according to the weighted Welch two-sample t test, and k-mers with a P value higher than 0.05 were automatically discarded. Linear regression models were achieved using 1,000 top k-mers, with the lowest P value, for all three data sets. We applied 3-fold nested cross-validation, in which three different random combinations of training and test sets were used, and the model evaluation metrics were averaged over 3-fold train/test splits. The 3-fold explicitly indicates that each strain was included once into the test set and twice into the training set.
Nodulation and acetylene reduction assays and annotation and phylogenetic analyses, as well as mapping procedures, are reported in Text S1.
ACKNOWLEDGMENTS
We acknowledge the financial support for this project provided by transnational funding bodies, partners of the H2020 ERA-NETs SUSFOOD2 and CORE Organic Cofund, under the Joint SUSFOOD2/CORE Organic Call 2019. Erki Aun and Maido Remm were funded by the institutional grant IUT34-11 from the Estonian Ministry of Education and Research and the EU ERDF grant no. 2014-2020.4.01.15-0012 (Estonian Center of Excellence in Genomics and Translational Medicine). A. Mengoni is supported by the MICRO4Legumes grant (Italian Ministry of Agriculture).
A. Bellabarba, F. Pini, A. Mengoni, and C. Viti conceived and planned the research; A. Bellabarba and F. Pini performed competition assays and symbiotic assays; E. Azzarello and F. Decorosi contributed to competition assays; E. Aun and M. Remm performed the PhenotypeSeeker analysis and contributed to linear regression model interpretation; A. Bellabarba and G. Bacci performed the bioinformatics analyses; A. Bellabarba, F. Pini, A. Mengoni, L. Giovannetti, and C. Viti interpreted the data; A. Bellabarba, F. Pini, and A. Mengoni prepared the manuscript; all authors participated in editing the manuscript.
We declare no competing interests.
Contributor Information
Carlo Viti, Email: carlo.viti@unifi.it.
Alessio Mengoni, Email: alessio.mengoni@unifi.it.
Marnix Medema, Wageningen University.
REFERENCES
- 1.Masson-Boivin C, Sachs JL. 2018. Symbiotic nitrogen fixation by rhizobia—the roots of a success story. Curr Opin Plant Biol 44:7–15. doi: 10.1016/j.pbi.2017.12.001. [DOI] [PubMed] [Google Scholar]
- 2.Masson-Boivin C, Giraud E, Perret X, Batut J. 2009. Establishing nitrogen-fixing symbiosis with legumes: how many rhizobium recipes? Trends Microbiol 17:458–466. doi: 10.1016/j.tim.2009.07.004. [DOI] [PubMed] [Google Scholar]
- 3.Oldroyd GED, Murray JD, Poole PS, Downie JA. 2011. The rules of engagement in the legume-rhizobial symbiosis. Annu Rev Genet 45:119–144. doi: 10.1146/annurev-genet-110410-132549. [DOI] [PubMed] [Google Scholar]
- 4.Haag AF, Arnold MFF, Myka KK, Kerscher B, Dall’Angelo S, Zanda M, Mergaert P, Ferguson GP. 2013. Molecular insights into bacteroid development during Rhizobium–legume symbiosis. FEMS Microbiol Rev 37:364–383. doi: 10.1111/1574-6976.12003. [DOI] [PubMed] [Google Scholar]
- 5.Sprent JI. 2009. Legume nodulation: a global perspective. John Wiley & Sons, Hoboken, NJ. [Google Scholar]
- 6.Hirsch AM. 1992. Developmental biology of legume nodulation. New Phytol 122:211–237. doi: 10.1111/j.1469-8137.1992.tb04227.x. [DOI] [PubMed] [Google Scholar]
- 7.Checcucci A, DiCenzo GC, Bazzicalupo M, Mengoni A. 2017. Trade, diplomacy, and warfare: the quest for elite rhizobia inoculant strains. Front Microbiol 8:2207. doi: 10.3389/fmicb.2017.02207. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Trabelsi D, Pini F, Aouani ME, Bazzicalupo M, Mengoni A. 2009. Development of real-time PCR assay for detection and quantification of Sinorhizobium meliloti in soil and plant tissue. Lett Appl Microbiol 48:355–361. doi: 10.1111/j.1472-765X.2008.02532.x. [DOI] [PubMed] [Google Scholar]
- 9.Werner GDA, Strassmann JE, Ivens ABF, Engelmoer DJP, Verbruggen E, Queller DC, Noe R, Johnson NC, Hammerstein P, Kiers ET. 2014. Evolution of microbial markets. Proc Natl Acad Sci U S A 111:1237–1244. doi: 10.1073/pnas.1315980111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Kiers ET, Rousseau RA, West SA, Denison RF. 2003. Host sanctions and the legume–rhizobium mutualism. Nature 425:1095–1098. doi: 10.1038/nature01931. [DOI] [PubMed] [Google Scholar]
- 11.Denison RF. 2000. Legume sanctions and the evolution of symbiotic cooperation by rhizobia. Am Nat 156:567–576. doi: 10.1086/316994. [DOI] [PubMed] [Google Scholar]
- 12.Porter SS, Simms EL. 2014. Selection for cheating across disparate environments in the legume-rhizobium mutualism. Ecol Lett 17:1121–1129. doi: 10.1111/ele.12318. [DOI] [PubMed] [Google Scholar]
- 13.Pini F, East AK, Appia-Ayme C, Tomek J, Karunakaran R, Mendoza-Suarez M, Edwards A, Terpolilli JJ, Roworth J, Downie JA, Poole PS. 2017. Bacterial biosensors for in vivo spatiotemporal mapping of root secretion. Plant Physiol 174:1289–1306. doi: 10.1104/pp.16.01302. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Checcucci A, Azzarello E, Bazzicalupo M, Galardini M, Lagomarsino A, Mancuso S, Marti L, Marzano MC, Mocali S, Squartini A, Zanardo M, Mengoni A. 2016. Mixed nodule infection in Sinorhizobium meliloti–Medicago sativa symbiosis suggest the presence of cheating behavior. Front Plant Sci 7:835. doi: 10.3389/fpls.2016.00835. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Kiers ET, Rousseau RA, Denison RF. 2006. Measured sanctions: legume hosts detect quantitative variation in rhizobium cooperation and punish accordingly. Evol Ecol Res 8:1077–1086. [Google Scholar]
- 16.Gano-Cohen KA, Wendlandt CE, Stokes PJ, Blanton MA, Quides KW, Zomorrodian A, Adinata ES, Sachs JL. 2019. Interspecific conflict and the evolution of ineffective rhizobia. Ecol Lett 22:914–924. doi: 10.1111/ele.13247. [DOI] [PubMed] [Google Scholar]
- 17.Gano-Cohen KA, Wendlandt CE, Al Moussawi K, Stokes PJ, Quides KW, Weisberg AJ, Chang JH, Sachs JL. 2020. Recurrent mutualism breakdown events in a legume rhizobia metapopulation. Proc Biol Sci 287:20192549. doi: 10.1098/rspb.2019.2549. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Mendoza-Suárez MA, Geddes BA, Sánchez-Cañizares C, Ramírez-González RH, Kirchhelle C, Jorrin B, Poole PS. 2020. Optimizing Rhizobium-legume symbioses by simultaneous measurement of rhizobial competitiveness and N2 fixation in nodules. Proc Natl Acad Sci U S A 117:9822–9831. doi: 10.1073/pnas.1921225117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Kirschner MW, Gerhart JC, Norton J. 2005. The plausibility of life. Yale University Press, New Haven, CT. [Google Scholar]
- 20.Lees JA, Vehkala M, Välimäki N, Harris SR, Chewapreecha C, Croucher NJ, Marttinen P, Davies MR, Steer AC, Tong SYC, Honkela A, Parkhill J, Bentley SD, Corander J. 2016. Sequence element enrichment analysis to determine the genetic basis of bacterial phenotypes. Nat Commun 7:12797. doi: 10.1038/ncomms12797. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Moradigaravand D, Palm M, Farewell A, Mustonen V, Warringer J, Parts L. 2018. Prediction of antibiotic resistance in Escherichia coli from large-scale pan-genome data. PLoS Comput Biol 14:e1006258. doi: 10.1371/journal.pcbi.1006258. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Rahman A, Hallgrímsdóttir I, Eisen M, Pachter L. 2018. Association mapping from sequencing reads using k-mers. Elife 7:e32920. doi: 10.7554/eLife.32920. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Power RA, Parkhill J, de Oliveira T. 2017. Microbial genome-wide association studies: lessons from human GWAS. Nat Rev Genet 18:41–50. doi: 10.1038/nrg.2016.132. [DOI] [PubMed] [Google Scholar]
- 24.Epstein B, Abou-Shanab RAI, Shamseldin A, Taylor MR, Guhlin J, Burghardt LT, Nelson M, Sadowsky MJ, Tiffin P. 2018. Genome-wide association analyses in the model rhizobium Ensifer meliloti. mSphere 3:e00386-18. doi: 10.1128/mSphere.00386-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Burghardt LT, Epstein B, Guhlin J, Nelson MS, Taylor MR, Young ND, Sadowsky MJ, Tiffin P. 2018. Select and resequence reveals relative fitness of bacteria in symbiotic and free-living environments. Proc Natl Acad Sci U S A 115:2425–2430. doi: 10.1073/pnas.1714246115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Burghardt LT. 2020. Evolving together, evolving apart: measuring the fitness of rhizobial bacteria in and out of symbiosis with leguminous plants. New Phytol 228:28–34. doi: 10.1111/nph.16045. [DOI] [PubMed] [Google Scholar]
- 27.Boivin S, Ait Lahmidi N, Sherlock D, Bonhomme M, Dijon D, Heulin-Gotty K, Le-Queré A, Pervent M, Tauzin M, Carlsson G, Jensen E, Journet EP, Lopez-Bellido R, Seidenglanz M, Marinkovic J, Colella S, Brunel B, Young P, Lepetit M. 2020. Host-specific competitiveness to form nodules in Rhizobium leguminosarum symbiovar viciae. New Phytol 226:555–568. doi: 10.1111/nph.16392. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Jones KM, Kobayashi H, Davies BW, Taga ME, Walker GC. 2007. How rhizobial symbionts invade plants: the Sinorhizobium-Medicago model. Nat Rev Microbiol 5:619–633. doi: 10.1038/nrmicro1705. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.diCenzo GC, Zamani M, Checcucci A, Fondi M, Griffitts JS, Finan TM, Mengoni A. 2019. Multidisciplinary approaches for studying rhizobium-legume symbioses. Can J Microbiol 65:1–33. doi: 10.1139/cjm-2018-0377. [DOI] [PubMed] [Google Scholar]
- 30.Galardini M, Mengoni A, Brilli M, Pini F, Fioravanti A, Lucas S, Lapidus A, Cheng J-F, Goodwin L, Pitluck S, Land M, Hauser L, Woyke T, Mikhailova N, Ivanova N, Daligault H, Bruce D, Detter C, Tapia R, Han C, Teshima H, Mocali S, Bazzicalupo M, Biondi EG. 2011. Exploring the symbiotic pangenome of the nitrogen-fixing bacterium Sinorhizobium meliloti. BMC Genomics 12:235. doi: 10.1186/1471-2164-12-235. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Sugawara M, Epstein B, Badgley BD, Unno T, Xu L, Reese J, Gyaneshwar P, Denny R, Mudge J, Bharti AK, Farmer AD, May GD, Woodward JE, Médigue C, Vallenet D, Lajus A, Rouy Z, Martinez-Vaz B, Tiffin P, Young ND, Sadowsky MJ. 2013. Comparative genomics of the core and accessory genomes of 48 Sinorhizobium strains comprising five genospecies. Genome Biol 14:R17. doi: 10.1186/gb-2013-14-2-r17. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Aun E, Brauer A, Kisand V, Tenson T, Remm M. 2018. A k-mer-based method for the identification of phenotype-associated genomic biomarkers and predicting phenotypes of sequenced bacteria. PLoS Comput Biol 14:e1006434. doi: 10.1371/journal.pcbi.1006434. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Roux B, Rodde N, Jardinaud MF, Timmers T, Sauviac L, Cottret L, Carrere S, Sallet E, Courcelle E, Moreau S, Debelle F, Capela D, de Carvalho-Niebel F, Gouzy J, Bruand C, Gamas P. 2014. An integrated analysis of plant and bacterial gene expression in symbiotic root nodules using laser-capture microdissection coupled to RNA sequencing. Plant J 77:817–837. doi: 10.1111/tpj.12442. [DOI] [PubMed] [Google Scholar]
- 34.Friesen ML. 2012. Widespread fitness alignment in the legume-rhizobium symbiosis. New Phytol 194:1096–1111. doi: 10.1111/j.1469-8137.2012.04099.x. [DOI] [PubMed] [Google Scholar]
- 35.Westhoek A, Field E, Rehling F, Mulley G, Webb I, Poole PS, Turnbull LA. 2017. Policing the legume-Rhizobium symbiosis: a critical test of partner choice. Sci Rep 7:1419. doi: 10.1038/s41598-017-01634-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Kiers ET, Ratcliff WC, Denison RF. 2013. Single-strain inoculation may create spurious correlations between legume fitness and rhizobial fitness. New Phytol 198:4–6. doi: 10.1111/nph.12015. [DOI] [PubMed] [Google Scholar]
- 37.Fagorzi C, Bacci G, Huang R, Cangioli L, Checcucci A, Fini M, Perrin E, Natali C, DiCenzo GC, Mengoni A. 2021. Nonadditive transcriptomic signatures of genotype-by-genotype interactions during the initiation of plant-rhizobium symbiosis. mSystems 6:e00974-20. doi: 10.1128/mSystems.00974-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Amarger N. 1981. Competition for nodule formation between effective and ineffective strains of Rhizobium meliloti. Soil Biol Biochem 13:475–480. doi: 10.1016/0038-0717(81)90037-7. [DOI] [Google Scholar]
- 39.Bourion V, Heulin-Gotty K, Aubert V, Tisseyre P, Chabert-Martinello M, Pervent M, Delaitre C, Vile D, Siol M, Duc G, Brunel B, Burstin J, Lepetit M. 2017. Co-inoculation of a pea core-collection with diverse rhizobial strains shows competitiveness for nodulation and efficiency of nitrogen fixation are distinct traits in the interaction. Front Plant Sci 8:2249. doi: 10.3389/fpls.2017.02249. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Drouin A, Giguère S, Déraspe M, Marchand M, Tyers M, Loo VG, Bourgault AM, Laviolette F, Corbeil J. 2016. Predictive computational phenotyping and biomarker discovery using reference-free genome comparisons. BMC Genomics 17:754. doi: 10.1186/s12864-016-2889-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Davis JJ, Boisvert S, Brettin T, Kenyon RW, Mao C, Olson R, Overbeek R, Santerre J, Shukla M, Wattam AR, Will R, Xia F, Stevens R. 2016. Antimicrobial resistance prediction in PATRIC and RAST. Sci Rep 6:27930. doi: 10.1038/srep27930. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Galibert F, Finan TM, Long SR, Puhler A, Abola P, Ampe F, Barloy-Hubler F, Barnett MJ, Becker A, Boistard P, Bothe G, Boutry M, Bowser L, Buhrmester J, Cadieu E, Capela D, Chain P, Cowie A, Davis RW, Dreano S, Federspiel NA, Fisher RF, Gloux S, Godrie T, Goffeau A, Golding B, Gouzy J, Gurjal M, Hernandez-Lucas I, Hong A, Huizar L, Hyman RW, Jones T, Kahn D, Kahn ML, Kalman S, Keating DH, Kiss E, Komp C, Lelaure V, Masuy D, Palm C, Peck MC, Pohl TM, Portetelle D, Purnelle B, Ramsperger U, Surzycki R, Thebault P, Vandenbol M, Vorholter FJ, Weidner S, Wells DH, Wong K, Yeh KC, Batut J. 2001. The composite genome of the legume symbiont Sinorhizobium meliloti. Science 293:668–672. doi: 10.1126/science.1060966. [DOI] [PubMed] [Google Scholar]
- 43.Galardini M, Pini F, Bazzicalupo M, Biondi EG, Mengoni A. 2013. Replicon-dependent bacterial genome evolution: the case of Sinorhizobium meliloti. Genome Biol Evol 5:542–558. doi: 10.1093/gbe/evt027. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Ney B, Ahmed FH, Carere CR, Biswas A, Warden AC, Morales SE, Pandey G, Watt SJ, Oakeshott JG, Taylor MC, Stott MB, Jackson CJ, Greening C. 2017. The methanogenic redox cofactor F420 is widely synthesized by aerobic soil bacteria. ISME J 11:125–137. doi: 10.1038/ismej.2016.100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Greening C, Ahmed FH, Mohamed AE, Lee BM, Pandey G, Warden AC, Scott C, Oakeshott JG, Taylor MC, Jackson CJ. 2016. Physiology, biochemistry, and applications of F420 - and Fo-dependent redox reactions. Microbiol Mol Biol Rev 80:451–493. doi: 10.1128/MMBR.00070-15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Bashiri G, Antoney J, Jirgis ENM, Shah MV, Ney B, Copp J, Stuteley SM, Sreebhavan S, Palmer B, Middleditch M, Tokuriki N, Greening C, Scott C, Baker EN, Jackson CJ. 2019. A revised biosynthetic pathway for the cofactor F420 in prokaryotes. Nat Commun 10:1558. doi: 10.1038/s41467-019-09534-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Torres-Quesada O, Millán V, Nisa-Martínez R, Bardou F, Crespi M, Toro N, Jiménez-Zurdo JI. 2013. Independent activity of the homologous small regulatory RNAs AbcR1 and AbcR2 in the legume symbiont Sinorhizobium meliloti. PLoS One 8:e68147. doi: 10.1371/journal.pone.0068147. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Mauchline TH, Fowler JE, East AK, Sartor AL, Zaheer R, Hosie AH, Poole PS, Finan TM. 2006. Mapping the Sinorhizobium meliloti 1021 solute-binding protein-dependent transportome. Proc Natl Acad Sci U S A 103:17933–17938. doi: 10.1073/pnas.0606673103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Ramachandran VK, East AK, Karunakaran R, Downie JA, Poole PS. 2011. Adaptation of Rhizobium leguminosarum to pea, alfalfa and sugar beet rhizospheres investigated by comparative transcriptomics. Genome Biol 12:R106. doi: 10.1186/gb-2011-12-10-r106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Hosie AH, Allaway D, Galloway CS, Dunsby HA, Poole PS. 2002. Rhizobium leguminosarum has a second general amino acid permease with unusually broad substrate specificity and high similarity to branched-chain amino acid transporters (Bra/LIV) of the ABC family. J Bacteriol 184:4071–4080. doi: 10.1128/JB.184.15.4071-4080.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Prell J, Bourdes A, Kumar S, Lodwig E, Hosie A, Kinghorn S, White J, Poole P. 2010. Role of symbiotic auxotrophy in the Rhizobium-legume symbioses. PLoS One 5:e13933. doi: 10.1371/journal.pone.0013933. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Pobigaylo N, Szymczak S, Nattkemper TW, Becker A. 2008. Identification of genes relevant to symbiosis and competitiveness in Sinorhizobium meliloti using signature-tagged mutants. Mol Plant Microbe Interact 21:219–231. doi: 10.1094/MPMI-21-2-0219. [DOI] [PubMed] [Google Scholar]
- 53.van Dillewijn P, Soto MJ, Villadas PJ, Toro N. 2001. Construction and environmental release of a Sinorhizobium meliloti strain genetically modified to be more competitive for alfalfa nodulation. Appl Environ Microbiol 67:3860–3865. doi: 10.1128/AEM.67.9.3860-3865.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.McIntosh M, Krol E, Becker A. 2008. Competitive and cooperative effects in quorum-sensing-regulated galactoglucan biosynthesis in Sinorhizobium meliloti. J Bacteriol 190:5308–5317. doi: 10.1128/JB.00063-08. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Jones KM, Lloret J, Daniele JR, Walker GC. 2007. The type IV secretion system of Sinorhizobium meliloti strain 1021 is required for conjugation but not for intracellular symbiosis. J Bacteriol 189:2133–2138. doi: 10.1128/JB.00116-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Nelson MS, Chun CL, Sadowsky MJ. 2017. Type IV effector proteins involved in the Medicago-Sinorhizobium symbiosis. Mol Plant Microbe Interact 30:28–34. doi: 10.1094/MPMI-10-16-0211-R. [DOI] [PubMed] [Google Scholar]
- 57.Nelson MS, Sadowsky MJ. 2015. Secretion systems and signal exchange between nitrogen-fixing rhizobia and legumes. Front Plant Sci 6:491. doi: 10.3389/fpls.2015.00491. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Gourion B, Berrabah F, Ratet P, Stacey G. 2015. Rhizobium-legume symbioses: the crucial role of plant immunity. Trends Plant Sci 20:186–194. doi: 10.1016/j.tplants.2014.11.008. [DOI] [PubMed] [Google Scholar]
- 59.Hubber A, Vergunst AC, Sullivan JT, Hooykaas PJJ, Ronson CW. 2004. Symbiotic phenotypes and translocated effector proteins of the Mesorhizobium loti strain R7A VirB/D4 type IV secretion system. Mol Microbiol 54:561–574. doi: 10.1111/j.1365-2958.2004.04292.x. [DOI] [PubMed] [Google Scholar]
- 60.Jacob AI, Adham SAI, Capstick DS, Clark SRD, Spence T, Charles TC. 2008. Mutational analysis of the Sinorhizobium meliloti short-chain dehydrogenase/reductase family reveals substantial contribution to symbiosis and catabolic diversity. Mol Plant Microbe Interact 21:979–987. doi: 10.1094/MPMI-21-7-0979. [DOI] [PubMed] [Google Scholar]
- 61.Ding H, Yip CB, Geddes BA, Oresnik IJ, Hynes MF. 2012. Glycerol utilization by Rhizobium leguminosarum requires an ABC transporter and affects competition for nodulation. Microbiology (Reading) 158:1369–1378. doi: 10.1099/mic.0.057281-0. [DOI] [PubMed] [Google Scholar]
- 62.diCenzo GC, Tesi M, Pfau T, Mengoni A, Fondi M. 2020. Genome-scale metabolic reconstruction of the symbiosis between a leguminous plant and a nitrogen-fixing bacterium. Nat Commun 11:2574. doi: 10.1038/s41467-020-16484-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Checcucci A, diCenzo GC, Ghini V, Bazzicalupo M, Becker A, Decorosi F, Döhlemann J, Fagorzi C, Finan TM, Fondi M, Luchinat C, Turano P, Vignolini T, Viti C, Mengoni A. 2018. Creation and characterization of a genomically hybrid strain in the nitrogen-fixing symbiotic bacterium Sinorhizobium meliloti. ACS Synth Biol 7:2365–2378. doi: 10.1021/acssynbio.8b00158. [DOI] [PubMed] [Google Scholar]
- 64.Sambrook J, Fritsch EF, Maniatis T. 1989. Molecular cloning: a laboratory manual, 2nd ed. Cold Spring Harbor Laboratory, Cold Spring Harbor, NY. [Google Scholar]
- 65.Beringer JE. 1974. R factor transfer in Rhizobium leguminosarum. J Gen Microbiol 84:188–198. doi: 10.1099/00221287-84-1-188. [DOI] [PubMed] [Google Scholar]
- 66.Cheng HP, Walker GC. 1998. Succinoglycan is required for initiation and elongation of infection threads during nodulation of alfalfa by Rhizobium meliloti. J Bacteriol 180:5183–5191. doi: 10.1128/JB.180.19.5183-5191.1998. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Smit P, Raedts J, Portyanko V, Debellé F, Gough C, Bisseling T, Geurts R. 2005. NSP1 of the GRAS protein family is essential for rhizobial Nod factor-induced transcription. Science 308:1789–1791. doi: 10.1126/science.1111025. [DOI] [PubMed] [Google Scholar]
- 68.Pini F, Spini G, Galardini M, Bazzicalupo M, Benedetti A, Chiancianesi M, Florio A, Lagomarsino A, Migliore M, Mocali S, Mengoni A. 2014. Molecular phylogeny of the nickel-resistance gene nreB and functional role in the nickel sensitive symbiotic nitrogen fixing bacterium Sinorhizobium meliloti. Plant Soil 377:189–201. doi: 10.1007/s11104-013-1979-3. [DOI] [Google Scholar]
- 69.Poole PS, Blyth A, Reid CJ, Walters K. 1994. myo-Inositol catabolism and catabolite regulation in Rhizobium leguminosarum bv. viciae Microbiology 140:2787–2795. doi: 10.1099/00221287-140-10-2787. [DOI] [Google Scholar]
- 70.Schneider CA, Rasband WS, Eliceiri KW. 2012. NIH Image to ImageJ: 25 years of image analysis. Nat Methods 9:671–675. doi: 10.1038/nmeth.2089. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.R Development Core Team. 2011. R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. [Google Scholar]
- 72.Hammer Ø, Harper DAT, Ryan PD. 2001. PAST: paleontological statistics software packages for education and data analysis. Palaeontol Electron 4:XIX–XX. [Google Scholar]
- 73.Kroll S, Agler MT, Kemen E. 2017. Genomic dissection of host–microbe and microbe–microbe interactions for advanced plant breeding. Curr Opin Plant Biol 36:71–78. doi: 10.1016/j.pbi.2017.01.004. [DOI] [PubMed] [Google Scholar]
- 74.Beilsmith K, Thoen MPM, Brachi B, Gloss AD, Khan MH, Bergelson J. 2019. Genome-wide association studies on the phyllosphere microbiome: embracing complexity in host–microbe interactions. Plant J 97:164–181. doi: 10.1111/tpj.14170. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.