Skip to main content
mSystems logoLink to mSystems
. 2026 Jan 16;11(2):e01605-25. doi: 10.1128/msystems.01605-25

Genomic signatures in Variovorax enabling colonization of the Populus endosphere

Delaney G Beals 1,#, Dana L Carper 1,#, Leah H Hochanadel 1, Sara S Jawdy 1, Dawn M Klingeman 1, Bryan T Piatkowski 1,2, David J Weston 1, Mitchel J Doktycz 1, Dale A Pelletier 1,
Editor: Ákos T Kovács3
PMCID: PMC12911387  PMID: 41543249

ABSTRACT

Microbial colonization of plant roots involves strong selective pressures that shape the structure and function of root-associated communities. In particular, the endosphere represents a highly selective environment requiring host entry and in planta persistence. However, strain-specific microbial traits that enable endosphere colonization remain poorly understood. Here, we use a defined, genome-resolved community of 28 Variovorax strains isolated from the roots of Populus deltoides and Populus trichocarpa (poplar trees) to determine which strains partition between rhizosphere and endosphere compartments and to identify the genomic traits associated with endosphere specialization. By combining strain-resolved metagenomic profiling, comparative genomics, and functional assays, we demonstrate that dominant endosphere colonizers are enriched in genes related to nutrient metabolism, redox balance, transcriptional regulation, and a conserved L-fucose utilization pathway experimentally shown to enhance root colonization. Not all strains succeed through the same strategy. Community-wide functional profiling revealed a distinct and reduced set of traits in the endosphere, including orthogroups associated with low-abundance strains that were overlooked in strain-level analyses. These findings reveal that multiple ecological strategies, such as metabolic competition, regulatory adaptation, and niche specialization, can support endosphere colonization. Our results advance the understanding of how bacterial colonization traits are distributed and deployed within a plant microbiome and suggest that host filtering selects for distinct, and sometimes complementary, microbial strategies. This work supports a shift toward mechanistic, genome-resolved models of microbiome assembly and offers a framework for linking microbial function to host colonization success.

IMPORTANCE

Plants often depend on diverse microbial partners to support their growth, resilience, and adaptation to changing environments. Among these microbes, some bacteria inhabit the rhizosphere (the narrow zone around roots where microbes interact with the plant) while others are able to enter and persist within root tissues. The traits that distinguish these two lifestyles remain poorly understood. In this study, we examined a group of related Variovorax strains from poplar tree root microbiomes to ask why some rhizosphere-associated strains also become successful endosphere colonizers. We found that strains appear to succeed through different strategies: some may benefit from rapid growth on plant-derived carbon sources, while others may rely on stress tolerance or fine-tuned regulation. These results suggest that there is no single path from the rhizosphere into the root interior, but rather multiple strategies shaped by the host environment. Understanding this diversity can inform efforts to design resilient plant-microbe communities.

KEYWORDS: endosphere colonization, strain-resolved metagenomics, comparative genomics, niche differentiation, synthetic community, functional trait enrichment

INTRODUCTION

Plant-associated microbial communities play a central role in plant growth, health, and environmental responsiveness (14). Among these communities, root-associated bacteria inhabiting the rhizosphere and endosphere are particularly important. The rhizosphere, a carbon-rich zone surrounding the plant root, is influenced by root exudates and supports a diverse and dynamic assemblage of microbes, with individual plants often hosting between 50 and 1,000 bacterial operational taxonomic units (OTUs) (5, 6). A subset of these microbes can further colonize internal root tissues as endophytes, forming close associations with the host that may confer additional benefits to the plant (7).

Compared to the rhizosphere, the endosphere harbors lower microbial diversity, suggesting strong ecological filtering by the host. Colonization of the endosphere represents an ecological selection bottleneck, requiring bacteria not only to reach the root interior but also to survive the selective pressures of the host environment, establish themselves amid competing microbes, and maintain a functional presence long enough to influence plant physiology (811). Endophytes that pass through these ecological filters are often associated with enhanced host growth, increased abiotic stress tolerance, and protection against pathogens (12, 13). Despite the importance of these benefits, we still lack the framework for predicting which rhizosphere-associated bacteria can successfully colonize the endosphere, and what traits enable them to do so.

Prior studies have identified a wide range of bacterial traits that facilitate plant root colonization, including chemotaxis, secretion systems, and metabolic versatility (12, 1416). More detailed studies in specific systems, such as tomato plant root tip colonization, have revealed that competitive colonization is shaped by a combination of metabolic capabilities, regulatory systems, cell surface structures, and secretion pathways (17, 18). While these findings highlight the multifactorial nature of bacterial adaptation to the root environment, most have been framed at a broad taxonomic scale, leaving open how specific genetic features influence niche preference within root-associated habitats. In particular, little is known about how these adaptations vary among closely related strains within the same genus or how that variation contributes to differential colonization of the rhizosphere versus the endosphere.

To address these gaps, we aimed to identify bacterial genomic signatures associated with endosphere colonization from the rhizosphere and test whether they can explain differences in ecological behavior among closely related strains. Developing such a framework is key for predicting microbiome assembly from genomic content and ultimately for rationally designing beneficial microbial consortia. We focused on Variovorax, a bacterial genus frequently identified as a core member of root microbiomes across diverse plant hosts, including Populus (19), Arabidopsis (20), and Brassica (21). Although Variovorax strains are often characterized as plant growth-promoting bacteria (2226), their relative preference to colonize internal root tissues varies substantially, suggesting functional differentiation within the genus. This makes Variovorax a useful system for dissecting the genomic and ecological bases of niche specialization.

In this study, we examine Variovorax bacterial strains previously isolated from the rhizosphere, root surface, and endosphere of Populus deltoides and Populus trichocarpa trees growing in natural environments. Because these strains originate from natural Populus root environments, they represent ecologically relevant variation in host-associated traits. We therefore hypothesized that endosphere isolates may retain functions that facilitate root colonization under controlled experimental conditions. Building on this hypothesis, we aimed to determine how individual plant-associated Variovorax lineages differ in their colonization patterns across root compartments and what genomic traits distinguish endosphere-competent strains.

To test this, we assembled a defined community (DefCom) of 28 fully sequenced Variovorax strains (19) and inoculated axenic Populus plants, tracking colonization outcomes with strain-resolved metagenomics. This design parallels the “select-and-resequence” framework developed for plant-associated bacterial systems (27, 28), adapted here to quantify strain-resolved colonization dynamics within the Populus root microbiome. This community-based approach enabled high-throughput assessment of strain performance under direct competition in a shared host environment, providing a scalable alternative to pairwise inoculation assays. By integrating comparative genomics with orthogroup-level functional profiling, we identified both conserved and strain-specific determinants of endosphere colonization and experimentally validated one such function, L-fucose metabolism. These findings highlight the complexity of microbial adaptation to plant-associated niches and demonstrate how genome content can be linked to ecological outcomes at the strain level.

RESULTS

Populus hosts are colonized by diverse Variovorax lineages

To determine which members of the Variovorax DefCom preferentially colonize Populus roots, we inoculated axenic P. deltoides and P. trichocarpa plants with a defined 28-strain community ( Table S1) and tracked colonization in the rhizosphere and endosphere after 3 weeks of growth (Fig. 1A). The strains represent a broad range of phylogenetic diversity within the genus, spanning multiple clades of the Variovorax tree (Fig. 1B), including several closely related groups and more distantly related lineages based on average nucleotide identity (Fig. S1). Nineteen of the 28 strains were originally isolated from Populus endospheres in natural environments, while the remainder were recovered from roots and rhizospheres. These source designations provide contextual information but are not interpreted as fixed ecological categories, as an isolation source alone cannot resolve the genetic basis of endosphere adaptation. Because many Variovorax genomes in the DefCom share high nucleotide identity, metagenomic reads were mapped to all DefCom reference genomes using unique mapping to prevent double-counting of reads that align equally well to multiple strains.

Fig 1.

Experimental workflow depicting 28 Variovorax strains colonizing Populus roots analyzed through metagenomic mapping. The phylogenetic tree illustrates evolutionary relationships between bacterial isolates with origins in different plant compartments.

(A) Experimental overview of DefCom inoculation and root colonization. A defined community of 28 Variovorax strains isolated from Populus roots was introduced into sterile Populus microcosms. Colonization in the rhizosphere and endosphere was assessed after 3 weeks by strain-resolved metagenomic mapping. (B) Maximum-likelihood phylogeny of Variovorax isolates based on 1,147 single-copy orthologs. Colored boxes indicate the Populus genotype and plant compartment from which each strain was isolated (19).

Endosphere colonization outcomes are dominated by a small subset of Variovorax strains

Despite equal starting abundances, endosphere colonization outcomes were uneven: five strains (BK151, YR750, BK613, CF313, and OV700) accounted for ~80% of total mapped reads in the endosphere compartment (Fig. 2A). To examine how strains partition between root compartments, we calculated the log₂ fold change in normalized abundance between the endosphere and rhizosphere for each strain in both P. deltoides and P. trichocarpa (Fig. 2B), representing their relative enrichment between compartments. Across hosts, BK151, BK370, BK613, and GV051 were significantly enriched in the endosphere. In P. deltoides, enrichment was the strongest for BK151, BK370, and BK613, whereas in P. trichocarpa, additional GV lineage strains (GV035, GV051, and GV004) were significantly enriched. Several other strains exhibited host-specific enrichment patterns that paralleled their isolation source (Fig. 1B), with P. deltoides isolates (CF313, PDC80, and YR634) tending to be more abundant in P. deltoides roots, while GV lineage strains (GV035, GV051, and GV004) were more enriched in P. trichocarpa, although these differences were not significant.

Fig 2.

Bacterial strain distribution data comparing Populus root compartments. Dominant strains show distinct preferences for either endosphere or rhizosphere with consistent colonization patterns across both tree species, despite varying abundance levels.

(A) Mean strain abundance (counts per million, CPM ± SD) in the endosphere (n = 7) and rhizosphere (n = 8) across Populus species. Strains are ordered by descending endosphere abundance; those to the left of the vertical dotted line together account for ~80% of all endosphere reads. (B) Differential abundance of strains in the endosphere versus rhizosphere of P. deltoides and P. trichocarpa. Bars show log₂ fold change (endosphere vs. rhizosphere) based on CPM-normalized data, representing relative enrichment between compartments; asterisks indicate significant enrichment after false discovery rate (FDR) correction (FDR < 0.05). Positive values denote endosphere enrichment, and negative values denote rhizosphere enrichment. (C) Relationship between strain abundance and compartment enrichment. Individual points represent strains; filled circles correspond to values from P. deltoides microcosms and open circles to P. trichocarpa.

Host-specific enrichment patterns suggest multiple colonization strategies

To assess how abundance and enrichment together describe endosphere colonization behavior across Populus hosts, we compared strain-level abundance against log₂ fold change between the endosphere and rhizosphere (Fig. 2C). This analysis revealed three general colonization strategies. Strong colonizers such as BK151, BK370, and BK613 combined moderate-to-high endosphere abundance with strong endosphere enrichment in both tree species. Niche specialists, including OV329 in P. trichocarpa, were endosphere-enriched despite moderate abundance, while generalists such as YR750 and CF313 maintained high abundance in both endosphere and rhizosphere compartments.

Together, these patterns identified six strains (CF313, OV700, YR750, BK370, BK151, and BK613) as the dominant endosphere-associated members of the DefCom community. However, neither isolation history nor phylogenetic relatedness predicted their success. Only three of these strains were originally recovered from Populus endospheres, and pairwise average nucleotide identity (ANI) comparisons showed that strong colonizers were distributed both within a highly similar (≥98.5%) ANI cluster and across more divergent clades (Fig. S1). Although formal species boundaries have not been established for Variovorax, these values indicate that endosphere competence varies within and between putative species and reflects specific functional traits rather than taxonomic origin.

To resolve these traits, we carried out a comparative genomic analysis using ANI-defined lineages. We applied a ≥97% ANI threshold to group all 28 DefCom strains into lineages (Fig. S1), preventing near-identical genomes from being treated as independent observations. These lineage assignments established the evolutionary units used to identify genomic features associated with endosphere colonization.

Comparative genomics reveals common traits among dominant endosphere colonizers

Using these lineage definitions, we next compared the KEGG Ortholog (KO) profiles across endosphere- and non-endosphere-associated lineages. A KO was considered enriched if it occurred in at least 2 endosphere lineages and in no more than 4 of 11 non-endosphere lineages, resulting in 131 endosphere-enriched KOs (Fig. 3A; Table S2). The top enriched KOs (Fig. 3B) highlight a set of functions shared across endosphere-associated lineages. Several of the strongest signals were linked to host-derived sugar metabolism, including L-rhamnose mutarotase (K03534) and the L-fucose degradation enzymes K18333 and K18334, which indicate a conserved capacity for utilizing deoxyhexoses abundant in plant tissues. A coordinated set of Type III secretion system proteins (K03219 and K03222 through K03230) was also enriched, consistent with a role for secretion and interaction machinery in endosphere colonization.

Fig 3.

Integrated analysis of Variovorax strains shows KEGG ortholog distribution across lineages. Data reveal endosphere-enriched genes and confirm L-fucose pathway importance for Populus root colonization.

(A) Heatmap of the number of genes annotated to each KO across DefCom Variovorax strains. Columns show KOs detected in at least one genome; rows show strains used in this study. Color intensity indicates the number of unique genes annotated with each KO per strain (white = absent, dark blue = higher counts); capped at five genes for visualization. Letter labels on the left denote Variovorax shared lineages (≥97% ANI); unlabeled strains represent unique lineages. (B) Top 20 KOs enriched in the Populus endosphere based on lineage-level frequency. Endosphere-associated lineages included lineage B (BK151, BK370, and BK613), CF313, OV700, and YR750; all remaining lineages were treated as non-endosphere. KO frequencies were calculated across all ANI-defined lineages, and enrichment was defined as presence in ≥2 endosphere lineages and in ≤4 of 11 non-endosphere lineages. (C) Growth of Variovorax sp. CF313 wild-type (WT) and L-fucose pathway mutant (CF313Δfuc) over 72 h in 3-(N-morpholino)propanesulfonic acid (MOPS) minimal medium supplemented with 0.1% glucose or 0.1% L-fucose. (D) Root colonization by wild type and CF313Δfuc. Box-and-whisker plots show total CFUs recovered from Populus roots. The fucose pathway mutant exhibited significantly reduced colonization (P = 0.0022).

Across all 131 enriched KOs, several broad functional themes emerged. Endosphere-associated lineages were enriched in carbohydrate and deoxyhexose utilization; high-affinity nutrient transport for nitrogen, sulfur, and metals; and detoxification pathways such as oxidative stress defenses and metal efflux. Secretion and export systems, including Type III, IV, and VI secretion machinery and ATP-binding cassette (ABC) transporters, were similarly overrepresented. Numerous AraC, LysR, and DeoR regulators and several two-component systems indicated expanded regulatory capacity. Together, these signatures reveal a strategy that integrates metabolic specialization, nutrient acquisition, environmental sensing, and interaction functions that differentiate successful endosphere-colonizing Variovorax lineages.

L-fucose utilization pathway enables endosphere colonization

Several endosphere-enriched KOs belonged to the L-fucose utilization pathway, including L-fucose dehydrogenase (K18333) and L-fuconate dehydratase (K18334) (Fig. 3B). To test whether this pathway contributes to endosphere colonization by strain CF313, we generated a deletion of the L-fuconolactonase gene (PMI12_01353). L-fuconolactonase was selected because it is encoded at the start of the fucose utilization locus in CF313 and initiates the sequence of reactions used by Variovorax to process fucose (Fig. S2). The resulting CF313Δfuc strain failed to grow on L-fucose as a sole carbon source, while the wild type showed robust growth under the same conditions (Fig. 3C). When inoculated into sterile P. deltoides microcosms, the mutant exhibited significantly reduced colonization compared to wild-type CF313, with a statistically significant decrease in CFUs recovered from root tissue relative to the inoculum (P = 0.0022) (Fig. 3D).

Several non-endosphere strains also encoded the complete L-fucose utilization pathway (Fig. S2), raising the question of whether pathway presence alone is predictive of endosphere enrichment. To explore this, we tested all 28 Variovorax strains in MOPS minimal medium supplemented with 0.1% L-fucose as the sole carbon source. Three strains, CF313, OV700, and YR750, were able to grow under these conditions, with OD₆₀₀ increasing at least 3-fold over 72 h (Fig. S3). Other strains, which included those with intact fucose pathways, such as BK151, BK613, BK370, and YR634, showed little to no growth. These findings highlight that gene presence does not necessarily imply functional activity and suggest that preferential colonization of the endosphere may depend not only on specific metabolic capabilities but also on how effectively those functions are deployed under host-associated conditions.

Orthogroup-based profiling reveals reduced and distinct functional capacity in the endosphere

While strain-based analyses can identify dominant endosphere colonizing strains, they may overlook functions encoded by low-abundance strains that nonetheless possess traits relevant to endosphere adaptation. We hypothesized that such traits may be present in additional Variovorax strains that were not classified as enriched at the strain level and that examining gene content across the community could help reveal them. To test this, we inferred orthogroups (sets of homologous genes grouped by shared evolutionary origin) from amino acid sequences of all 28 Variovorax strains. Metagenomic reads from DefCom-inoculated root compartments were then mapped to each genome and aggregated at the orthogroup level. This allowed us to quantify their relative abundances as counts per million (CPM) and detect orthogroups associated with endosphere enrichment, independent of whether an individual Variovorax strain was considered dominant.

To assess the extent to which the Populus endosphere selects for a distinct set of microbial traits, we analyzed alpha and beta diversity of orthogroup abundance profiles (expressed as counts per million, CPM) across compartments and host species. The endosphere exhibited significantly reduced richness and evenness in orthogroup profiles compared to the rhizosphere (ANOVA and Wilcoxon tests; P < 0.05) (Fig. 4A), consistent with selective filtering for a narrower set of traits. Non-metric multidimensional scaling (NMDS) ordination of Bray–Curtis dissimilarities revealed clear clustering by root compartment and host species, with both factors contributing significantly to variation in orthogroup composition (PERMANOVA; P < 0.05) (Fig. 4B). Compartment explained 21% of the variance, and host species explained 14%. These results indicate that specific orthogroups differ in abundance between compartments, suggesting that the endosphere imposes selective constraints on community functional potential.

Fig 4.

Visualization of Variovorax orthogroup diversity across Populus root compartments. Boxplots show significant differences between endosphere and rhizosphere, while NMDS reveals samples cluster by compartment type rather than tree species.

(A) Alpha diversity of orthogroup profiles across tree types and compartments. Boxplots show orthogroup richness and Shannon diversity for each tree species (left) and compartment (right). Significance was assessed using ANOVA and Wilcoxon tests; asterisks indicate significant compartment differences (***, **), and “n.s.” denotes non-significant tree species comparisons. (B) NMDS ordination of Variovorax orthogroup functional profiles based on Bray–Curtis dissimilarities of metagenomic CPM values. Points represent root-associated samples colored by compartment (endosphere vs. rhizosphere) and shaped by tree species (P. deltoides vs. P. trichocarpa).

Differential abundance analysis identifies conserved endosphere-enriched orthogroups

To identify the gene groups consistently associated with endosphere enrichment, we first aggregated all detected orthogroups across the community, regardless of their strain of origin, and compared their normalized read abundances between endosphere and rhizosphere metagenomes. Statistical enrichment of orthogroups in the endosphere was then assessed using multiple complementary approaches, including DESeq2 (using raw counts), limma-voom (with per-copy normalization), and metagenomeSeq (zero-inflated modeling with CSS normalization). This conservative strategy yielded 249 orthogroups consistently enriched in the endosphere across all three methods (Fig. S1, Table S3). These orthogroups included genes annotated as ABC transporters, superoxide dismutases, and transcriptional regulators (LysR, AraC, and TetR families). Functional annotation of these orthogroups using Clusters of Orthologous Group (COG) categories showed that transcription, amino acid transport and metabolism, and functions of unknown category contained many endosphere-enriched orthogroups (Fig. 5A). Additional enriched categories included signal transduction and inorganic ion transport and metabolism (Fig. S5).

Fig 5.

Volcano plots reveal orthogroup enrichment patterns between plant endosphere and rhizosphere compartments. A heatmap shows nine KEGG Orthologs present in endosphere-associated Variovorax bacterial lineages identified through multiple statistical methods.

(A) Differential representation of orthogroups across COG functional categories in endosphere and rhizosphere compartments. Volcano plots show log₂ fold change (endosphere vs. rhizosphere) versus adjusted P-value from limma-voom analyses of raw orthogroup counts. Orthogroups enriched in the endosphere (log₂FC > 0, FDR < 0.05) are shown in green, those enriched in the rhizosphere (log₂FC < 0, FDR < 0.05) in yellow, and non-significant orthogroups in gray. Vertical dashed lines indicate log₂FC thresholds of ±1, and the horizontal line, the FDR threshold of 0.05. The 10 most enriched orthogroups in each COG category in order by log₂FC are listed. (B) Comparison of KOs enriched across analysis frameworks. The heatmap displays the presence of nine KOs identified as significantly enriched in the endosphere by three statistical methods (DESeq2, limma-voom, and metagenomeSeq) and that were more frequent in ANI-defined endosphere-associated lineages. Rows represent Variovorax strains (endosphere-enriched strains in bold), and columns are grouped by ANI-defined lineages indicated by brackets above the heatmap. Strains that lacked all nine KOs are excluded.

To place these orthogroup patterns in a broader functional context, we focused on KOs that were enriched both in the orthogroup-level metagenomic analysis and the earlier lineage-level KO frequency analysis of endosphere-enriched functions (Fig. 5B). This intersection yielded a compact set of nine functions spanning DNA replication (K06919), transcriptional regulation (K07726), iron uptake (K07224 and K16301), stress response and redox metabolism (K08351), carbohydrate utilization (K09992), nitrogen metabolism (K10536), and two conserved domains of unknown function (K09712 and K09980). These functions were consistently found in the dominant endosphere lineage B (BK151, BK370, and BK613) and in YR750, with sporadic representation in a subset of non-endosphere strains. The strongest lineage specificity was observed for K09712 (DUF333), which was restricted to lineage B and YR750 and never detected in strains lacking endosphere enrichment. In contrast, iron-uptake (K07224 and K16301), stress-response (K08351), and carbohydrate-utilization functions (K09992) were enriched in endosphere-associated lineages but were not exclusive to them. Collectively, this set of nine KOs points to a recurrent combination of regulatory, nutrient-acquisition, redox, and carbohydrate-metabolism functions that distinguishes successful endosphere colonizers from the broader Variovorax pool.

Endosphere-enriched orthogroups suggest divergent colonization strategies when bacteria are in different Populus hosts

Endosphere-enriched orthogroups showed host-specific patterns, suggesting divergent colonization strategies between P. deltoides and P. trichocarpa (Fig. 6). Differential abundance analyses were performed separately for each host using limma-voom, comparing endosphere and rhizosphere metagenomes within each species. Orthogroups significantly enriched in each host were then compared to identify functions unique to or shared between hosts. In P. deltoides, enriched functions were biased toward host interaction and membrane remodeling, such as type V secretion system substrates, outer membrane autotransporters, and patatin-like phospholipases, as well as redox-related enzymes like multicopper oxidases and NADPH:quinone reductases. These suggest a colonization strategy shaped by direct engagement with host surfaces and oxidative stress mitigation. In contrast, P. trichocarpa favored specialized metabolic functions, including aromatic compound degradation, nitrogen assimilation, and sulfur metabolism. A set of 39 orthogroups was consistently enriched in both hosts, largely comprising core functions such as central carbon metabolism (e.g., glutamate dehydrogenase and succinyl-CoA synthetase), transporters, proteases, and diverse transcriptional regulators (Table S4). Together, these patterns suggest that while both hosts select for a common set of metabolic and regulatory functions, their distinct enrichment signatures reflect different physiological filters shaping endosphere colonization.

Fig 6.

Comparative visualization of microbial orthogroup enrichment between endosphere and rhizosphere across Populus species. UpSet plot and Venn diagram show distribution patterns. Bar chart reveals functional microbiome differentiation in root compartments.

(A) UpSet plot showing significantly enriched orthogroups identified by limma-voom analysis of endosphere versus rhizosphere metagenomes for each Populus host species. Analyses were conducted separately for P. deltoides and P. trichocarpa to identify orthogroups enriched within each host; shared enrichments represent functions independently significant in both. Inset: Venn diagram of orthogroups detected (presence/absence) in P. deltoides (P. delt) and P. trichocarpa (P. trich) endosphere (endo) and rhizosphere (rhizo) metagenomes. The inset depicts all orthogroups found in each tree × compartment combination, including those identified as not significantly enriched. (B) Top significant orthogroups enriched in the endosphere (green) or rhizosphere (yellow) of P. deltoides and P. trichocarpa based on log2 fold-change estimates from limma-voom.

DISCUSSION

Variovorax is consistently enriched in the Populus rhizosphere; however, only a subset of strains succeeds in colonizing the endosphere. Because all strains in our experiment were originally isolated from Populus roots or rhizospheres and introduced at equal starting abundance, their relative enrichment in the endosphere reflects compartmental specialization among naturally Populus-associated strains rather than absolute colonization potential. Endosphere colonization is a multistep ecological challenge requiring microbial access to the root, competition within the rhizosphere, survival of the chemically selective internal environment, and the capacity to engage the host in functionally meaningful ways (29, 30). Our strain-resolved metagenomic analyses confirm that endophytes represent a selected subset of the broader rhizosphere community, and by resolving strain-level contributions, our analysis reveals fine-scale functional variation that is often masked in OTU-based or bulk metagenomic data. This approach shows that ecological filtering leaves reproducible genomic signatures that can illuminate the traits necessary for successful endosphere colonization within Populus.

Endosphere colonization outcomes among Variovorax strains are shaped less by genomic potential than by how those functions are regulated and expressed within the host. In our defined community, experimental results showed that related strains exhibited different patterns of enrichment between root compartments. Comparative phylogenetic and ANI analyses reinforce this observation, showing that dominant endosphere colonizers occur across both closely related and more divergent lineages. The presence of successful colonizers throughout the Variovorax tree indicates that endosphere enrichment can occur both within closely related clades and independently across more diverse lineages. This pattern suggests that endosphere competence is further shaped by regulatory and ecological factors that extend beyond phylogenetic boundaries. Supporting this distinction, the conservation of canonical plant-beneficial genes such as ACC deaminase and IAA degradation (31) across all Variovorax genomes underscores that these broadly distributed traits are not sufficient to explain preferential colonization of the endosphere. Instead, the orthogroups enriched in the endosphere were dominated by transcriptional regulators and other regulatory functions, suggesting that dynamic control of gene expression, rather than static gene inventories, is central to colonization of the endosphere.

Our results reveal two ecological strategies for endosphere colonization: metabolic competitiveness and persistence through regulatory adaptation. Strains that reached high abundance and grew robustly on fucose exemplify rapid resource capture, underscoring competition for host-derived carbon as a major driver of success, a pattern consistent with synthetic community studies of Populus rhizosphere bacteria (32, 33). In contrast, strains that did not grow on fucose despite possessing the pathway were instead enriched in functions drawn from the consensus set of nine endosphere-associated KOs, including iron-uptake systems, redox and stress-response enzymes, nitrogen and trehalose metabolism, and regulatory proteins. These traits reflect a strategy centered on maintaining cellular homeostasis and adjusting gene expression within the chemically selective host environment (3437). Flexible colonizers such as YR750 encoded both fucose utilization and the regulatory traits found in persistence-oriented strains, suggesting the capacity to use multiple strategies depending on environmental conditions. Taken together, these outcomes show that endosphere persistence can arise through distinct ecological routes, either dominance via rapid substrate use or endurance through regulatory control, and that success is not determined by genomic content alone but by how strains deploy their traits in the host environment.

Host species further shaped which traits were favored, indicating that Populus species impose distinct selective filters on their microbial partners. P. deltoides endophytes were biased toward redox-active enzymes and membrane remodeling functions, while P. trichocarpa endophytes favored transcriptional regulators and amino acid metabolism. These distinctions underscore that genomic signatures of colonization are not uniform across hosts but are contingent on plant context. More broadly, differences in the traits favored by Populus species align with emerging evidence that endosphere colonization in long-lived woody hosts may operate through mechanisms distinct from those described in annual crops and model plants, where colonization is often framed through pathogen-associated secretion and invasion systems (3840). While not directly tested here, our results may help clarify this distinction by revealing regulatory and metabolic traits linked to stable associations in tree endophytes.

Beyond host genotype, the ecological context in which colonization occurs also shapes microbial success. Fucose, a major component of plant cell-wall polysaccharides and a prominent sugar in root mucilage at the growing root tip (4143), may become available to microbes through root cell turnover or the release of soluble carbohydrates in root exudates, which contain sugars and other primary metabolites (44). Although we did not quantify fucose concentrations, host-specific differences in carbohydrate availability and utilization could help explain variation in colonization patterns across Populus species. In natural soils, these resource-driven interactions occur within the broader microbiome, where metabolic competition, antagonism, and cooperation among community members further influence which strains gain access to the endosphere. Together, these factors highlight that colonization outcomes are emergent properties of both host filtering and community ecology and that predictive models of root microbiome assembly must integrate metabolic, regulatory, and ecological dimensions.

Our results demonstrate that Variovorax colonization of the Populus endosphere is governed by strategies of metabolic competitiveness, regulatory adaptation, or a combination of both and that host species further influences which traits are successful. While stochastic influences cannot be completely excluded, the reproducible enrichment of specific traits across replicates and host species indicates that selection is the main force shaping colonization outcomes. Enriched genes highlight ecological differentiation; however, gene content alone does not fully explain colonization outcomes, emphasizing the importance of regulatory control and host context. Future work integrating genomic profiles with dynamic transcriptomic and proteomic measurements, coupled with predictive modeling, will be essential to determine how colonization strategies are deployed under native conditions. Ultimately, this genome-resolved perspective advances mechanistic understanding of niche-specific root microbiome assembly and informs efforts to design microbial consortia that promote plant health and resilience across variable environments.

MATERIALS AND METHODS

Genome collection and phylogenetic analysis

Genomes of 28 Variovorax strains previously isolated (19) from the rhizosphere and endosphere of P. deltoides and P. trichocarpa were obtained from the Integrated Microbial Genomes & Microbiomes (IMG/M) database (45) (Table S1). The two- or three-letter prefix in each strain name corresponds to the original isolation site of that strain. Single copy orthologs (N = 1,447) were identified using Broccoli v1.2 (46), and coding sequences were aligned using translator v1.1-2 (47) and MAFFT v7.508 (48). Alignments were trimmed using ClipKIT v1.3.0 (49) to retain sites with less than 30% gaps and then concatenated. Phylogenetic reconstruction was performed using IQ-TREE2 v2.2.0.3 (50) using ModelFinder (51) to perform model selection and determine the best partitioning scheme. Tree topology support was evaluated using the ultrafast bootstrap approximation with 1,000 bootstrap replicates (52). Pairwise ANI was calculated using the fastANI (53) module implemented in KBase (54) to assess species-level relatedness among the 28 genomes.

Bacterial defined community (DefCom) assembly and preparation

All bacterial strains were streaked from frozen glycerol stocks to rich medium plates (Reasoner’s 2A [R2A], Franklin Lakes, BD Difco, NJ, USA) (55). A single colony from each plate was used to inoculate 5 mL of R2A and incubated overnight at 30°C and 200 rpm by continually shaking. Once turbid, bacterial suspensions were pelleted, washed with sterile water to remove remaining media, and then diluted with sterile water to an OD600 = 0.01. Equal volumes of each OD-normalized strain culture were combined into a single DefCom inoculum, and 10 mL of the DefCom inoculum was used to inoculate soil for plant experiments.

Plant inoculation and growth

Axenic P. deltoides (genotype WV94) and P. trichocarpa (genotype BESC-819) shoots were generated from surface-sterilized tissue-culture cuttings and maintained under sterile conditions following a standard procedure (56). Rooted shoots were transplanted into sterile polycarbonate vessels (3 × 3 × 4″) containing 120 cm³ of inert clay (Pro’s Choice Rapid Dry, Alpharetta, GA) supplemented with 80 mL of 1× Modified Hoagland’s solution. Twelve DefCom-inoculated and 12 uninoculated control replicates were prepared for each tree species. Immediately prior to transplanting, 10 mL of DefCom inoculum was added to the clay and mixed thoroughly for 30 s. Microcosms were sealed and maintained under controlled growth chamber conditions (16 h photoperiod, 21°C/17°C day/night, 60% RH) for 3 weeks. Uninoculated controls remained sterile, confirmed by plating rinse fractions on R2A medium.

Root colonization

After harvest, roots were gently washed twice by submerging in sterile deionized water (20 mL each) to remove adhering clay; wash fractions were collected for rhizosphere analysis. Roots were then placed in 15 mL sterile PBS and sonicated for 30 s at 50–60 Hz. The buffer was collected as the rhizoplane fraction. This sonication step was repeated twice with fresh PBS, after which the washed roots were designated as the endosphere fraction and used for downstream enrichment. Following the final sonication, roots were designated as the endosphere fraction and prepared for downstream enrichment using a protocol adapted from Pelletier and coworkers (57).

An ethanol- and UV-sterilized (15 min) grinder (KSM2; Braun, Kronberg, Germany) was used to disrupt and homogenize the endosphere roots in 20 mL sterile Milli-Q. The homogenate was poured through sterile miracloth and collected in a 50-mL conical tube. The endosphere homogenates were centrifuged at 500 × g for 5 min at 10°C (Spinchron R; Beckman Coulter, Brea, CA). The supernatants were transferred to new conical tubes and centrifuged at 5,500 × g for 20 min at 10°C (Sorvall Evolution RC; Thermo Scientific, Carlsbad, CA). The supernatants were discarded, and the pellet was resuspended in 20 mL of bacterial cell extraction (BCE) buffer (50 mM Tris-HCl [pH 7.5], 1% Triton X-100, 2 mM 2-mercaptoethanol). The suspension was filtered through sterile miracloth and transferred to a sterile 50-mL Oak Ridge tube (Nalgene, Rochester, NY). The suspensions were centrifuged at 10,000 × g for 10 min at 10°C, and the supernatants were discarded. The pellet was resuspended in 20 mL BCE buffer, filtered again through sterile miracloth, and centrifuged at 10,000 × g for 10 min at 10°C. The supernatant was discarded, and the pellet was resuspended in 1 mL of 50 mM Tris-HCl (pH 7.5). The suspension was centrifuged at 10,000 × g for 3 min, the supernatant removed, and the pellet resuspended in 1 mL of 50 mM Tris-HCl (pH 7.5). The final pellet was stored at –20°C for DNA extraction.

DNA extraction and metagenomic sequencing

DNA from the four endosphere and rhizosphere samples was extracted using the DNeasy Plant Pro Kit (Qiagen, Venlo, the Netherlands) according to the manufacturer’s instructions for each DefCom/Populus species. Library preparation was performed using the Nextera XT DNA Library Preparation Kit (Illumina), following the manufacturer’s protocol. Prepared libraries were quantified using the Qubit dsDNA BR Assay Kit (Invitrogen, Carlsbad, CA, USA), and quality was assessed on an Agilent 2100 Bioanalyzer using a DNA 7500 Chip Kit (Agilent Technologies, Santa Clara, CA, USA). Libraries were pooled at equal molar concentrations and paired-end sequenced (2 × 251 bp) on an Illumina MiSeq instrument with v2 chemistry.

Strain abundance and differential enrichment analysis

Reads were demultiplexed on the MiSeq and processed using Atropos v1.1.28 (58) for adapter trimming and quality filtering (q = 15, length < 150). To exclude host-derived DNA sequences from the differential enrichment analysis, filtered reads were aligned to the appropriate Populus genome, either P. trichocarpa (Phytozome genome ID: 444, NCBI taxonomy ID: 3694) (59) or P. deltoides (Phytozome genome ID: 445; NCBI taxonomy ID: 3696) (60). Reads that were not mapped onto the Populus genome were imported into R (v4.0.2) and were mapped to all DefCom Variovorax genomes using Rsubread v2.2.4 (61), allowing up to 16 equally best mapping locations (nBestLocations = 16). To minimize double-counting among closely related strains (>98% ANI), features were counted using unique-mapping mode (countMultiMappingReads = FALSE), which excludes reads that map equally well to multiple reference genomes. For comparison, a separate analysis using fractional assignment of multi-mapped reads (countMultiMappingReads = TRUE, fraction = TRUE) was performed to evaluate the robustness of enrichment patterns. Overall enrichment patterns were consistent across mapping strategies, and any inconsistencies were limited to strains sharing ≥99% nucleotide identity. All subsequent analyses and figures use the unique-mapping counts.

Read counts were normalized using TMM implemented in edgeR v3.30.3 (62). Voom (63) was used to transform count data to log2-counts per million (logCPM), estimate the mean-variance relationship, and use this to compute appropriate observational-level weights. Contrasts of plant species, plant compartment, and their interaction were carried out using limma v3.44.3 (64) with Benjamini-Hochberg correction for multiple testing. Log₂ fold changes (endosphere vs. rhizosphere) were used to quantify differential compartment association rather than absolute colonization efficiency, since all strains were inoculated at equal initial abundance.

Functional enrichment analysis (KO/COG)

Functional annotation of all 28 Variovorax genomes was performed using EggNOG-mapper v2 (65, 66) to assign KOs, COGs, and additional functional categories. Strains were first grouped into ANI-defined lineages using a ≥97% ANI threshold, and lineages containing at least one dominant endosphere colonizer were classified as endosphere-associated lineages. For each KO, presence within a lineage was scored as 1 if the KO occurred in any genome in that lineage and 0 otherwise, and KO frequency was calculated as the proportion of lineages in each group (endosphere or non-endosphere) that contained the annotation. A KO was considered endosphere-enriched if it occurred in at least 2 endosphere-associated lineages and in no more than 4 of 11 non-endosphere lineages. This lineage-based framework standardized functional comparisons across genomes and enabled subsequent enrichment analyses and visualization of trait distributions at both the lineage and orthogroup levels.

Mutant construction and validation

Flanking regions of approximately 500 bp were amplified on either side of the L-fuconolactonase gene (PMI12_01353) using primers with homologous overlaps. All primers for cloning were designed using the NEBuilder design tool (New England Biolabs (NEB), Ipswitch, MA), and the oligos were synthesized by Integrated DNA Technologies (Integrated DNA Technologies (IDT), Coralville, IA) or Eurofins Genomics (Eurofins Genomics, Louisville, KY). Gibson Assembly Master Mix (NEB) was then used according to the manufacturer’s protocol to clone the amplified fragments into pMO130 suicide vector backbone (Addgene plasmid #27388; http://n2t.net/addgene: 27388; RRID:Addgene_27388), which carries kanamycin resistance, and the sacB gene, which confers sucrose sensitivity (67). The assembled plasmid was transformed into NEB 5-alpha (NEB) Escherichia coli cells by heat shock. The plasmid construct was then purified using a miniprep kit (Qiagen, Valencia, CA) and transformed by electroporation into the conjugative E. coli strain WM3064 (68). Mating with Variovorax strain CF313 was accomplished by mixing washed donor and recipient cells at a 1:10 ratio, concentrating by centrifugation, and depositing onto a sterile filter on an R2A plate. These were allowed to mate overnight at 30°C. Cells were subsequently washed from the filter and plated on R2A medium supplemented with 50 μg/mL kanamycin sulfate to select for plasmid-integrated CF313. One resulting kanamycin-resistant colony was grown overnight in liquid R2A at 30°C with 200 rpm shaking without selection to allow for final chromosomal integration to occur. An overnight culture (10 μL) was streaked to single colonies on R2A medium with 10% sucrose for counterselection. Several resulting colonies were then patched onto R2A media and screened using PCR.

Growth curves

Overnight R2A cultures were pelleted, washed, and resuspended in filter-sterilized 1× MOPS minimal medium supplemented with either 0.1% glucose or 0.1% L-fucose. In 96-well plates, 150 μL of medium and 2 μL of inoculum (adjusted to an initial OD₆₀₀ = 0.01) were combined. Parallel cultures were also grown in R2A medium on the same plate for comparison. OD₆₀₀ measurements were recorded every 10 min for 72 h at 30°C with orbital shaking using a microplate spectrophotometer. Each condition was assayed in triplicate technical replicates.

Plant colonization by wild-type and mutant strains

Bacterial inoculants were prepared using overnight cultures of CF313 and CF313Δfuc diluted in sterile water to an OD₆₀₀ of 0.01. Individual strains or combined strains (10 mL) were thoroughly mixed into sterile inert clay supplemented with 1× Hoagland’s solution in a microcosm. Newly rooted axenic tissue culture plants of P. deltoides (genotype WV94) were individually established in each microcosm and maintained in a controlled-condition growth chamber (16 h photoperiod, 21°C) for 3 weeks.

Plants were gently removed from the boxes in a sterile hood, and roots were dipped several times in sterile water to remove clay and then patted dry with sterile paper towels. After recording root weights, roots were sonicated for 3 min in 15 mL of 0.1× PBS buffer. The buffer was decanted, and the wash was repeated three times with fresh buffer. Washed roots were resuspended in 10 mL 0.1× PBS and homogenized using a Geno/Grinder (SPEX Sample Prep). Homogenates were briefly centrifuged to pellet large debris, and a 10-fold serial dilution was prepared in 0.1× PBS up to 10⁻⁷. For each dilution, 10 μL was spotted in triplicate onto MOPS minimal medium plates supplemented with either 0.1% glucose or 0.1% L-fucose, and plates were allowed to dry in a sterile hood. Colonies were counted after visible growth, and CFUs per gram of root tissue were calculated based on the dilution factor and plated volumes.

Orthogroup analysis

OrthoFinder (v3.0.1b1) (69) was used to infer orthogroups from amino acid sequences of all 28 Variovorax genomes. Functional annotation of all coding sequences was performed with EggNOG-mapper v2 (65, 66), providing KOs, COGs, and additional functional assignments, which were then associated with their respective orthogroups. Metagenomic reads were mapped to the orthogroups, and counts were normalized to counts per million (CPM) to quantify functional abundance. Orthogroup alpha and beta diversity (Bray–Curtis) was visualized using NMDS (vegan package in R). Enriched orthogroups were identified using DESeq2 (70), limma-voom (63), and metagenomeSeq (71), and significance required agreement across all three methods (FDR < 0.05).

ACKNOWLEDGMENTS

This research was sponsored by the Genomic Science Program, U.S. Department of Energy, Office of Science, Biological and Environmental Research, as part of the Plant Microbe Interfaces Science Focus Area (https://pmiweb.ornl.gov/). Oak Ridge National Laboratory is managed by UT-Battelle, LLC, for the U.S. Department of Energy under contract DE-AC05-00OR22725.

We thank the anonymous reviewers for their insightful comments that greatly improved the manuscript.

Contributor Information

Dale A. Pelletier, Email: pelletierda@ornl.gov.

Ákos T. Kovács, Universiteit Leiden, Leiden, Netherlands

DATA AVAILABILITY

Raw metagenomic sequencing reads have been deposited in the NCBI Sequence Read Archive under BioProject accession number PRJNA1322484. The R code for preprocessing, analysis, and visualization is available at doi: https://doi.org/10.5281/zenodo.17466836.

SUPPLEMENTAL MATERIAL

The following material is available online at https://doi.org/10.1128/msystems.01605-25.

Supplemental figures. msystems.01605-25-s0001.docx.

Figures S1 to S5.

DOI: 10.1128/msystems.01605-25.SuF1
Supplemental tables. msystems.01605-25-s0002.xlsx.

Tables S1 to S4.

DOI: 10.1128/msystems.01605-25.SuF2

ASM does not own the copyrights to Supplemental Material that may be linked to, or accessed through, an article. The authors have granted ASM a non-exclusive, world-wide license to publish the Supplemental Material files. Please contact the corresponding author directly for reuse.

REFERENCES

  • 1. Berendsen RL, Pieterse CMJ, Bakker PAHM. 2012. The rhizosphere microbiome and plant health. Trends Plant Sci 17:478–486. doi: 10.1016/j.tplants.2012.04.001 [DOI] [PubMed] [Google Scholar]
  • 2. Hassani MA, Durán P, Hacquard S. 2018. Microbial interactions within the plant holobiont. Microbiome 6:58. doi: 10.1186/s40168-018-0445-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Philippot L, Raaijmakers JM, Lemanceau P, van der Putten WH. 2013. Going back to the roots: the microbial ecology of the rhizosphere. Nat Rev Microbiol 11:789–799. doi: 10.1038/nrmicro3109 [DOI] [PubMed] [Google Scholar]
  • 4. Turner TR, James EK, Poole PS. 2013. The plant microbiome. Genome Biol 14:209. doi: 10.1186/gb-2013-14-6-209 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Goswami M, Deka S. 2022. Rhizodeposits: an essential component for microbial interactions in rhizosphere, p 129–151. In Singh UB, Rai JP, Sharma AK (ed), Re-visiting the rhizosphere eco-system for agricultural sustainability. Springer Nature, Singapore. [Google Scholar]
  • 6. Timm CM, Campbell AG, Utturkar SM, Jun S-R, Parales RE, Tan WA, Robeson MS, Lu T-Y, Jawdy S, Brown SD, Ussery DW, Schadt CW, Tuskan GA, Doktycz MJ, Weston DJ, Pelletier DA. 2015. Metabolic functions of Pseudomonas fluorescens strains from Populus deltoides depend on rhizosphere or endosphere isolation compartment. Front Microbiol 6:1118. doi: 10.3389/fmicb.2015.01118 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Hardoim PR, van Overbeek LS, Berg G, Pirttilä AM, Compant S, Campisano A, Döring M, Sessitsch A. 2015. The hidden world within plants: ecological and evolutionary considerations for defining functioning of microbial endophytes. Microbiol Mol Biol Rev 79:293–320. doi: 10.1128/MMBR.00050-14 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Fernández-González AJ, Villadas PJ, Gómez-Lama Cabanás C, Valverde-Corredor A, Belaj A, Mercado-Blanco J, Fernández-López M. 2019. Defining the root endosphere and rhizosphere microbiomes from the World Olive Germplasm Collection. Sci Rep 9:20423. doi: 10.1038/s41598-019-56977-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Gottel NR, Castro HF, Kerley M, Yang Z, Pelletier DA, Podar M, Karpinets T, Uberbacher E, Tuskan GA, Vilgalys R, Doktycz MJ, Schadt CW. 2011. Distinct microbial communities within the endosphere and rhizosphere of Populus deltoides roots across contrasting soil types. Appl Environ Microbiol 77:5934–5944. doi: 10.1128/AEM.05255-11 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Mukhtar S, Mehnaz S, Malik KA. 2021. Comparative study of the rhizosphere and root endosphere microbiomes of cholistan desert plants. Front Microbiol 12:618742. doi: 10.3389/fmicb.2021.618742 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Yamamoto K, Shiwa Y, Ishige T, Sakamoto H, Tanaka K, Uchino M, Tanaka N, Oguri S, Saitoh H, Tsushima S. 2018. Bacterial diversity associated with the rhizosphere and endosphere of two halophytes: glaux maritima and salicornia europaea. Front Microbiol 9. doi: 10.3389/fmicb.2018.02878 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Compant S, van der Heijden MGA, Sessitsch A. 2010. Climate change effects on beneficial plant-microorganism interactions. FEMS Microbiol Ecol 73:197–214. doi: 10.1111/j.1574-6941.2010.00900.x [DOI] [PubMed] [Google Scholar]
  • 13. Gaiero JR, McCall CA, Thompson KA, Day NJ, Best AS, Dunfield KE. 2013. Inside the root microbiome: bacterial root endophytes and plant growth promotion. American J of Botany 100:1738–1750. doi: 10.3732/ajb.1200572 [DOI] [PubMed] [Google Scholar]
  • 14. Brader G, Compant S, Mitter B, Trognitz F, Sessitsch A. 2014. Metabolic potential of endophytic bacteria. Curr Opin Biotechnol 27:30–37. doi: 10.1016/j.copbio.2013.09.012 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Levy A, Salas Gonzalez I, Mittelviefhaus M, Clingenpeel S, Herrera Paredes S, Miao J, Wang K, Devescovi G, Stillman K, Monteiro F, et al. 2018. Genomic features of bacterial adaptation to plants. Nat Genet 50:138–150. doi: 10.1038/s41588-017-0012-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Lugtenberg B, Kamilova F. 2009. Plant-growth-promoting rhizobacteria. Annu Rev Microbiol 63:541–556. doi: 10.1146/annurev.micro.62.081307.162918 [DOI] [PubMed] [Google Scholar]
  • 17. Lugtenberg BJJ, Dekkers L, Bloemberg GV. 2001. Molecular determinants of rhizosphere colonization by Pseudomonas. Annu Rev Phytopathol 39:461–490. doi: 10.1146/annurev.phyto.39.1.461 [DOI] [PubMed] [Google Scholar]
  • 18. Lugtenberg BJJ, Dekkers LC. 1999. What makes Pseudomonas bacteria rhizosphere competent. Environ Microbiol 1:9–13. doi: 10.1046/j.1462-2920.1999.00005.x [DOI] [PubMed] [Google Scholar]
  • 19. Carper DL, Weston DJ, Barde A, Timm CM, Lu T-Y, Burdick LH, Jawdy SS, Klingeman DM, Robeson MS, Veach AM, Cregger MA, Kalluri UC, Schadt CW, Podar M, Doktycz MJ, Pelletier DA. 2021. Cultivating the bacterial microbiota of populus roots mSystems 6:e01306–20. doi: 10.1128/mSystems.01306-20 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Bulgarelli D, Rott M, Schlaeppi K, Ver Loren van Themaat E, Ahmadinejad N, Assenza F, Rauf P, Huettel B, Reinhardt R, Schmelzer E, Peplies J, Gloeckner FO, Amann R, Eickhorst T, Schulze-Lefert P. 2012. Revealing structure and assembly cues for Arabidopsis root-inhabiting bacterial microbiota. Nature 488:91–95. doi: 10.1038/nature11336 [DOI] [PubMed] [Google Scholar]
  • 21. Taye ZM, Helgason BL, Bell JK, Norris CE, Vail S, Robinson SJ, Parkin IAP, Arcand M, Mamet S, Links MG, Dowhy T, Siciliano S, Lamb EG. 2019. Core and differentially abundant bacterial taxa in the rhizosphere of field grown Brassica napus genotypes: implications for canola breeding. Front Microbiol 10:3007. doi: 10.3389/fmicb.2019.03007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Belimov AA, Hontzeas N, Safronova VI, Demchinskaya SV, Piluzza G, Bullitta S, Glick BR. 2005. Cadmium-tolerant plant growth-promoting bacteria associated with the roots of Indian mustard (Brassica juncea L. Czern.). Soil Biol Biochem 37:241–250. doi: 10.1016/j.soilbio.2004.07.033 [DOI] [Google Scholar]
  • 23. Belimov AA, Dodd IC, Hontzeas N, Theobald JC, Safronova VI, Davies WJ. 2009. Rhizosphere bacteria containing 1‐aminocyclopropane‐1‐carboxylate deaminase increase yield of plants grown in drying soil via both local and systemic hormone signalling. New Phytologist 181:413–423. doi: 10.1111/j.1469-8137.2008.02657.x [DOI] [PubMed] [Google Scholar]
  • 24. Finkel OM, Salas-González I, Castrillo G, Conway JM, Law TF, Teixeira P, Wilson ED, Fitzpatrick CR, Jones CD, Dangl JL. 2020. A single bacterial genus maintains root growth in a complex microbiome. Nature 587:103–108. doi: 10.1038/s41586-020-2778-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Jiang F, Chen L, Belimov AA, Shaposhnikov AI, Gong F, Meng X, Hartung W, Jeschke DW, Davies WJ, Dodd IC. 2012. Multiple impacts of the plant growth-promoting rhizobacterium Variovorax paradoxus 5C-2 on nutrient and ABA relations of Pisum sativum. J Exp Bot 63:6421–6430. doi: 10.1093/jxb/ers301 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Sun S-L, Yang W-L, Fang W-W, Zhao Y-X, Guo L, Dai Y-J. 2018. The plant growth-promoting Rhizobacterium Variovorax boronicumulans CGMCC 4969 regulates the level of indole-3-acetic acid synthesized from indole-3-acetonitrile. Appl Environ Microbiol 84:e00298-18. doi: 10.1128/AEM.00298-18 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Burghardt LT, Epstein B, Guhlin J, Nelson MS, Taylor MR, Young ND, Sadowsky MJ, Tiffin P. 2018. Select and resequence reveals relative fitness of bacteria in symbiotic and free-living environments. Proc Natl Acad Sci USA 115:2425–2430. doi: 10.1073/pnas.1714246115 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Burghardt LT, Trujillo DI, Epstein B, Tiffin P, Young ND. 2020. A select and resequence approach reveals strain-specific effects of Medicago Nodule-Specific PLAT-domain genes. Plant Physiol 182:463–471. doi: 10.1104/pp.19.00831 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Bulgarelli D, Schlaeppi K, Spaepen S, Ver Loren van Themaat E, Schulze-Lefert P. 2013. Structure and functions of the bacterial microbiota of plants. Annu Rev Plant Biol 64:807–838. doi: 10.1146/annurev-arplant-050312-120106 [DOI] [PubMed] [Google Scholar]
  • 30. Sasse J, Martinoia E, Northen T. 2018. Feed your friends: do plant exudates shape the root microbiome? Trends Plant Sci 23:25–41. doi: 10.1016/j.tplants.2017.09.003 [DOI] [PubMed] [Google Scholar]
  • 31. Finkel OM, Salas-González I, Castrillo G, Conway JM, Law TF, Teixeira P, Wilson ED, Fitzpatrick CR, Jones CD, Dangl JL. 2020. A single bacterial genus maintains root growth in a complex microbiome. Nature 587:103–108. doi: 10.1038/s41586-020-2778-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. Wang J, Carper DL, Burdick LH, Shrestha HK, Appidi MR, Abraham PE, Timm CM, Hettich RL, Pelletier DA, Doktycz MJ. 2021. Formation, characterization and modeling of emergent synthetic microbial communities. Comput Struct Biotechnol J 19:1917–1927. doi: 10.1016/j.csbj.2021.03.034 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. Wang J, Appidi MR, Burdick LH, Abraham PE, Hettich RL, Pelletier DA, Doktycz MJ. 2024. Formation of a constructed microbial community in a nutrient-rich environment indicates bacterial interspecific competition. mSystems 9:e00006–24. doi: 10.1128/msystems.00006-24 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. Griswold AR, Jameson-Lee M, Burne RA. 2006. Regulation and physiologic significance of the agmatine deiminase system of Streptococcus mutans UA159. J Bacteriol 188:834–841. doi: 10.1128/JB.188.3.834-841.2006 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Iturriaga G, Suárez R, Nova-Franco B. 2009. Trehalose metabolism: from osmoprotection to signaling. Int J Mol Sci 10:3793–3810. doi: 10.3390/ijms10093793 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36. Vanaporn M, Titball RW. 2020. Trehalose and bacterial virulence. Virulence 11:1192–1202. doi: 10.1080/21505594.2020.1809326 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. Williams BJ, Du R-H, Calcutt MW, Abdolrasulnia R, Christman BW, Blackwell TS. 2010. Discovery of an operon that participates in agmatine metabolism and regulates biofilm formation in Pseudomonas aeruginosa. Mol Microbiol 76:104–119. doi: 10.1111/j.1365-2958.2010.07083.x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38. Brigham LM, Bueno de Mesquita CP, Spasojevic MJ, Farrer EC, Porazinska DL, Smith JG, Schmidt SK, Suding KN. 2023. Drivers of bacterial and fungal root endophyte communities: understanding the relative influence of host plant, environment, and space. FEMS Microbiol Ecol 99:fiad034. doi: 10.1093/femsec/fiad034 [DOI] [PubMed] [Google Scholar]
  • 39. Khare E, Mishra J, Arora NK. 2018. Multifaceted interactions between endophytes and plant: developments and prospects. Front Microbiol 9:2732. doi: 10.3389/fmicb.2018.02732 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40. Shen SY, Fulthorpe R. 2015. Seasonal variation of bacterial endophytes in urban trees. Front Microbiol 6:427. doi: 10.3389/fmicb.2015.00427 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41. Anderson CT, Wallace IS, Somerville CR. 2012. Metabolic click-labeling with a fucose analog reveals pectin delivery, architecture, and dynamics in Arabidopsis cell walls. Proc Natl Acad Sci USA 109:1329–1334. doi: 10.1073/pnas.1120429109 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42. Sinha Roy S, Mittra B, Sharma S, Das TK, Babu CR. 2002. Detection of root mucilage using an anti-fucose antibody. Ann Bot 89:293–299. doi: 10.1093/aob/mcf040 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43. Zhang L, Paasch BC, Chen J, Day B, He SY. 2019. An important role of l-fucose biosynthesis and protein fucosylation genes in Arabidopsis immunity. New Phytol 222:981–994. doi: 10.1111/nph.15639 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44. Canarini A, Kaiser C, Merchant A, Richter A, Wanek W. 2019. Root exudation of primary metabolites: mechanisms and their roles in plant responses to environmental stimuli. Front Plant Sci 10:157. doi: 10.3389/fpls.2019.00157 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45. Chen I-M, Chu K, Palaniappan K, Ratner A, Huang J, Huntemann M, Hajek P, Ritter SJ, Webb C, Wu D, Varghese NJ, Reddy TBK, Mukherjee S, Ovchinnikova G, Nolan M, Seshadri R, Roux S, Visel A, Woyke T, Eloe-Fadrosh EA, Kyrpides NC, Ivanova NN. 2023. The IMG/M data management and analysis system v.7: content updates and new features. Nucleic Acids Res 51:D723–D732. doi: 10.1093/nar/gkac976 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46. Derelle R, Philippe H, Colbourne JK. 2020. Broccoli: combining phylogenetic and network analyses for orthology assignment. Mol Biol Evol 37:3389–3396. doi: 10.1093/molbev/msaa159 [DOI] [PubMed] [Google Scholar]
  • 47. Abascal F, Zardoya R, Telford MJ. 2010. TranslatorX: multiple alignment of nucleotide sequences guided by amino acid translations. Nucleic Acids Res 38:W7–13. doi: 10.1093/nar/gkq291 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48. Katoh K, Standley DM. 2013. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol 30:772–780. doi: 10.1093/molbev/mst010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49. Steenwyk JL, Buida TJ, Li Y, Shen X-X, Rokas A. 2020. ClipKIT: a multiple sequence alignment trimming software for accurate phylogenomic inference. PLoS Biol 18:e3001007. doi: 10.1371/journal.pbio.3001007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50. Minh BQ, Schmidt HA, Chernomor O, Schrempf D, Woodhams MD, von Haeseler A, Lanfear R. 2020. IQ-TREE 2: new models and efficient methods for phylogenetic inference in the genomic era. Mol Biol Evol 37:1530–1534. doi: 10.1093/molbev/msaa015 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51. Kalyaanamoorthy S, Minh BQ, Wong TKF, von Haeseler A, Jermiin LS. 2017. ModelFinder: fast model selection for accurate phylogenetic estimates. Nat Methods 14:587–589. doi: 10.1038/nmeth.4285 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52. Hoang DT, Chernomor O, von Haeseler A, Minh BQ, Vinh LS. 2018. UFBoot2: Improving the ultrafast bootstrap approximation. Mol Biol Evol 35:518–522. doi: 10.1093/molbev/msx281 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53. Jain C, Rodriguez-R LM, Phillippy AM, Konstantinidis KT, Aluru S. 2018. High throughput ANI analysis of 90K prokaryotic genomes reveals clear species boundaries. Nat Commun 9:5114. doi: 10.1038/s41467-018-07641-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54. Arkin AP, Cottingham RW, Henry CS, Harris NL, Stevens RL, Maslov S, Dehal P, Ware D, Perez F, Canon S, et al. 2018. KBase: The United States department of energy systems biology knowledgebase. Nat Biotechnol 36:566–569. doi: 10.1038/nbt.4163 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55. Reasoner DJ, Geldreich EE. 1985. A new medium for the enumeration and subculture of bacteria from potable water. Appl Environ Microbiol 49:1–7. doi: 10.1128/aem.49.1.1-7.1985 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56. Timm CM, Pelletier DA, Jawdy SS, Gunter LE, Henning JA, Engle N, Aufrecht J, Gee E, Nookaew I, Yang Z, Lu T-Y, Tschaplinski TJ, Doktycz MJ, Tuskan GA, Weston DJ. 2016. Two poplar-associated bacterial isolates induce additive favorable responses in a constructed plant-microbiome system. Front Plant Sci 7:497. doi: 10.3389/fpls.2016.00497 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57. Utturkar SM, Cude WN, Robeson MS, Yang ZK, Klingeman DM, Land ML, Allman SL, Lu T-Y, Brown SD, Schadt CW, Podar M, Doktycz MJ, Pelletier DA. 2016. Enrichment of root endophytic bacteria from populus deltoides and single-cell-genomics analysis. Appl Environ Microbiol 82:5698–5708. doi: 10.1128/AEM.01285-16 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58. Didion JP, Martin M, Collins FS. 2017. Atropos: specific, sensitive, and speedy trimming of sequencing reads. PeerJ 5:e3720. doi: 10.7717/peerj.3720 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59. Tuskan GA, Difazio S, Jansson S, Bohlmann J, Grigoriev I, Hellsten U, Putnam N, Ralph S, Rombauts S, Salamov A, et al. 2006. The genome of black cottonwood, Populus trichocarpa (Torr. & Gray). Science 313:1596–1604. doi: 10.1126/science.1128691 [DOI] [PubMed] [Google Scholar]
  • 60. Sreedasyam A, Plott C, Hossain MS, Lovell JT, Grimwood J, Jenkins JW, Daum C, Barry K, Carlson J, Shu S, et al. 2023. JGI Plant Gene Atlas: an updateable transcriptome resource to improve functional gene descriptions across the plant kingdom. Nucleic Acids Res 51:8383–8401. doi: 10.1093/nar/gkad616 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61. Liao Y, Smyth GK, Shi W. 2019. The R package Rsubread is easier, faster, cheaper and better for alignment and quantification of RNA sequencing reads. Nucleic Acids Res 47:e47. doi: 10.1093/nar/gkz114 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62. Robinson MD, McCarthy DJ, Smyth GK. 2010. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 26:139–140. doi: 10.1093/bioinformatics/btp616 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63. Law CW, Chen Y, Shi W, Smyth GK. 2014. Voom: precision weights unlock linear model analysis tools for RNA-seq read counts. Genome Biol 15:1–17. doi: 10.1186/gb-2014-15-2-r29 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64. Ritchie ME, Phipson B, Wu D, Hu Y, Law CW, Shi W, Smyth GK. 2015. Limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res 43:e47. doi: 10.1093/nar/gkv007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65. Cantalapiedra CP, Hernández-Plaza A, Letunic I, Bork P, Huerta-Cepas J. 2021. eggNOG-mapper v2: functional annotation, orthology assignments, and domain prediction at the metagenomic scale. Mol Biol Evol 38:5825–5829. doi: 10.1093/molbev/msab293 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66. Huerta-Cepas J, Szklarczyk D, Heller D, Hernández-Plaza A, Forslund SK, Cook H, Mende DR, Letunic I, Rattei T, Jensen LJ, von Mering C, Bork P. 2019. eggNOG 5.0: a hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses. Nucleic Acids Res 47:D309–D314. doi: 10.1093/nar/gky1085 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67. Hamad MA, Zajdowicz SL, Holmes RK, Voskuil MI. 2009. An allelic exchange system for compliant genetic manipulation of the select agents Burkholderia pseudomallei and Burkholderia mallei. Gene 430:123–131. doi: 10.1016/j.gene.2008.10.011 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68. Dehio C, Meyer M. 1997. Maintenance of broad-host-range incompatibility group P and group Q plasmids and transposition of Tn5 in Bartonella henselae following conjugal plasmid transfer from Escherichia coli. J Bacteriol 179:538–540. doi: 10.1128/jb.179.2.538-540.1997 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69. Emms DM, Kelly S. 2019. OrthoFinder: phylogenetic orthology inference for comparative genomics. Genome Biol 20:238. doi: 10.1186/s13059-019-1832-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70. Love MI, Huber W, Anders S. 2014. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol 15:550. doi: 10.1186/s13059-014-0550-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71. Paulson JN, Stine OC, Bravo HC, Pop M. 2013. Differential abundance analysis for microbial marker-gene surveys. Nat Methods 10:1200–1202. doi: 10.1038/nmeth.2658 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplemental figures. msystems.01605-25-s0001.docx.

Figures S1 to S5.

DOI: 10.1128/msystems.01605-25.SuF1
Supplemental tables. msystems.01605-25-s0002.xlsx.

Tables S1 to S4.

DOI: 10.1128/msystems.01605-25.SuF2

Data Availability Statement

Raw metagenomic sequencing reads have been deposited in the NCBI Sequence Read Archive under BioProject accession number PRJNA1322484. The R code for preprocessing, analysis, and visualization is available at doi: https://doi.org/10.5281/zenodo.17466836.


Articles from mSystems are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES