Skip to main content
The ISME Journal logoLink to The ISME Journal
. 2021 Sep 10;16(3):642–654. doi: 10.1038/s41396-021-01102-w

Transitional genomes and nutritional role reversals identified for dual symbionts of adelgids (Aphidoidea: Adelgidae)

Dustin T Dial 1,#, Kathryn M Weglarz 2,4,#, Akintunde O Aremu 1, Nathan P Havill 3, Taylor A Pearson 1, Gaelen R Burke 1,, Carol D von Dohlen 2,
PMCID: PMC8857208  PMID: 34508228

Abstract

Many plant-sap-feeding insects have maintained a single, obligate, nutritional symbiont over the long history of their lineage. This senior symbiont may be joined by one or more junior symbionts that compensate for gaps in function incurred through genome-degradative forces. Adelgids are sap-sucking insects that feed solely on conifer trees and follow complex life cycles in which the diet fluctuates in nutrient levels. Adelgids are unusual in that both senior and junior symbionts appear to have been replaced repeatedly over their evolutionary history. Genomes can provide clues to understanding symbiont replacements, but only the dual symbionts of hemlock adelgids have been examined thus far. Here, we sequence and compare genomes of four additional dual-symbiont pairs in adelgids. We show that these symbionts are nutritional partners originating from diverse bacterial lineages and exhibiting wide variation in general genome characteristics. Although dual symbionts cooperate to produce nutrients, the balance of contributions varies widely across pairs, and total genome contents reflect a range of ages and degrees of degradation. Most symbionts appear to be in transitional states of genome reduction. Our findings support a hypothesis of periodic symbiont turnover driven by fluctuating selection for nutritional provisioning related to gains and losses of complex life cycles in their hosts.

Subject terms: Microbial ecology, Evolution, Genomics

Introduction

Symbiotic associations have repeatedly spurred the diversification of eukaryotic lineages by endowing hosts with novel adaptive traits, unlocking previously unexploited ecological niches [1, 2]. Iconic examples include the intracellular bacterial symbionts of insects that provide essential nutrients to their hosts, notably, in hemipteran insects that feed on a nutritionally unbalanced diet of plant sap. Bacterial symbionts likely facilitated the exploitation of this niche by provisioning essential amino acids (EAAs) and vitamins lacking in plant sap [3].

In many insect groups, strict vertical transmission of symbionts between generations has promoted tight co-evolutionary relationships, but in the process has increased risk of an evolutionary “dead-end”. While symbiosis is adaptive to the consortium, long-term sequestration is costly to symbiont genomes [4]. As a result of greatly reduced effective population sizes, severe population bottlenecks in each generation, relaxed selective constraints, and deletional mutational biases, bacterial symbiont genomes are reduced to the core housekeeping genes and genes essential to the hosts’ survival [5]. Furthermore, strong genetic drift may override purifying selection to spur further losses of genes presumed important to the partnership [6]. When a host’s diet is enriched, relaxed selection on nutritional contributions may facilitate further symbiont gene deletions, locking the insect into its current niche and pushing the consortium further down the evolutionary “rabbit hole” of heritable symbiosis [4]. Escape from this “rabbit hole” is possible only when the host genome can compensate, or when a new symbiont joins the holobiont and supplements or replaces functions of the original one [7, 8].

In support of the symbiotic “rabbit hole”, ancient symbionts of several sap-feeding lineages, such as scales and mealybugs, psyllids, and auchenorrhynchans (cicadas, various hoppers), are mainly found in pairs or even multitudes (e.g., [914]). In such dual symbioses, typically a senior symbiont (defined as the older of two obligate nutritional cosymbionts [15]) is accompanied by one (or more) sequentially acquired junior symbiont(s) that completes degraded nutritional pathways. In most dual endosymbioses studied thus far, a senior symbiont provides the majority of genes in EAA metabolic pathways and the junior symbiont(s) supplies the remainder [10, 1619]. When symbionts occur in multiples, the array of degradative evolutionary forces act on all symbionts, resulting in potential symbiont replacement as genome degradation progresses. Most transitions represent a change in junior symbionts; replacement of senior symbionts is relatively rare, suggesting that time since acquisition is associated with stability [2025]. This process may be the result of established developmental dependencies and a long history of host adaptations to support eroding symbiont functions [4, 26, 27].

It is not completely understood why most host lineages acquire co-symbionts, while some do not, or why senior symbionts appear to reach a point of stability—and why that stability might break down in some lineages. Studying gains and losses of obligate symbionts in sap-feeding insects is challenging due to the infrequent occurrence of these events and confounding factors such as the diverse feeding habits of the insect hosts. The recently characterized symbionts of Adelgidae (Hemiptera: Aphidoidea) [15, 2833] offer an opportunity to examine patterns and potential processes of symbiont replacements in a well-defined lineage with similar biology. Comprising approximately 70 described species in five major lineages [34], adelgids feed solely on conifer trees (Pinaceae) and have complex life cycles that include yearly alternation between a primary and a secondary (or alternate) host [35, 36]. Half of their life cycle is spent on spruce (Picea spp.) as their primary host plant, where a gall is formed and nutrient-rich parenchyma cells are tapped; the other half is spent on an alternate conifer in one of five other genera, where most species tap relatively nutrient-poor phloem sap. Unlike most other sap-feeding insect lineages of similar or older ages, Adelgidae lack a universal senior symbiont: rather, each adelgid lineage hosts a unique pair of obligate symbionts (as determined from 16 S rRNA gene sequencing and microscopy) [15, 3032]. This diversity implies that both junior and senior symbionts have been recurrently replaced across the family [15, 30, 31, 33]. These dynamic changes in symbiont composition also align with historical acquisitions of alternate host-plant genera [15].

To date, the only complete, published genomes of adelgid symbionts are those of the hemlock woolly adelgid, Adelges tsugae, sampled from the invasive eastern North American population [28]. A. tsugae hosts two obligate symbionts: a senior symbiont, “Candidatus Annandia adelgestsuga”, and a junior symbiont, “Ca. Pseudomonas adelgestsugas”. Genomes for both of these organisms bear characteristics of long-term, obligate nutritional symbionts, such as highly reduced, AT-rich genomes missing many core housekeeping genes but retaining complementary genes in nutrient-synthesis pathways. However, in contrast to nutritional-symbiont partners of other sap feeders, symbionts of A. tsugae exhibit more balanced contributions to EAA pathways [28]. We previously proposed a hypothesis to explain the frequent replacements of symbionts in Adelgidae, which accounts for the unusually decreased contributions of the senior symbiont in A. tsugae [15]. This hypothesis proposes that historical fluctuations in dietary composition during the evolution of host-alternating life cycles may have subjected obligate symbionts to fluctuating selection, thus accelerating gene inactivation in nutrient pathways [28]. If fluctuating selection has been a major driver of symbiont turnover in adelgids, we would expect to find a similar pattern of accelerated degradation in nutrient provisioning pathways of senior symbionts in the other adelgid lineages.

Here, we characterize the genomes of dual symbionts from species of Adelgidae representing the pine, fir, Douglas fir, and larch lineages, and perform comparative analyses including the previously characterized A. tsugae from the hemlock lineage. We sought to determine whether these dual symbionts are also nutritional partners, as we have presumed, and whether provisioning profiles are consistent across the family. Thus, we investigated whether genomes of symbionts indicate cooperation in nutrient metabolic pathways, and whether the relatively balanced contributions observed between symbionts of A. tsugae are found in other lineages. We further tested whether genome characteristics support the previous hypothesis of symbiont ages and relationships, and whether the dynamics of inferred symbiont gains and losses in this lineage are reflected in their genome-degradation characteristics.

Methods

Material acquisition and genome sequencing

Samples selected for sequencing were species from four of the five major conifer-host-associated lineages of Adelgidae (Supplementary Table 1). Species were selected based on the availability of specimens. Samples consisted of pooled individuals from single galls, or from several insects from a single population collected from bark (see Supplementary Table 1). Data from A. tsugae [28] were incorporated to represent the fifth lineage from hemlock. The A. lariciatus and P. similis samples were extracted using the High Pure PCR template kit (Roche Diagnostics, Indianapolis), and treated with DNase-free RNAse (Roche Diagnostics, Indianapolis IN, USA). The A. piceae, A. kitamiensis, and A. cooleyi samples were extracted using a DNeasy Blood & Tissue Kit (Qiagen Inc., Germantown MD, USA). DNA concentration was quantified with a Qubit fluorometer (Thermo Fisher Scientific, Germantown MA, USA). All samples underwent library construction, and all except the A. cooleyi sample underwent Illumina (Illumina Corp., San Diego CA, USA) 150 × 150 paired-end sequencings on either a NextSeq or HiSeq 2500 machine. The A. cooleyi DNA was sequenced using Pacific Biosciences (PacBio) (Menlo Park CA, USA) sequencing, after size selection with the BluePippin system (Sage Science, Beverly, MA, USA). Data from A. kitamiensis were used only for estimation of genome-wide rates of synonymous (dS) substitutions in the symbionts.

Genome assembly

Raw Illumina reads were quality-trimmed and filtered with Trimmomatic Version 0.36 [37] and the FASTX-toolkit (http://hannonlab.cshl.edu/fastx_toolkit/). Total Illumina reads from P. similis, A. piceae, and A. kitamiensis were quality- and adapter-trimmed using the following parameters with Trimmomatic [37]: ILLUMINACLIP: TruSeq3-PE.fa:2:30:10 LEADING:3 TRAILING:3 SLIDINGWINDOW:4:15 MINLEN:36. A. lariciatus reads were filtered with the FASTX-toolkit such that a read was discarded if less than 90% of its bases had Phred quality scores of 30 or more. For all Illumina datasets but P. similis, overlapping reads were merged with the paired-end read mergeR (PEAR) [38]. Quality merged and unmerged reads were assembled de novo with SPAdes Version 3.12.0 [39]. The final assemblies for A. piceae, A. kitamiensis, and A. lariciatus symbionts were assembled with the -merged flag, while P. similis reads were left unmerged and were assembled with the -meta flag. All short-read assemblies were performed with k-mers 21, 33, 55, 77, 99, and 127. Assemblies were polished with Illumina reads with Pilon version 1.22 [40]. A. cooleyi PacBio reads were assembled and polished with Flye version 2.6 [41]. Symbiont reads were then extracted and isolated from the raw dataset by BLASTn searching against symbiont scaffolds. The resulting reads were reassembled and polished with Flye version 2.6.

Symbiont scaffolds were initially identified on the basis of relative coverage, GC content, and paired-end read mapping. Blobtools v1.01 [42] was used to bin symbiont scaffolds from metagenomic scaffolds into their respective families and to verify that all symbiont sequences were collected. Symbiont scaffolds from SPAdes were fed into SSPACE [43] to create sets of “super scaffolds.” For those that could not be computationally scaffolded or joined with SSPACE, PCR was performed to determine their order and orientation within the genome. Due to the presence of three identical rRNA operons, the “Ca. Vallotia lariciata” genome possessed a collapsed repeat. The orientation of “Ca. V. lariciata” scaffolds were determined by a combination of computational scaffolding, PCR, and shared synteny with “Ca. V. cooleyia” surrounding breaks. GapPadder [44] was used to extend the lengths of the contigs within the scaffolds. Finally, Pilon was used for misassembly detection and further gap closing.

Annotation

GC-skew was calculated with GenSkew v.1.0 (http://genskew.csb.univie.ac.at) and used to determine the origin of replication; the origin was designated at the region with the strongest signal where genomes had weak overall GC skew. Initial annotations were performed with the Prokka v1.14 [45] pipeline. Pseudogenes were approximated with Pseudofinder (https://github.com/filip-husnik/pseudofinder/) with default settings and the NCBI nr database (https://blast.ncbi.nlm.nih.gov/). Known bifunctional proteins with at least one intact functional domain were not flagged as pseudogenes. Genes annotated as hypothetical proteins by Prokka were searched against the NCBI RefSeq nonredundant protein (NR) database with BLASTp, and if a function could be assigned, the annotation was adjusted manually. All genomes were checked for insertion-sequence elements with the ISSaga2 web-based interface [46] and verified with BLASTp searches. Amino acid and vitamin-biosynthesis pathways were reconstructed using the BioCyc, EcoCyc, and KAAS databases [4749]. Sequences for host-support genes were collected from the AphidCyc database and searched against an A. tsugae transcriptome (GenBank accession PRJNA242203) and our metagenomic scaffolds to verify the presence. Clusters of orthologous groups (COGs) were determined using the online eggNOG-mapper tool (DIAMOND mapping mode and default choices for other settings) [50, 51]. Statistical tests for comparing means of genes belonging to COG categories for different symbiont types were performed with Analysis of Variance (ANOVA) and Tukey’s Honestly Significant Difference tests. Principal components analysis (PCA) was performed on the proportions of CDS belonging to each of the 26 COG categories relative to the total number of CDS in a given bacterial genome. All statistical analyses were performed with JMP (https://www.jmp.com/en_us/home.html).

Synteny analysis

Genome-wide synteny was examined between pairs of adelgid symbionts sharing close phylogenetic affiliation. An all-against-all BLAST of amino acid sequences (e-value cutoff = 1e-10) served as input for MCScanX to identify colinear blocks involving more than five genes (parameters: gap_penalty = 5) [52]. Orthovenn2 [53] was used to identify clusters of orthologous genes within symbiont pairs, and output diagrams were modified to reflect the number of genes shared within pairs and those unique to each species. Synteny plots were generated using VGSC 2.0 [54] and modified in Adobe Illustrator (Adobe Corp., San Jose, CA, USA).

Phylogenetics and molecular evolution

Our initial phylogenetic analysis was performed at the order level (Enterobacterales, 158 taxa and 70 orthologs) with genera representing each major proposed family: [55] Enterobacteriaceae, Erwiniaceae, Pectobacteriaceae, Yersiniaceae, Hafniaceae, Morganellaceae, and Budviciaceae. The phylogenetic placement of symbionts was inferred with amino acid (AA) sequences using a likelihood-based approach. The phyloSkeleton package [56] was used to collect taxa representing the backbone of the Enterobacterales tree. Sequences were downloaded from NCBI and orthologs were determined using a curated list of genomes with HMMER3 [57] (settings: e-value = 0.01, best-match-only) with the Bact109 panortholog gene set included in phyloSkeleton. Species were chosen such that at least one representative from each genus was selected. Additional taxa were included where adelgid symbionts were predicted to cluster based on previous analyses with 16 S rRNA sequences [15, 29, 31]. Three species from the Alphaproteobacteria and Betaproteobacteria were included as outgroups. Orthogroups that did not include target symbionts were excluded and paralogs were manually inspected and removed from the dataset. Orthologous sequences were aligned with Mafft v7.4 [58] (L-INS-I algorithm) and ambiguously aligned regions were trimmed with trimAL v1.4 [59] with the -automated1 option.

Phylogenetic analysis was initially performed with RAxML [60] at the CIPRES Science Gateway (https://www.phylo.org/) with the GTR + G model. However, this method is prone to topological errors due to compositional heterogeneity (e.g., inclusion of AT-rich symbiont genomes) and rapid rates of symbiont evolution. To mitigate this, we recoded our AA datasets by biochemical properties with the Dayhoff-6 recoding scheme and conducted our analysis with PhyloBayes MPI [61], a method that is known to be robust to long-branch-attraction artifacts [62]. The Enterobacterales dataset was run with two chains with the options -cat -gtr -dgam = 4. Convergence was tested with the bpcomp and tracecomp commands. The Enterobacterales chains did not converge according to bpcomp output even after 30,000 iterations and with 20% burn-in, with maxdiff = 1. However, parameter convergence was reached according to tracecomp statistics, with the maximum discrepancy < 0.3 and the minimum effective size > 50.

After initial placement of Enterobacterales symbionts at the order level, we subsampled the taxa from each clade within which an adelgid symbiont clustered, and conducted separate analyses with sets of orthologs common to all taxa identified by Orthofinder [63]. This subsampling allowed the analyses to reach Markov chain Monte Carlo convergence for each tree with higher support and similar topology. Taxa included in the Burkholderia tree were manually collected from NCBI. We also conducted a phylogenomic analysis of the Pseudomonas clade to place “Ca. Pseudomonas adelgestsuga”, using a recent phylogeny of Pseudomonadales [64] to select representatives of the major lineages for the alignment of orthologous proteins. Orthologous-protein clustering was performed with Orthofinder, and alignment, trimming, and tree construction were performed as above.

We additionally performed a phylogenetic analysis based on 16 S rRNA gene sequences to resolve relationships between the Serratia-like adelgid symbionts, Ecksteinia and Gillettellia. 16 S rRNA gene sequences from S. symbiotica were collected from NCBI using previous studies as references for taxon selection [65, 66]. Sequences were aligned with SSU-ALIGN (Nawrocki, 2009) and subsequently inspected in Geneious Prime (version 2019, Biomatters Ltd., Aukland, New Zealand). Ambiguously aligned and divergent sequences were deleted with GBlocks [67] with settings’-b5 = h’ to retain half of positions containing a gap. The resulting alignment was used as input into PhyloBayes with settings -cat -gtr and ran for approximately 40,000 generations per chain (4 chains) until convergence, as tested by bpcomp and tracecomp. Consensus trees were viewed with FigTree (http://tree.bio.ed.ac.uk/software/figtree).

To assess the extent of deviation of intergenic from genic GC content, we grouped intergenic sequences into bins greater than and less than 300 base pairs to differentiate between “normal” intergenic spacers and the large intergenic spacers contributing to the low coding densities of adelgid symbiont genomes. The GC contents of the binned intergenic sequences and CDS were calculated and plotted in R (R Core Team 2020) (https://www.r-project.org/). To determine whether the differences in the genome characteristics of specific symbionts are explained by differences in mutation rate, we estimated their genome-wide rates of synonymous (dS) substitutions using Pseudofinder with default settings. Violin plots were made in R using ggplot2 [68].

Results

Adelgidae symbionts exhibit wide variation in basic genome characteristics

The complete genomes of symbionts from single adelgid species representing four of the five major lineages of adelgids (pine, fir, Douglas fir, and larch) were sequenced and assembled into circular chromosomes to investigate their patterns of genome evolution. We recovered genomes of the two expected symbionts, as characterized previously [2932], from each adelgid lineage sampled: “Ca. Annandia pinicola” and “Ca. Hartigia pinicola” from pine (hereafter, Annandia pinicola and Hartigia), “Ca. Ecksteinia adelgidicola” and “Ca. Steffania adelgidicola” from true fir (hereafter, Ecksteinia and Steffania), “Ca. Vallotia cooleyia” and “Ca. Gillettellia cooleyia” from Douglas fir (hereafter, Vallotia cooleyia and Gillettellia), and “Ca. Vallotia lariciata” and “Ca. Profftia lariciata” from larch (hereafter, Vallotia lariciata and Profftia). To these, we added data from symbionts of A. tsugae representing the hemlock lineage, ‘Ca. Annandia adelgestsuga’ and “Ca. Pseudomonas adelgestsugas” (hereafter, Annandia adelgestsuga and Pseudomonas) [28] to deduce evolutionary patterns and processes across all of Adelgidae. Adelgid symbiont main chromosomes ranged in size from 0.34 Mb to 2.03 Mb, GC contents ranged from 17.8 to 45.9%, numbers of coding sequences (CDS) and pseudogenes varied from 313 to 985 and 12 to 200, respectively, and coding densities varied from 87.5 to 34.2% (Fig. 1), reflecting a dynamic history of genome evolution and symbiont replacement throughout the adelgid family.

Fig. 1. Statistics for adelgid endosymbiont genomes (main chromosomes) sequenced from each major Adelgidae lineage.

Fig. 1

A. Relationships of adelgid species representing the five conifer-host-associated lineages, B Chromosome sizes of symbionts (with names indicated at the base of the figure). Circles representing symbiont chromosomes are scaled to indicate size and colored to indicate bacterial lineage (see Fig. 2), C Numbers of predicted open-reading frames (ORFs). The proportion of hypothetical proteins composing the proteome is shown in green, D Numbers of predicted pseudogenes, E The proportion of nucleotides in each symbiont chromosome encoding genes (coding density), F The percentage of G or C nucleotides in symbiont chromosomes.

We also recovered putative plasmid sequences in each dataset during the assembly and binning process. For datasets from A. cooleyi and A. lariciatus, plasmids were assigned to the Vallotia genomes (pAcVc for A. cooleyi and pAlVl for A. lariciatus) due to an abundance of genes that shared the highest sequence similarity with relatives of Vallotia (see below). The pAcVc and pAlVl plasmids were 61 and 67 kbp in size, possessed 38.2% and 41.8% GC content, and had 28 and 36 CDS (12 and 4 pseudogenes), respectively. Six additional plasmids recovered for the A. piceae and P. similis datasets (named pAp1 through −3 and pPs1 through −3) could not be assigned to symbionts because their genes shared identity with diverse bacteria (Supplementary Table 2). However, from A. piceae, the GC contents of two plasmids (pAp2 and pAp3) were more similar to Ecksteinia while the other (pAp1) was more similar to Steffania. These plasmids generally encoded genes involved in replication, recombination, metabolism, translation, transport, protein folding, and gene transfer.

Adelgidae symbionts originate from diverse bacterial lineages

Phylogenomic analysis of the Enterobacterales placed adelgid symbionts within diverse and well-supported lineages (Fig. 2A). Within Erwiniaceae, the two Annandia species, along with ‘Ca. Purcelliella pentastirinorum’ and “Ca. Stammera capleta” formed a clade sister to Buchnera spp. (Fig. 2B). Steffania clustered within the Sodalis clade in Pectobacteriaceae (Fig. 2C). Ecksteinia and Gillettellia grouped within the Serratia symbiotica clade, but were not sister taxa as found in a previous 16 S rRNA gene analysis [32]. Instead, they were separated by Serratia symbiotica from the giant willow aphid Tuberolachnus salignus (STs) (Fig. 2D). Greater taxon sampling of the S. symbiotica clade in the 16S rRNA gene phylogeny placed Ecksteinia and Gillettellia as sister taxa, but with low support (Fig. 2H). Hartigia clustered within the Providencia clade (Fig. 2E), a genus of free-living bacteria and insect-associated symbionts, as sister to “Ca. Providencia siddallii”, a symbiont of glossiphoniid leeches that provisions B vitamins to its host [69]. Profftia fell within Hafniaceae (Fig. 2F), a group of facultatively aerobic bacteria commonly isolated from the gastrointestinal tract of humans and animals [70].

Fig. 2. Phylogenetic relationships of adelgid endosymbionts within each major Adelgidae lineage.

Fig. 2

A Multi-gene Bayesian phylogeny of the Enterobacterales with 158 taxa inferred from 70 orthologs recoded under the Dayhoff-6 scheme. Colors indicate clades of taxa at the family level to which adelgid symbionts belong, BF Maximum likelihood (shown) and Bayesian phylogenies of individual clades in which adelgid symbionts clustered in (A) using orthogroups where all taxa were represented, G Multigene maximum likelihood (shown) and Bayesian phylogeny of the Burkholderiales. H. Bayesian 16 S rRNA gene phylogeny of the Serratia symbiotica clade; Serratia symbiotica from aphid species are denoted by aphid species names. Adelgid symbiont names are shown in red. Support values are listed as bootstrap replicates supporting nodes for maximum likelihood trees (ML) and percentage posterior probabilities for Bayesian inference phylogenies (BI). Bayesian support values are from analyses conducted on Dayhoff-6-recoded datasets, except for C, where sequences were left uncoded.

Phylogenomic analysis of the Burkholderiales placed the two betaproteobacterial Vallotia species as clustering with two endosymbionts of the fungus Rhizopus microsporus, Mycetohabitans endofungoum and Paraburkholderia rhizoxinica (Fig. 2G). Previous work hypothesized that Vallotia was derived from these fungal symbionts, although only one species, Vallotia tarda from the larch lineage, was included in the analysis [71]. Our two additional Vallotia genomes, representing the larch and Douglas-fir lineages, clustered within the Mycetohabitans clade, further strengthening this hypothesis. Phylogenomic analysis of the Pseudomonas (Pseudomonadales) representatives revealed that the closest relatives to P. adelgestsugas were P. edaphica and P. salomonii within the P. fluorescens clade, the largest and most diverse Pseudomonas group [64] (Supplementary Fig. 1). Together, these results show that while several symbionts of major adelgid lineages arose through independent acquisitions, others arose through introductions in a common ancestor of multiple host lineages.

Pairs of symbionts with phylogenetically affiliated progenitors vary in the degree of similarity in genomic content and architecture

Analysis of synteny between related symbiont pairs revealed varying levels of conservation in gene order. Synteny between the two Annandia species was highly conserved, with the majority of both genomes retaining the same gene content and order (Fig. 3). Synteny between the two Vallotia species was lower; they shared five large conserved blocks but also showed some rearrangements and inversions. Despite both being derived from within a lineage of Serratia-like symbionts, Ecksteinia and Gillettellia genomes differed dramatically in gene order, genome size, and numbers of shared CDS. Annandia pinicola and Annandia adelgestsugas share nearly their entire genetic repertoire, while the Vallotia genomes each possess more unique genes. Many unique genes in the Vallotia genomes are hypothetical (Fig. 1C), but the average GC content of these genes is more similar to non-hypothetical CDS than to intergenic spacers, implying that they are experiencing some degree of selective constraint. The similarities between the Annandia and Vallotia pairs are indicative of single-acquisition events, while the genomic differences between Ecksteinia and Gillettellia are suggestive, but not conclusive, of independent origins.

Fig. 3. Genome structure and shared genomic content of related adelgid symbionts.

Fig. 3

A Genome synteny comparisons of adelgid symbionts with similar ancestors. Each line between two chromosomes links a pair of colinear genes found in syntenous blocks of a minimum of five genes. Bars representing genomes are scaled within synteny plots of each pairwise comparison. The Annandia genomes are perfectly colinear, the Vallotia genomes exhibit several rearrangements, and Ecksteinia and Gillettellia exhibit little synteny. B Venn diagrams displaying shared and unshared protein-coding genes between each pair. Circle overlaps represent the numbers of orthologs shared between species, and numbers outside overlaps show unshared genes (paralogs or singletons). Circles are sized to approximate the number of genes in each genome.

Adelgidae symbionts cooperate to produce all essential amino acids and certain vitamins, but contributions vary across symbiont pairs

We found evidence of nutritional complementation in each set of cosymbionts for each adelgid species. With probable contributions from the host [72, 73], each consortium possesses genes to complete pathways for all essential amino acids (EAA). While the lysine pathway of A. lariciatus symbionts is missing two genes, identical gene losses have occurred in endosymbionts of related insects [73, 74], suggesting this pathway may be supplemented by unknown host or symbiont enzymes. Altogether, our data suggest a complex history of gene retention and loss in nutritional pathways. Fig. 4 shows the presence and absence of genes for canonical EAA biosynthesis pathways and key precursors (homoserine is required for threonine and methionine and chorismate is required for phenylalanine and tryptophan). In many cases, one of the two symbionts contributes most or all genes for the biosynthesis of a given EAA. For example, arginine, threonine, isoleucine, valine, leucine, and histidine are often produced entirely or mostly by one symbiont. In other cases, the biosynthesis of an EAA is almost always achieved cooperatively by the dual symbionts, for example, with the exception of the symbionts of P. similis, the tryptophan pathway has been divided such that one symbiont performs the first two rate-limiting steps with trpEG and the other symbiont performs the following steps with trpDCAB. The complement of genes in other EAA pathways is less consistent. Cosymbionts of P. similis, A. tsugae, and A. cooleyi are fully (or nearly fully) redundant in the lysine pathway, while only Steffania and Vallotia contribute lysine genes in A. piceae and A. lariciatus, respectively. Cosymbionts of A. tsugae, A. piceae, and A. cooleyi possess fully redundant chorismate-biosynthesis pathways, while in P. similis and A. lariciatus, only one of the two symbionts is capable of synthesizing chorismate. The contributions of A. piceae, A. cooleyi, A. lariciatus and P. similis co-symbionts to EAA-biosynthesis are far less evenly distributed than in A. tsugae. In all symbiont pairs but those of A. tsugae, one symbiont encodes the majority of EAA biosynthesis genes. Gillettellia has very few EAA biosynthetic capabilities that are not encoded by its Vallotia partner; the only relevant genes it possesses are trpEG, dapD, serC, and lysC/thrA for tryptophan, lysine, and homoserine. Profftia cannot produce any EAA on its own; this symbiont encodes two genes for tryptophan biosynthesis (trpEG), is the sole contributor of chorismate, and cooperates with its Vallotia partner to make phenylalanine. Ecksteinia contributes the least to EAA biosynthesis, encoding only two genes for tryptophan biosynthesis (trpEG) and genes for chorismate and phenylalanine that are redundant with its partner. Beyond amino acid provisioning, all symbionts retain at least some genes to produce vitamins and co-factors, but these pathways are largely incomplete (Supplementary Fig. 2). Intriguingly, pseudogenized thiamine genes in a plasmid recovered from our A. piceae dataset share the greatest sequence similarity with thiamine genes of “Ca. Hamiltonella defensa” and “Ca. Erwinia haradaeae” symbionts of Lachninae aphids (Supplementary Table 2). This may imply a common origin for these genes and that they are horizontally transferred between symbiont lineages relatively often [75]. Moreover, each Vallotia plasmid contains argG and tyrB (encoding components of the arginine and phenylalanine EAA pathways, respectively), which may contribute to the maintenance of this plasmid in the genome. The plasmid of Vallotia cooleyia contains functional thiC and thiD (encoding components of the thiamine pathway), while these genes have been pseudogenized in the plasmid of Vallotia lariciata. Overall, while adelgids of the hemlock lineage hosts symbionts that are fairly balanced in their nutritional contributions, adelgids in the fir, Douglas fir, larch, and pine lineages rely on one symbiont far more heavily than the other.

Fig. 4. Nutritional complementation in the production of essential amino acids and precursors in cosymbionts from each adelgid lineage.

Fig. 4

Genes encoding enzymes involved in biosynthetic pathways are shown as column names. Standard abbreviations for genes are used. Gene presence or absence in adelgid symbiont main chromosomes and plasmids is shown, along with contributions from host (adelgid) genes when genes with similar functions are missing from symbiont genomes. Genes are indicated as redundant when there exists more than one gene or gene copy that can complete the pathway.

Coding and noncoding content of Adelgidae symbiont genomes reflects a range of ages and stages of degradation

A notable feature in Adelgidae is the concentration of symbionts with genomes containing large intergenic spacers (IGS). In ancient obligate symbionts and many other bacteria, typically there is a tight correlation between the genome size and number of protein-coding sequences in bacterial species, i.e., genomes are gene-dense with only short spacer regions (Fig. 5). The few genomes with lower-than-typical coding densities correspond to recently derived symbionts from disparate lineages. While both Annandia and Vallotia symbionts fall within the expected correlation, the other adelgid symbionts have larger genomes than predicted given their numbers of protein-coding genes (Fig. 5). We detected no insertion sequences in these genomes; thus, the large genome sizes relative to the number of coding sequences in adelgid symbionts are not due to the proliferation of repetitive sequences or mobile elements. These low-coding densities, ranging from 34.2 to 50.8% (Supplementary Table 3), are indicative of symbionts in an intermediate stage of genome degradation, where the larger intergenic regions represent sequences that were once intact genes but have not yet been purged from the genome [76]. Annandia and Vallotia genomes are more typical of the high-coding densities observed in many ancient obligate symbionts [7678].

Fig. 5. Relationship between genome size and total number of CDS in genomes of adelgid symbionts and other representative bacteria.

Fig. 5

The names of symbionts with an unusual genome size relative to numbers of CDS (coding density) are indicated.

To gain further insight into the relative degree of genome degradation of adelgid symbionts, we determined the proportion of genes belonging to each COG category and made comparisons to other bacteria in various stages of genome degradation (Fig. 6; Supplementary Fig. 3) [76]. Genomes of Annandia, the hypothesized ancestral symbiont of Adelgidae, were most like obligate symbionts, which typically retain a large proportion of genes in translation (category J) and lose many poorly characterized genes (categories S and X) compared with free-living bacteria and facultative symbionts. Ecksteinia also experienced greater retention in category J relative to all other symbionts, except Annandia, with Profftia having the (marginally) next highest. The two Annandia species, Ecksteinia, and Profftia, experienced the greatest proportional reductions in poorly characterized genes, with the remaining adelgid symbionts appearing most like facultative or free-living symbionts in this respect. Genes underlying central cellular processes, as compiled in Moran and Bennett (2014) [79] and Bennett et al. (2014) [80], showed varying degrees of retention (Supplementary Fig. 4). Both Annandia species and Ecksteinia showed the greatest degree of gene loss in DNA-replication initiation, cell division, phospholipid and fatty acid synthesis, and peptidoglycan synthesis. Overall, these comparisons suggest that, relative to other adelgid symbionts, Annandia, Ecksteinia, and Profftia have patterns of gene retention most similar to bacteria in the obligate symbiont category.

Fig. 6. Classification of protein-coding sequences into general COG functional categories. Uncategorized sequences were assigned to a custom category X.

Fig. 6

Taxa were grouped into free-living, facultative, obligate–transitional (>0.7 Mbp), obligate (<0.7 Mbp), and tiny–obligate (<0.2 Mbp) genome categories. Adelgid symbionts are shown in red text. Free-living and facultative bacteria related to adelgid symbionts were chosen for comparison.

We calculated the GC contents of all the CDS and IGS separately for each symbiont genome (Fig. 7). We divided IGS into those that are small in size (representing IGS typical of bacteria with normal average coding densities) and large in size, which presumably represent “fossils” of genes that have accumulated sequence changes but not enough large deletions to reduce the genome to normal coding density. The Annandia genome was typical of many obligate symbionts with very few large IGS. For the remaining symbionts, the GC peaks for large intergenic sequences differed from those of the CDS, with differences most pronounced in Profftia and Ecksteinia and least pronounced for the two Vallotia species. Furthermore, we searched for homology within the IGS to quantify the degree of IGS divergence from ancestral genes. The extent of detectable homology to known genes in the IGSs is reflected in the numbers of pseudogenes flagged as “no predicted ORF” by Pseudofinder due to BLASTx hits in intergenic regions (Supplementary Table 3). The two Annandia species, Ecksteinia, Profftia, and Hartigia appear to have the least amount of homology in their IGS relative to other symbionts. We found that Vallotia and Profftia have similar dS values (Supplementary Fig. 5), suggesting that Profftia’s more extreme departure of GC content in CDS vs. IGS and lack of intergenic homology is not due to differences in mutation rate. The high relative AT content in IGS and only trace homology to known genes are features expected of symbionts in relatively more advanced stages of genome reduction as a consequence of longer periods of internment [8183].

Fig. 7. Distributions of GC percentages in short intergenic sequences (<300 bp, red), long intergenic sequences (>300 bp, pink), and CDS (blue).

Fig. 7

Differences between CDS and long intergenic GC percentages are shown between lines indicating distribution means.

Discussion

Previous phylogenetic studies of rRNA gene sequences have revealed a high degree of diversity and a dynamic evolutionary history of symbiosis within the Adelgidae [15, 2933]. These studies identified unique pairs of symbionts in each of the five major adelgid lineages, which were consistent within each lineage. These bacteria were inferred to be nutritional partners on the basis that their hosts’ alternating, nutritionally unbalanced diet required supplementation, and that the symbiont genomes of A. tsugae from the hemlock lineage confirmed a pattern of nutritional supplementation and complementation [28]. The present study sought to address prevailing questions concerning the evolution of symbiosis within the Adelgidae, including (i) whether dual symbionts in the other adelgid lineages are nutritional and show similar provisioning patterns, (ii) whether the relationships previously proposed are supported by genomic data, and (iii) whether dynamic gains and losses of symbionts are correlated with genome-degradation characteristics. We found that adelgid symbionts are indeed obligate nutritional partners, but their genomes vary greatly in the degrees of degradation and patterns of nutritional cooperation, consistent with a dynamic history of gains and losses. This work raises intriguing questions regarding the underlying drivers of genome degradation and the role that host ecology plays in symbiont gene loss and turnover.

The Adelgidae possesses an unusually high concentration of symbionts in transitional states of genome degradation

Genomes of adelgid symbionts exhibit a broad spectrum of degradation, varying from tiny and compact, to larger and with low-coding densities typical of genomes in an intermediate stage of reduction. While some possess little-to-moderate AT bias (e.g., Steffania, Vallotia, Pseudomonas, and Gillettellia) and share COG distributions similar to other intermediately reduced genomes, other genomes with low-coding densities possess properties more similar to symbionts in an advanced state of genome reduction (e.g., Profftia and Ecksteinia). Typical of most ancient obligate symbionts, all adelgid symbionts have lost tRNA genes and contain only a single functional rRNA operon, except Vallotia, which has three identical rRNA operons. While examples of symbionts with abundant, large intergenic sequences exist (e.g., [84, 85]), we have found an unprecedented concentration of symbionts, from diverse lineages, with these IGS in the adelgids. The transition to endosymbiotic life is thought to be accompanied by wide-ranging gene inactivations caused by relaxed selection on genes redundant with the host and a reduction in effective population size [76]. While previous studies of a handful of transitioning symbionts have documented a delay between gene inactivation and subsequent deletion that is resolved over time, many adelgid symbionts are in a transitional state, despite varying in relative age and ancestry. These features are suggestive of high turnover rates, potential genome redundancy at the onset of symbiosis, and inefficient selection, and may occur convergently due to shared life-cycle characteristics of hosts.

Support for a hypothesis of symbiont gains, losses, and relationships

Within the framework of symbiont gains and losses proposed by Toenshoff et al. (2014) [31] and expanded upon by von Dohlen et al. (2017) ([15] Fig. 6), Annandia was posited as the ancestral symbiont of Adelgidae [28]. Annandia was replaced by the common ancestor of Ecksteinia and Gillettellia, and the latter was joined by the junior symbiont Vallotia. Gillettellia (senior) and Vallotia (junior) diversified with the Douglas-fir lineage. Vallotia displaced the Gillettellia/Ecksteinia ancestor as the senior symbiont before diversification of the larch lineage. Hartigia, Pseudomonas, Steffania, and Profftia were all hypothesized to have entered as junior symbionts before the diversification of each of their respective host lineages (pine, hemlock, true fir, and larch, respectively). A major goal of the present study was to test this scenario using information from symbiont genome data, and from phylogenetic analyses with broader taxonomic representation incorporating new data from GenBank.

As in our original hypothesis, our analyses clearly support the placement of Annandia as the ancestral symbiont of adelgids. The genomes of Annandia symbionts from pine and hemlock lineages are highly similar, and phylogenetic analyses place them as sister taxa with strong support. This ancestral symbiont was most likely acquired sometime in the late Cretaceous period in the Adelgidae stem lineage [15, 31]. The high level of synteny shared by these two symbionts indicates that they became stable before the diversification of their hosts, as is frequently seen in other ancient, obligate symbiont-host partnerships [21, 22]. Hartigia has a large genome with very-low-coding density, similar to Serratia junior symbionts in Cinara aphids [84], supporting its placement as the junior symbiont in the pine lineage. As the junior companion to Annandia adelgestsuga, Pseudomonas adelgestsugas is younger than Annandia, but most certainly acquired in the stem hemlock lineage [15, 32].

Previously, we hypothesized that Ecksteinia and Gillettellia shared a single progenitor in the ancestor of the fir, Douglas fir, and larch lineages that split ~65 million years ago [15, 30, 86]. This ancestral symbiont would have codiversified with its hosts in the fir and Douglas-fir lineages and been replaced in the larch lineage. Our current results appear to contradict this interpretation, while suggesting a different evolutionary scenario. Our phylogenomic analyses showed with strong support that, although closely related, Gillettellia and Ecksteinia were not sister taxa. It is conceivable that this topology was an artifact of signal-confounding long-branch attraction; however, our methods were designed to account for this. The topology also might have resulted from reduced taxon sampling imposed by the limited genomes available for comparison, a possibility supported by the sister relationship of Ecksteinia and Gillettellia in our species-rich 16 S rRNA gene tree (albeit with low support). Other evidence to support independent acquisitions of these symbionts includes that Ecksteinia and Gillettellia share little synteny relative to the Annandia or Vallotia pairs; moreover, they differ greatly in genome size, GC content, and coding capacity. We cannot completely discount that Ecksteinia and Gillettellia are sister taxa that diverged rapidly before the loss of mobile elements that allowed for lineage-specific rearrangements, as observed in S. symbiotica in aphids [87]. However, on the totality of current evidence, we argue that the most likely explanation is that these symbionts arose from independent acquisitions from different Serratia-like ancestors. Indeed, Serratia symbiotica and Sodalis-like bacteria are known to form obligate symbiotic relationships repeatedly in other systems (e.g., [88, 89]).

In contrast to our previously proposed hypothesis, our genomic data suggest that Profftia, not Vallotia, may be the oldest symbiont of the stem larch + Douglas-fir lineage. Profftia possesses more signatures of long-term sequestration than either Vallotia or Gillettellia with regard to coding capacity, AT bias, the extent of intergenic sequence degradation, and redundancy in nutritional pathway genes with its partner. Profftia possesses fewer coding sequences than Gillettellia or either Vallotia spp., and its average genomic GC content is more comparable to the clearly ancient Annandia and Ecksteinia. Long-term obligate symbionts have often lost most of their rRNA operons, with many only possessing one or two [90]. Profftia retains a single isolated 16 S rRNA gene and adjacent 23 S and 5 S rRNA genes, while Vallotia lariciata encodes three identical operons. Genes encoding the tricarboxylic acid cycle (TCA) are often lost in long-established symbionts and are present in facultative and more recently obligate symbionts [79, 91, 92]. Vallotia lariciata retains a nearly complete set of TCA cycle genes, while Profftia has retained few. Large intergenic regions representing once-intact genes are expected to accumulate neutral mutations at a clock-like rate, resulting in sequences that steadily become more AT rich over time compared with genic sequences that are constrained by purifying selection [8183]. Among all adelgid symbionts, the degree of departure of intergenic from genic GC is most extreme in Profftia and Ecksteinia. Moreover, Profftia has an average intergenic length similar to Gillettellia and approximately twice that of either Vallotia, but possesses less detectable homology in its intergenic sequences than Gillettellia or Vallotia spp. Redundancies in nutritional pathways are thought to be eliminated 30–60 million years after codependency develops [88]. Profftia and Vallotia lariciata share no EAA-pathway genes, whereas Vallotia cooleyia and Gillettellia have redundancies in homoserine, lysine, and chorismate, suggesting the former partnership is comparatively older. Our dS comparisons of Profftia and Vallotia suggest that these differences in genome characteristics are not explained by differences in substitution rates caused by differential loss of DNA-repair genes or replication times. Rather, with our existing data, these differences are best explained by the unequal lengths of time these bacteria have been evolving as obligate endosymbionts, with Profftia sequestered the longest. We note that factors other than age of association may influence the relative degree of symbiont genome degradation within an adelgid species. For example, differences in the severity of bottlenecks experienced by each symbiont during vertical transmission could change the strength of genetic drift affecting stochastic gene loss. Future work to explore this idea could quantify the titers of dual symbionts provisioned to eggs, as well as potential changes in symbiont numbers throughout the life cycle.

We propose the following scenario as the most parsimonious, given our genomic data (Fig. 8). Annandia was acquired deep within the stem lineage of the Adelgidae, analogous to the history of aphids and their Buchnera symbionts [93]. Hartigia and Pseudomonas were acquired as junior symbionts in the pine and hemlock lineages, respectively. Annandia was lost in the stem lineage of the fir, Douglas fir and larch lineages and Ecksteinia was acquired either before the divergence of the fir lineage or soon thereafter. We cannot say exactly when Ecksteinia was acquired, except that it was almost certainly before Steffania. Profftia was likely acquired in the stem lineage of the Douglas-fir and larch lineages before Vallotia, and was replaced by Gillettellia in the Douglas-fir lineage. In each case, we cannot say whether symbionts were ever hosted in triplicate during adelgid evolution.

Fig. 8. Possible scenarios of symbiont replacements during the evolution of Adelgidae.

Fig. 8

A Hypothesis proposed in von Dohlen et al.([15]). B Current hypothesis based on the results of this study. Two-letter codes and colored lines indicate acquisition and retention of specific symbionts. C Venn-like diagrams showing the relative contributions of each symbiont to synthesizing EAA and precursors (homoserine and chorismate). In each pair, hypothesized senior symbionts are represented by circles on the left and juniors on the right; age relationships are based on B. Numbers where circles overlap indicate EAA or precursors that symbionts cooperate to produce. In pathways where partial redundancies exist between symbionts, the symbiont contributing genes for the full pathway is counted as the sole contributor.

The history of symbiont replacements in adelgids is unusual among sap-feeding insects in three respects. First, adelgids have replaced both senior and junior symbionts multiple times over their comparatively short (<100 MY) and species-poor history. Other much older and more diverse lineages have generally coevolved with a single senior symbiont, e.g., Buchnera in aphids (160 + MY), Sulcia in Auchenorrhyncha (~270 MY), Evansia in Coleorrhyncha (~250 MY), Carsonella in psyllids (~250 MY), and Portiera in whiteflies (~100–200 MY) [23, 9497]. Second, both senior and junior symbionts of adelgids generally derive from unrelated bacterial lineages, whereas in other insects, junior symbionts often derive from the same or related bacterial lineages, e.g., Serratia in aphids; Sodalis-like symbionts in psyllids and mealybugs [19, 89, 98]. Third, adelgid dual symbionts vary in the proportion of nutrient provisioning undertaken by each partner, while in other insects the senior symbiont typically provides the majority [1719, 99, 100]. We recognize that while adelgids may stand out as unusual at present, future research may uncover similar replacement patterns in cooperative nutritional symbioses. This is likely the case in Coccoidea; symbionts in this group are diverse [101, 102] but the nutritional status of many is yet to be explored.

Atypical nutritional provisioning profiles are consistent with the idea that fluctuating selection on nutritional requirements is a driver of symbiont turnover

While partnered symbionts consistently possess complementary genes in nutrient biosynthesis pathways, we find that specific gene breakdown in these pathways does not follow a common pattern across adelgid lineages or between junior versus senior symbionts. Adelgid symbionts are clearly interdependent, requiring their partners (and hosts) to produce the metabolites necessary for survival of the insect–bacterial consortium. That no two adelgid symbiont pairs are identical in the particulars of how and what they provision to their insect host further supports that the process of individual gene loss is at least partially stochastic.

Losses in EAA-provisioning capabilities of symbionts are sometimes linked to trophic shifts of the insect host to more nutritious diets. One example includes the loss of Buchnera arginine-biosynthesis genes in aphids that feed inside galls [103]. Several aphid and leafhopper symbionts have lost specific EAA pathways associated with a shift to feeding on phloem sap rich in the same EAA [21, 104, 105]. In more extreme cases, trophic shifts are related to a loss of nutritional symbionts entirely, e.g., in certain leafhoppers feeding on nutrient-rich parenchyma [23]. These examples demonstrate that changes in an insect’s diet can alter selective forces acting to maintain nutrient provisioning by symbionts, or even the symbionts themselves, resulting in degradation of the symbiosis.

The unusually high turnover of obligate symbionts in Adelgidae might be explained by a history of fluctuations in dietary quality related to complex life-cycle evolution and gall formation [15]. Nutrient-partitioning strategies among the five symbiont pairs investigated here suggest that senior symbionts incur accelerated losses of nutritional capabilities as compared with senior symbionts in other sap-feeding insects, leading to frequent acquisitions of junior symbionts and losses of senior symbionts. Most other hemipteran insects studied in depth (reviewed in [79]) lack the complex life cycles of adelgids and thus do not experience regular and repeated bouts of fluctuations in plant-sap quality. They also exhibit stable relationships with senior symbionts that play consistent, majority roles in EAA provisioning. In contrast, in adelgids, the senior symbiont is the primary provider in only two lineages (pine and Douglas-fir), shares nearly equal responsibility in hemlock, and provides very few EAA genes in the fir and larch lineages. During periods of relaxed selection, when adelgids feed on nutrient-rich galls on spruce hosts, accelerated gene inactivations in the nutrient-provisioning pathways should occur indiscriminately between these dual symbionts, allowing either symbiont to reduce its nutrient-provisioning responsibilities or to acquire a partnership-ending mutation. Periods of increased selection, when adelgids feed on nutrient-poor phloem of alternate hosts, can occur yearly for populations that regularly alternate between spruce and other conifers, or can last for up to thousands of years for populations on alternate hosts separated from spruces due to glacial cycles or invasions to new environments [36, 106]. Thus, the varied nutrient provisioning contributions of adelgid symbionts may be a consequence of historical and contemporary fluctuations in host dietary quality that spurs symbiont genome decay and turnover [15, 28].

Supplementary information

Acknowledgements

We thank Thomas Crowley and Cody Bills for laboratory assistance, and Jim Corrigan for adelgid collections. We would like to acknowledge the Arnold Arboretum of Harvard University for providing financial support through a Deland Award to KMW. KMW was also supported by a grant from the Lewis and Clark Fund for Exploration and Field Research from the American Philosophical Society, and a Doctoral Dissertation Improvement Grant from the National Science Foundation to CDvD and KMW (DEB-1601802). Research was further supported by National Science Foundation grants to CDvD (DEB-1655182) and GRB (DEB-1655177), the Utah Agricultural Experiment Station (approved as journal paper #9463), and the University of Georgia’s Research Foundation, Office for the Vice President for Research, and Agricultural Experiment Station. The support and resources from the Georgia Advanced Computing Research Center at the University of Georgia are also gratefully acknowledged.

Author contributions

CDvD and GRB designed the study and directed the research; KMW and DTD performed the research; TAP and AOA helped analyze data; NPH contributed specimens; KMW, DTD, CDvD, and GRB wrote the paper; all authors reviewed the paper.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

These authors contributed equally: Dustin T. Dial, Kathryn M. Weglarz.

Contributor Information

Gaelen R. Burke, Email: grburke@uga.edu

Carol D. von Dohlen, Email: carol.vondohlen@usu.edu

Supplementary information

The online version contains supplementary material available at 10.1038/s41396-021-01102-w.

References

  • 1.Szathmáry E, Smith JM. The major evolutionary transitions. Nature. 1995;374:227–32. doi: 10.1038/374227a0. [DOI] [PubMed] [Google Scholar]
  • 2.West SA, Fisher RM, Gardner A, Kiers ET. Major evolutionary transitions in individuality. Proc Natl Acad Sci USA. 2015;112:10112–9. doi: 10.1073/pnas.1421402112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Moran NA. The coevolution of bacterial endosymbionts and phloem-feeding insects. Ann Mo Bot Gard. 2001;88:35–44. [Google Scholar]
  • 4.Bennett GM, Moran NA. Heritable symbiosis: the advantages and perils of an evolutionary rabbit hole. Proc Natl Acad Sci USA. 2015;112:10169–76. doi: 10.1073/pnas.1421388112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Gil R, Sabater-Munoz B, Latorre A, Silva FJ, Moya A. Extreme genome reduction in Buchnera spp.: toward the minimal genome needed for symbiotic life. Proc Natl Acad Sci USA. 2002;99:4454–8. doi: 10.1073/pnas.062067299. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Tamames J, Gil R, Latorre A, Pereto J, Silva FJ, Moya A. The frontier between cell and organelle: genome analysis of Candidatus Carsonella ruddii. BMC Evol Biol. 2007;7:181. doi: 10.1186/1471-2148-7-181. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Husnik F, Nikoh N, Koga R, Ross L, Duncan RP, Fujie M, et al. Horizontal gene transfer from diverse bacteria to an insect genome enables a tripartite nested mealybug symbiosis. Cell. 2013;153:1567–78. doi: 10.1016/j.cell.2013.05.040. [DOI] [PubMed] [Google Scholar]
  • 8.Wilson ACC, Duncan RP. Signatures of host/symbiont genome coevolution in insect nutritional endosymbioses. Proc Natl Acad Sci USA. 2015;112:10255–61. doi: 10.1073/pnas.1423305112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.von Dohlen CD, Kohler S, Alsop ST, McManus WR. Mealybug β-proteobacterial endosymbionts contain γ-proteobacterial symbionts. Nature. 2001;412:433–6. doi: 10.1038/35086563. [DOI] [PubMed] [Google Scholar]
  • 10.McCutcheon JP, McDonald BR, Moran NA. Convergent evolution of metabolic roles in bacterial co-symbionts of insects. Proc Natl Acad Sci USA. 2009;106:15394–9. doi: 10.1073/pnas.0906424106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Gatehouse LN, Sutherland P, Forgie SA, Kaji R, Christeller JT. Molecular and histological characterization of primary (Betaproteobacteria) and secondary (Gammaproteobacteria) endosymbionts of three mealybug species. Appl Environ Microbiol. 2012;78:1187–97. doi: 10.1128/AEM.06340-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Bennett GM, Moran NA. Small, smaller, smallest: the origins and evolution of ancient dual symbioses in a phloem-feeding insect. Genome Biol Evol. 2013;5:1675–88. doi: 10.1093/gbe/evt118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Bressan A, Mulligan KL. Localization and morphological variation of three bacteriome-inhabiting symbionts within a planthopper of the genus Oliarus (Hemiptera: Cixiidae): Bacteriome-inhabiting symbionts in Oliarus filicicola. Environ Microbiol Rep. 2013;5:499–505. doi: 10.1111/1758-2229.12051. [DOI] [PubMed] [Google Scholar]
  • 14.Bennett GM, Mao M. Comparative genomics of a quadripartite symbiosis in a planthopper host reveals the origins and rearranged nutritional responsibilities of anciently diverged bacterial lineages. Environ Microbiol. 2018;20:4461–72. doi: 10.1111/1462-2920.14367. [DOI] [PubMed] [Google Scholar]
  • 15.von Dohlen CD, Spaulding U, Patch KB, Weglarz KM, Foottit RG, Havill NP, et al. Dynamic acquisition and loss of dual-obligate symbionts in the plant-sap-feeding Adelgidae (Hemiptera: Sternorrhyncha: Aphidoidea) Front Microbiol. 2017;8:1037. doi: 10.3389/fmicb.2017.01037. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Mao M, Yang X, Poff K, Bennett G. Comparative genomics of the dual-obligate symbionts from the treehopper, Entylia carinata (Hemiptera: Membracidae), provide insight into the origins and evolution of an ancient symbiosis. Genome Biol Evol. 2017;9:1803–15. doi: 10.1093/gbe/evx134. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.McCutcheon JP, Moran NA. Functional convergence in reduced genomes of bacterial symbionts spanning 200 my of evolution. Genome Biol Evol. 2010;2:708–18. doi: 10.1093/gbe/evq055. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.McCutcheon JP, von Dohlen CD. An interdependent metabolic patchwork in the nested symbiosis of mealybugs. Curr Biol. 2011;21:1366–72. doi: 10.1016/j.cub.2011.06.051. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Sloan DB, Moran NA. Genome reduction and co-evolution between the primary and secondary bacterial symbionts of psyllids. Mol Biol Evol. 2012;29:3781–92. doi: 10.1093/molbev/mss180. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Hall AAG, Morrow JL, Fromont C, Steinbauer MJ, Taylor GS, Johnson SN, et al. Codivergence of the primary bacterial endosymbiont of psyllids versus host switches and replacement of their secondary bacterial endosymbionts. Environ Microbiol. 2016;18:2591–603. doi: 10.1111/1462-2920.13351. [DOI] [PubMed] [Google Scholar]
  • 21.Tamas I, Klasson L, Canbäck B, Näslund AK, Eriksson A-S, Wernegreen JJ, et al. 50 million years of genomic stasis in endosymbiotic bacteria. Science. 2002;296:2376–9. doi: 10.1126/science.1071278. [DOI] [PubMed] [Google Scholar]
  • 22.Shigenobu S, Watanabe H, Hattori M, Sakaki Y, Ishikawa H. Genome sequence of the endocellular bacterial symbiont of aphids Buchnera sp. APS. Nature. 2000;407:81–6. doi: 10.1038/35024074. [DOI] [PubMed] [Google Scholar]
  • 23.Moran NA, Tran P, Gerardo NM. Symbiosis and insect diversification: an ancient symbiont of sap-feeding insects from the bacterial phylum Bacteroidetes. Appl Environ Microbiol. 2005;71:8802–10. doi: 10.1128/AEM.71.12.8802-8810.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Gruwell ME, Hardy NB, Gullan PJ, Dittmar K. Evolutionary relationships among primary endosymbionts of the mealybug subfamily Phenacoccinae (Hemiptera: Coccoidea: Pseudococcidae) Appl Environ Microbiol. 2010;76:7521–5. doi: 10.1128/AEM.01354-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Koga R, Moran NA. Swapping symbionts in spittlebugs: evolutionary replacement of a reduced genome symbiont. ISME J. 2014;8:1237–46. doi: 10.1038/ismej.2013.235. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Mao M, Bennett GM. Symbiont replacements reset the co-evolutionary relationship between insects and their heritable bacteria. ISME J. 2020;14:1384–95. doi: 10.1038/s41396-020-0616-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Braendle C, Miura T, Bickel R, Shingleton AW, Kambhampati S, Stern DL. Developmental origin and evolution of bacteriocytes in the aphid–Buchnera symbiosis. PLoS Biol. 2003;1:e21. doi: 10.1371/journal.pbio.0000021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Weglarz KM, Havill NP, Burke GR, von Dohlen CD. Partnering with a pest: genomes of hemlock woolly adelgid symbionts reveal atypical nutritional provisioning patterns in dual-obligate bacteria. Genome Biol Evol. 2018;10:1607–21. doi: 10.1093/gbe/evy114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Toenshoff ER, Penz T, Narzt T, Collingro A, Schmitz-Esser S, Pfeiffer S, et al. Bacteriocyte-associated gammaproteobacterial symbionts of the Adelges nordmannianae/piceae complex (Hemiptera: Adelgidae) ISME J. 2012;6:384–96. doi: 10.1038/ismej.2011.102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Toenshoff ER, Gruber D, Horn M. Co-evolution and symbiont replacement shaped the symbiosis between adelgids (Hemiptera: Adelgidae) and their bacterial symbionts. Environ Microbiol. 2012;14:1284–95. doi: 10.1111/j.1462-2920.2012.02712.x. [DOI] [PubMed] [Google Scholar]
  • 31.Toenshoff ER, Szabó G, Gruber D, Horn M. The pine bark adelgid, Pineus strobi, contains two novel bacteriocyte-associated gammaproteobacterial symbionts. Appl Environ Microbiol. 2014;80:878–85. doi: 10.1128/AEM.03310-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.von Dohlen CD, Spaulding U, Shields K, Havill NP, Rosa C, Hoover K. Diversity of proteobacterial endosymbionts in hemlock woolly adelgid (Adelges tsugae) (Hemiptera: Adelgidae) from its native and introduced range. Environ Microbiol. 2013;15:2043–62. doi: 10.1111/1462-2920.12102. [DOI] [PubMed] [Google Scholar]
  • 33.Havelka J, Danilov J, Rakauskas R. Relationships between aphid species of the family Adelgidae (Hemiptera Adelgoidea) and their endosymbiotic bacteria: a case study in Lithuania. Bull Insectology. 2021;74:1–10. [Google Scholar]
  • 34.Favret C, Havill NP, Miller GL, Sano M, Victor B. Catalog of the adelgids of the world (Hemiptera, Adelgidae) Zookeys. 2015;534:35–54. doi: 10.3897/zookeys.534.6456. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Blackman RL, Eastop VF Aphids on the world’s trees: an identification and information guide. 1994. CAB International.
  • 36.Havill NP, Foottit RG. Biology and evolution of Adelgidae. Ann Rev Ento. 2007;52:325–49. doi: 10.1146/annurev.ento.52.110405.091303. [DOI] [PubMed] [Google Scholar]
  • 37.Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30:2114–20. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Zhang J, Kobert K, Flouri T, Stamatakis A. PEAR: a fast and accurate Illumina paired-end read mergeR. Bioinformatics. 2014;30:614–20. doi: 10.1093/bioinformatics/btt593. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, et al. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comp Bio. 2012;19:455–77. doi: 10.1089/cmb.2012.0021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Walker BJ, Abeel T, Shea T, Priest M, Abouelliel A, Sakthikumar S, et al. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS ONE. 2014;9:e112963. doi: 10.1371/journal.pone.0112963. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Kolmogorov M, Yuan J, Lin Y, Pevzner PA. Assembly of long, error-prone reads using repeat graphs. Nat Biotechnol. 2019;37:540–6. doi: 10.1038/s41587-019-0072-8. [DOI] [PubMed] [Google Scholar]
  • 42.Laetsch DR, Blaxter ML. BlobTools: Interrogation of genome assemblies. F1000Research. 2017;6:1287. [Google Scholar]
  • 43.Boetzer M, Henkel CV, Jansen HJ, Butler D, Pirovano W. Scaffolding pre-assembled contigs using SSPACE. Bioinformatics. 2011;27:578–9. doi: 10.1093/bioinformatics/btq683. [DOI] [PubMed] [Google Scholar]
  • 44.Chu C, Li X, Wu Y. GAPPadder: a sensitive approach for closing gaps on draft genomes with short sequence reads. BMC Genomics. 2019;20:426. doi: 10.1186/s12864-019-5703-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Seemann T. Prokka: rapid prokaryotic genome annotation. Bioinformatics. 2014;30:2068–9. doi: 10.1093/bioinformatics/btu153. [DOI] [PubMed] [Google Scholar]
  • 46.Varani AM, Siguier P, Gourbeyre E, Charneau V, Chandler M. ISsaga is an ensemble of web-based methods for high throughput identification and semi-automatic annotation of insertion sequences in prokaryotic genomes. Genome Biol. 2011;12:R30. doi: 10.1186/gb-2011-12-3-r30. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Karp PD, Billington R, Caspi R, Fulcher CA, Latendresse M, Kothari A, et al. The BioCyc collection of microbial genomes and metabolic pathways. Brief Bioinform. 2019;20:1085–93. doi: 10.1093/bib/bbx085. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Karp PD, Ong WK, Paley S, Billington R, Caspi R, Fulcher C, et al. The EcoCyc database. EcoSal Plus. 2018;8:10.1128. doi: 10.1128/ecosalplus.esp-0006-2018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Moriya Y, Itoh M, Okuda S, Yoshizawa AC, Kanehisa M. KAAS: an automatic genome annotation and pathway reconstruction server. Nucleic Acids Res. 2007;35:W182–5. doi: 10.1093/nar/gkm321. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Huerta-Cepas J, Forslund K, Coelho LP, Szklarczyk D, Jensen LJ, von Mering C, et al. Fast genome-wide functional annotation through orthology assignment by eggNOG-Mapper. Mol Biol Evol. 2017;34:2115–22. doi: 10.1093/molbev/msx148. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Tatusov RL, Galperin MY, Natale DA, Koonin EV. The COG database: a tool for genome-scale analysis of protein functions and evolution. Nucleic Acids Res. 2000;28:33–36. doi: 10.1093/nar/28.1.33. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Wang Y, Tang H, DeBarry JD, Tan X, Li J, Wang X, et al. MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity. Nucleic Acids Res. 2012;40:e49–e49. doi: 10.1093/nar/gkr1293. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Xu L, Dong Z, Fang L, Luo Y, Wei Z, Guo H, et al. OrthoVenn2: a web server for whole-genome comparison and annotation of orthologous clusters across multiple species. Nucleic Acids Res. 2019;47:W52–W58. doi: 10.1093/nar/gkz333. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Xu Y, Bi C, Wu G, Wei S, Dai X, Yin T, et al. VGSC: a web-based vector graph toolkit of genome synteny and collinearity. Biomed Res Int. 2016;2016:7823429. doi: 10.1155/2016/7823429. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Adeolu M, Alnajar S, Naushad S, S Gupta R. Genome-based phylogeny and taxonomy of the ‘Enterobacteriales’: proposal for Enterobacterales ord. nov. divided into the families Enterobacteriaceae, Erwiniaceae fam. nov., Pectobacteriaceae fam. nov., Yersiniaceae fam. nov., Hafniaceae fam. nov., Morganellaceae fam. nov., and Budviciaceae fam. nov. Int J Syst Evol Microbiol. 2016;66:5575–99. doi: 10.1099/ijsem.0.001485. [DOI] [PubMed] [Google Scholar]
  • 56.Guy L. phyloSkeleton: taxon selection, data retrieval and marker identification for phylogenomics. Bioinformatics. 2017;33:1230–2. doi: 10.1093/bioinformatics/btw824. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Eddy SR. Accelerated profile HMM searches. PLoS Comput Biol. 2011;7:e1002195. doi: 10.1371/journal.pcbi.1002195. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 2013;30:772–80. doi: 10.1093/molbev/mst010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Capella-Gutiérrez S, Silla-Martínez JM, Gabaldón T. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics. 2009;25:1972–3. doi: 10.1093/bioinformatics/btp348. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30:1312–3. doi: 10.1093/bioinformatics/btu033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Lartillot N, Rodrigue N, Stubbs D, Richer J. PhyloBayes MPI: phylogenetic reconstruction with infinite mixtures of profiles in a parallel environment. Syst Biol. 2013;62:611–5. doi: 10.1093/sysbio/syt022. [DOI] [PubMed] [Google Scholar]
  • 62.Husník F, Chrudimský T, Hypša V. Multiple origins of endosymbiosis within the Enterobacteriaceae (γ-Proteobacteria): convergence of complex phylogenetic approaches. BMC Biology. 2011;9:1–17. doi: 10.1186/1741-7007-9-87. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Emms DM, Kelly S. OrthoFinder: phylogenetic orthology inference for comparative genomics. Genome Biol. 2019;20:238. doi: 10.1186/s13059-019-1832-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Hesse C, Schulz F, Bull CT, Shaffer BT, Yan Q, Shapiro N, et al. Genome-based evolutionary history of Pseudomonas spp. Environ Microbiol. 2018;20:2142–59. doi: 10.1111/1462-2920.14130. [DOI] [PubMed] [Google Scholar]
  • 65.Burke GR, Normark BB, Favret C, Moran NA. Evolution and diversity of facultative symbionts from the aphid subfamily Lachninae. Appl Environ Microbiol. 2009;75:5328–35. doi: 10.1128/AEM.00717-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Manzano‐Marín A, Szabó G, Simon J, Horn M, Latorre A. Happens in the best of subfamilies: establishment and repeated replacements of co‐obligate secondary endosymbionts within Lachninae aphids: co-obligate endosymbiont dynamics in the Lachninae. Environ Microbiol. 2017;19:393–408. doi: 10.1111/1462-2920.13633. [DOI] [PubMed] [Google Scholar]
  • 67.Castresana J. Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis. Mol Biol Evol. 2000;17:540–52. doi: 10.1093/oxfordjournals.molbev.a026334. [DOI] [PubMed] [Google Scholar]
  • 68.ggplot2. Create elegant data visualisations using the grammar of graphics. https://ggplot2.tidyverse.org/. Accessed Apr 2021.
  • 69.Manzano-Marín A, Oceguera-Figueroa A, Latorre A, Jiménez-García LF, Moya A. Solving a bloody mess: B-vitamin independent metabolic convergence among gammaproteobacterial obligate endosymbionts from blood-feeding arthropods and the leech Haementeria officinalis. Genome Biol Evol. 2015;7:2871–84. doi: 10.1093/gbe/evv188. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Janda JM, Abbott SL. The genus Hafnia: from soup to nuts. Clin Microbiol Rev. 2006;19:12–18. doi: 10.1128/CMR.19.1.12-28.2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Szabó G, Schulz F, Manzano-Marín A, Toenshoff ER, Horn M Evolutionary recent dual obligatory symbiosis among adelgids indicates a transition between fungus and insect associated lifestyles. bioRxiv. 2020; e-pub ahead of print 16 October 2020; 10.1101/2020.10.16.342642. [DOI] [PMC free article] [PubMed]
  • 72.Wilson ACC, Ashton PD, Calevro F, Charles H, Colella S, Febvay G, et al. Genomic insight into the amino acid relations of the pea aphid, Acyrthosiphon pisum, with its symbiotic bacterium Buchnera aphidicola. Insect Mol Biol. 2010;19:249–58. doi: 10.1111/j.1365-2583.2009.00942.x. [DOI] [PubMed] [Google Scholar]
  • 73.Sloan DB, Nakabachi A, Richards S, Qu J, Murali SC, Gibbs RA, et al. Parallel histories of horizontal gene transfer facilitated extreme reduction of endosymbiont genomes in sap-feeding insects. Mol Biol Evol. 2014;31:857–71. doi: 10.1093/molbev/msu004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Hansen AK, Moran NA. The impact of microbial symbionts on host plant utilization by herbivorous insects. Mol Ecol. 2014;23:1473–96. doi: 10.1111/mec.12421. [DOI] [PubMed] [Google Scholar]
  • 75.Manzano-Marín A, Coeur d’acier A, Clamens A-L, Orvain C, Cruaud C, Barbe V, et al. Serial horizontal transfer of vitamin-biosynthetic genes enables the establishment of new nutritional symbionts in aphids’ di-symbiotic systems. ISME J. 2020;14:259–73. doi: 10.1038/s41396-019-0533-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Lo W-S, Huang Y-Y, Kuo C-H. Winding paths to simplicity: genome evolution in facultative insect symbionts. FEMS Microbiol Rev. 2016;40:855–74. doi: 10.1093/femsre/fuw028. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Toh H, Weiss BL, Perkin SAH, Yamashita A, Oshima K, Hattori M, et al. Massive genome erosion and functional adaptations provide insights into the symbiotic lifestyle of Sodalis glossinidius in the tsetse host. Genome Res. 2006;16:149–56. doi: 10.1101/gr.4106106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Cole ST, Eiglmeier K, Parkhill J, James KD, Thomson NR, Wheeler PR, et al. Massive gene decay in the leprosy bacillus. Nature. 2001;409:1007–11. doi: 10.1038/35059006. [DOI] [PubMed] [Google Scholar]
  • 79.Moran NA, Bennett GM. The tiniest tiny genomes. Annu Rev Microbiol. 2014;68:195–215. doi: 10.1146/annurev-micro-091213-112901. [DOI] [PubMed] [Google Scholar]
  • 80.Bennett GM, McCutcheon JP, MacDonald BR, Romanovicz D, Moran NA. Differential genome evolution between companion symbionts in an insect-bacterial symbiosis. mBio. 2014;5:e01697–14. doi: 10.1128/mBio.01697-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Degnan PH, Ochman H, Moran NA. Sequence conservation and functional constraint on intergenic spacers in reduced genomes of the obligate symbiont Buchnera. PLoS Genet. 2011;7:e1002252. doi: 10.1371/journal.pgen.1002252. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Van Leuven JT, Meister RC, Simon C, McCutcheon JP. Sympatric speciation in a bacterial endosymbiont results in two genomes with the functionality of one. Cell. 2014;158:1270–80. doi: 10.1016/j.cell.2014.07.047. [DOI] [PubMed] [Google Scholar]
  • 83.Gomez-Valero L. The evolutionary fate of nonfunctional DNA in the bacterial endosymbiont Buchnera aphidicola. Mol Biol Evol. 2004;21:2172–81. doi: 10.1093/molbev/msh232. [DOI] [PubMed] [Google Scholar]
  • 84.Manzano-Marín A, Coeur d’acier A, Clamens A-L, Orvain C, Cruaud C, Barbe V, et al. A freeloader? The highly eroded yet large genome of the Serratia symbiotica symbiont of Cinara strobi. Genome Biol Evol. 2018;10:2178–89. doi: 10.1093/gbe/evy173. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Santos-Garcia D, Silva FJ, Morin S, Dettner K, Kuechler SM. The all-rounder Sodalis: a new bacteriome-associated endosymbiont of the lygaeoid bug Henestaris halophilus (Heteroptera: Henestarinae) and a critical examination of its evolution. Genome Biol Evol. 2017;9:2893–910. doi: 10.1093/gbe/evx202. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Havill NP, Foottit RG, von Dohlen CD. Evolution of host specialization in the Adelgidae (Insecta: Hemiptera) inferred from molecular phylogenetics. Mol Phylogenet. 2007;44:357–70. doi: 10.1016/j.ympev.2006.11.008. [DOI] [PubMed] [Google Scholar]
  • 87.Manzano-Marín A, Latorre A. Snapshots of a shrinking partner: genome reduction in Serratia symbiotica. Sci Rep. 2016;6:32590. doi: 10.1038/srep32590. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Monnin D, Jackson R, Kiers ET, Bunker M, Ellers J, Henry LM. Parallel evolution in the integration of a co-obligate aphid symbiosis. Curr Biol. 2020;30:1949–57. doi: 10.1016/j.cub.2020.03.011. [DOI] [PubMed] [Google Scholar]
  • 89.Husnik F, McCutcheon JP. Repeated replacement of an intrabacterial symbiont in the tripartite nested mealybug symbiosis. Proc Natl Acad Sci USA. 2016;113:e5416–24. doi: 10.1073/pnas.1603910113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90.Moran NA, McCutcheon JP, Nakabachi A. Genomics and evolution of heritable bacterial symbionts. Annu Rev Genet. 2008;42:165–90. doi: 10.1146/annurev.genet.41.110306.130119. [DOI] [PubMed] [Google Scholar]
  • 91.Degnan PH, Leonardo TE, Cass BN, Hurwitz B, Stern D, Gibbs RA, et al. Dynamics of genome evolution in facultative symbionts of aphids. Environ Microbiol. 2010;12:2060–9. doi: 10.1111/j.1462-2920.2009.02085.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 92.Burke GR, Moran NA. Massive genomic decay in Serratia symbiotica, a recently evolved symbiont of aphids. Genome Biol Evol. 2011;3:195–208. doi: 10.1093/gbe/evr002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 93.Munson MA, Baumann P, Clark MA, Baumann L, Moran NA, Voegtlin DJ, et al. Evidence for the establishment of aphid-eubacterium endosymbiosis in an ancestor of four aphid families. J Bacteriol. 1991;173:6321–4. doi: 10.1128/jb.173.20.6321-6324.1991. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 94.Moran NA, Munson MA, Baumann P, Ishikawa H. A molecular clock in endosymbiotic bacteria is calibrated using the insect hosts. Proc R Soc B. 1993;253:167–71. [Google Scholar]
  • 95.Kuechler SM, Gibbs G, Burckhardt D, Dettner K, Hartung V. Diversity of bacterial endosymbionts and bacteria-host co-evolution in Gondwanan relict moss bugs (Hemiptera: Coleorrhyncha: Peloridiidae) Environ Microbiol. 2013;15:2031–42. doi: 10.1111/1462-2920.12101. [DOI] [PubMed] [Google Scholar]
  • 96.Thao ML, Moran NA, Abbot P, Brennan EB, Burckhardt DH, Baumann P. Cospeciation of psyllids and their primary prokaryotic endosymbionts. Appl Environ Microbiol. 2000;66:2898–905. doi: 10.1128/aem.66.7.2898-2905.2000. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 97.Thao ML, Baumann P. Evolutionary relationships of primary prokaryotic endosymbionts of whiteflies and their hosts. Appl Environ Microbiol. 2004;70:3401–6. doi: 10.1128/AEM.70.6.3401-3406.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 98.Meseguer AS, Manzano-Marín A, Coeur d’Acier A, Clamens AL, Godefroid M, Jousselin E. Buchnera has changed flatmate but the repeated replacement of co-obligate symbionts is not associated with the ecological expansions of their aphid hosts. Mol Ecol. 2017;26:2363–78. doi: 10.1111/mec.13910. [DOI] [PubMed] [Google Scholar]
  • 99.McCutcheon JP, Moran NA. Parallel genomic evolution and metabolic interdependence in an ancient symbiosis. Proc Natl Acad Sci USA. 2007;104:19392–7. doi: 10.1073/pnas.0708855104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 100.Rao Q, Rollat-Farnier PA, Zhu DT, Santos-Garcia D, Silva FJ, Moya A, et al. Genome reduction and potential metabolic complementation of the dual endosymbionts in the whitefly Bemisia tabaci. BMC Genomics. 2015;16:226. doi: 10.1186/s12864-015-1379-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 101.Rosenblueth M, Sayavedra L, Sámano-Sánchez H, Roth A, Martínez-Romero E. Evolutionary relationships of flavobacterial and enterobacterial endosymbionts with their scale insect hosts (Hemiptera: Coccoidea) J Evol Biol. 2012;25:2357–68. doi: 10.1111/j.1420-9101.2012.02611.x. [DOI] [PubMed] [Google Scholar]
  • 102.Michalik K, Szklarzewicz T, Kalandyk-Kołodziejczyk M, Jankowska W, Michalik A. Bacteria belonging to the genus Burkholderia are obligatory symbionts of the eriococcids Acanthococcus aceris Signoret, 1875 and Gossyparia spuria (Modeer, 1778) (Insecta, Hemiptera, Coccoidea) Arthropod Struct Dev. 2016;45:265–72. doi: 10.1016/j.asd.2016.04.002. [DOI] [PubMed] [Google Scholar]
  • 103.Van Ham RC, Kamerbeek J, Palacios C, Rausell C, Abascal F, Bastolla U, et al. Reductive genome evolution in Buchnera aphidicola. Proc Natl Acad Sci USA. 2003;100:581–6. doi: 10.1073/pnas.0235981100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 104.Vogel KJ, Moran NA. Effect of host genotype on symbiont titer in the aphid-Buchnera symbiosis. Insects. 2011;2:423–34. doi: 10.3390/insects2030423. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 105.Bennett GM, McCutcheon JP, McDonald BR, Moran NA. Lineage-specific patterns of genome deterioration in obligate symbionts of sharpshooter leafhoppers. Genome Biol Evol. 2015;8:296–301. doi: 10.1093/gbe/evv159. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 106.Havill NP, Griffin BP, Andersen JC, Foottit RG, Justesen MJ, Caccone A, et al. Species delimitation and invasion history of the balsam woolly adelgid, Adelges (Dreyfusia) piceae (Hemiptera: Aphidoidea: Adelgidae), species complex. Syst Entomol. 2021;46:186–204. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials


Articles from The ISME Journal are provided here courtesy of Oxford University Press

RESOURCES