Skip to main content
The Journal of Infectious Diseases logoLink to The Journal of Infectious Diseases
. 2024 Aug 14;230(Suppl 1):S51–S61. doi: 10.1093/infdis/jiae256

Analysis of the Borreliaceae Pangenome Reveals a Distinct Genomic Architecture Conserved Across Phylogenetic Scales

Jacob E Lemieux 1,
PMCID: PMC12102487  PMID: 39140725

Abstract

The family Borreliaceae contains arthropod-borne spirochetes that cause two widespread human diseases, Lyme disease and relapsing fever. Lyme disease is a subacute, progressive illness with variable stage and tissue manifestations. Relapsing fever is an acute febrile illness with prominent bacteremia that may recur and disseminate, particularly to the nervous system. Clinical heterogeneity is a hallmark of both diseases. While human clinical manifestations are influenced by a wide variety of factors, including immune status and host genetic susceptibility, there is evidence that Borreliaceae microbial factors influence the clinical manifestations of human disease caused by this family of spirochetes. Despite these associations, the spirochete genes that influence the severity and manifestations of human disease are, for the most part, unknown. Recent work has identified lineage-specific expansions of lipoproteome-rich accessory genome elements in virulent clones of Borrelia burgdorferi. Using publicly available genome assemblies, it is shown that all Borreliaceae lineages for which sufficient sequence data are available harbor a similar pattern of strongly structured, lineage-specific expansions in their accessory genomes, particularly among lipoproteins, and that this pattern holds across phylogenetic scales including genera, species, and genotypes. The relationships among pangenome elements suggest that infrequent episodes of marked genomic change followed by clonal expansion in geographically and enzootically structured populations may account for the unique lineage structure of Borreliaceae. This analysis informs future genotype–phenotype studies among Borreliaceae and lays a foundation for studies of individual gene function guided by phylogenetic patterns of conservation, diversification, gain, and/or loss.

Keywords: Lyme disease, Spirochetes, Genomics, Relapsing fever, Borreliaceae


Spirochetes of the family Borreliaceae cause two major clinical syndromes: relapsing fever (RF) and Lyme disease (LD). RF is an acute, febrile disease with prominent and often recurrent spirochetemia. High densities of spirochetes are often found in the bloodstream during RF [1]. LD, in contrast, is a subacute, progressive illness with variable, organ-specific manifestations, a consequence of the tropism of the spirochete to various host tissues [2]. LD and RF also have different primary tissue tropisms: RF spirochetes are found at the highest abundance in the bloodstream [3], whereas most LD spirochetes are found at the highest concentrations in tissues and occur only rarely, and at low levels, in the bloodstream [2].

Relapsing febrile syndromes have been recognized since antiquity, but it wasn't until the studies of Obermeier in 1873 that spirochetes were recognized as a cause of this syndrome (reviewed in [4]). The spirochetes described by Obermeier were initially called Spirochaeta obermeieri and later named Borrelia (see references in [4]). LD was described more than a century later. Although reports of erythema chronicum migrans were known in the European literature [5], it was not until the 1970s that Steere and coworkers [6, 7] systematically described the syndrome that later came to be known as LD. The causative spirochete was isolated by Burgdorfer and coworkers in 1982 [8] and subsequently named Borrelia burgdorferi in his honor.

The clinical divergence between LD and RF is associated with a deep phylogenetic split between RF and LD spirochetes. In 2003, a third monophyletic clade within Borreliaceae was identified and found to contain spirochetes that infect reptiles, particularly Varanus lizards and other vertebrates [9–11]. These reptile-associated (REP) Borrelia are not known to cause human disease but given their recent recognition, human pathogenicity and many other details of these taxa are not fully understood. Subsequent characterization of RF and LD spirochetes revealed new lineages that differed in their genetic content and phenotype. Many RF species have been described and delineated on the basis of their geographic distribution (“Old World” vs “New World”) and/or host associations (soft-tick vs hard-tick RF agents) [4]. After the initial description of B burgdorferi, extensive diversity among B burgdorferi spirochetes was recognized by several authors [12, 13] using newly developed serological reagents. With the recognition of these genospecies came early evidence suggesting that infection by different members of the species complex was correlated with geography and distinct clinical manifestations [14], foreshadowing that the complex clinical syndrome of LD was due, in part, to extensive microbial genetic diversity. These findings were later confirmed and extended (reviewed in [15]). Genetic lineages within B burgdorferi have been referred to as genospecies, genomic species, or subspecies, and the overall complex of LD-causing Borreliaceae is referred to as B burgdorferi sensu lato (Bbsl) or Borreliella. (Recent phylogenetic analysis supports separation of RF and LD agents into separate genera [16]. Adeolu and Gupta [16] proposed revising the taxonomic assignments, designating Bbsl as Borreliella while retaining Borrelia for RF spirochetes based on historical priority. Although the proposed taxonomic revision is controversial [17–19], this article follows NCBI and uses Borrelia to refer to RF spirochetes and Borreliella to refer to LD spirochetes. The article parenthetically adds Bbsl for clarity and because these terms by GenBank have also been criticized [20] and are not universally adopted by taxonomic databases. The goal is clarity and consistency with the data source, rather than to take a position on the nomenclature.) All Borreliella/Bbsl spirochetes cause LD, but the clinical manifestations of LD vary based on the infecting genotype. Borrelia burgdorferi is strongly associated with Lyme arthritis (LA) [21]; Borrelia garinii and Borrelia bavariensis are strongly associated with Lyme neuroborreliosis [22–24], and Borrelia afzelii is associated with acrodermatitis chronica atrophicans [22–24] and Borrelial lymphocytoma [25, 26].

Extending the paradigm that phenotypic differences within LD are linked to genetic differences, lineages within a given genospecies, B burgdorferi sensu stricto (hereafter referred to as B burgdorferi) were identified, classified by various typing schemes—including serotyping, ribosomal spacer type (RST), and outer surface protein C (OspC) type [27–29]—and found to differ in their phenotypes. At the genotype level, which is best studied for B burgdorferi, OspC type A/RST 1 genotypes of B burgdorferi disseminate in humans at higher rates [30–33] and cause higher rates of symptoms [34], greater inflammation [34, 35], and more severe LA [34].

Thus, the divergence of clinical disease in humans correlates with genetic divergence across phylogenetic scales in Borreliaceae. The largest phenotypic differences are observed between RF and LD spirochetes; lesser but still notable clinical variation is observed between species within a genus, and more subtle yet meaningful variation occurs at the genotype level.

Despite strong evidence that microbial genetic factors influence clinical manifestations, the specific genes that contribute to these phenotypic differences are incompletely understood. Many genes lack annotated functions; in other cases, where some or all functional activities are well-characterized, including systems for antigenic variation, serum resistance, and tissue adhesion (reviewed in [36–38]), gaps remain in understanding how these functions work together to influence disease manifestations. This is in part due to the complexity of Borreliaceae genomes, which contain numerous plasmids that are challenging to sequence and annotate [39, 40], and the relative rarity of human clinical isolates of LD and RF, which require specialized culture conditions not typically performed in routine clinical practice [41].

Identifying genes that contribute to distinct clinical manifestations of LD and RF remains an important scientific goal because of its potential to reveal fundamental insight into Borreliaceae pathogenesis and open new opportunities for therapy and prevention. There are opportunities to do so using comparative phylogenetic approaches, in which the genetic content of related lineages is cataloged and compared, as well as microbial genome-wide association study approaches, in which genetic elements are tested for their association with a phenotype of interest. Recent work has used these approaches to identify the genetic elements associated with disseminated LD [33], but the genetic and phenotypic heterogeneity of Borreliaceae spirochetes means that these and related approaches are in their infancy. The concept of a pangenome, representing a “core” of genes shared by all isolates along with a “shell” or “accessory” genome of genes present in some isolates, was introduced in 2005 [42] and has become one of the most widely used methods in comparative bacterial genomics. Understanding pangenome relationships among strains and lineages can extend comparative genomic and genotype–phenotype studies to Borreliaceae linages; prioritize individual genes for functional follow-up; and foster an understanding of genomic diversity and evolution within the family. Here, publicly available genome sequence data are used to construct pangenomes and characterize lineage-specific genomic events from across Borreliaceae.

MATERIALS AND METHODS

Borreliaceae genome assemblies and associated metadata were downloaded from the National Center for Biotechnology Information (NCBI). Assemblies were filtered to be >1 Mb (excluding chromosome-only assemblies) and <2 Mb. Assemblies were annotated using Prokka [43]. Pangenomes were constructed using Roary [44], with a minimum BLAST threshold of 80% homology and without splitting of paralogous groups, and PIRATE [45] using default parameters. Trees were constructed using FastTree [46] and IQ-TREE2 [47] with 1000 ultrafast bootstrap replicates. The pipeline was executed using Snakemake [48]. Lipoproteins were annotated using SpLip [49]. The resulting pangenomes, phylogenetic trees, and metadata files were analyzed and plotted using R [50], ggplot [51], and ggtree [52]. The genomic position of pangenome homology groups was annotated by aligning pangenome elements against Borreliaceae reference genomes using minimap2 [53]. Because chromosome sequences are easier to assemble than plasmid sequences in Borreliaceae [39, 54], and therefore presumably more accurate if assemblies are unfinished, the following approach was used: If pangenome elements aligned to the chromosome sequences, they were annotated as provisionally chromosomal. If they aligned to reference sequences but not to chromosome sequences, they were annotated as provisionally plasmid-encoded. Lineage-specific associations of individual pangenome elements were identified using logistic regression to predict the lineage based on the pangenome homology group, as implemented in R using the glm() command.

Ethical Approval

The analysis of microbial genome sequences from deidentified patient data was approved by the Massachusetts General Hospital Institutional Review Board under protocol 2019P001846.

RESULTS

To characterize the pangenome relationships among Borreliaceae, all publicly available Borreliaceae assemblies were downloaded from NCBI GenBank. The assemblies were annotated using Prokka [43], pangenomes were constructed by efficient all-versus-all BLAST implemented in Roary [44], and maximum-likelihood phylogenetic trees consisting of core genome sequences were constructed using FastTree [46] and/or IQ-tree2 [47]. As of 26 December 2023, NCBI GenBank contained 702 Borreliaceae genome assemblies from 20 Borreliella/Bbsl taxa and 18 Borrelia taxa from both the RF group and the “reptilian” (REP) group [55]. The core genome tree constructed from these assemblies is shown in Figure 1, with overrepresented taxa downsampled to a maximum of 25. Consistent with prior phylogenetic analyses, the tree reveals a well-supported, deep division into 2 monophyletic branches consisting of Borrelia spp and Borreliella spp/Bbsl spirochetes. Borrelia spp contain 2 monophyletic sister clades, RF spirochetes, and the REP group.

Figure 1.

Figure 1.

Core genome phylogenetic relationships among Borreliaceae. Radial tree of Borreliaceae with tips annotated by species. Nodes with bootstrap support >90% are colored with a dot, demonstrating robust phylogenetic support for the genus- and species-specific divisions on the tree. The clade of Lyme disease (LD)–causing spirochetes, Borreliella/Borrelia burgdorferi sensu lato, has been highlighted in red; relapsing fever (RF)–causing spirochetes and the reptile-associated group are shaded in blue and orange, respectively. Old World and New World subgroupings of RF spirochetes are shown with a dotted line. Within the LD spirochetes, the clades associated with distinct manifestations are shown. The scale bar denotes nucleotide substitutions per site. Abbreviations: ACA, acrodermatitis chronicum atrophicans; LA, Lyme arthritis; LD, Lyme disease; LNB, Lyme neuroborreliosis; REP, reptile-associated group; RF, relapsing fever.

The Borreliaceae pangenome contains 299 core open reading frames (ORFs) at a protein BLAST homology threshold of 80% (Figure 2). The noncore (accessory) pangenome among Borreliaceae contains 5684 ORF homology groups, although this estimate is probably inflated due to fragmented assemblies [56]. The accessory genome is strongly structured by lineage, with divisions in the accessory genome following the core genome phylogeny. The deep phylogenetic division between Borrelia spp and Borreliella spp/Bbsl is mirrored in the divergence of the accessory genomes. Borrelia spp and Borreliella spp/Bbsl possess additional genus- and species-specific core genomes that are readily apparent in Figure 2. Annotated pangenomes can be used to define genus- and species-specific pangenomes and lipoproteomes (Supplementary File 1). This analysis was repeated using PIRATE [45], which allows for the construction of pangenomes across multiple homology values, confirming that this accessory genome structure was not an artifact of a single homology threshold.

Figure 2.

Figure 2.

Accessory genome content among Borreliaceae is strongly structured by lineage. Patterns of open reading frame presence are shown according to a phylogenetic tree produced from core genome sequences. A matrix showing the presence or absence of pangenome homology groups in each strain is shown at right. Pangenome homology groups that are present are shaded in solid color, and groups that are absent are uncolored. Each row corresponds to an individual strain, and each column corresponds to individual homology groups. The rows are ordered according to a phylogenetic tree, with tips annotated by the Borreliaceae genus and species. Relevant taxa are also labeled. As in Figure 1, the Borreliaceae that cause Lyme disease (Borreliella/Borrelia burgdorferi sensu lato) are shaded and labeled in red; the relapsing fever and reptile-associated groups are shaded and labeled in blue and orange, respectively. The columns are clustered with hierarchical clustering to reflect similarity in the patterns of gene presence among lineages. The tree scale bar denotes nucleotide substitutions per site. Abbreviations: Bbsl, Borrelia burgdorferi sensu lato; LD, Lyme disease; ORF, open reading frame; REP, reptile-associated group; RF, relapsing fever.

The structure of accessory genome elements consists of a lineage-specific “block” pattern in which groupings of accessory genome presence are almost perfectly correlated with lineage-defining subdivisions in the core genome tree (Figure 2). This pattern closely resembles the pattern observed for the B burgdorferi pangenome [33] but is present across phylogenetic scales. Within the Borreliella spp/Bbsl, there are lineage-specific groups of accessory loci, for example among B burgdorferi (B burgdorferi), B garinii, B bavariensis, and B afzelii genomes. Borrelia spp similarly possess lineage-specific changes such that there are tightly structured core genomes for Old World RF agents (including Borrelia crocidurae and Borrelia duttonii), hard-tick RF agents (Borrelia miyamotoi), and New World RF agents (including Borrelia turicatae, Borrelia parkeri, and Borrelia hermsii). Within sublineages, a similar pattern occurs. For example, B miyamotoi also shows evidence of distinct genotypes, with subsets of B miyamotoi diverging based on geographic origin [57]. Russian and Asian B miyamotoi isolates possess a slightly enlarged pangenome and a distinct block of genetic elements compared to those from Western Europe and North America, each of which forms a distinct clade (Supplementary Figure 1). Among B burgdorferi sublineages, OspC type A/RST1 genotypes possess a lineage-core group of ORFs [33].

Lipoproteins are particularly important for Borreliaceae immune evasion, tissue tropism, and other aspects of vertebrate pathogenesis [2, 58]. To characterize the relationships among the Borreliaceae pan-lipoproteome, lipoproteins were annotated using SpLip [49] and analyzed the phylogenetic patterns of lipoprotein-annotated homology groups (Figure 3). The strong lineage-specific pattern of the pangenome (Figure 2) is reproduced almost exactly in the lipoproteome (Figure 3), except that all lipoproteins are lineage-variable at the homology thresholds used.

Figure 3.

Figure 3.

The Borreliaceae lipoproteome is strongly structured by lineage. Patterns of open reading frames annotated as putative lipoproteins, as classified by SpLip [49], are shown according to a phylogenetic tree produced from core genome sequences. A matrix showing the presence or absence of probable lipoproteins in each strain is shown. Each row corresponds to an individual strain and each column corresponds to a homology group classified as a probable lipoprotein. The rows are ordered according to a phylogenetic tree, with tips annotated by the Borreliaceae genus and species. Relevant taxa are also labeled. As in Figure 1, the Borreliaceae that cause Lyme disease (Borreliella/Borrelia burgdorferi sensu lato) are shaded and labeled in red; the relapsing fever and reptile-associated groups are shaded and labeled in blue and orange, respectively. The columns are clustered with hierarchical clustering to reflect similarity in the patterns of gene presence among lineages. The tree scale bar denotes nucleotide substitutions per site. Abbreviations: Bbsl, Borrelia burgdorferi sensu lato; LD, Lyme disease; ORF, open reading frame; REP, reptile-associated group; RF, relapsing fever.

The spatial organization of homology groups within Borreliaceae genomes was then assessed. Because many assemblies are incomplete due to the difficulty of generating complete assemblies through short-read sequencing [39, 40, 59], a provisional analysis of draft genome organization was performed by aligning a representative of each pangenome homology group to well-annotated reference genomes. Lipoproteins are preferentially encoded on plasmids throughout Borreliaceae (Figure 4). There was a strong association between lipoprotein status and chromosomal versus plasmid genomic location (Fisher exact test, P < 2.2 × 10−16); a similar association was observed when lipoprotein status was evaluated using 2 categories (not a lipoprotein vs probable lipoprotein) or 3 categories (not a lipoprotein, possible lipoprotein, or probable lipoprotein). Core genome elements were almost exclusively encoded on the chromosome (Supplementary Figure 2). Accessory genome elements are preferentially encoded by plasmids (Supplementary Figure 3), including all Borreliella accessory genome elements and many Borrelia accessory elements. One notable difference between Borrelia spp and Borreliella spp/Bbsl is that in Borrelia spp portions of the species-specific accessory genome are encoded on the chromosome (Supplementary Figure 2), whereas in Borreliella/Bbsl nearly all species-specific accessory genome elements are extra-chromosomal (Supplementary Figure 3). This suggests that the chromosome in Borrelia spp has evolved to carry lineage-specific accessory gene content, whereas, in Borreliella spp/Bbsl, lineage-specific variation in gene content is almost exclusively encoded on plasmids. Thus, the pattern of lineages defined by plasmid-encoded accessory elements enriched in lipoproteins is present but less absolute in Borrelia compared to Borreliella/Bbsl. Together, these analyses enable the identification of lineage-specific lipoproteomes (Supplementary Figures 4 and 5), an approach that can highlight more subtle lineage-specific genomic patterns such as the unique sequences associated with OspC type A/RST1 genotypes among B burgdorferi (Supplementary Figure 4B and 4C) [33].

Figure 4.

Figure 4.

Borreliaceae lipoproteins are preferentially encoded on plasmids. The genomic location of pangenome elements is shown. Counts are stratified by whether they are classified by SpLip [49] as probable lipoproteins, possible lipoproteins, or not lipoproteins. Test of association by Fisher exact test, P < 2.2 × 10−16.

By summing the number of homology groups in an individual genome, pangenome analysis also provides an estimate of the size of the coding genome and lipoproteome. Borrelia genomes have slightly more ORF homology groups than Borreliella/Bbsl genomes (Figure 5A), an effect that is driven by soft-tick Borrelia RF spirochetes (Figure 5B), particularly among lipoproteins (Figure 5C and 5D). Differences are seen among arthropod vectors. The hard-tick RF spirochete B miyamotoi has fewer ORF homology groups and probable lipoproteins than soft-tick RF spirochetes. Despite being an RF spirochete, its number of ORFs and probable lipoproteins are similar to LD spirochetes. Among LD Borreliella/Bbsl, B garinii and B bavariensis have fewer ORF homology groups than B burgdorferi; however, there is marked variation in genome size within B burgdorferi (Figure 5) [33] and other species (Figure 5B and 5D).

Figure 5.

Figure 5.

Borrelia pangenomes contain an expanded number of open reading frames (ORFs) and putative lipoproteins. A, The number of ORF homology groups for each genome assessed, grouped by genus. B, The number of ORF homology groups for each genome assessed, grouped by species. C, The number of ORF homology groups annotated as probable lipoproteins for each genome assessed, grouped by genus. D, The number of ORF groups annotated as probable lipoproteins for each genome assessed, grouped by species. P values report the results of a Wilcoxon rank-sum test with a 2-sided alternative. Abbreviations: **P < .01; ****P < .0001.

DISCUSSION

The growing quantity of genomic data in publicly available databanks provides unique opportunities to characterize the evolutionary and genomic relationships among Borreliaceae. Such characterizations can help prioritize individual genes for further investigation and provide a basis for genome-wide studies of genotype-phenotype correlations. Recent work has shown that lineages of B burgdorferi contain groups of strongly linked loci accessory elements, particularly surface lipoproteins encoded on plasmids, and these blocks of loci are linked to clinical manifestations [33]. Using published assemblies of Borreliaceae, it is shown here that this pattern extends across phylogenetic scales and appears to be a general organizing principle of the Borreliaceae pangenome. The large number of genomes included from across the family is a strength of this analysis and builds on previous reports [33, 60], providing a panoramic view of Borreliaceae genome diversity. Limitations include a convenience sample with a lack of standardization among sequencing, assembly, and submission procedures; incomplete and fragmented assemblies that may inflate the number of pangenome elements [56] and do not easily permit an analysis of associated genomic context; and a focus on variation at the level of homology clusters rather than single genetic variants.

The strong block structure of accessory genome elements across phylogenetic scales is remarkable. The monophyletic nature of these blocks implies that most have a single common ancestor and that the set of genes included in the block has been stable over evolutionary time. This pattern suggests that extant variation in Borreliaceae is the consequence of a relatively small number of profound ancestral changes involving multiple genetic elements, and that these marked changes occurred simultaneously or in short succession. The abrupt genesis of major genomic changes is consistent with a “punctuated equilibrium” model [61] of evolution. This pattern would explain the strong statistical linkage between physically unlinked markers, for example, OspC and the B burgdorferi ribosomal spacer [62].

It is notable that the patterns hold for major and minor lineages across scales. The fractal nature of this pattern may provide clues into the underlying mechanisms that result in this pattern. Fractals occur when the same laws apply recursively across scales [63]. The process of lineage generation may be an example of such a recursive law and suggests the following mechanism: The creation of a new genotype begins with a rare genomic event that involves the gain, loss, and/or exchange of multiple genes, probably in an arthropod or vertebrate host infected with multiple genotypes and/or different species. Such a lineage may clonally expand in a new niche, leading to geographically and/or ecologically structured populations. Over time, these populations accumulate further genetic diversity and eventually become differentiated enough to be classified as a species or genus. Along the way, sublineages form recursively by the same mechanism.

Although this mechanism raises questions about how Borreliaceae could survive abrupt genetic shifts, complex genomic rearrangements are increasingly recognized in other contexts, especially in cancer, where the phenomenon is termed chromothripsis [64–66]. The unique genome organization of Borreliaceae [39, 67]—in which conserved functions are encoded by a single chromosome and genes that mediate host–pathogen interactions, particularly surface lipoproteins encoded on plasmids—probably facilitates organism survival following an episode of dramatic change by allowing loss, gain, or exchange of multiple genes on plasmids while insulating the core metabolic or housekeeping functions of the spirochete on the chromosome and certain conserved plasmids. Other mechanisms may also predispose to complex genetic rearrangements: Both RF and LD Borreliaceae genomes contain sophisticated machinery for inducing gene conversation to support antigenic variation [36, 68–70]. Regulation of this process is paramount to genome integrity. Genomic instability involving multiple linked loci is readily observable in B burgdorferi, in which routine in vitro culture is associated with spontaneous plasmid loss [71, 72] and double-stranded breaks appear to result in plasmid loss in vitro [73]. B burgdorferi also possesses a bacteriophage [74–76], raising the possibility that bacteriophage-mediated lateral transfer of multiple loci has contributed to the evolutionary history of Borreliaceae.

The genetic structure of Borreliaceae informs the search for microbial loci associated with human disease in several ways. Lineage-level associations aggregate multiple genes’ worth of effects, potentially increasing the effect size of the genotype–phenotype link, and the consistent genetic profile among strains within a given lineage reduces the variance of these effects because strains are nearly isogenic. Thus, lineage-level associations should be relatively easy to identify, which may be why several are known. On the other hand, locus-level effects will be more difficult to identify because individual loci are rarely observed alone. Large samples will be required, and statistical techniques, such as conditioning on lineage-level effects, may help. Experimental methods in appropriate disease models involving reverse genetics will likely be necessary to assign or exclude causal roles for individual genes. Functional genomic approaches capable of screening multiple loci, such as TnSeq [77], CRISPRi using dCas9 [78, 79], and gain-of-function screens [80, 81], have recently been developed and are well-suited to the task of identifying experimentally important loci from among a larger set of candidates. Lineage-level associations greatly narrow the list of candidates for functional characterization and reverse genetic methods.

In summary, this analysis of Borreliaceae pangenomes reveals a conserved genomic architecture in which lineages are defined by correlated blocks of accessory genome elements, primarily encoded on plasmids and enriched for lipoproteins, a pattern that holds across phylogenetic scales. This unusual fractal genetic structure provides clues into the origin and evolution of Borreliaceae lineages and establishes a basis for reductionist studies of gene function informed by family-wide phylogenetic relationships.

Supplementary Data

Supplementary materials are available at The Journal of Infectious Diseases online (http://jid.oxfordjournals.org/). Supplementary materials consist of data provided by the author that are published to benefit the reader. The posted materials are not copyedited. The contents of all supplementary data are the sole responsibility of the authors. Questions or messages regarding errors should be addressed to the author.

Supplementary Material

jiae256_Supplementary_Data

Notes

Acknowledgments. The author gratefully acknowledges all of the sequence submitters and contributors who deposited Borreliaceae assemblies in GenBank (Supplemental File 2), as well as Klemen Strle, Ira Schwartz, and Allen Steere for helpful discussions.

Code availability. Analysis code is available at github.org/jacoblemieux/panborreliaceae.

Financial support. This work was supported by the National Institutes of Health (award number K99/R00AI148604).

Supplement sponsorship. This article appears as part of the supplement “Lyme Disease,” sponsored by the Lyme Disease Program at Massachusetts General Hospital and a generous gift from the Morse family.

Potential conflicts of interest. Author certifies no potential conflicts of interest.

The author has submitted the ICMJE Form for Disclosure of Potential Conflicts of Interest. Conflicts that the editors consider relevant to the content of the manuscript have been disclosed.

References

  • 1. Barbour  AG. Relapsing fever, In: Tick-borne diseases of humans. Washington, DC: ASM Press, 2014:268–91. [Google Scholar]
  • 2. Steere  AC, Strle F, Wormser GP, et al.  Lyme borreliosis. Nat Rev Dis Primers 2016; 2:16090. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Lopez  J, Hovius JW, Bergström S. Pathogenesis of relapsing fever. Curr Issues Mol Biol 2021; 42:519–50. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Cutler  SJ. Relapsing fever—a forgotten disease revealed. J Appl Microbiol 2010; 108:1115–22. [DOI] [PubMed] [Google Scholar]
  • 5. Afzelius  A. Erythema chronicum migrans. Acta Derm Venereol 1921; 2:120–5. [PubMed] [Google Scholar]
  • 6. Steere  AC, Malawista SE, Hardin JA, Ruddy S, Askenase W, Andiman WA. Erythema chronicum migrans and Lyme arthritis. The enlarging clinical spectrum. Ann Intern Med 1977; 86:685–98. [DOI] [PubMed] [Google Scholar]
  • 7. Steere  AC, Malawista SE, Snydman DR, Andiman WA. Cluster of arthritis in children and adults in Lyme, Connecticut. Arthritis Rheum 1976; 20:824. [DOI] [PubMed] [Google Scholar]
  • 8. Burgdorfer  W, Barbour AG, Hayes SF, Benach JL, Grunwaldt E, Davis JP. Lyme disease—a tick-borne spirochetosis? Science 1982; 216:1317–9. [DOI] [PubMed] [Google Scholar]
  • 9. Güner  ES, Hashimoto N, Kadosaka T, Imai Y, Masuzawa T. A novel, fast-growing Borrelia sp. isolated from the hard tick Hyalomma aegyptium in Turkey. Microbiology 2003; 149:2539–44. [DOI] [PubMed] [Google Scholar]
  • 10. Kalmár  Z, Cozma V, Sprong H, et al.  Transstadial transmission of Borrelia turcica in Hyalomma aegyptium ticks. PLoS One 2015; 10:e0115520. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Gofton  AW, Popa-Baez A, Takano A, et al.  Characterisation and comparative genomics of three new Varanus-associated Borrelia spp. from Indonesia and Australia. Parasit Vectors 2023; 16:317. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Barbour  AG, Heiland RA, Howe TR. Heterogeneity of major proteins in Lyme disease borreliae: a molecular analysis of North American and European isolates. J Infect Dis 1985; 152:478–84. [DOI] [PubMed] [Google Scholar]
  • 13. Wilske  B, Preac-Mursic V, Schierz G. Antigenic heterogeneity of European Borrelia burgdorferi strains isolated from patients and ticks. Lancet 1985; 1:1099. [DOI] [PubMed] [Google Scholar]
  • 14. Stanek  G, Wewalka G, Groh V, Neumann R, Kristoferitsch W. Differences between Lyme disease and European arthropod-borne Borrelia infections. Lancet 1985; 1:401. [DOI] [PubMed] [Google Scholar]
  • 15. Stanek  G, Reiter M. The expanding Lyme Borrelia complex—clinical significance of genomic species? Clin Microbiol Infect 2011; 17:487–93. [DOI] [PubMed] [Google Scholar]
  • 16. Adeolu  M, Gupta RS. A phylogenomic and molecular marker based proposal for the division of the genus Borrelia into two genera: the emended genus Borrelia containing only the members of the relapsing fever Borrelia, and the genus Borreliella gen. nov. containing the members of the Lyme disease Borrelia (Borrelia burgdorferi sensu lato complex). Antonie Van Leeuwenhoek 2014; 105:1049–72. [DOI] [PubMed] [Google Scholar]
  • 17. Margos  G, Castillo-Ramirez S, Cutler S, et al.  Rejection of the name Borreliella and all proposed species comb. nov. placed therein. Int J Syst Evol Microbiol 2020; 70:3577–81. [DOI] [PubMed] [Google Scholar]
  • 18. Gupta  RS. Distinction between Borrelia and Borreliella is more robustly supported by molecular and phenotypic characteristics than all other neighbouring prokaryotic genera: Response to Margos' et al. ‘The genus Borrelia reloaded’. PLoS One 2019; 14:e0221397. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Stevenson  B, Fingerle V, Wormser GP, Margos G. Public health and patient safety concerns merit retention of Lyme borreliosis–associated spirochetes within the genus Borrelia, and rejection of the genus novum Borreliella. Ticks Tick Borne Dis 2019; 10:1–4. [DOI] [PubMed] [Google Scholar]
  • 20. Margos  G, Wormser GP, Schwartz I, et al.  Evidence of taxonomic bias in public databases: the example of the genus Borrelia. Ticks Tick Borne Dis 2022; 13:101994. [DOI] [PubMed] [Google Scholar]
  • 21. Grillon  A, Scherlinger M, Boyer P-H, et al.  Characteristics and clinical outcomes after treatment of a national cohort of PCR-positive Lyme arthritis. Semin Arthritis Rheum 2018; 48:1105–12. [DOI] [PubMed] [Google Scholar]
  • 22. Coipan  EC, Jahfari S, Fonville M, et al.  Imbalanced presence of Borrelia burgdorferi s.l. multilocus sequence types in clinical manifestations of Lyme borreliosis. Infect Genet Evol 2016; 42:66–76. [DOI] [PubMed] [Google Scholar]
  • 23. van Dam  AP, Kuiper H, Vos K, et al.  Different genospecies of Borrelia burgdorferi are associated with distinct clinical manifestations of Lyme borreliosis. Clin Infect Dis 1993; 17:708–17. [DOI] [PubMed] [Google Scholar]
  • 24. Balmelli  T, Piffaretti JC. Association between different clinical manifestations of Lyme disease and different species of Borrelia burgdorferi sensu lato. Res Microbiol 1995; 146:329–40. [DOI] [PubMed] [Google Scholar]
  • 25. Maraspin  V, Nahtigal Klevišar M, Ružić-Sabljić E, Lusa L, Strle F. Borrelial lymphocytoma in adult patients. Clin Infect Dis 2016; 63:914–21. [DOI] [PubMed] [Google Scholar]
  • 26. Lenormand  C, Jaulhac B, De Martino S, Barthel C, Lipsker D. Species of Borrelia burgdorferi complex that cause borrelial lymphocytoma in France. Br J Dermatol 2009; 161:174–6. [DOI] [PubMed] [Google Scholar]
  • 27. Postic  D, Assous MV, Grimont PA, Baranton G. Diversity of Borrelia burgdorferi sensu lato evidenced by restriction fragment length polymorphism of rrf (5S)-rrl (23S) intergenic spacer amplicons. Int J Syst Bacteriol 1994; 44:743–52. [DOI] [PubMed] [Google Scholar]
  • 28. Wilske  B, Busch U, Fingerle V, et al.  Immunological and molecular variability of OspA and OspC. Implications for Borrelia vaccine development. Infection 1996; 24:208–12. [DOI] [PubMed] [Google Scholar]
  • 29. Wilske  B, Preac-Mursic V, Göbel UB, et al.  An OspA serotyping system for Borrelia burgdorferi based on reactivity with monoclonal antibodies and OspA sequence analysis. J Clin Microbiol 1993; 31:340–50. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Wormser  GP, Liveris D, Nowakowski J, et al.  Association of specific subtypes of Borrelia burgdorferi with hematogenous dissemination in early Lyme disease. J Infect Dis 1999; 180:720–5. [DOI] [PubMed] [Google Scholar]
  • 31. Wormser  GP, Brisson D, Liveris D, et al.  Borrelia burgdorferi genotype predicts the capacity for hematogenous dissemination during early Lyme disease. J Infect Dis 2008; 198:1358–64. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. Jones  KL, Glickstein LJ, Damle N, Sikand VK, McHugh G, Steere AC. Borrelia burgdorferi genetic markers and disseminated disease in patients with early Lyme disease. J Clin Microbiol 2006; 44:4407–13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. Lemieux  JE, Huang W, Hill N, et al.  Whole genome sequencing of human Borrelia burgdorferi isolates reveals linked blocks of accessory genome elements located on plasmids and associated with human dissemination. PLoS Pathog 2023; 19:e1011243. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. Strle  K, Jones KL, Drouin EE, Li X, Steere AC. Borrelia burgdorferi RST1 (OspC type A) genotype is associated with greater inflammation and more severe Lyme disease. Am J Pathol 2011; 178:2726–39. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Strle  K, Shin JJ, Glickstein LJ, Steere AC. Association of a Toll-like receptor 1 polymorphism with heightened Th1 inflammatory responses and antibiotic-refractory Lyme arthritis. Arthritis Rheum 2012; 64:1497–507. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36. Norris  SJ. Antigenic variation with a twist—the Borrelia story. Mol Microbiol 2006; 60:1319–22. [DOI] [PubMed] [Google Scholar]
  • 37. Skare  JT, Garcia BL. Complement evasion by Lyme disease spirochetes. Trends Microbiol 2020; 28:889–99. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38. Caine  JA, Coburn J. Multifunctional and redundant roles of Borrelia burgdorferi outer surface proteins in tissue adhesion, colonization, and complement evasion. Front Immunol 2016; 7:442. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39. Casjens  S, Palmer N, van Vugt R, et al.  A bacterial genome in flux: the twelve linear and nine circular extrachromosomal DNAs in an infectious isolate of the Lyme disease spirochete Borrelia burgdorferi. Mol Microbiol 2000; 35:490–516. [DOI] [PubMed] [Google Scholar]
  • 40. Margos  G, Hepner S, Mang C, et al.  Lost in plasmids: next generation sequencing and the complex genome of the tick-borne pathogen Borrelia burgdorferi. BMC Genomics 2017; 18:422. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41. Barbour  AG, Hayes SF. Biology of Borrelia species. Microbiol Rev 1986; 50:381–400. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42. Tettelin  H, Masignani V, Cieslewicz MJ, et al.  Genome analysis of multiple pathogenic isolates of Streptococcus agalactiae: implications for the microbial ‘pan-genome.’  Proc Natl Acad Sci U S A  2005; 102:13950–5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43. Seemann  T. Prokka: rapid prokaryotic genome annotation. Bioinformatics 2014; 30:2068–9. [DOI] [PubMed] [Google Scholar]
  • 44. Page  AJ, Cummins CA, Hunt M, et al.  Roary: rapid large-scale prokaryote pan genome analysis. Bioinformatics 2015; 31:3691–3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45. Bayliss  SC, Thorpe HA, Coyle NM, Sheppard SK, Feil EJ. PIRATE: a fast and scalable pangenomics toolbox for clustering diverged orthologues in bacteria. Gigascience 2019; 8:giz119. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46. Price  MN, Dehal PS, Arkin AP. FastTree: computing large minimum evolution trees with profiles instead of a distance matrix. Mol Biol Evol 2009; 26:1641–50. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47. Minh  BQ, Schmidt HA, Chernomor O, et al.  IQ-TREE 2: new models and efficient methods for phylogenetic inference in the genomic era. Mol Biol Evol 2020; 37:1530–4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48. Köster  J, Rahmann S. Snakemake—a scalable bioinformatics workflow engine. Bioinformatics 2012; 28:2520–2. [DOI] [PubMed] [Google Scholar]
  • 49. Setubal  JC, Reis M, Matsunaga J, Haake DA. Lipoprotein computational prediction in spirochaetal genomes. Microbiology 2006; 152:113–21. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50. Ihaka  R, Gentleman R. R: a language for data analysis and graphics. J Comput Graph Stat 1996; 5:299–314. [Google Scholar]
  • 51. Wickham  H, Averick M, Bryan J, et al.  Welcome to the Tidyverse. J Open Source Software 2019; 4:1686. [Google Scholar]
  • 52. Yu  G, Smith DK, Zhu H, Guan Y, Lam TT. ggtree: an r package for visualization and annotation of phylogenetic trees with their covariates and other associated data. Methods Ecol Evol 2017; 8:28–36. [Google Scholar]
  • 53. Li  H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics 2018; 34:3094–100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54. Kingry  LC, Replogle A, Batra D, et al.  Toward a complete North American Borrelia miyamotoi genome. Genome Announc 2017; 5:e01557-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55. Barbour  AG, Gupta RS. The family Borreliaceae (Spirochaetales), a diverse group in two genera of tick-borne spirochetes of mammals, birds, and reptiles. J Med Entomol 2021; 58:1513–24. [DOI] [PubMed] [Google Scholar]
  • 56. Tonkin-Hill  G, MacAlasdair N, Ruis C, et al.  Producing polished prokaryotic pangenomes with the Panaroo pipeline. Genome Biol 2020; 21:180. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57. Cosson  J-F, Michelet L, Chotte J, et al.  Genetic characterization of the human relapsing fever spirochete Borrelia miyamotoi in vectors and animal reservoirs of Lyme disease spirochetes in France. Parasit Vectors 2014; 7:233. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58. Haake  DA. Spirochaetal lipoproteins and pathogenesis. Microbiology 2000; 146(Pt 7):1491–504. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59. Hepner  S, Kuleshov K, Tooming-Kunderud A, et al.  A high fidelity approach to assembling the complex Borrelia genome. BMC Genomics 2023; 24:401. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60. Mongodin  EF, Casjens SR, Bruno JF, et al.  Inter- and intra-specific pan-genomes of Borrelia burgdorferi sensu lato: genome stability and adaptive radiation. BMC Genomics 2013; 14:693. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61. Gould  SJ, Eldredge N. Punctuated equilibria: an alternative to phyletic gradualism. In: Schopf TJM, ed. Models in paleobiology. San Francisco, CA: Freeman, Cooper & Co, 1972:82–115. [Google Scholar]
  • 62. Wang  IN, Dykhuizen DE, Qiu W, Dunn JJ, Bosler EM, Luft BJ. Genetic diversity of ospC in a local population of Borrelia burgdorferi sensu stricto. Genetics 1999; 151:15–30. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63. Mandelbrot  BB. The fractal geometry of nature. 2nd ed. New York: Times Books, 1982. [Google Scholar]
  • 64. Stephens  PJ, Greenman CD, Fu B, et al.  Massive genomic rearrangement acquired in a single catastrophic event during cancer development. Cell 2011; 144:27–40. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65. Fredga  K, Hall B. A complex familial translocation involving chromosomes 5, 9 and 13. Cytogenetics 1970; 9:294–306. [DOI] [PubMed] [Google Scholar]
  • 66. Shoshani  O, Brunner SF, Yaeger R, et al.  Chromothripsis drives the evolution of gene amplification in cancer. Nature 2021; 591:137–41. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67. Fraser  CM, Casjens S, Huang WM, et al.  Genomic sequence of a Lyme disease spirochaete, Borrelia burgdorferi. Nature 1997; 390:580–6. [DOI] [PubMed] [Google Scholar]
  • 68. Meier  JT, Simon MI, Barbour AG. Antigenic variation is associated with DNA rearrangements in a relapsing fever Borrelia. Cell 1985; 41:403–9. [DOI] [PubMed] [Google Scholar]
  • 69. Zhang  JR, Hardham JM, Barbour AG, Norris SJ. Antigenic variation in Lyme disease borreliae by promiscuous recombination of VMP-like sequence cassettes. Cell 1997; 89:275–85. [DOI] [PubMed] [Google Scholar]
  • 70. Plasterk  RH, Simon MI, Barbour AG. Transposition of structural genes to an expression sequence on a linear plasmid causes antigenic variation in the bacterium Borrelia hermsii. Nature 1985; 318:257–63. [DOI] [PubMed] [Google Scholar]
  • 71. Barbour  AG. Plasmid analysis of Borrelia burgdorferi, the Lyme disease agent. J Clin Microbiol 1988; 26:475–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72. Grimm  D, Elias AF, Tilly K, Rosa PA. Plasmid stability during in vitro propagation of Borrelia burgdorferi assessed at a clonal level. Infect Immun 2003; 71:3138–45. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73. Takacs  CN, Nakajima Y, Haber JE, Jacobs-Wagner C. Cas9-mediated endogenous plasmid loss in Borrelia burgdorferi. PLoS One 2022; 17:e0278151. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74. Eggers  CH, Samuels DS. Molecular evidence for a new bacteriophage of Borrelia burgdorferi. J Bacteriol 1999; 181:7308–13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75. Eggers  CH, Kimmel BJ, Bono JL, Elias AF, Rosa P, Samuels DS. Transduction by ϕBB-1, a bacteriophage of Borrelia burgdorferi. J Bacteriol 2001; 183:4771–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76. Faith  DR, Kinnersley M, Brooks DM, et al.  Characterization and genomic analysis of the Lyme disease spirochete bacteriophage ϕBB-1. PLoS Pathog 2024; 20:e1012122. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77. Troy  EB, Lin T, Gao L, et al.  Understanding barriers to Borrelia burgdorferi dissemination during infection using massively parallel sequencing. Infect Immun 2013; 81:2347–57. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78. Takacs  CN, Scott M, Chang Y, et al.  A CRISPR interference platform for selective downregulation of gene expression in Borrelia burgdorferi. Appl Environ Microbiol 2021; 87:e02519-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79. Murphy  BT, Wiepen JJ, He H, et al.  Inducible CRISPRi-based operon silencing and selective in trans gene complementation in Borrelia burgdorferi. J Bacteriol 2023; 205:e0046822. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80. Dowdell  AS, Murphy MD, Azodi C, et al.  Comprehensive spatial analysis of the Borrelia burgdorferi lipoproteome reveals a compartmentalization bias toward the bacterial surface. J Bacteriol 2017; 199:e00658-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81. Pereira  MJ, Wager B, Garrigues RJ, et al.  Lipoproteome screening of the Lyme disease agent identifies inhibitors of antibody-mediated complement killing. Proc Natl Acad Sci U S A 2022; 119:e2117770119. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

jiae256_Supplementary_Data

Articles from The Journal of Infectious Diseases are provided here courtesy of Oxford University Press

RESOURCES