Skip to main content
PLOS Pathogens logoLink to PLOS Pathogens
. 2024 Aug 12;20(8):e1012400. doi: 10.1371/journal.ppat.1012400

Genomic and functional determinants of host spectrum in Group B Streptococcus

Chiara Crestani 1,¤a,*, Taya L Forde 1,¤b, John Bell 2, Samantha J Lycett 3, Laura M A Oliveira 4, Tatiana C A Pinto 4, Claudia G Cobo-Ángel 5,¤c, Alejandro Ceballos-Márquez 5, Nguyen N Phuoc 6, Wanna Sirimanapong 7, Swaine L Chen 8,9, Dorota Jamrozy 10, Stephen D Bentley 10, Michael Fontaine 2,¤d, Ruth N Zadoks 1,2,11
Editor: Lakshmi Rajagopal12
PMCID: PMC11341095  PMID: 39133742

Abstract

Group B Streptococcus (GBS) is a major human and animal pathogen that threatens public health and food security. Spill-over and spill-back between host species is possible due to adaptation and amplification of GBS in new niches but the evolutionary and functional mechanisms underpinning those phenomena are poorly known. Based on analysis of 1,254 curated genomes from all major GBS host species and six continents, we found that the global GBS population comprises host-generalist, host-adapted and host-restricted sublineages, which are found across host groups, preferentially within one host group, or exclusively within one host group, respectively, and show distinct levels of recombination. Strikingly, the association of GBS genomes with the three major host groups (humans, cattle, fish) is driven by a single accessory gene cluster per host, regardless of sublineage or the breadth of host spectrum. Moreover, those gene clusters are shared with other streptococcal species occupying the same niche and are functionally relevant for host tropism. Our findings demonstrate (1) the heterogeneity of genome plasticity within a bacterial species of public health importance, enabling the identification of high-risk clones; (2) the contribution of inter-species gene transmission to the evolution of GBS; and (3) the importance of considering the role of animal hosts, and the accessory gene pool associated with their microbiota, in the evolution of multi-host bacterial pathogens. Collectively, these phenomena may explain the adaptation and clonal expansion of GBS in animal reservoirs and the risk of spill-over and spill-back between animals and humans.

Author summary

Group B Streptococcus (GBS) is a bacterium that represents a health and food security threat in three major host groups: humans (in particular neonates), bovines, and fishes. However, the genomic mechanisms driving adaptation to these hosts remain unclear. Here, we use powerful statistical approaches to compare genomes of GBS from around the world. We found that GBS’ ability to exchange genes is a good indicator of how well it can adapt to different hosts. Additionally, three groups of genes appear crucial for GBS to infect either humans, cows, or fishes, regardless of the bacterial strain. These gene groups are also found in other similar bacteria that live in the same hosts or environments. Where a functional role is already known for two gene groups in humans and bovines, respectively, it is shown here for the first time for the third gene group in fishes. Our findings demonstrate the importance of considering the role of animals in the evolution of multi-host bacterial pathogens like GBS. Gene exchange between bacteria infecting multiple hosts represents a high threat to human health, as high-risk types of GBS could adapt and expand in animals, which could then act as reservoirs for highly pathogenic human infections.

1 Introduction

Group B streptococci (GBS) were first described by Rebecca Lancefield, who isolated them from “certified milk” and milk of a cow with mastitis [1], giving the only member of GBS its scientific name, Streptococcus agalactiae. Since then, GBS has become a well-recognized human pathogen, primarily affecting neonates and their mothers [2, 3]. It is also increasingly a cause of disease in non-pregnant adults, both as opportunistic infection in immunocompromised individuals [4] and as foodborne disease in adults without co-morbidities [57]. The public health burden posed by GBS, particularly on maternal and child health in low—and middle-income countries, is such that the World Health Organisation has identified the development of GBS vaccines for maternal immunization as a priority [8, 9]. Like group A streptococci (GAS, Streptococcus pyogenes) [10], GBS not only causes a spectrum of diseases but can also be carried asymptomatically. In contrast to GAS, GBS is primarily carried in the gastrointestinal and urogenital tract and, possibly more important from an evolutionary perspective, it also has a wide range of animal hosts.

GBS continues to be an important cause of bovine mastitis around the world, with negative impacts on milk quality and quantity, cow health, and farmers’ livelihoods [11]. It was largely eliminated from northern Europe, but re-emerged in the 21st century [1214]. GBS is also a major pathogen of tilapia, the world’s third most farmed fish species (FAO, 2022). It was first described in poikilothermic species (fishes, frogs) in the 1980s [15] and subsequently emerged as a major fish pathogen during the global expansion and intensification of fish farming [7, 16].

Multi-host bacterial pathogens can adapt to host species using diverse mechanisms. These range from point mutations [17] to the acquisition—through horizontal gene transfer (HGT)—of accessory genome content that provides a survival advantage in the context of a particular host, e.g., the ability to evade the host immune system or the acquisition of new metabolic pathways in response to the availability of particular nutrients [18, 19]. For example, a transposon carrying scpB, encoding a C5a-peptidase (a surface-associated serine protease) [20], which enables invasion of epithelial cells, has been associated with human GBS [21, 22], whereas nutritional adaptation is exemplified by the acquisition of a mobile genetic element (MGE) encoding a lactose-fermenting pathway, which appears to be critical for successful colonisation of the bovine mammary gland [23]. Considering the high genome plasticity of GBS [24], and the emergence of new clades in animal host species in the last hundred years [7, 13, 25], there is a need to gain a better understanding of mechanisms driving host adaptation, including HGT, not least to ensure that the emergence of new variants can be monitored during the anticipated introduction of human GBS vaccination (WHO, 2021).

Here, we analyse the GBS population using 1,254 genomes, rigorously selected to represent diversity in lineages, host species, and geographic origins, to gain insight into the role of genome plasticity and the accessory genome in GBS host adaptation. To facilitate this analysis, we propose nomenclature based on clonal groups (CG) and sublineages (SL), as recently introduced for other multi-host pathogens, and demonstrate that they comprise host generalists and host specialists lineages, whereby the latter can have a preferred or dominant host species (hereafter termed “host adapted”) or be limited to one host species or host group (hereafter termed “host restricted”). Using genome-wide association studies (GWAS), we show that human-, bovine-, and fish-adaptation of GBS are associated with C5a-peptidase (scpB), the lactose operon (Lac.2) and an accessory gene cluster known as Locus 3, respectively, regardless of GBS lineage, and that those three gene clusters seem to be largely necessary and sufficient to explain host adaptation. Through in vivo challenge experiments, we provide functional evidence for the critical role of Locus 3 in fish-associated GBS. This has important repercussions on public health, such as the adaptation and clonal expansion of the hypervirulent clone ST283 in fish, which was integral to the unexpected GBS foodborne-outbreak in Singapore in 2015, and to the current widespread dissemination of this clone in South-East Asia [7, 26, 27].

2 Materials and methods

2.1 Ethics statement

Animal experiments were in compliance with the ARRIVE guidelines and were carried out out in accordance with the U.K. Animals (Scientific Procedures) Act, 1986 and associated guidelines, and with the EU Directive 2010/63/EU for animal experiments. Challenge studies were conducted by a contract research organisation specialized in aquatic animal health (Ictyopharma, France). Studies were performed to Good Laboratory Practice standards, approved by an internal Ethics Committee and the French Ministry of Research, and audited by an independent Quality Assurance partner.

2.2 Genome selection

The aim of this study was to explore genomic mechanisms of host adaptation in GBS at the population level, and in particular across its sublineages. To this end, we applied in-depth comparative genomic approaches, such as GWAS, to a comprehensive global GBS dataset with a well-balanced proportion of human and animal isolates, and therefore a low bias towards human GBS. A detailed description of all materials and methods can be found in Section A in S1 Appendix.

First, we aimed to collate a dataset representative of the broad diversity of the GBS population in terms of clonal complexes, hosts of origin, geographical locations, and temporal range (n = 2,437) (S1 Table). Metadata were curated from the literature (March 2020) and whole genome sequence data were obtained either from public repositories or self-generated. Using Pandas v1.1.3 [28], we de-duplicated the initial dataset for clones that were over-represented based on a series of metadata variables and further filtered based on genome assembly quality (see section A.2 of the S1 Appendix). The resulting de-duplicated, quality-filtered dataset comprised 1,254 genomes.

2.3 Core genome analyses

IQ-TREE v.2.0.6 [29] was used to estimate a core genome phylogeny with a GTR model, from a recombination-free alignment file obtained with snippy v4.4.5 (https://github.com/tseemann/snippy) and gubbins v3.2.0 [30], using NGBS128 as reference. On the same alignment file, fastbaps v1.0.8 [31] was used to define genomic clusters.

A generalised linear model was run in RStudio v2022.07.01, R v4.2.0 (2022–04-22), on the output from gubbins, after having mapped the internal nodes and the leaves to the corresponding CG, to test association between the host-specialisation level (generalists, adapted, restricted) and the number of nucleotide bases identified in recombination blocks.

2.4 Accessory genome analyses

Accessory gene distances were calculated with GraPPLE (https://github.com/JDHarlingLee/GraPPLE) using the Jaccard similarity index from a gene presence/absence matrix; the latter was generated with panaroo v1.2.0 [32] from gff files annotated with Prokka v1.14.5 [33]. The resulting file was visualised with Graphia v2.2 [34].

2.5 Genome-wide association studies

Scoary v1.6.16 [35] was run to detect gene-enrichment in GBS from the three major host groups from the panaroo-generated presence/absence gene matrix. The host groups (human, bovine, and the poikilotherm group including fishes and frogs) were defined as binary phenotypes, with 1 as belonging to the host, and 0 as not belonging to the host; GBS genomes from host groups other than these three were always categorised as 0, including those originating from dead fish sampled at markets for which a possible contamination due to human handling could not be excluded.

The pyseer suite v1.3.3 [36] was used to assess unitig association with a linear mixed model on the same host groups as above.

2.6 In vivo assessment of the role of Locus 3 in fish infection

Challenge studies on Nile tilapia (Oreochromis niloticus) were carried out with ST7 and ST283 knock-out mutants of Locus 3 (ΔLocus3). Strains, plasmids and oligonucleotide primers used for mutagenesis are described in S1 Appendix (Table C.1 and Table C.2). In preparation for challenge, tilapia were transferred into 100 L experimental tanks (four replicates per GBS strain, two negative control replicates) under the same ecological conditions. For challenge experiments, ten fish from each experimental tank were challenged by intraperitoneal (IP) injection with 0.1 mL of phosphate buffered saline (PBS; negative control) or 0.1 mL of 105 CFU/mL of GBS in PBS. For contact challenge experiments, IP-challenged tilapia were co-habited with 40 healthy tilapia per tank.

Mortality of IP-challenged and contact-challenged fish was monitored for 21 days. Moribund fish were euthanized and included in mortality counts. Kidney and brain samples were taken from up to 10 fish per tank and used to check for presence of GBS (bacterial culture) and Locus 3 genes (PCR assays).

Kaplan-Meier survival analysis [37] was conducted to compare survival of knock-out mutants and their isogenic wild type strains. A statistical Log-rank test was conducted [38], using Microsoft Excel software, to compare survival curves of tilapia challenged by wild type vs knock-out mutants.

2.7 Time-scaled phylogenies of two clones of public health interest: CG283 and SL23

BEAST v2.6.6 [39] was used to estimate the time of emergence of two clones of public health interest, CG283 and SL23. The best model parameters chosen were: GTR+G4 with a strict clock and a constant coalescent population size. Xml files were run in triplicate with 200 million generations and sampling frequency of 10,000 (burn-in 10%). Resulting trees from the three replicate runs were combined with LogCombiner, and the final tree files were obtained with TreeAnnotator and visualised with FigTree v1.4.4 (http://tree.bio.ed.ac.uk/software/figtree/) and Microreact [40].

The Maximum Clade Credibility trees from the two BEAST datasets were used together with gene presence/absence for scpB, Lac.2, and Locus 3 in a discrete traits analyses, as well as with host. To infer the gene presence/absences at the ancestral nodes in the trees, the ancestral character estimation function (ace) within R-package ape [41] was used with a discrete asymmetric model (all rates different, ARD).

3 Results and discussion

3.1 The global GBS population is composed of host-generalist and host-specialist lineages

The GBS global population comprises: i) host generalists, defined here as having no more than 80% of isolates originating from a single host species (human, bovine, camel) or host group (poikilothermic species, including fishes and frogs; for simplicity, this host group is referred to as “fish” throughout the manuscript), and host-specialist lineages, defined here as having more than 80% of isolates originating from a single host or host group, and further subdivided into ii) host-adapted lineages, which have a clear host predilection, defined here as having more than 80% but less than 98% of isolates originating from a single host or host group; and iii) host-restricted lineages, whose occurrence is almost exclusively associated (≥98%) or completely restricted to a single host species or host group (Fig 1A). The cut-offs were chosen by plotting in a histogram the prevalence of the dominant host within each CG (e.g., CG23: dominant host bovine, prevalence 64.78%; CG22: dominant host human, prevalence 100%) (Fig 1B) (see Section A.3 in S1 Appendix).

Fig 1. Diagram illustrating host specialism levels in Group B Streptococcus (GBS) clonal groups (CG).

Fig 1

A) Host generalist lineages show extensive between-lineage homologous recombination and the three host-associated accessory gene clusters (scpB, Lac.2, Locus 3) are primarily found in isolates from their associated host, while they are lacking from isolates from other hosts. Host restricted lineages can be associated with reductive evolution (e.g., gene loss of function and genome reduction as in CG260, CG261 and CG552), they carry host-associated genes (e.g., CG260, CG261 and CG552 all carry Locus 3) and they either show absence of recombination (CG260, CG261 and CG552) or recombination limited within CG (e.g., CG61 and CG91). Host-adapted CG are primarily associated with one host and they show all the characteristics of the host restricted lineages except for genome reduction and pseudogenisation. B) Thresholds used for the categorisation of CG in the three levels of host specialism; the prevalence of genomes within each CG associated with the dominant host species (x-axis) was plotted to identify cut-offs, where the y-axis represents the number of CG corresponding to a given host prevalence.

Of 15 sub-populations that were identified by hierarchical Bayesian clustering, largely in alignment with the topology of the core gene phylogeny (Fig 2A), five were host generalists, i.e. sublineage (SL) 1, SL23, SL103, SL283, and SL314; four were host-adapted, i.e. SL17, SL19, SL26 and SL130 (all with a predilection for the human host); and the remaining six were host restricted, i.e. SL22 (human), SL61, SL91 (bovine), SL552 (fish), SL609 and SL612 (camel). In this nomenclature, sublineages are named after the most common sequence type (ST) in each subpopulation based on 7-gene multi-locus sequence typing (MLST) nomenclature (Fig C.1 in S1 Appendix). Within each SL, clonal groups (CG, n = 23) were defined based on phylogenetic sub-clusters using the same nomenclature principle as for SL (Fig C.1 in S1 Appendix). Among CG, the same cut-offs used for SL identified nine host generalist CG, six host-adapted CG, and eight host-restricted CG (Figs 1 and 2B). Two generalist SL (SL1, SL283) comprised three generalist CG each (CG1, CG459, and CG817; CG6, CG7, and CG283, respectively), and one host restricted SL comprised three restricted CG (SL552, including CG260, CG261, and CG552). However, SL23, which was classified as host generalist, encompassed the host generalist CG23 as well as the host-adapted CG24, which is primarily found in humans. The two CG are associated with different capsular types: the human-adapted CG24 primarily comprises isolates from capsular type Ia, whereas CG23 is mostly associated with capsular type III, except for a human-associated subclade of type Ia and II isolates (for more detail, please refer to our Microreact project at microreact.org).

Fig 2. Population structure and pangenome of Group B Streptococcus (GBS).

Fig 2

A) Maximum-likelihood phylogenetic tree of 1,254 GBS genomes; leaf colours indicate host of origin, and external strips show sublineage (SL), clonal group (CG) and homologous recombination; tree was rooted on an out-group of five reference genomes from Streptococcus pyogenes (hidden); B) Prevalence of host species within each CG; C) Correlation of number of genes and assembly size; SL552 shows a marked pseudogenisation and reduced genome size; D) Correlation of average number of recombination bases and recombination blocks of each CG, as well as recombination to mutation (r/m) rate; E) Recombination observed in the three categories of host-specialism; the difference between groups is statistically significant (p-value<0.0001).

The existence of host-generalist and host-specialist lineages within populations of multi-host bacterial pathogens is well-described, for example in Salmonella enterica [42], Campylobacter jejuni [43, 44], and Staphylococcus aureus [18, 19]. These observations have been associated with genomic phenomena that may lead to host-restriction, typically in the case of pseudogenisation and/or gene loss, as described for SL61 [45] and SL552 (Fig 2C) [46], or to host jumps, which provide access to a new accessory gene pool, with subsequent host-adaptation, which is often a result of acquisition of accessory genes that provide an adaptive advantage to the new ecological niche.

3.2 Host-specialism among GBS lineages is associated with different levels of genome plasticity

Recombination can also play a role in host-adaptation [47]. We observed marked differences in recombination between clonal groups of GBS, ranging from large recombination blocks in, e.g., host generalists CG283 and CG817 to absence of recombination in, e.g., host-restricted CG552, CG260, CG261 (Fig 2A and 2D). Host-generalist lineages appear more subject to recombination than host-specialists, as illustrated by the number of nucleotide bases present within recombination blocks, for which the difference is statistically significant (t-value generalists = 107.56, adapted = -15.42, restricted = -28.12, p-value<0.0001) (Fig 2E).

Our results are indicative of a higher genome plasticity of host-generalist lineages compared to host-specialists, in particular to host restricted lineages, and suggest that the ability to uptake and retain foreign DNA, including accessory genes that could provide a survival advantage in new niches and hosts, is superior in host-generalists (e.g., SL283). The lack of shared recombination between some lineages (e.g., SL1, SL19 and SL283 vs others; SL23 vs others; SL91 vs others, SL61 vs others), coupled with results from analysis of the accessory gene set (section 3.3), suggests the existence of some barriers to genetic exchange (i.e., HGT) within GBS. These barriers are unlikely to be ecological (e.g., segregation [48]), as at least some lineages co-exist in the different host populations and in the same geographical areas. As an example, GBS isolates beloning to ST1 (part of SL1/CG1) were reported in both humans and cattle in Colombia [49] and in several Northern European countries [50, 51], where ST23 (part of the SL23) was also detected in both hosts. Rather, these are more likely mechanistic barriers that can act as a defense against the uptake of foreign DNA, such as restriction-modification systems (RMS), CRISPR, or antiphage systems, as described by Mourkas and colleagues [48].

3.3 The accessory genome in GBS is lineage-associated

To explore the role of the accessory genome content in host adaptation, we used network analysis of accessory gene distances. This shows distinct clusters of accessory genome content that align with SL (and therefore with CG) rather than host species (Fig 3), noting that SL and host species are inherently fully concordant for host-restricted lineages, such as SL552 or SL91. These observations suggest that the accessory gene set as an ensemble depends more on the sublineage of origin of the isolates than on their ecological niche in GBS. This is in contrast to what is reported in S. aureus by Richardson and colleagues [19], who showed an association between host species and the whole set of accessory genes. These results, coupled with previous knowledge on the association of MGE encoding host-associated virulence factors or metabolic pathways (e.g., scpB in humans [20, 52] and Lac.2 in bovines [23, 51, 53]), led us to hypothesise that a limited number of acquired genes might drive host-adaptation in GBS.

Fig 3. Accessory gene distance network of 1,254 Group B Streptococcus (GBS) genomes.

Fig 3

A) Major host groups (human, bovine, fish and camel) are overlaid to the nodes; B) Sublineages (SL) defined by fastbaps and renamed based on an inheritance principle from 7-gene MLST nomenclature are shown. The two panels show association of accessory gene clusters with SL, and a lack of clustering based on host species, unless when this is a direct result of host-specific lineages (e.g., SL552, SL91, SL61).

3.4 Specific gene clusters drive host-adaptation in GBS

The hypothesis that a specific subset of genes drives host association was tested with GWAS using two approaches, one pangenome-based and the other unitig-based. Both analyses identified three accessory gene clusters which were strongly associated with humans or animal hosts, as detailed below.

3.4.1 GBS in humans: The scpB transposon

The pangenome-based approach identified the scpB transposon as significantly positively associated with the human host (p-value for scpB was 1.26x10-131; all p-values reported in this paper for pan-GWAS analyses were corrected with the Benjamini-Hochberg method) (S2 Table), and similar results were obtained with the unitig-based approach (Fig 4A and C.2A in S1 Appendix). The scpB gene is a transposon-encoded virulence factor that is known for its higher prevalence among human isolates [21, 52]. It has been shown both through in vitro and in vivo approaches how this gene interacts negatively with the human host immune system (cleaving the C5a complement component) [54, 55] and how it contributes to GBS cellular adhesion and invasion by binding to fibronectin [56, 57]. Additionally, it has been associated with the ability to colonise or infect the human host in other human pathogenic streptococci (Group A Streptococcus, Streptococcus dysgalactiae subsp. equisimilis and Streptococcus canis) [20, 52]. In vitro work suggests that scpB likely plays no role in bovine GBS infections [20], even when it is carried by GBS cattle isolates. Its role in fish infections has not been assessed. Interestingly, in our dataset, scpB in GBS from fish was uniquely found in a lineage shared with humans (SL283), which could explain why this is the only lineage from fish that shows evidence for zoonotic transmission (see section 3.6.1).

Fig 4. Host-associated accessory gene clusters in Group B Streptococcus (GBS).

Fig 4

A) Manhattan plot of human-associated unitigs mapped to reference genome NZ_CP019979 (only genes/elements detected as significant by both GWAS methods are here annotated); B) Manhattan plot of bovine-associated unitigs mapped to reference genome CP008813; C) Circular maximum-likelihood phylogenetic tree of 1,254 GBS genomes. Leaf colours indicate host of origin, whereas the three external strips show presence/absence of the three main host-associated accessory gene clusters (human-scpB, bovine-Lac.2, fish-Locus 3, respectively).

3.4.2 GBS in cattle: The lactose operon Lac.2

In bovine GBS, significant genes with both GWAS approaches corresponded to a 9 to 11-gene cluster, the Lac.2 operon, in particular its genes lacEG (Scoary p-value 4.69x10-168; Fig 4B and C.2B in S1 Appendix, and S3 Table). In GBS, a Lac.2-positive genetic background is associated with phenotypic lactose fermentation, which has been observed primarily in cattle GBS isolates with in vitro and transcriptomic approaches [51, 53]. In cattle, GBS only causes infection of the mammary gland, which is rich in lactose (milk sugar). The acquisition of metabolic pathways (here: Lac.2) in response to nutrient availability (here: lactose) is a known driver of niche adaptation [43, 47, 58], and fermentation of lactose via lactose operons is known to promote growth in lactose-rich environments such as the bovine udder not only in GBS, but in other mastitis-causing Gram-positive (Streptococcus uberis, Streptococcus dysgalactiae subsp. dysgalactiae [23, 53]) and Gram-negative organisms (Klebsiella pneumoniae [58]), which strengthens the argument that acquisition of lactose-fermenting genes drives niche adaptation to the bovine mammary gland.

3.4.3 GBS in fish: Locus 3

In fish GBS, the unitig-based approach was unsatisfactory, which was likely due to a strong population structure effect, as genomes from this host group are found only in two SL (SL552 and SL283). With the pangenome-based approach, genomes from fish GBS appeared to be enriched for genes belonging to a cluster known as Locus 3 (SE 99.5%; SP 72.6%; BH p-value 9.62x10-85) (S4 Table). Locus 3, which is described as fish-associated [59], was present in 99.5% of fish and frog GBS genomes and only in a minority of non-fish assemblies, especially in the generalist SL283, a SL primarily shared between humans and fish. Unlike for scpB in humans and Lac.2 in bovines, there was no prior evidence for a functional role of Locus 3 in fish (see section 3.5). However, we detected a conserved segment of this element in all available genomes (n = 212, NCBI, June 2024) from another important agent of fish streptococcosis, Streptococcus iniae (Fig C.3 in S1 Appendix) [60], highlighting its association with the aquatic niche and its possible impact in streptococcal adaptation to fish.

The significant association of the three accessory gene clusters with the three main GBS host groups across SL and CG (Fig 4C) as well as with other streptococci pathogenic to the same host species, suggests that they are each major drivers of adaptation to the distinct host niches in GBS lineages, and possibly more broadly in the genus Streptococcus. As mentioned above, functional evidence for the role of scpB in human infections and of Lac.2 in bovine mastitis already exists (sections 3.4.1 and 3.4.2). This was not the case for Locus 3, which is why we decided to proceed with in vivo challenge experiments in fish after mutagenesis experiments.

3.5 Functional evidence of the role of locus 3 in piscine GBS

Locus 3 is a 17-gene cluster inserted between a GNAT family N-acetyltransferase and a class I SAM-dependent methyltransferase, which includes, among others, genes for carbohydrate transport and metabolism [59] (Fig 5A). To test the functional relevance of this genomic determinant of fish-association of GBS, Nile tilapia (Oreochromis niloticus) were challenged with a multi-gene knock-out mutant (ΔLocus3) and wildtype (WT) isolates of two highly virulent fish-associated GBS strains (ST7 and ST283). Animals were either challenged intraperitoneally (IP) or through cohabitation with the IP-challenged fish. For both challenge routes and both GBS strains, a significant reduction in mortality was observed in groups challenged with the ΔLocus3 mutant relative to WT (Fig 5B–5E and Table C.3 in S1 Appendix). The observed attenuation of GBS after removal of Locus 3 provides the first functional evidence of its importance in GBS infection of fish.

Fig 5. Locus 3 gene cluster and survival curves of Nile tilapia (Oreochromis niloticus).

Fig 5

A) Locus 3 gene cluster organisation and associated gene functions; B) Tilapia challenged intraperitoneally (IP) with Group B Streptococcus (GBS) Sequence Type (ST) 7 wild type (WT) or its isogenic mutant (ΔLocus3); C) Tilapia challenged with GBS ST7 (WT) or its isogenic mutant (ΔLocus3) through cohabitation with IP-challenged fish; D) Tilapia challenged IP with GBS ST283 (WT) or its isogenic mutant (ΔLocus3); E) Tilapia challenged with GBS ST283 (WT) or its isogenic mutant (ΔLocus3) through cohabitation with IP-challenged fish. All experiments (B-E) included a negative control (mock challenge with phosphate buffered saline). Below survival curves, number at risk has been indicated for each timepoint.

3.6 The impact of host-associated gene clusters on public health

3.6.1 CG283, a generalist clade associated with foodborne infection

CG283 (Fig 6A) is widespread in fishes in the South-East Asian region [7] and it is carried asymptomatically by close to 3% of the human population in some areas [61]. In 2015, it was recognized as the causative agent of the first ever reported outbreak of GBS foodborne disease. All CG283 isolates in our study carried the fish-associated Locus 3, regardless of whether they originated from fish or humans, which is probably due to the fact that most human ST283 originated from people infected through fish consumption [26]. Only a subset of ST283 isolates carries scpB, as previously observed in Vietnam and Thailand [62]. Traits analysis predicted scpB to be present in the common ancestor of CG283 isolates, and to have been lost multiple times, with two major events corresponding to two branches (the first around 1992, and the second around 2003) (Fig 6A). The presence of scpB in the ancestral node of CG283 suggests a probable human origin for this clade, which would have acquired Locus 3 concomitantly to the expansion of aquaculture (1980s), illustrating how initial spill-over from humans to animals can be followed by host-adaptation and subsequent amplification in the new host, a phenomenon also described in, e.g., S. aureus [18]. We hypothesise that the exchange of accessory genetic material between human and fish GBS is favoured by frequent direct or indirect contact between humans and fish in South-East Asia, e.g., through recycling of human waste as fish feed, floating fish farms or fish handling and consumption. A CG283 sub-clade mostly carrying scpB (Fig 6A) is associated with human infections, and it includes genomes from GBS isolated during the 2015 Singaporean foodborne outbreak that was linked with the consumption of contaminated raw fish [7, 26], an outbreak that resulted from spill-back of the now fish-adapted CG283 into humans. In this sub-clade, however, scpB appears to have been lost on multiple occasions. The second major loss of scpB, which happened around 2003 (Fig 6A), appears to have taken place in Vietnam, and was followed by expansion of a CG283 sub-clade primarily associated with fish. The emergence of CG283 in Brazil, estimated around 2012 but whose first available isolates date to 2015, likely resulted from an introduction of the Vietnamese sub-clade, as exemplified by the Brazilian isolates clustering within the Vietnamese ones (Fig 6A). So far, very few human infections with GBS ST283 have been reported from Vietnam [7], and no human infections associated with CG283 have been reported in South America. This is probably not a result of differences in eating habits compared to the Asian population, as a percentage as high as 43.45% of the Brazilian population is reported to consume raw or under-cooked seafood [63]. Rather, absence of reported CG283 infections in humans could be due to the absence of scpB. The evolution of CG283 illustrates how ongoing surveillance of GBS across host species is needed to detect emergence of new clades as well as the acquisition and loss of accessory genome content that informs on the hazard to public health and food security.

Fig 6. Time-scaled phylogenies of two clones of public health interest in Group B Streptococcus (GBS).

Fig 6

Branch colour shows host-jumps, leaf colour the host of isolation, and external strips show presence-absence of genes and features of interest. A) Clonal group (CG) 283 is predicted to have originated around 1986 from humans. Its population shows diversification in multiple sub-clades over the years. Genomes associated with human cases from the 2015 outbreak of foodborne infection in Singapore are part of a sub-clade in which most of the genomes carry the human-associated transposon bearing scpB. In recent years, CG283 has been described in Brazil (first isolates available are from 2015); this introduction likely happened from Vietnam, and it is associated with a scpB-negative genotype. The two main scpB loss events are indicated with arrows on the phylogeny; B) Sublineage (SL) 23 likely originated in humans at the beginning of the 18th century. It is described as a host-generalist clonal complex (CC). However, when assessing host prevalence within its two CG (CG23 and CG24), as well as the distribution of human and bovine-associated accessory gene clusters (scpB and Lac.2, respectively) the two CG show distinct patterns of host-tropism. CG24 appears as human-adapted, whereas isolates of CG23, whose common ancestor is predicted to have acquired Lac.2 in the second half of the 1800s (red arrow), show a tendency towards bovine-specialism, but retain the ability to cause human infections (scpB is highly conserved).

3.6.2 SL23, a historical host specialist that gave rise to a host generalist

SL23, which was referred to as CC23 in previous literature, is an old GBS sublineage, with emergence estimated to have occurred in the middle of the 18th century in our study and by others [25]. It has long been considered a host-generalist, but there are two CG within SL23, with distinct patterns in terms of host tropism. The first one, CG24, is a host-adapted CG that primarily causes disease in humans (Fig 2), with occasional host jumps to cattle. These host jumps are associated with sporadic acquisitions of Lac.2 (Fig 6B). Reflecting its human host association, scpB is strongly conserved in this CG, with the exception of one bovine isolate. The second CG in SL23 is CG23, which is a host-generalist clade that primarily affects humans and cattle, with additional reports in, e.g., companion animals, aquatic mammals and reptiles [64]. Mirroring the situation for scpB in CG24, Lac.2 is highly conserved in CG23, with the exception of one human isolate (Fig 6B), explaining its ability to infect the bovine mammary gland. As in CG283, scpB has been lost multiple times independently from CG23, suggesting that this virulence factor is lost easily in the absence of selective pressure from the human host. Considering that the phylogeny suggests bidirectional interspecies transmission of CG23 (Fig 6B) ongoing monitoring of CG23 infections in both host species would be needed to gauge risk of bovine GBS to public health.

4 Conclusions

Multi-host bacterial pathogens often comprise lineages that show distinct patterns of host-tropism, which can vary from host generalism, such as CC130 and CC398 in S. aureus [19], to host restriction, such as for the serovars Gallinarum and Pullorum of S. enterica subsp. enterica in poultry [42]. A multitude of interacting ecological (e.g., physical segregation), genomic (e.g., ability of a genome to uptake DNA, homologous and non-homologous recombinations, gene loss and pseudogenisation) and adaptive mechanisms can affect host-range [48]. Genome plasticity and HGT within GBS have been described as major driving forces in the evolution of GBS [24]. Here, we show that those forces may differ between lineages, with higher genome plasticity in host-generalists than in host specialists.

As in other bacterial pathogens [19, 48], adaptive mechanisms that confer a fitness advantage within the context of a particular niche play a crucial role in GBS. Despite the diversity of lineages associated with each major host category, and the diversity of host categories associated with many major lineages, we have identified that three key accessory gene clusters seem largely necessary and sufficient to explain host adaptation. Two of those clusters, i.e., scpB in human GBS [20] and Lac.2 in bovine GBS [23], were known to be genomic and functionally relevant markers of host association, and we provide functional evidence for the critical role of Locus 3 in fish infections in different genetic backgrounds of GBS. Remarkably, these three elements are not only present in GBS but are also found in other streptococcal species that share ecological niches with GBS (e.g., scpB in the human or zoonotic pathogens Group A Streptococcus, Streptococcus dysgalactiae subsp. equisimilis and Streptococcus canis, Lac.2 in the bovine mastitis pathogens S. dysgalactiae subsp. dysgalactiae and S. uberis, and Locus 3 in the fish pathogen S. iniae), suggesting their broader importance within the genus Streptococcus in each of these hosts or niches.

Through analysis of two multihost clades (CG283 and SL23), we show the impact of acquisition or loss of these three elements on host-adaptation and expansion following host-jumps, and illustrate the public health relevance of such genomic mechanisms for GBS. We propose that, by focusing genomic surveillance on clones with high potential for host-jumps (i.e., those with high levels of recombination, such as SL283) and their acquisition of host-associated accessory gene clusters, strategies and interventions to reduce the risk of interspecies transmission could be improved. In addition, genomic surveillance of GBS across host species will be critical when human GBS vaccines are rolled out, to monitor HGT of vaccine targets such as the capsule, a phenomenon already observed in GBS [65, 66]. Selective pressure induced by vaccination could favour emergence of new serotypes, as also observed for Streptococcus pneumonaie [67], but with a need to consider both human and animal hosts to understand and monitor the evolution of GBS.

Supporting information

Supplementary data files for this article include:

S1 Appendix. Supplementary data file.

Detailed materials and methods, additional results, and supplementary figures and tables.

(PDF)

ppat.1012400.s001.pdf (15.6MB, pdf)
S1 Table. Dataset and results file.

Dataset of genomes used for this study, with accompanying metadata, accession numbers, and main results of genomic analyses.

(XLSX)

ppat.1012400.s002.xlsx (326.2KB, xlsx)
S2 Table. Scoary results.

Scoary results for human-enriched genes.

(XLSX)

ppat.1012400.s003.xlsx (192.5KB, xlsx)
S3 Table. Scoary results.

Scoary results for bovine-enriched genes.

(XLSX)

ppat.1012400.s004.xlsx (219.5KB, xlsx)
S4 Table. Scoary results.

Scoary results for fish-enriched genes.

(XLSX)

ppat.1012400.s005.xlsx (170.2KB, xlsx)

Acknowledgments

We would like to thank John Lees for bioinformatic support for pyseer. We would also like to thank Marta Mansos-Lourenco, Carla Parada-Rodrigues and Sylvain Brisse for constructive feedback on the manuscript.

Data Availability

All raw or assembled Illumina sequence data is available from the European Nucleotide Archive (ENA) or from the National Center for Biotechnology Information (NCBI), with accession numbers provided in S1 Table. Phylogenetic analysis of the 1,254 Group B Streptococcus genomes, together with all metadata, is available at the project URL https://microreact.org/project/vkEcchaHsSJa6sLhSGvmHF-group-b-streptococcus-host-adaptation-2023 within Microreact.

Funding Statement

RNZ and MF received funding from a Technology Strategy Board, Agri-Tech Catalyst - Early Stage Feasibility grant (Ref 132168), for funding the mutagenesis and in vivo studies. SDB received funding from the Bill & Melinda Gates Foundation (Opportunity INV-010426) for the JUNO Project, which funded sequencing of part of the isolates included in this study. DJ and in part CC salaries were provided by the JUNO project. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1. Lancefield R. C. A serological differentiation of human and other groups of hemolytic streptococci. Journal of Experimental Medicine. 1933; 57(4):571–595 doi: 10.1084/jem.57.4.571 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Hall J., Adams N. H., Bartlett L., et al. Maternal disease with group B Streptococcus and serotype distribution worldwide: systematic review and meta-analyses. Clinical Infectious Diseases. 2017; 65(suppl 2):S112–S124. doi: 10.1093/cid/cix660 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Madrid L., Seale A. C., Kohli-Lynch M., et al. Infant group B streptococcal disease incidence and serotypes worldwide: systematic review and meta-analyses. Clinical Infectious Diseases. 2017; 65(suppl 2):S160–S172. doi: 10.1093/cid/cix656 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Navarro-Torné A., Curcio D., Moïsi J. C., et al. Burden of invasive group B Streptococcus disease in non-pregnant adults: A systematic review and meta-analysis. PLoS ONE. 2021; 16(9):e0258030. doi: 10.1371/journal.pone.0258030 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Luangraj M., Hiestand J., Rasphone O., et al. Invasive Streptococcus agalactiae ST283 infection after fish consumption in two sisters, Lao PDR. Wellcome Open Research. 2022; 7:148. doi: 10.12688/wellcomeopenres.17804.2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Zohari Z., Barkham T., Mohamad Maswan N., et al. Fish-associated Streptococcus agalactiae ST283: first human cases reported from Malaysia. Journal of Medical Microbiology. 2023; 72(6):001729. doi: 10.1099/jmm.0.001729 [DOI] [PubMed] [Google Scholar]
  • 7. Barkham T., Zadoks R. N., Azmai M. N. A., et al. One hypervirulent clone, sequence type 283, accounts for a large proportion of invasive Streptococcus agalactiae isolated from humans and diseased tilapia in Southeast Asia. PLoS Neglected Tropical Diseases. 2019; 13(6):e0007421. doi: 10.1371/journal.pntd.0007421 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Vekemans J., Crofts J., Baker C. J., et al. The role of immune correlates of protection on the pathway to licensure, policy decision and use of group B Streptococcus vaccines for maternal immunization: considerations from World Health Organization consultations. Vaccine. 2019; 37(24):3190–3198. doi: 10.1016/j.vaccine.2019.04.039 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.World Health Organization (WHO), et al. WHO preferred product characteristics for group B Streptococcus vaccines. World Health Organization; 2017.
  • 10. Hung T. Y., Phuong L. K., Grobler A., et al. Antibiotics to eradicate Streptococcus pyogenes pharyngeal carriage in asymptomatic children and adults: a systematic review. Journal of Infection. 2024;. doi: 10.1016/j.jinf.2024.01.003 [DOI] [PubMed] [Google Scholar]
  • 11. Krishnamoorthy P., Suresh K. P., Jayamma K. S., et al. An understanding of the global status of major bacterial pathogens of milk concerning bovine mastitis: A systematic review and meta-analysis (scientometrics). Pathogens. 2021; 10(5):545. doi: 10.3390/pathogens10050545 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Jørgensen H., Nordstoga A., Sviland S., et al. Streptococcus agalactiae in the environment of bovine dairy herds—rewriting the textbooks? Veterinary Microbiology. 2016; 184:64–72. doi: 10.1016/j.vetmic.2015.12.014 [DOI] [PubMed] [Google Scholar]
  • 13. Crestani C., Forde T. L., Lycett S. J., et al. The fall and rise of group B Streptococcus in dairy cattle: reintroduction due to human-to-cattle host jumps? Microbial Genomics. 2021; 7(9):000648. doi: 10.1099/mgen.0.000648 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Katholm J., Bennedsgaard T., Koskinen M., et al. Quality of bulk tank milk samples from Danish dairy herds based on real-time polymerase chain reaction identification of mastitis pathogens. Journal of Dairy Science. 2012; 95(10):5702–5708. doi: 10.3168/jds.2011-5307 [DOI] [PubMed] [Google Scholar]
  • 15. Amborski R., Snider T. 3rd, Thune R., et al. A non-hemolytic, group B Streptococcus infection of cultured bullfrogs, Rana catesbeiana, in Brazil. Journal of Wildlife Diseases. 1983; 19(3):180–184. doi: 10.7589/0090-3558-19.3.180 [DOI] [PubMed] [Google Scholar]
  • 16. Kawasaki M., Delamare-Deboutteville J., Bowater R. O., et al. Microevolution of Streptococcus agalactiae ST-261 from Australia indicates dissemination via imported tilapia and ongoing adaptation to marine hosts or environment. Applied and Environmental Microbiology. 2018; 84(16):e00859–18. doi: 10.1128/AEM.00859-18 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Viana D., Comos M., McAdam P. R., et al. A single natural nucleotide mutation alters bacterial pathogen host tropism. Nature Genetics. 2015; 47(4):361–366. doi: 10.1038/ng.3219 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Lowder B. V., Guinane C. M., Zakour N. L. B., et al. Recent human-to-poultry host jump, adaptation, and pandemic spread of Staphylococcus aureus. Proceedings of the National Academy of Sciences. 2009; 106(46):19545–19550. doi: 10.1073/pnas.0909285106 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Richardson E. J., Bacigalupe R., Harrison E. M., et al. Gene exchange drives the ecological success of a multi-host bacterial pathogen. Nature Ecology and Evolution. 2018; 2(9):1468. doi: 10.1038/s41559-018-0617-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Gleich-Theurer U., Aymanns S., Haas G., et al. Human serum induces streptococcal c5a peptidase expression. Infection and Immunity. 2009; 77(9):3817–3825. doi: 10.1128/IAI.00826-08 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Sørensen U. B. S., Poulsen K., Ghezzo C., et al. Emergence and global dissemination of host-specific Streptococcus agalactiae clones. MBio. 2010; 1(3):e00178–10. doi: 10.1128/mBio.00178-10 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Shabayek S., Spellerberg B. Group B streptococcal colonization, molecular characteristics, and epidemiology. Frontiers in Microbiology. 2018; 9:437. doi: 10.3389/fmicb.2018.00437 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Richards V. P., Lang P., Bitar P. D. P., et al. Comparative genomics and the role of lateral gene transfer in the evolution of bovine adapted Streptococcus agalactiae. Infection, Genetics and Evolution. 2011; 11(6):1263–1275. doi: 10.1016/j.meegid.2011.04.019 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Richards V. P., Velsko I. M., Alam T., et al. Population gene introgression and high genome plasticity for the zoonotic pathogen Streptococcus agalactiae. Molecular Biology and Evolution. 2019; 36(11):2572–2590. doi: 10.1093/molbev/msz169 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Da Cunha V., Davies M. R., Douarre P. E., et al. Streptococcus agalactiae clones infecting humans were selected and fixed through the extensive use of tetracycline. Nature Communications. 2014; 5:ncomms5544. doi: 10.1038/ncomms5544 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Kalimuddin S., Chen S. L., Lim C. T., et al. 2015 Epidemic of severe Streptococcus agalactiae sequence type 283 infections in Singapore associated with the consumption of raw freshwater fish: a detailed analysis of clinical, epidemiological, and bacterial sequencing data. Clinical Infectious Diseases. 2017; 64(suppl 2):S145–S152. doi: 10.1093/cid/cix021 [DOI] [PubMed] [Google Scholar]
  • 27. Tan S., Lin Y., Foo K., et al. Group B Streptococcus serotype III sequence type 283 bacteremia associated with consumption of raw fish, Singapore. Emerging Infectious Diseases. 2016; 22(11):1970. doi: 10.3201/eid2211.160210 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.The Pandas Development Team. Pandas; 2020.
  • 29. Nguyen L. T., Schmidt H. A., Von Haeseler A., et al. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Molecular Biology and Evolution. 2015; 32(1):268–274. doi: 10.1093/molbev/msu300 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Croucher N. J., Page A. J., Connor T. R., et al. Rapid phylogenetic analysis of large samples of recombinant bacterial whole genome sequences using Gubbins. Nucleic Acids Research. 2014; 43(3):e15–e15. doi: 10.1093/nar/gku1196 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Tonkin-Hill G., Lees J. A., Bentley S. D., et al. Fast hierarchical Bayesian analysis of population structure. Nucleic Acids Research. 2019; 47(11):5539–5549. doi: 10.1093/nar/gkz361 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. Tonkin-Hill G., MacAlasdair N., Ruis C., et al. Producing polished prokaryotic pangenomes with the Panaroo pipeline. Genome Biology. 2020; 21(1):1–21. doi: 10.1186/s13059-020-02090-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. Seemann T. Prokka: rapid prokaryotic genome annotation. Bioinformatics. 2014; 30(14):2068–2069. doi: 10.1093/bioinformatics/btu153 [DOI] [PubMed] [Google Scholar]
  • 34. Freeman T. C., Horsewell S., Patir A., et al. Graphia: A platform for the graph-based visualisation and analysis of high dimensional data. PLoS Computational Biology. 2022; 18(7):e1010310. doi: 10.1371/journal.pcbi.1010310 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Brynildsrud O., Bohlin J., Scheffer L., et al. Rapid scoring of genes in microbial pan-genome-wide association studies with Scoary. Genome Biology. 2016; 17(1):1–9. doi: 10.1186/s13059-016-1108-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36. Lees J. A., Galardini M., Bentley S. D., et al. Pyseer: a comprehensive tool for microbial pangenome-wide association studies. Bioinformatics. 2018; 34(24):4310–4312. doi: 10.1093/bioinformatics/bty539 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. May W. L. Kaplan-Meier Survival Analysis. In: Encyclopedia of Cancer. Chapter Kaplan-Meier Survival Analysis. Springer, New York; 2009. p. 1590–1593. [Google Scholar]
  • 38. Kleinbaum D. G., Klein M. Survival Analysis: A Self-Learning Text. Springer, New York; 2012. [Google Scholar]
  • 39. Bouckaert R., Heled J., Kühnert D., et al. BEAST 2: a software platform for Bayesian evolutionary analysis. PLoS Computational Biology. 2014; 10(4):e1003537. doi: 10.1371/journal.pcbi.1003537 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40. Argimón S., Abudahab K., Goater R. J., et al. Microreact: visualizing and sharing data for genomic epidemiology and phylogeography. Microbial Genomics. 2016; 2(11):e000093. doi: 10.1099/mgen.0.000093 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41. Paradis E., Schliep K. ape 5.0: an environment for modern phylogenetics and evolutionary analyses in R. Bioinformatics. 2019; 35(3):526–528. doi: 10.1093/bioinformatics/bty633 [DOI] [PubMed] [Google Scholar]
  • 42. Langridge G. C., Fookes M., Connor T. R., et al. Patterns of genome evolution that have accompanied host adaptation in Salmonella. Proceedings of the National Academy of Sciences. 2015; 112(3):863–868. doi: 10.1073/pnas.1416707112 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43. Sheppard S. K., Didelot X., Meric G., et al. Genome-wide association study identifies vitamin B5 biosynthesis as a host specificity factor in Campylobacter. Proceedings of the National Academy of Sciences. 2013; 110(29):11923–11927. doi: 10.1073/pnas.1305559110 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44. Sheppard S. K., Cheng L., Méric G., et al. Cryptic ecology among host generalist Campylobacter jejuni in domestic animals. Molecular Ecology. 2014; 23(10):2442–2451. doi: 10.1111/mec.12742 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45. Almeida A., Alves-Barroco C., Sauvage E., et al. Persistence of a dominant bovine lineage of group B Streptococcus reveals genomic signatures of host adaptation. Environmental Microbiology. 2016; 18(11):4216–4229. doi: 10.1111/1462-2920.13550 [DOI] [PubMed] [Google Scholar]
  • 46. Rosinski-Chupin I., Sauvage E., Mairey B., et al. Reductive evolution in Streptococcus agalactiae and the emergence of a host adapted lineage. BMC Genomics. 2013; 14(1):252. doi: 10.1186/1471-2164-14-252 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47. Sheppard S. K., Guttman D. S., Fitzgerald J. R. Population genomics of bacterial host adaptation. Nature Reviews Genetics. 2018; 19(9):549–565. doi: 10.1038/s41576-018-0032-z [DOI] [PubMed] [Google Scholar]
  • 48. Mourkas E., Yahara K., Bayliss S. C., et al. Host ecology regulates interspecies recombination in bacteria of the genus Campylobacter. eLife. 2022; 11:e73552. doi: 10.7554/eLife.73552 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49. Cobo-Ángel C. G., Jaramillo-Jaramillo A. S., Palacio-Aguilera M., et al. Potential group B Streptococcus interspecies transmission between cattle and people in Colombian dairy farms. Scientific Reports. 2019; 9(1):1–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50. Sørensen U. B. S., Klaas I. C., Boes J., et al. The distribution of clones of Streptococcus agalactiae(group B streptococci) among herdspersons and dairy cows demonstrates lack of host specificity for some lineages. Veterinary Microbiology. 2019; 235:71–79. doi: 10.1016/j.vetmic.2019.06.008 [DOI] [PubMed] [Google Scholar]
  • 51. Lyhs U., Kulkas L., Katholm J., et al. Streptococcus agalactiae serotype IV in humans and cattle, northern Europe. Emerging Infectious Diseases. 2016; 22(12):2097. doi: 10.3201/eid2212.151447 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52. Franken C., Haase G., Brandt C., et al. Horizontal gene transfer and host specificity of beta-haemolytic streptococci: the role of a putative composite transposon containing scpB and lmb. Molecular Microbiology. 2001; 41(4):925–935. doi: 10.1046/j.1365-2958.2001.02563.x [DOI] [PubMed] [Google Scholar]
  • 53. Richards V. P., Choi S. C., Bitar P. D. P., et al. Transcriptomic and genomic evidence for Streptococcus agalactiae adaptation to the bovine environment. BMC Genomics. 2013; 14(1):920. doi: 10.1186/1471-2164-14-920 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54. Bohnsack J., Mollison K., Buko A., et al. Group B streptococci inactivate complement component c5a by enzymic cleavage at the c-terminus. Biochemical Journal. 1991; 273(3):635–640. doi: 10.1042/bj2730635 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55. Bohnsack J. F., Widjaja K., Ghazizadeh S., et al. A role for c5 and c5a-ase in the acute neutrophil response to group B streptococcal infections. Journal of Infectious Diseases. 1997; 175(4):847–855. doi: 10.1086/513981 [DOI] [PubMed] [Google Scholar]
  • 56. Beckmann C., Waggoner J. D., Harris T. O., et al. Identification of novel adhesins from group B streptococci by use of phage display reveals that c5a peptidase mediates fibronectin binding. Infection and Immunity. 2002; 70(6):2869–2876. doi: 10.1128/IAI.70.6.2869-2876.2002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57. Cheng Q., Stafslien D., Purushothaman S. S., et al. The group B streptococcal c5a peptidase is both a specific protease and an invasin. Infection and Immunity. 2002; 70(5):2408–2413. doi: 10.1128/IAI.70.5.2408-2413.2002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58. Holt K. E., Wertheim H., Zadoks R. N., et al. Genomic analysis of diversity, population structure, virulence, and antimicrobial resistance in Klebsiella pneumoniae, an urgent threat to public health. Proceedings of the National Academy of Sciences. 2015; 112(27):E3574–E3581. doi: 10.1073/pnas.1501049112 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59. Delannoy C. M., Zadoks R. N., Crumlish M., et al. Genomic comparison of virulent and non-virulent Streptococcus agalactiae in fish. Journal of Fish Diseases. 2016; 39(1):13–29. doi: 10.1111/jfd.12319 [DOI] [PubMed] [Google Scholar]
  • 60. Agnew W., Barnes A. C. Streptococcus iniae: an aquatic pathogen of global veterinary significance and a challenging candidate for reliable vaccination. Veterinary Microbiology. 2007; 122(1-2):1–15. doi: 10.1016/j.vetmic.2007.03.002 [DOI] [PubMed] [Google Scholar]
  • 61. Barkham T., Tang W. Y., Wang Y. C., et al. Human fecal carriage of Streptococcus agalactiae Sequence Type 283, Thailand. Emerging Infectious Diseases. 2023; 29(8):1627–1629. doi: 10.3201/eid2908.230098 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62. Sirimanapong W., Phuoc N. N., Crestani C., et al. Geographical, temporal and host-species distribution of potentially human-pathogenic Group B Streptococcus in aquaculture species in Southeast Asia. Pathogens. 2023; 12(4):525. doi: 10.3390/pathogens12040525 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63. Baptista R. C., Rodrigues H., Sant’Ana A. S. Consumption, knowledge, and food safety practices of Brazilian seafood consumers. Food Research International. 2020; 132:109084. doi: 10.1016/j.foodres.2020.109084 [DOI] [PubMed] [Google Scholar]
  • 64. Delannoy C. M., Crumlish M., Fontaine M. C., et al. Human Streptococcus agalactiae strains in aquatic mammals and fish. BMC Microbiology. 2013; 13(1):1–9. doi: 10.1186/1471-2180-13-41 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65. Bellais S., Six A., Fouet A., et al. Capsular switching in group B Streptococcus CC17 hypervirulent clone: a future challenge for polysaccharide vaccine development. The Journal of Infectious Diseases. 2012; 206(11):1745–1752. doi: 10.1093/infdis/jis605 [DOI] [PubMed] [Google Scholar]
  • 66. Neemuchwala A., Teatero S., Athey T. B., et al. Capsular switching and other large-scale recombination events in invasive sequence type 1 group B Streptococcus. Emerging Infectious Diseases. 2016; 22(11):1941. doi: 10.3201//eid2211.152064 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67. Brueggemann A. B., Pai R., Crook D. W., et al. Vaccine escape recombinants emerge after pneumococcal vaccination in the United States. PLoS Pathogens. 2007; 3(11):e168. doi: 10.1371/journal.ppat.0030168 [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision Letter 0

Michael Wessels, Lakshmi Rajagopal

12 Jun 2024

Dear Dr Crestani,

Thank you very much for submitting your manuscript "Genomic and functional determinants of host spectrum in Group B Streptococcus" for consideration at PLOS Pathogens. As with all papers reviewed by the journal, your manuscript was reviewed by members of the editorial board and by independent reviewers. In light of the reviews (below this email), we would like to invite the resubmission of a revised version that takes into account the reviewers' comments. Your revised manuscript may be sent to reviewers for further evaluation.

When you are ready to resubmit, please upload the following:

[1] A letter containing a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.

[2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file).

Important additional instructions are given below your reviewer comments.

Please prepare and submit your revised manuscript within 60 days. If you anticipate any delay, please let us know the expected resubmission date by replying to this email. Please note that revised manuscripts received after the 60-day due date may require evaluation and peer review similar to newly submitted manuscripts.

Thank you again for your submission. We hope that our editorial process has been constructive so far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments.

Sincerely,

Lakshmi Rajagopal, Ph.D

Academic Editor

PLOS Pathogens

Michael Wessels

Section Editor

PLOS Pathogens

Michael Malim

Editor-in-Chief

PLOS Pathogens

orcid.org/0000-0002-7699-2064

***********************

Reviewer's Responses to Questions

Part I - Summary

Please use this section to discuss strengths/weaknesses of study, novelty/significance, general execution and scholarship.

Reviewer #1: Chiara Crestani and colleagues described the global GBS population comprises host-generalist, host-adapted and host-restricted sublineages, as they named, through bioinformatics analysis by downloading the whole genome sequence data from public repositories or self-generated. This is an interesting idea.

However, the manuscript seemed to lack novelty. The evidence in support of the authors’ statement is insufficient. Furthermore, we think the manuscript was not well-organized.

For example, In line 2.5. In vivo assessment of the role of Locus 3 in fish infection.

Why the Lac.2 operon (GBS in cattle) and the scpB transposon (GBS in humans) were not selected for in-vivo experiment. The author should explain this in the introduction.

Reviewer #2: In this work, Crestani et al., analyze a robust set of 1,254 high-quality GBS genomes to identify potential drivers of host specificity. They identify three functional groups that describe host specificity – host generalists, host-adapted, and host restricted and assign GBS clonal groups (CGs) within these classes to the host species. They then identify genetic drivers that contribute to host-adaptation reiterating the association of scpB acquisition with human interactions and the Lac.2 operon with cattle. Interestingly they identify a novel genetic locus they termed Locus 3 that is associated with adaptation to fish hosts, and they show loss of Locus 3 significantly decreases GBS virulence in two distinct strain backgrounds and animal models. The findings presented confirm previous claims about scpB and Lac.2 while also contributing novel findings for GBS evolution in the context of fish hosts.

**********

Part II – Major Issues: Key Experiments Required for Acceptance

Please use this section to detail the key new experiments or modifications of existing experiments that should be absolutely required to validate study conclusions.

Generally, there should be no more than 3 such required experiments or major modifications for a "Major Revision" recommendation. If more than 3 experiments are necessary to validate the study conclusions, then you are encouraged to recommend "Reject".

Reviewer #1: (No Response)

Reviewer #2: (No Response)

**********

Part III – Minor Issues: Editorial and Data Presentation Modifications

Please use this section for editorial suggestions as well as relatively minor modifications of existing data that would enhance clarity.

Reviewer #1: (No Response)

Reviewer #2: - Assuming mobile genetic element but ‘MGE’ should be defined in Ln 30. Spelled out in Ln 208

- Explanation of how 80% was selected as the cutoff for host generalist v host specialists would be helpful.

- Fig 2B could benefit from grouping the CGs by host generalist, host-adapted, and host restricted rather than numberical order across the X-axis

- Ln 168-169: Are the authors able to indicate where SL61 falls on Fig. 2C?

- Could the authors add labels or comment on the enriched human-associated unitigs that fall between 0.50 and 0.75 in the reference genome position in Fig 4A, especially the peak ~0.6.

- It is difficult to determine where the supplemental data citations are referencing. The tables are not cited in the text. In Table S2, what do the colors indicate? Are there corresponding legends to the supplemental data?

- The key for Fig. 6A says ‘Viet Nam’

- Ln 304-308: Are the colors representing Brazil and Vietnam inverted in this figure? The text cites that the emergence of the CG283 strains in Brazil occurred around 2015 but the figure looks as if this is the Vietnam strain emergence. And vice versa for the 2003 scpB loss that, according to the figure, appears to be isolated from strains in Brazil, the text cites Vietnam.

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Figure Files:

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

Data Requirements:

Please note that, as a condition of publication, PLOS' data policy requires that you make available all data used to draw the conclusions outlined in your manuscript. Data must be deposited in an appropriate repository, included within the body of the manuscript, or uploaded as supporting information. This includes all numerical values that were used to generate graphs, histograms etc.. For an example see here on PLOS Biology: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5.

Reproducibility:

To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

Decision Letter 1

Michael Wessels, Lakshmi Rajagopal

8 Jul 2024

Dear Dr Crestani,

We are pleased to inform you that your manuscript 'Genomic and functional determinants of host spectrum in Group B Streptococcus' has been provisionally accepted for publication in PLOS Pathogens.

Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests.

Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated.

IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript.

Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS.

Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Pathogens.

Best regards,

Lakshmi Rajagopal, Ph.D

Academic Editor

PLOS Pathogens

Michael Wessels

Section Editor

PLOS Pathogens

Michael Malim

Editor-in-Chief

PLOS Pathogens

orcid.org/0000-0002-7699-2064

***********************************************************

Reviewer Comments (if any, and for reference):

Acceptance letter

Michael Wessels, Lakshmi Rajagopal

7 Aug 2024

Dear Dr Crestani,

We are delighted to inform you that your manuscript, "Genomic and functional determinants of host spectrum in Group B Streptococcus," has been formally accepted for publication in PLOS Pathogens.

We have now passed your article onto the PLOS Production Department who will complete the rest of the pre-publication process. All authors will receive a confirmation email upon publication.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any scientific or type-setting errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript. Note: Proofs for Front Matter articles (Pearls, Reviews, Opinions, etc...) are generated on a different schedule and may not be made available as quickly.

Soon after your final files are uploaded, the early version of your manuscript, if you opted to have an early version of your article, will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

Thank you again for supporting open-access publishing; we are looking forward to publishing your work in PLOS Pathogens.

Best regards,

Michael Malim

Editor-in-Chief

PLOS Pathogens

orcid.org/0000-0002-7699-2064

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Appendix. Supplementary data file.

    Detailed materials and methods, additional results, and supplementary figures and tables.

    (PDF)

    ppat.1012400.s001.pdf (15.6MB, pdf)
    S1 Table. Dataset and results file.

    Dataset of genomes used for this study, with accompanying metadata, accession numbers, and main results of genomic analyses.

    (XLSX)

    ppat.1012400.s002.xlsx (326.2KB, xlsx)
    S2 Table. Scoary results.

    Scoary results for human-enriched genes.

    (XLSX)

    ppat.1012400.s003.xlsx (192.5KB, xlsx)
    S3 Table. Scoary results.

    Scoary results for bovine-enriched genes.

    (XLSX)

    ppat.1012400.s004.xlsx (219.5KB, xlsx)
    S4 Table. Scoary results.

    Scoary results for fish-enriched genes.

    (XLSX)

    ppat.1012400.s005.xlsx (170.2KB, xlsx)
    Attachment

    Submitted filename: Resubmission_Letter_Plos_Pathogens.pdf

    ppat.1012400.s006.pdf (230.9KB, pdf)

    Data Availability Statement

    All raw or assembled Illumina sequence data is available from the European Nucleotide Archive (ENA) or from the National Center for Biotechnology Information (NCBI), with accession numbers provided in S1 Table. Phylogenetic analysis of the 1,254 Group B Streptococcus genomes, together with all metadata, is available at the project URL https://microreact.org/project/vkEcchaHsSJa6sLhSGvmHF-group-b-streptococcus-host-adaptation-2023 within Microreact.


    Articles from PLOS Pathogens are provided here courtesy of PLOS

    RESOURCES