Abstract
Streptococcus anginosus, part of the Streptococcus anginosus group (SAG), is a human commensal increasingly recognized as an opportunistic pathogen responsible for abscesses formation and infections, also invasive ones. Despite its growing clinical importance, the genetic determinants of its pathogenicity remain poorly understood. This study aimed to identify essential genes in S. anginosus 980/01, a bloodstream isolate, under nutrient-rich laboratory conditions using a transposon mutagenesis combined with Transposon-Directed Insertion Site Sequencing (TraDIS). A mutant library was generated using the ISS1 transposon delivered via the thermosensitive plasmid pGh9:ISS1. Following transposition, insertions were mapped using Illumina sequencing and subsequently analyzed. Essential genes were identified based on the absence of insertions and statistical filtering. The library exhibited 98% genome saturation with over 130,000 unique insertion sites. Among 1825 genes, 348 (19.1%) were essential, 1446 non-essential, and 30 non-conclusive. Comparative analyses were performed with S. pyogenes MGAS5005 and S. agalactiae A909. Similarly to the latter, essential genes were enriched in functions related to translation, transcription, and cell wall biosynthesis. However, 40 genes uniquely essential to S. anginosus 980/01 were identified, suggesting unique survival strategies in S. anginosus. This study presents the first genome-wide identification of essential genes for S. anginosus 980/01, highlighting conserved and unique essential genes. These findings provide a basis for understanding its physiology and key genetic determinants of bacterial viability, and may help to uncover the pathogenic potential of S. anginosus in future studies.
Supplementary Information
The online version contains supplementary material available at 10.1038/s41598-025-18002-0.
Keywords: Streptococcus anginosus, Streptococcus anginosus group, Invasive, Essential genes, TraDIS
Subject terms: Microbiology, Bacteria, Microbial genetics
Introduction
Streptococcus anginosus, together with S. constellatus and S. intermedius, constitute the Streptococcus anginosus group (SAG) commonly found on mucosal membranes in the healthy human microbiota, particularly in the oral cavity, gastrointestinal tract, and urogenital tracts1. Despite this, SAG species are increasingly recognized as opportunistic pathogens, especially in immunocompromised or cystic fibrosis patients2–4. Among SAGs, S. anginosus is frequently isolated from abscesses, bloodstream infections, and polymicrobial infections, and is more commonly associated with invasive infections than other SAG members, also in individuals without any underlying diseases2,5–8.
Interestingly, S. anginosus strains exhibit variable hemolytic activity and can express different Lancefield antigens, complicating diagnosis and classification. These strains can also produce diverse virulence factors9,10. However, the molecular mechanisms underlying the S. anginosus pathogenesis remain poorly understood, with many of its genes still annotated as hypothetical7,11,12. To assess the essentiality and fitness of each gene within the bacterial genome, the most efficient strategies integrate the next-generation sequencing (NGS) technologies with large-scale transposon mutagenesis13–17. This approach enables the identification of the chromosomal sequences adjacent to transposon integration sites. However, genes required for bacterial growth and survival under specific experimental conditions, termed essential genes, cannot be disrupted. A saturated transposon mutant library contains a collection of viable mutants with a transposon inserted into every non-essential gene, while insertions into essential genes are lethal. Moreover, such a library can be challenged in specific conditions, such as human blood or antibiotics, to distinguish non-essential genes that are necessary for bacterial survival and growth in this particular environment.
For mutagenesis of Gram-positive bacteria, transposons are typically based either on mariner18–21 or ISS122–26, both utilizing a copy-out–paste-in mechanism. While the mariner-family transposons preferentially integrate at TA dinucleotide sequences27, the ISS1 does not exhibit such specificity22. Essential genes of several streptococcal species have been analysed in the ISS1 Transposon-Directed Insertion Site Sequencing (TraDIS), including S. uberis 0140J17, S. equi 404724, and S. agalactiae CNCTC 10/8428. Meanwhile, for S. pyogenes M1T1 544818, S. agalactiae A90929, and S. suis SC1930, the mariner-based Krmit or Himar1 have been used. The identification of essential genes provides promising targets for the development of novel antimicrobial therapies31.
In this study, we performed an ISS1 TraDIS approach to identify genes essential for the growth of S. anginosus 980/0111, an invasive strain isolated from sepsis, under nutrient-rich laboratory conditions. As the first genome-wide transposon mutagenesis study in S. anginosus, this analysis provides an initial framework for exploring its essential gene set. The TraDIS data for S. anginosus were finally compared to those of S. pyogenes and S. agalactiae to identify conserved essential genes and determine the unique set of genes essential to S. anginosus 980/01. The transposon-based methods, such as TraDIS, have become powerful tools for defining the genetic determinants of bacterial survival and virulence.
Materials and methods
The S. anginosus 980/01 strain is part of the Collection of the National Institute of Medicines, Warsaw, Poland11. It was isolated from the bloodstream of a 67-year-old patient in 2001 in Poland. The strain exhibits α-hemolytic activity and expresses the Lancefield group F antigen on its surface.
S. anginosus cultures
S. anginosus was grown on BHI agar (VWR chemicals) plates or in BHI broth (VWR chemicals) supplemented, if necessary, with erythromycin (Acros Organics) in a concentration of 5 × 10−3 g/L. Bacteria were grown at 37 °C, unless otherwise stated, in the presence of a CO2 generator, CO2Gen (Thermo Fisher Scientific, Waltham, USA). For extended storage, S. anginosus 980/01 was frozen in BHI with 15% glycerol at − 80 °C.
S. anginosus electrocompetent cells preparation
S. anginosus 980/01 was grown on BHI agar for 24 h. After that, the 25-fold diluted culture was grown in BHI to OD₆₆₀ ~ 0.3, harvested by centrifugation (5000 × g, 4 °C), washed three times with ice-cold 0.5 M sucrose, and resuspended in 0.5 M sucrose with 15% glycerol. Aliquots (50 μL) were stored at − 80 °C.
Transformation of S. anginosus with the pGh9:ISS1 plasmid
The pGh9:ISS1 plasmid22 with thermosensitive replication was used as a donor of the ISS1 transposon. Electrocompetent S. anginosus 980/01 cells were transformed with 50 ng of pGh9:ISS1 DNA via electroporation (2.5 kV/cm, 200 Ω, 25 μF, 5 ms pulse; Bio-Rad Gene Pulser). Cells were recovered in BHI at 28 °C for 4 h; after 1 h of incubation, erythromycin in a sublethal concentration was added to induce resistance expression. Transformants were selected on BHI with erythromycin (BHIE) at 28 °C after 48 h-incubation.
The S. anginosus 980/01 mutant library construction
A single colony of S. anginosus 980/01 carrying pGh9:ISS1 was grown in 10 mL BHIE at 28 °C for 24 h, then heat-shocked at 40 °C for 2.5 h to stop plasmid replication and induce ISS1 transposition. Transposants were selected after overnight growth on 100 BHIE plates (~ 6500 colonies/plate) at 40 °C, in an atmosphere with 5% CO2. Colonies were harvested into BHI with 15% glycerol and stored at − 80 °C.
PCR detection of pGh9:ISS1
To confirm the presence of pGh9:ISS1 in selected clones, PCR was performed using DreamTaq DNA polymerase and dNTPs (Thermo Fisher Scientific), with FwISS1 and RvISS1 primers (Supplemental File S1). The reaction conditions: initial denaturation at 95 °C for 3 min; 30 cycles of 95 °C for 30 s (denaturation), 50 °C for 30 s (annealing), and 72 °C for 4 min (extension); followed by a final extension at 72 °C for 7 min. PCR products were analyzed by agarose gel electrophoresis.
Identification of ISS1 insertion sites in mutants
To verify the ISS1 transposon insertion site in an individual mutant, genomic DNA was extracted and digested with FastDigest HindIII (Thermo Fisher Scientific). The resulting fragments were ligated using T4 DNA Ligase (Thermo Fisher Scientific), and served as templates for PCR amplification with DreamTaq DNA Polymerase (Thermo Fisher Scientific) using primers FwSAGISS1 and RvSAGISS1 (Supplemental File S1). The PCR products were separated by agarose gel electrophoresis, the bands were excised, purified (MicroElute Gel Extraction Kit, OMEGA Bio-tek, USA), and Sanger sequenced using FwSAGISS1 (detailed results are provided in Supplemental File S1).
ISS1 transposon mutant libraries preparation for Illumina sequencing
The portion of ISS1 transposon mutant library was regrown to OD660 = 0.3 in BHIE, spread onto 25 BHIE plates each, pooled, and harvested prior DNA extraction. Genomic DNA was isolated from the bacterial mutant pools using the SDS/Phenol method as described previously32,33. DNA quality control was performed by measuring the absorbance at 260/230, template concentration was determined using the Qubit fluorimeter (Thermo Fisher Scientific). DNA integrity was analyzed by 0.8% agarose gel electrophoresis.
The ISS1 transposon mutant libraries were constructed according to the protocol (Transposon insertion sequencing (Tn-seq) library preparation protocol—includes Unique Molecular Identifiers (UMI) for PCR duplicate removal: https://www.protocols.io/view/transposon-insertion-sequencing-tn-seq-library-pre-rm7vzn6d5vx1/v1) with minor modifications. In brief, DNA was mechanically sheared using Covaris (Covaris, MA, USA) into 300–500 bp fragment sizes, followed by end repair and TA-adaptor ligation using NEB Ultra II End Repair and Ligation Modules (New England Biolabs, Beverly, USA). Ligation reaction was purified by Ampure XP magnetic beads (Beckman Coulter, Brea, USA) and library restriction digest was performed using SmaI (Thermo Fisher Scientific) for 2 h at 25 °C to cleave the pGh9:ISS1 plasmid 33 bp upstream of the sequence encoding ISS1 to minimise the amount of TnSeq reads mapping to plasmid. The digested library was purified using Ampure XP beads, and the amount of DNA recovered was quantified using the Qubit dsDNA HS assay kit (Thermo Fisher Scientific) according to the manufacturer’s instructions. One hundred nanograms of library DNA was PCR amplified for 20 cycles according to the NEBNext Ultra II DNA library prep kit protocol. Library amplification utilized the specific ISS1 primer containing Nextera XT 5’ overhang (P5) with barcode index sequence and a TA–adaptor compatible unique indexing Nextera XT PCR (P7) primer per library, which facilitated the attachment of the final product to the sequencing flow cell (primer sequences are provided in Supplemental File S1). The libraries were quality-checked using the KAPA Library Quantification kit (KAPA-Roche, Basel, Switzerland), pooled in equimolar ratio, and sequenced on a NextSeq 550 instrument using the NextSeq HighOutput reagent v2.5 (150 cycle) chemistry kit (Illumina, San Diego, USA).
TnSeq data analysis
Raw demultiplexed fastq files were analysed using the TnSeq UMI scripts (https://github.com/nppalani/TnSeq/ and https://github.com/apredeus/TRADIS) modified to handle ISS1 transposon sequence containing data. Initially, the single command pipeline script, fastqtoreadcount_umi.sh, was run. The pipeline performed UMI-based PCR duplication removal, filtered and removed reads according to the transposon tag. After tag removal, the remaining reads were mapped to the S. anginosus 980/01 reference genome. Finally, raw read counts per insertion position output files were generated. Transposon insertions were viewed in Integrative Genomics Viewer34. TnSeq data analysis was further performed using Transit v.3.2.335. Gene essentiality was calculated using the Tn5Gaps method (https://transit.readthedocs.io/en/latest/method_tn5gaps.html), designed for transposons with random genome insertion. To estimate gene essentiality, the method identifies the longest uninterrupted regions within genes lacking insertions. The ISS1 insertion index was calculated for each gene as the number of unique insertion sites divided by the gene length (in base pairs). Based on this index, genes were categorized as essential, non-essential, or non-conclusive. For genes classified as essential, statistical significance was assessed using a threshold of p < 0.05, following correction for multiple testing using the false discovery rate (FDR) method to reduce the likelihood of false positives.
Metabolic pathway reconstruction and KEGG categorization
The metabolic pathways of S. anginosus 980/01 were reconstructed using the KEGG Mapper tool provided by the Kyoto Encyclopedia of Genes and Genomes (KEGG) (https://www.kegg.jp)36,37. Genome annotation files for S. anginosus 980/01 were first uploaded to BlastKOALA to assign KEGG Orthology (KO) identifiers to predicted protein-coding genes. Default parameters were used for KO assignment, including the “Prokaryotes” taxonomic group setting. The resulting KO-annotated gene sets were visualized and mapped to metabolic pathways using KEGG Mapper’s “Reconstruct Pathway” tool. Metabolic reconstructions were compared with annotated reference genomes of S. pyogenes MGAS5005 and S. agalactiae A909, both obtained from the KEGG database. Differences in core metabolism were identified based on the presence or absence of pathway-specific enzymes.
Ortholog identification and essential gene comparison
To identify orthologous genes and assess conservation of essentiality across species, we used OrthoFinder v3.0.1b1 (https://github.com/davidemms/OrthoFinder)38. Protein sequences from S. anginosus 980/01, S. pyogenes MGAS5005, and S. agalactiae A909 were used as input. OrthoFinder was run with default parameters, which include the use of DIAMOND for sequence similarity searches and MCL clustering for orthogroup inference. Only orthologs found in one-to-one orthogroups were considered for essentiality comparisons to avoid ambiguity due to paralogy.
KEGG pathway enrichment analysis
This analysis was performed to identify biological processes overrepresented among essential genes identified by TnSeq. Protein annotations for the complete genome were obtained using eggNOG-mapper (v2.1), (http://eggnog-mapper.embl.de/) and KEGG Orthology (KO) identifiers and pathway memberships (https://www.genome.jp/kegg/kegg2.html)37 were extracted. Gene identifiers were matched using 5-digit locus suffixes. Each gene was labelled as essential or non-essential. For each KEGG pathway, a 2 × 2 contingency table was constructed to compare the number of essential and non-essential genes present or absent in the pathway. Significance of enrichment was assessed using Fisher’s exact test, followed by Bonferroni correction for multiple testing. Pathways with adjusted p-values below 0.05 were considered significantly enriched.
Phylogenetic analysis of S. anginosus 980/01
Phylogenetic analysis of S. anginosus 980/01 was conducted using genome sequences from 76 S. anginosus isolates published by Prasad et al.39 in Table S2. These included both complete and draft genome assemblies representative of the species’ genomic diversity. A reference-free phylogenetic analysis was performed using kSNP440, which detects core single nucleotide polymorphisms (SNPs) across genome sequences without requiring alignment or a reference genome. A maximum-likelihood phylogenetic tree was inferred from the core SNP matrix and visualized using the Microreact (https://microreact.org/)41, an interactive platform for exploring phylogenetic data alongside associated metadata.
Results
Genome structure and mobile genetic elements of S. anginosus 980/01
The S. anginosus 980/01 genomic DNA was previously sequenced using both short-reads of Illumina and the long-reads of Oxford Nanopore technology to obtain the complete physical map; the sequencing of this strain is a part of the larger project and will be described elsewhere. It consists of 1,883,767 bps chromosome, with no plasmids. The sequence is deposited in GenBank under accession number CP183189, and as BioProject No.PRJNA1228600. The mean %G + C content is 39%. Genome annotation using DFAST v.1.2.18 software (https://github.com/nigyta/dfast_core) revealed that the genome contains 1825 genes coding for proteins, of which 415 are of unknown function. No antimicrobial resistance genes were detected using ResFinder v.4.3.3 (https://orbit.dtu.dk/en/publications/resfinder-an-open-online-resource-for-identification-of-antimicro). Nine putative streptococcal virulence factors were identified using ABRicate v.1.0.1 (https://github.com/tseemann/abricate) with the Virulence Factor Database (https://www.mgc.ac.cn/VFs/), retaining only hits with > 80% both sequence identity and target coverage (Supplemental File S1). Detection of mobile genetic elements revealed: a/ an intact single prophage, 38.9 kb in size (position 591,476–630,426), detected using Phastest (https://phastest.ca/), b/ two putative integrative and conjugative elements (ICE) as well as c/ a single putative integrative and mobilizable element (IME); ICEs and IME were identified using both ICEfinder (https://bioinfo-mml.sjtu.edu.cn/ICEfinder/ICEfinder.html) and ICEscreen v1.3.3 (https://icescreen.migale.inrae.fr/) (Supplemental File S1).
Phylogenetic analysis of S. anginosus 980/01
To investigate the genomic relationship of S. anginosus strain 980/01, we constructed a core-SNP phylogenetic tree based on the genomic sequences of 76 S. anginosus strains39. These included representatives of two subspecies (S. anginosus subsp. anginosus and S. anginosus subsp. whileyi), as well as two proposed genomosubspecies (S. anginosus genomosubsp. AJ1 and S. anginosus genomosubsp. vellorensis), as previously defined by multilocus sequence analysis (MLSA) (see Supplemental File S1, worksheet tab “Strains for core-SNP tree”).
In the resulting phylogenetic tree (Fig. 1), strain 980/01 clustered within the S. anginosus subsp. anginosus clade, grouping closely with 11 other strains and showing more distant relationships to an additional 25 strains within the same subspecies. This clustering indicates that S. anginosus 980/01 shares a recent common ancestor with these 11 strains and belongs to a well-supported sub-lineage of S. anginosus subsp. anginosus. Therefore, we conclude that S. anginosus 980/01 is a representative member of the this subspecies and may serve as a suitable reference strain for comparative studies involving this sub-lineage.
Fig. 1.
Maximum-likelihood phylogenetic tree of S. anginosus 980/01 and related S. anginosus isolates based on core genome single-nucleotide polymorphisms (SNPs). Coloured boxes indicate the S. anginosus subspecies groups. SNPs were identified using kSNP4. All included strains are listed in Supplemental File S1.
The S. anginosus 980/01 mutant library construction
To construct the S. anginosus 980/01 transposon mutant library, the pGh9:ISS1 plasmid of thermosensitive replication origin was used as a donor of the ISS1 transposon. To enable ISS1 transposition, the bacterial culture was incubated at a restrictive temperature as described in Materials and Methods section. Due to replicative ISS1 transposition into the random chromosomal site, the plasmid was incorporated between two duplicated ISS1 transposon sequences (Fig. 2). The resulting library contained 5.3 × 109 clones, with the ISS1 transposition frequency of 0.29% (calculated as CFU on BHIE versus BHI).
Fig. 2.
Schematic representation of ISS1 transposition into the S. anginosus 980/01 chromosome. The S.anginosus chromosome is shown as a thick black line, while pGh9:ISS1 as a thin black line. Arrows indicate gene localisation.
The S. anginosus 980/01 mutant library validation
The ISS1 integration sites in 30 randomly chosen erythromycin-resistant clones from the ISS1 S. anginosus 980/01 mutant library were analysed. The sequencing results revealed that in 28 out of 30 clones, ISS1 was integrated into unique sites, whereas in two mutants (6.7%), ISS1 was detected at redundant genomic locations. Nevertheless, the library predominantly comprises unique clones, with 93.3% showing distinct integration sites (Table 1).
Table 1.
Transposon insertion sites in 30 mutants
| No of mutant | Insertion site (bp) | Encoded function |
|---|---|---|
| 1 | 51909 | Phosphoenolpyruvate-dihydroxyacetone phosphotransferase (EC 2.7.1.121), subunit DhaM; DHA-specific IIA component |
| 2 | 61149 | Tagatose 1,5-bisphosphate aldolase (EC 4.1.2.40) |
| 3 | 62441 | Holliday junction DNA helicase RuvB |
| 4 | 112778 | Copper chaperone |
| 5 | 250131 | Glutamyl aminopeptidase (EC 3.4.11.7) |
| 6 | 326163 | Translation initiation factor 2 |
| 7 | 327112 | Phosphate transport system regulatory protein PhoU |
| 8 | 335194 | hypothetical protein |
| 9 | 339133 | Glycyl-tRNA synthetase beta chain (EC 6.1.1.14) |
| 10 | 467565 | Permease/hypothetical protein |
| 11* | 486888 | Cystathionine gamma-synthase (EC 2.5.1.48) |
| 12* | 486888 | Cystathionine gamma-synthase (EC 2.5.1.48) |
| 13 | 522974 | Methionine ABC transporter ATP-binding protein |
| 14 | 556391 | thioesterase family protein |
| 15 | 711094 | PTS system, cellobiose-specific IIC component (EC 2.7.1.69) |
| 16 | 921484 | hypothetical protein |
| 17 | 845413 | Phosphate ABC transporter, periplasmic phosphate-binding protein PstS |
| 18 | 968611 | Maltose O-acetyltransferase (EC 2.3.1.79) |
| 19 | 1061362 | Late competence protein ComEC, DNA transport |
| 20 | 986233 | Signal recognition particle associated protein |
| 21 | 1254429 | Lipoteichoic acid synthase LtaS Type IIc |
| 22 | 1322410 | Fibronectin/fibrinogen-binding protein |
| 23 | 1420899 | Cationic amino acid transporter—APC Superfamily |
| 24 | 1502700 | Hydrolase (HAD superfamily) |
| 25* | 1536265 | contains glycosyl transferase family 2 region |
| 26* | 1536265 | contains glycosyl transferase family 2 region |
| 27 | 1537722 | hypothetical protein |
| 28 | 1577977 | Mobile element protein |
| 29 | 1654917 | PTS system, cellobiose-specific IIA component (EC 2.7.1.69) |
| 30 | 1680876 | Para-aminobenzoate synthase, amidotransferase component (EC 2.6.1.85) |
The numbers of the redundant mutants are marked with an asterisk (*).
The S. anginosus 980/01 mutant library characteristics
For sequencing, three independent replicates of the ISS1 transposon mutant library (BHI_A, BHI_B, and BHI_C) were regrown and prepared for DNA extraction. After sequencing, each of them was analyzed using the Transit software with the Tn5gaps method35.
Analysis revealed 132,000–175,000 single insertion sites of ISS1 in the genome, on average every 10.7–14.3 nt. This corresponds to a library saturation of 98% (1,796 disrupted genes of 1,825 annotated), with an average of 64.7 insertions per disrupted gene and a mean of 2,718 sequencing reads per gene (Fig. 3, Supplemental Files S1 and S2). The ISS1 insertion index calculated for each gene was a base to consider a given gene essential, non-essential, and non-conclusive (Supplemental File S1). For genes classified as essential in S. anginosus 980/01 statistical significance was assessed using a false discovery rate (FDR) correction, with a significance threshold of p < 0.05.
Fig. 3.
Location of all ISS1 transposon insertion sites mapped to the S. anginosus 980/01 genome. The outer rings in red and cyan show coding sequences on the (+) and (−) strands, respectively. The innermost black histogram displays the location of each transposon insertion with the height of each bar representing the detection frequency of each insertion. Genome coordinates are shown on the outermost ring.
Based on these parameters, 348 of 1825 (19.1%) of the S. anginosus 980/01 genes were classified as essential, 1446 (79.2%) were non-essential, while 30 (1.7%) were non-conclusive. The functional categories analysis of essential genes is presented in Fig. 3. The main category encompasses genes involved in translation, transcription, and cell wall biogenesis.
Reconstruction of metabolic pathways
Based on the whole-genome analysis of S. anginosus 980/01, metabolic pathways were reconstructed using the KEGG Mapper of Kyoto Encyclopedia of Genes and Genomes36,37. They were compared with data for S. pyogenes M1T1 544842 and S. agalactiae A90943, other human pathogenic streptococci. Like S. anginosus 980/01, these strains were originally isolated from the blood of patients with sepsis. Notably, the study on S. pyogenes M1T1 5448 relied on the genome annotation of S. pyogenes MGAS5005, as the genome sequence of M1T1 5448 had not yet been published.
The comparison of KEGG data indicates that elements involved in DNA replication and repair, transcription, and translation, as well as the major metabolic pathways, are similar to those of S. pyogenes MGAS5005 and S. agalactiae A909. So, for ATP production, S. anginosus 980/01 relies on fermentation and substrate phosphorylation.
In predicted carbohydrate metabolism, due to the presence of specific genes, S. anginosus 980/01 can carry out the citrate cycle reactions facilitated by citrate synthase, aconitate hydratase, and isocitrate dehydrogenase. Those genes are absent in S. agalactiae A909 and S. pyogenes MGAS5005, but present in e.g., S. suis and S. mutans44. However, these genes are not essential for S. anginosus 980/01 under the tested growth conditions.
Comparison of essential gene sets of S. anginosus 980/01 with S. pyogenes and S. agalactiae
To identify conserved and species-specific essential genes, we compared the S. anginosus 980/01 dataset with published essential gene sets from other human pathogenic streptococci, S. pyogenes M1T1 544842 and S. agalactiae A90943. Like S. anginosus 980/01, these strains were originally isolated from the blood of patients with sepsis.
The analysis was conducted using data provided by Le Breton et al. and Hooven et al.18,29. S. pyogenes, and S. agalactiae as well as S. anginosus 980/01 were cultivated at 37ºC, under nutrient-rich growth conditions. S. pyogenes was grown in Todd-Hewitt broth supplemented with 0.2% yeast extract18, while S. anginosus 980/01—in Brain Heart Infusion broth. In contrast, S. agalactiae was grown on Tryptic Soy Agar29, a comparatively less nutrient-rich medium than the other two.
Genes from S. pyogenes and S. agalactiae were categorized by the authors as essential, critical, non-essential, or not-defined/inconclusive. Orthologs were identified using OrthoFinder and comparisons were made between: a/S. anginosus 980/01 and S. pyogenes MGAS5005, b/ S. anginosus 980/01 and S. agalactiae A909, and c/ S. pyogenes MGAS5005 and S. agalactiae A909.
The essentiality classifications of each orthologous pair were compared, excluding genes with the essentiality unknown or inconclusive. The results are summarized in Fig. 4a and Supplemental File S1.
Fig. 4.
Functional and comparative analysis of essential genes in S. anginosus and related streptococci. (a/) Functional classification of 348 essential genes, including 40 strain-specific, in S. anginosus 980/01. (b/) Comparison of essential genes identified in S. anginosus 980/01, S. pyogenes MGAS5005, and S. agalactiae A909. The Venn diagram illustrates shared and unique essential genes. The bottom-most value reflect numbers of genes conserved across all three species. (c/) Top enriched KEGG pathways among essential genes identified by TnSeq data analysis. Bars represent the number of essential genes mapped to each KEGG pathway. Only the top 5 pathways (based on adjusted p-value < 0.05) are shown. Enrichment significance was calculated using Fisher’s exact test with Bonferroni correction.
The proportion of essential genes in S. anginosus (19%) is comparable to that in S. pyogenes MGAS5005 (298 essential genes of 1866; 16%)18 and S. agalactiae A909 (317 of 2136; 15%)29. Among the genes classified as essential in all three species, 178 (53.5%) were shared; between S. anginosus and S. pyogenes it was 59.9%, and 62.4% between S. anginosus and S. agalactiae (Fig. 4b). These 178 genes essential for 3 strains were enriched in categories such as translation (53 genes), cell wall biogenesis (19), lipid transport and metabolism (16), replication and amino acid transport and metabolism (13 each), carbohydrate transport and metabolism (12), coenzyme transport and metabolism (9), transcription (8), and energy production and conversion (6). Other functional groups included posttranslational modification, protein turnover, and chaperones, inorganic ion transport and metabolism, intracellular trafficking and nucleotide transport and metabolism (5 genes each), cell division, secretion, and vesicular transport (4 each), as well as hypothetical proteins (3) and signal transduction mechanisms (2) (Fig. 4c; Supplemental File S1, worksheet tab “TnSeq KEGG enrichment”).
Interestingly, 8 genes encoding glycolytic pathway enzymes were essential in all three species. One gene encoding 2,3-bisphosphoglycerate-dependent phosphoglycerate mutase was classified as non-conclusive in S. pyogenes MGAS500518. The essential genes include eno and gapA, encoding enolase and glyceraldehyde 3-phosphate dehydrogenase, respectively. These enzymes not only have metabolic roles but also act as moonlighting proteins that bind host plasminogen and contribute to virulence in other streptococci45,46. The subcellular localisation of these enzymes in S. anginosus 980/01 will be investigated further.
Although the bulk of genes essential under the tested laboratory conditions are common for the three strains, we identified 40 essential genes unique to S. anginosus 980/01. Most of these genes encode transcription–related proteins and hypothetical proteins with unknown function (Fig. 4a, Supplemental File S3). However, this group also includes:
sodA, encoding superoxide dismutase, a key enzyme in the oxygen defence system,
ssaC encoding the substrate-binding component of a manganese ABC transporter (SsaABC), essential for manganese acquisition, oxidative stress resistance, and full virulence in streptococci47,48,
purB, encoding adenylosuccinate lyase, catalyzing key reactions in the de novo purine biosynthesis pathway, is indispensable for nucleotide production and cellular proliferation,
clpC and clpX, encoding ATPase subunits of the Clp protease complex, as well as mecA, encoding an adaptor enabling ClpC activity, and ctsR, a regulator of the clp genes; all genes shown to be involved in adaptive response in B. subtilis49,50.
ldh, encoding L-lactate dehydrogenase, which catalyzes the conversion of pyruvate to lactate while regenerating NAD⁺, a critical step for maintaining glycolytic flux under anaerobic or microaerophilic conditions.
Discussion
This study provides the genome-wide map of essential genes in S. anginosus 980/01, an invasive strain isolated from sepsis, with the use of high–density transposon mutagenesis (TraDIS) approach. Due to considerable genetic and phenotypic heterogeneity observed among S. anginosus strains8,10, there is a clear need for molecular characterization to better understand both conserved and strain-specific requirements of this opportunistic pathogen. Despite growing recognition of the clinical relevance of the S. anginosus and other members of the S. anginosus group, functional genomic data for this species remain limited. Our findings contribute to filling this gap. The phylogenetic placement of strain 980/01 within the sub-lineage of S. anginosus subsp. anginosus group supports the relevance of its essential gene set. These genes likely represent conserved functions across related isolates, highlighting their potential broader significance.
We identified 348 genes essential for growth under laboratory nutrient-rich, CO2-enriched aerobic conditions. Many of these genes are conserved across S. pyogenes MGAS500518 and S. agalactiae A90929, reflecting core metabolic and cellular processes essential to streptococcal viability. Although compositional differences between media used in these studies may influence the essentiality of individual genes, the observed overlap in core essential genes indicates conservation of fundamental cellular functions. Notably, 40 genes were uniquely essential in S. anginosus 980/01, highlighting species-specific physiological traits and potential therapeutic targets.
Among these, sodA, encoding a sole superoxide dismutase in S. anginosus, was identified. This correlates with the need for a CO2-enriched atmosphere to support colony formation, suggesting limited oxidative stress tolerance. A similar dependence on sodA has been observed in S. thermophilus, where its disruption leads to oxygen hypersensitivity which can be rescued by manganese ions51.
Also uniquely essential was ssaC, encoding the substrate-binding component of the SsaACB, the manganese transporter. Manganese is a key cofactor for enzymes involved in oxidative stress defence, including superoxide dismutase, however, it plays a role beyond catalytic functions (for review52). The essentiality of ssaC under nutrient-rich conditions highlights low manganese bioavailability, necessitating high-affinity uptake systems51.
Genes involved in protein quality control, clpC and clpX encoding ATPases of the Clp protease complex, and together with mecA and a putative ctsR gene were also found to be essential in S. anginosus 980/01, and dispensable in S. pyogenes MGAS5005 and S. agalactiae A909. In S. pneumoniae R6, clpX, but not clpC, is also essential53. In many bacteria, impairment of the Clp protease has pleiotropic effects on cell wall composition or virulence (for review54). Notably, in B. subtilis, all four genes are involved in the heat shock response49,50. Given that the ISS1-transposon mutant library in this study was generated at 40 °C, a temperature that may trigger proteotoxic stress, the observed essentiality could reflect a condition-dependent requirement for these genes.
We also found purB, encoding adenylosuccinate lyase, to be essential. This indicates a dependence on de novo purine synthesis, which may reflect adaptation to purine-limited environments such as mucosal surfaces or abscesses55. Notably, S. agalactiae A909 encodes two purB homologs, potentially providing functional redundancy and explaining its non-essentiality in that species. Given the limited data on purine availability in such niches, purB may represent a metabolic vulnerability worth further investigation.
Finally, ldh, encoding L-lactate dehydrogenase, was also uniquely essential in S. anginosus 980/01, reflecting its reliance on homolactic fermentation for NAD⁺ regeneration. In contrast, S. pyogenes and S. agalactiae, can employ alternative fermentation pathway as mixed-acids fermentation44.
In summary, this study identifies essential genes in S. anginosus 980/01, providing insight into its metabolic constraints and potential vulnerabilities. These findings enhance our understanding of S. anginosus physiology, particularly within S. anginosus subsp anginosus group, and establish the basis for future research on its pathogenic potential. Comparative analysis across multiple bloodstream isolates could help identify conserved targets for specific drug discovery.
Supplementary Information
Below is the link to the electronic supplementary material.
Acknowledgements
We thank the Collection of Mikrobank2 from the National Institute of Medicines (Warsaw, Poland) for sharing the strain. We extend our gratitude to Prof. Izabela Sitkiewicz (SGGW, Warsaw, Poland) for sharing the nucleotide sequence ahead of its official release.
Author contributions
AK performed the research, analyzed the data, and wrote the manuscript; KŻ performed the research; JG analyzed the data, and wrote the manuscript; RG analyzed the data; IKZ supervised the project, analyzed the data, and wrote the manuscript.
Funding
The study was funded by the National Science Centre, Poland (grant number 2018/29/B/NZ6/00624).
Data availability
The *S. anginosus* 980/01 genome sequence is available under GenBank accession number: CP183189. The datasets presented in this study have been deposited in the BioProject database under accession number PRJNA1228600.
Declarations
Competing interests
The authors declare no competing interests.
Ethics approval
The strain was collected by the National Institute of Medicines (Warsaw, Poland) as continuous surveillance in accordance with the World Health Medical Association 1966 Declaration of Helsinki and the EU rules of Good Clinical Practice.
Footnotes
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Whiley, R. A., Beighton, D., Winstanley, T. G., Fraser, H. Y. & Hardie, J. M. Streptococcusintermedius, Streptococcus constellatus, and Streptococcusanginosus (the Streptococcus milleri group): Association with different body sites and clinical infections. J. Clin. Microbiol.30, 243–244 (1992). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Clarridge, J. E., Attorri, S., Musher, D. M., Hebert, J. & Dunbar, S. Streptococcus intermedius, Streptococcus constellatus, and Streptococcusanginosus (“Streptococcus milleri group”) are of different clinical importance and are not equally associated with abscess. Clin. Infect. Dis.32, 1511–1515 (2001). [DOI] [PubMed] [Google Scholar]
- 3.Parkins, M. D., Sibley, C. D., Surette, M. G. & Rabin, H. R. The Streptococcus milleri group: An unrecognized cause of disease in cystic fibrosis—A case series and literature review. Pediatr. Pulmonol.43, 490–497 (2008). [DOI] [PubMed] [Google Scholar]
- 4.Simone, G. et al. Streptococcusanginosus group disseminated infection: Case report and literature review. Infez. Med.20, 145–154 (2012). [PubMed] [Google Scholar]
- 5.Fisher, L. E. & Russell, R. R. B. The isolation and characterization of milleri group streptococci from dental periapical abscesses. J. Dent. Res.72, 1191–1193 (1993). [DOI] [PubMed] [Google Scholar]
- 6.Junckerstorff, R. K., Robinson, J. O. & Murray, R. J. Invasive Streptococcusanginosus group infection-does the species predict the outcome?. Int. J. Infect. Dis.18, 38–40 (2014). [DOI] [PubMed] [Google Scholar]
- 7.Plainvert, C. et al. Microbiological epidemiology of invasive infections due to non-beta-hemolytic streptococci, France. Microbiol. Spectr.10.1128/spectrum.00160-23 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Pilarczyk-Zurek, M., Sitkiewicz, I. & Koziel, J. The clinical view on Streptococcusanginosus group: Opportunistic pathogens coming out of hiding. Front. Microbiol.10.3389/FMICB.2022.956677 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Asam, D. & Spellerberg, B. Molecular pathogenicity of Streptococcusanginosus. Mol. Oral Microbiol.29, 145–155 (2014). [DOI] [PubMed] [Google Scholar]
- 10.Kuryłek, A., Stasiak, M. & Kern-Zdanowicz, I. Virulence factors of Streptococcus anginosus: A molecular perspective. Front. Microbiol.10.3389/fmicb.2022.1025136 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Obszanska, K. et al. Streptococcusanginosus (milleri) group strains isolated in Poland (1996–2012) and their antibiotic resistance patterns. Polish J. Microbiol.65, 33–41 (2016). [DOI] [PubMed] [Google Scholar]
- 12.Baracco, G. J. Infections caused by group C and G Streptococcus (Streptococcus dysgalactiae subsp. equisimilis and Others): Epidemiological and clinical aspects. Microbiol. Spectr.10.1128/microbiolspec.gpp3-0016-2018 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Langridge, G. C. et al. Simultaneous assay of every Salmonella Typhi gene using one million transposon mutants. Genome Res.19, 2308–2316 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.van Opijnen, T., Bodi, K. L. & Camilli, A. Tn-seq: High-throughput parallel sequencing for fitness and genetic interaction studies in microorganisms. Nat. Methods6, 767–772 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Gawronski, J. D., Wong, S. M. S., Giannoukos, G., Ward, D. V. & Akerley, B. J. Tracking insertion mutants within libraries by deep sequencing and a genome-wide screen for Haemophilus genes required in the lung. Proc. Natl. Acad. Sci. U. S. A.106, 16422–16427 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Goodman, A. L. et al. Identifying genetic determinants needed to establish a human gut symbiont in its habitat. Cell Host Microbe6, 279–289 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Blanchard, A. M., Egan, S. A., Emes, R. D., Warry, A. & Leigh, J. A. PIMMS (pragmatic insertional mutation mapping system) laboratory methodology a readily accessible tool for identification of essential genes in Streptococcus. Front. Microbiol.7, 1645. 10.3389/fmicb.2016.01645 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Le Breton, Y. et al. Essential genes in the core genome of the human pathogen Streptococcuspyogenes. Sci. Rep.10.1038/srep09838 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Burghout, P. et al. Streptococcus pneumoniae folate biosynthesis responds to environmental CO2 levels. J. Bacteriol.195, 1573–1582 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Hava, D. L. & Camilli, A. Large-scale identification of serotype 4 Streptococcus pneumoniae virulence factors. Mol. Microbiol.45, 1389–1406 (2002). [PMC free article] [PubMed] [Google Scholar]
- 21.Le Breton, Y. et al. Genome-wide identification of genes required for fitness of Group A Streptococcus in human blood. Infect. Immun.81, 862–875 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Maguin, E., Prévost, H., Ehrlich, S. D. & Gruss, A. Efficient insertional mutagenesis in lactococci and other gram-positive bacteria. J. Bacteriol.178, 931–935 (1996). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Spellerberg, B. et al. Identification of genetic determinants for the hemolytic activity of Streptococcus agalactiae by ISS1 transposition. J. Bacteriol.181, 3212–3219 (1999). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Charbonneau, A. R. L. et al. Defining the ABC of gene essentiality in streptococci. BMC Genomics18, 1–11 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Gury, J., Barthelmebs, L. & Cavin, J. F. Random transposon mutagenesis of Lactobacillusplantarum by using the pGh9:ISS1 vector to clone genes involved in the regulation of phenolic acid metabolism. Arch. Microbiol.182, 337–345 (2004). [DOI] [PubMed] [Google Scholar]
- 26.Thibessard, A., Fernandez, A., Gintz, B., Decaris, B. & Leblond-Bourget, N. Transposition of pGh9:ISS1 is random and efficient in Streptococcusthermophilus CNRZ368. Can. J. Microbiol.48, 473–478 (2002). [DOI] [PubMed] [Google Scholar]
- 27.Rubin, E. J. et al. In vivo transposition of mariner-based elements in enteric bacteria and mycobacteria. Proc. Natl. Acad. Sci.96, 1645–1650 (1999). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Zhu, L. et al. Genome-wide assessment of Streptococcusagalactiae genes required for survival in human whole blood and plasma. Infect. Immun.10.1128/IAI.00357-20 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Hooven, T. A. et al. The essential genome of Streptococcusagalactiae. BMC Genomics17, 1–12 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Zhang, Y. et al. Identification of essential genes by transposon insertion sequencing and genome-scale metabolic model construction in Streptococcussuis. Microbiol. Spectr.10.1128/spectrum.02791-24 (2025). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Osterman, A. L. & Gerdes, S. Y. Comparative approach to analysis of gene essentiality. Methods Mol. Biol.416, 459–466 (2007). [DOI] [PubMed] [Google Scholar]
- 32.Wilson, K. Preparation of genomic DNA from bacteria. In Current Protocols in Molecular Biology, Chapter 2 (eds Ausubel, F. M., Bent, R., Kingston, R. E. et al.) 210–212 (Wiley, 1987). 10.1002/0471142727.MB0204S56. [DOI] [PubMed] [Google Scholar]
- 33.Nowak, R. M. et al. Hybrid de novo whole-genome assembly and annotation of the model tapeworm Hymenolepisdiminuta. Sci. Data 2019 616, 1–14 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Robinson, J. T. et al. Integrative genomics viewer. Nat. Biotechnol.29, 24–26 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.DeJesus, M. A., Ambadipudi, C., Baker, R., Sassetti, C. & Ioerger, T. R. TRANSIT: A software tool for Himar1 TnSeq analysis. PLOS Comput. Biol.11, e1004401 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Kanehisa, M., Furumichi, M., Sato, Y., Matsuura, Y. & Ishiguro-Watanabe, M. KEGG: Biological systems database as a model of the real world. Nucleic Acids Res.53, D672–D677 (2025). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Kanehisa, M. & Goto, S. KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res.28, 27–30 (2000). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Emms, D. M. & Kelly, S. OrthoFinder: Solving fundamental biases in whole genome comparisons dramatically improves orthogroup inference accuracy. Genome Biol.10.1186/s13059-015-0721-2 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Prasad, A. et al. Comparative genomic study of Streptococcusanginosus reveals distinct group of urinary strains. mSphere10.1128/msphere.00687-22 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Gardner, S. N. & Hall, B. G. When whole-genome alignments just won’t work: kSNP v2 software for alignment-free SNP discovery and phylogenetics of hundreds of microbial genomes. PLoS ONE8, e81760 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Argimón, S. et al. Microreact: Visualizing and sharing data for genomic epidemiology and phylogeography. Microb. Genomics2, e000093 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Chatellier, S. et al. Genetic relatedness and superantigen expression in group A streptococcus serotype M1 isolates from patients with severe and nonsevere invasive diseases. Infect. Immun.68, 3523–3534 (2000). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Tettelin, H. et al. Genome analysis of multiple pathogenic isolates of Streptococcus agalactiae: Implications for the microbial ‘pan-genome’. Proc. Natl. Acad. Sci. U. S. A.102, 13950–13955 (2005). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Willenborg, J. & Goethe, R. Metabolic traits of pathogenic streptococci. FEBS Lett.590, 3905–3919 (2016). [DOI] [PubMed] [Google Scholar]
- 45.Bergmann, S., Rohde, M., Chhatwal, G. S. & Hammerschmidt, S. α-Enolase of Streptococcuspneumoniae is a plasmin(ogen)-binding protein displayed on the bacterial cell surface. Mol. Microbiol.40, 1273–1287 (2001). [DOI] [PubMed] [Google Scholar]
- 46.Pancholi, V. & Fischetti, V. A. A major surface protein on group a streptococci is a glyceraldehyde-3-phosphate-dehydrogenase with multiple binding activity. J. Exp. Med.176, 415–426 (1992). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Puccio, T. et al. Manganese transport by Streptococcussanguinis in acidic conditions and its impact on growth in vitro and in vivo. Mol. Microbiol.117, 375–393 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Murgas, C. J. et al. Intracellular Metal Speciation in Streptococcussanguinis establishes SsaACB as critical for redox maintenance. ACS Infect. Dis.6, 1906–1921 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Derre, I., Rapoport, G. & Msadek, T. The CtsR regulator of stress response is active as a dimer and specifically degraded in vivo at 37°C. Mol. Microbiol.38, 335–347 (2000). [DOI] [PubMed] [Google Scholar]
- 50.Krüger, E., Zühlke, D., Witt, E., Ludwig, H. & Hecker, M. Clp-mediated proteolysis in Gram-positive bacteria is autoregulated by the stability of a repressor. EMBO J.20, 852–863 (2001). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Andrus, J. M., Bowen, S. W., Klaenhammer, T. R. & Hassan, H. M. Molecular characterization and functional analysis of the manganese-containing superoxide dismutase gene (sodA) from Streptococcusthermophilus AO54. Arch. Biochem. Biophys.420, 103–113 (2003). [DOI] [PubMed] [Google Scholar]
- 52.Bosma, E. F., Rau, M. H., Van Gijtenbeek, L. A. & Siedler, S. Regulation and distinct physiological roles of manganese in bacteria. FEMS Microbiol. Rev.45, 1–16 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Robertson, G. T., Ng, W. L., Gilmour, R. & Winkler, M. E. Essentiality of clpX, but not clpP, clpL, clpC, or clpE, in Streptococcuspneumoniae R6. J. Bacteriol.185, 2961–2966 (2003). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Aljghami, M. E., Barghash, M. M., Majaesic, E., Bhandari, V. & Houry, W. A. Cellular functions of the ClpP protease impacting bacterial virulence. Front. Mol. Biosci.9, 1054408. 10.3389/fmolb.2022.1054408 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Barnes, V. M. et al. Acceleration of purine degradation by periodontal diseases. J. Dent. Res.88, 851–855 (2009). [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The *S. anginosus* 980/01 genome sequence is available under GenBank accession number: CP183189. The datasets presented in this study have been deposited in the BioProject database under accession number PRJNA1228600.




