Skip to main content
PHAGE: Therapy, Applications, and Research logoLink to PHAGE: Therapy, Applications, and Research
. 2023 Dec 14;4(4):165–172. doi: 10.1089/phage.2023.0021

Bacillus subtilis Phages Related to SIOphi from Desert Soils of the Southwest United States

Leigh H Magness 1, Véronique A Delesalle 1,, Albert C Vill 2, Madison S Strine 3, Brianne E Chaudhry 4, Katherine B Lichty 5, Alexandra A Guffey 6, Jenna M DeCurzio 7, Greg P Krukonis 8,
PMCID: PMC11932521  PMID: 40134792

Abstract

Background:

Phages impact Bacillus subtilis microbial dynamics, but our understanding of the diversity of phages that can lyse this model organism is limited.

Materials and Methods:

Phages were isolated from soil collected from two Southwest United States desert sites on wild B. subtilis strains. Phage genomes were assembled and bioinformatically characterized, and phage host range was assessed.

Results:

Four myoviruses with high nucleotide and amino acid similarity to each other (>97%) and to three phages in GenBank (>82%), including SIOphi, were isolated. These phages have double-stranded DNA genomes (153,882–156,577 bp), low GC content, and 256–270 putative protein coding genes (28–30% with predicted functions). Comparative genomics revealed differences in putative lysis and replication genes.

Conclusions:

Comparative approaches provide insight into phage evolution, identifying unique genes shared within phage clusters. Better characterization of Bacillus phages will aid in linking genetic differences among phages (e.g., lysin genes) to phenotype (e.g., host range).

Keywords: Bacillus subtilis, Southwest US deserts, functional annotation, comparative genomics, gene conservation

Introduction

Bacteriophages (phages) are abundant in all ecological communities and are important drivers of ecological and evolutionary processes.1–8 Phages are diverse, with a wide range of genome sizes and a high frequency of novel genes due to the diversity of their bacterial hosts, as well as the speed of bacteria and phage evolution.2,9,10 Despite the importance of Bacillus species to microbial communities,11–13 their impact on plant community structure,14 and the role of Bacillus subtilis as a model organism,9,15,16 only 408 sequenced phages (as deposited in NCBI in July 2023) have been isolated on various species of this genus. This is a small number in contrast to the more than 2000 sequenced phages isolated on Mycobacterium smegmatis, largely due to the efforts of the SEA-PHAGES program.17 Our research is focused on describing phages specifically isolated on wild strains of B. subtilis. Here, we describe four such phages that form a distinct subcluster, sharing high genetic similarity to only three previously described phages.9

Materials and Methods

Approximately 1 g of soil from samples collected either at Tumamoc Hill in Tucson, AZ, or Death Valley National Park (Table 1) was added to 20 mL LB broth and incubated at 37°C, shaking at 250 rpm for 4 h, then filtered (0.22 μm). Filtrates were plated either on B. subtilis wild strain T89-30 or TG115, strains isolated by Istock et al.18,19 and kept as spore stocks in our lab. Two individual plaques were isolated from each soil sample and were single-plaque purified three times on lawns made from spores of the isolation host. High titer lysates were prepared by flooding with LB broth multiple webbed plates. Lysates were filtered (0.22 μm) and DNA was extracted using the Promega Wizard DNA Clean-Up System. A high-titer lysate of one of our isolated phages was used for visualization on a Philips CM100 transmission electron microscope at an accelerating voltage of 100 kV by staining with 2% phosphotungstic acid.

Table 1.

Basis Genomic Characteristics of Four Novel Bacillus subtilis Phages and Their Close Relatives

Phage Isolation strain Genome size (bp) DTR (bp) %GC No. of ORFs Collecting site and date GPS coordinates Accession number
000TH008 T89-30 155,246 3014 38.6 266 Tumamoc Hill, AZ, September 2011 32.2180N 111.0036W MW419084
000TH009 TG115 155,191 3012 38.6 266 Tumamoc Hill, AZ, September 2011 32.2180N 111.0036W MW419085
015DV002 TG115 153,882 2802 38.7 256 Death Valley NP, CA, May 2014 36.1485N 116.7702W MW419086
015DV004 TG115 156,577 3080 38.6 270 Death Valley NP, CA, May 2014 36.1485N 116.7702W MW419087
SIOphi Bacillus subtilis 53 146,698 N/A 39.0 206 N/A N/A KC699836
vB_BspH_Mawwa Bacillus sp.54 149,014 N/A 39.0 230 Springville, UT N/A MW749002
vB_BspH_TimeGriffin Bacillus sp.54 148,525 N/A 39.1 235 Provo, UT N/A MW749007

Genome size includes the DTR, whose length is also given. ORF for either known or putative gene; NP; N/A based on information associated with accession for these phages. Samples from Death Valley were collected under the permit DEVA-2014-SCI-0026. In text, we refer to the two phages isolated from Death Valley as DV phages and the two phages isolated from Tumamoc Hill as TH phages.

DTR, direct terminal repeat; N/A, not available; NP, National Park; ORF, open reading frame.

DNA samples were sent to North Carolina State University's Genomic Science Laboratory for sequencing on Illumina MiSeq platform (v3 150 SE flow cell) from libraries prepared with Illumina Truseq Nano DNA library prep kit. We aligned and assembled reads using GS de novo assembler v2.920 and verified quality with Consed v29.21 Each genome assembled as a single contig with >1000 × coverage and consensus sequences were of high quality with no additional sequencing necessary. Genome ends were determined with PhageTerm.22

Finished sequences were imported into DNAMaster v5.22.2223 for annotation. Putative genes were called based on both Glimmer v3.0 and GeneMark v2.5 algorithms.24,25 Putative protein functions were predicted using BLAST v2.1226 (if E-value below 10−5) and HHpred (if probability <85%, coverage >50%, and E-value <10−5).27 The absence of tRNA genes was confirmed with Aragorn.28 Phage lifecycle was confirmed by analyzing annotations with PhageLeads29 and BACPHLIP.30 Default settings were used in all programs. All genome annotations have been submitted to NCBI (accession numbers in Table 1).

A Phamerator database31 was constructed with 312 Bacillus phages with complete sequences available on NCBI as of May 1, 2022 (Bacillus v5; https://phamerator.org).32,33 This database groups phages into clusters and subclusters and was used to generate genome maps illustrating relationships between phages based on both amino acid (aa) and nucleotide similarity. As part of the Phamerator clustering protocol, gene products are first grouped into phage protein families, or “phams,” using the PhaMMseqs pipeline; then phages that share 35% or more of their phams are classified as belonging to the same cluster.32,33

Additional genomic comparisons were done by (1) calculating Average Nucleotide Identity (ANI) and Average Amino acid Identity (AAI) using algorithms from the enve-omics lab,34 (2) generating genome dotplots with Gepard,35 (3) conducting BLASTn (nr/nt database) or BLASTp (nr database) searches (performed July 2023), either against the complete database or the database for tailed phages (taxid: 10699, 10662 and 10744), using default parameters, or (4) aligning aa sequences of homologous genes using ClustalW within MEGA11.36 Alignment figures were made with ESPript3.0.37

We tested our phages' ability to lyse seven B. subtilis strains: two lab strains obtained from the Bacillus Genetic Stock Center—the model strains 168 (BGSCID 1A1) and W23SR (BGSCID 2A3)—and five wild strains in our lab collection originally from Istock et al.18,19 A microtiter plate-based assay38 was set up with initial titered concentrations of 90.91 × 106 PFU/mL and 90.91 × 107 CFU/mL for a multiplicity of infection of 1:10. Two hundred twenty microliters of LB2 broth with bacteria and phage was added to wells of a 96-well plate with each phage-bacteria combination replicated five times. Wells with only bacteria served as controls. Plates were incubated for 20 h at 37°C auto mixing for 10 s every 5 min with absorbance reading at 600 nm every 30 min. For a conservative estimate of host range, we scored a phage as impacting bacterial growth if at least four out of five growth curves for bacteria with phage were below all the growth curves for bacteria alone.

Results

Basic genomic characteristics

Four B. subtilis phages were isolated from soil samples from Tumamoc Hill in Tucson, AZ, or Death Valley National Park (Table 1). These phages were isolated on wild bacterial strains of B. subtilis. Their double-stranded DNA genomes ranged from 153,882 to 156,577 bp, including direct terminal repeat (DTR) ends. They have 256–270 putative protein coding genes, no tRNA genes, and a GC content (38.6–38.7%) lower than typical of their B. subtilis host, which is usually around 43%.39 These phages produced clear plaques on their isolation host, suggesting they are lytic, a conclusion supported by annotations of their genomes and by genomic analyses using the programs PhageLeads29 and BACPHLIP.30

As is typical of myoviruses, these phages have a long contractile tail (Fig. 1) and a genome greater than 125 Kb.2,40 Gene order in myoviruses is less conserved than in siphoviruses or podoviruses; however, two groupings of structural genes can still be identified (Fig. 2; Supplementary Fig. S1).40,41 The first includes the terminase, portal, and head proteins, while the second encodes the baseplate and tail proteins.40 Overall, a function could be assigned to less than 30% of the putative genes (Fig. 2; Supplementary Table S1). These phages show high nucleotide and amino acid similarity to each other—greater than 97% for all pairwise comparisons (Table 2; Supplementary Fig. S2).

FIG. 1.

FIG. 1.

Representative brightfield TEM image of 015DV002 stained with 2% phosphotungstic acid. Image was taken at 64,000 × magnification using a Philips CM100 TEM. TEM, transmission electron microscope.

FIG. 2.

FIG. 2.

Genome map of 015DV004, chosen because it is the longest genome with the most genes among our four W7 phages. The ruler shows genome length (in kilobases) with the forward genes shown above the ruler. Function or putative functions are listed above genes for genes on the forward strand and below genes for those genes on the reverse strand (see also Supplementary Table S1). Map was created using Phamerator.31

Table 2.

Average Nucleotide Identity (Above the Diagonal) and Average Amino Acid Identity (Below the Diagonal) for Our Seven Novel Phages and SSP1, with Each Phage Compared to the Other Phages in Its Cluster

  000TH008 000TH009 015DV002 015DV004 SIOphi vB_BspH_Mawwa vB_BspH_TimeGriffin
000TH008 100 98 98 87 87 87
000TH009 100 98 98 87 87 87
015DV002 97 97 98 87 87 87
015DV004 97 97 98 87 87 87
SIOphi 82 82 82 82 97 98
vB_BspH_Mawwa 82 83 83 82 95 98
vB_BspH_TimeGriffin 82 83 83 83 96 97

Data presented as % and calculated using algorithms from Rodriguez-R and Konstantinidis.34

These phages belong to subcluster W7

Based on the best BLASTn searches (see below), as well as BLASTp searches that were conducted during our annotations, we identified only Bacillus phages as the best matches with respect to genome or protein similarity; therefore, our subsequent analyses focused only on Bacillus phages. A BLASTn search using 015DV002 identified three Bacillus phages with significant nucleotide similarity to our phages: SIOphi (query coverage/identity 71%/91.3%), vB_BspH_Mawwa (74%/88.1%), and vB_BspH_TimeGriffin (71%/88.4%). The next closest match is to Bacillus phage SBSphiJ5 with only 11% query. Based on Phamerator clustering,31,32 these seven phages belong to the W7 subcluster (referred to as J2 in an older clustering analysis with a database of 83 phages9). These W7 phages show at least 87% nucleotide or 82% aa similarity for all pairwise comparisons (Table 2; Supplementary Fig. S2).

Based on pham assignment, there are 21 genes that are unique to these seven phages (i.e., not found in any other Bacillus phages included in our Phamerator database). All these genes are small (range 129–505 bp) and could not be assigned a putative function. In addition, there are 28 genes only found in our W7 phages, 6 genes only found in the 2 TH phages, and 9 genes only found in the 2 DV phages. The majority of these genes also have no known function.

Comparative genomics of our W7 phages

The genes with functions that are conserved across the four W7 phages are genes involved with DNA metabolism, cell lysis, and virion structure, which is expected as these are well-conserved genes in Bacillus phages.9 However, there are some differences worth noting, particularly for genes involved in phage attachment to their host and host lysis.

As is typical of phage lysins, the N-terminus is the enzymatically active domain, while the C-terminus contains the cell wall binding domain.42 For the lysin of our four W7 phages, the C-terminus shows high amino acid conservation and contains a domain of unknown function (DUF3597; pfam12200; Supplementary Fig. S3). In contrast, the N-terminus differs between 000TH008 (N-acetylmuramoyl-l-alanine amidase domain) and the other three phages (000TH009, 015DV002, and 015DV004; l-alanyl-d-glutamate peptidase domain), resulting in the lysins being classified in different phams. This suggests that these two lysins may differ in how they cleave the peptides that link the polysaccharide layers of the peptidoglycan cell wall.43 The lysin in 000TH008 belongs to a pham found in 21 other phages in our database, all members of the W cluster, including SIOphi, vB_BspH_Mawwa, and vB_BspH_TimeGriffin, while the other lysin is only found in our other three phages. The l-alanyl-d-glutamate peptidase domain belongs to the M15 metallopeptidase family and, outside of our database, the best DELTA-BLASTp (Domain Enhanced Lookup Time Accelerated BLAST) match is to a metallopeptidase in Edaphobacillus lindanitolerans (query coverage 95%; identity 41.6%, E-value = 1 × e−92).

Another genomic region of expected conservation is the section that encodes the baseplate and tail proteins.40 These genes must coevolve with each other to maintain necessary interactions.44 This high conservation is observed across the tail proteins of our W7 phages, except for one tail protein in 015DV004 (gp 178; Supplementary Fig. S4). Amino acid and nucleotide alignments of the tail protein homologs (Supplementary Fig. S4) reveal three regions: an N-terminus with high amino acid conservation not reflected in the nucleotide sequence, a middle region with both high amino acid and nucleotide conservation, and a C-terminus with less aa and nucleotide conservation. The N-terminus and middle region of higher conservation corresponds to a siphovirus protein of unknown function (DUF859, pfam 05895).

All four W7 phages encode the alpha and beta subunits of a ribonucleoside-diphosphate reductase (RNR), which has been observed to be the most highly conserved Bacillus gene product.9 However, in 015DV002, the alpha subunit is split into two genes (genes 131 and 133) by an HNH endonuclease (gene 132) (Supplementary Fig. S1; Supplementary Table S1). This phage also possesses an additional homing endonuclease (gene 136) that is not found in our other three W7 phages. Homing endonucleases are typically found as stand-alone genes in phage genomes but can also be inserted as self-splicing introns or inteins into coding genes.45,46 In particular, in Aeromonas phage Aeh1, the alpha subunit (nrdA) is also interrupted by a HNH family homing endonuclease which did not inactivate RNR function; the spliced alpha subunits could still assemble with each other and with the beta subunit.46

A number of genes with specific functions are only called in 000TH008 and 000TH009. These include MqsA-like antitoxin, rubredoxin reductase, HNH restriction endonuclease, and uracil DNA glycosylase. In contrast, a gene that encodes a glycosyl transferase is only called in the two DV phages. These differences are interesting and remaining to be explained. In particular, the MqsA-like antitoxin is found in bacteria and prophages and is part of a toxin/antitoxin (TA) system that provides stability to plasmids and plays a role in biofilm formation and phage inhibition.47 Thus, this gene may be useful in competition with other phages and was likely incorporated in the 000TH phages by horizontal gene transfer.

Repeated sequences in the W7 phages

Based on Phamerator maps, there are many repeated sequences between these genomes (Supplementary Fig. S1). These repeated sequences appear primarily in the DTR regions and the early region of the genome (first 50 genes). Most of these repeats appear to be intergenic, though a few cases of intragenomic repeats appear in genes that encode the minor capsid protein (gene 165 in 015DV004), a hypothetical protein (gene 196 015DV004), and the lysin mentioned above. Intergenic repeat sequences have been observed in Gordonia phages of cluster DJ (Personal communication; Pollenz, 2023), but little is known about their function.

Host range

We tested these phages against seven B. subtilis strains in our collection, including lab strain 168 and W23 (Supplementary Table S2). The DV phages had limited ability to lyse 168, while the TH phages could not lyse this strain; none of these phages could lyse strain W23. In contrast, all four phages could lyse all of the wild strains that we tested them against to varying degrees. For example, the impact of 015DV004 on the growth of T89-05 was smaller compared to the other three phages which showed more pronounced effects on bacteria growth (Supplementary Table S2). Notably, this is the phage with the less-conserved tail protein (gp 178) discussed above.

Discussion

Myoviruses typically have large genomes (>125 Kb) and have been studied for their ability to use multiple infection and lysis strategies for replication.48 The phages described here represent a novel subcluster of Bacillus phages with high nucleotide and amino acid similarity among themselves, and with the genomes of three other Bacillus phages. The number of unique genes without an identifiable putative function in these phage is consistent with previous studies, as dsDNA phage genomes are known to be a reservoir of putative coding genes of unknown function.49 These unique genes of unknown function help to delineate clusters and subclusters of phages, potentially elucidating evolutionary boundaries.9 It is also possible that discovery and description of additional phages may blur these boundaries.50 In addition, comparisons between multiple related phages help us better understand the host networks through which horizontal gene transfer occurs. Our study points to two such possibilities: the acquisition of the lysin l-alanyl-d-glutamate peptidase domain and the MqsA-like antitoxin in these phages.

A major impetus of phage genomic description is to link genetic differences among phages (e.g., lysin genes) to phenotype (e.g., host range). This is not an easy task. First, all methods for assessing host range have limitations.51 For example, plaques observed with spot testing can be caused by lysis from without. Second, we need a better understanding of the diversity of bacterial hosts that phages interact with in soil communities, so that phages can be tested against relevant bacterial hosts rather than lab strains they have not evolved with. Our four phages could either not lyse lab strains 168 and W23 or had limited ability to do so. Lab-propagated bacterial strains are known to undergo genome streamlining, and it is therefore possible that the bacterial genes required for productive phage infection were lost, as such genes may not be adaptive in a homogenous lab environment.52 Finally, phage research needs to further develop quantitative measures, in addition to qualitative (yes/no) measures, of phenotypic traits such as infectivity or host range. These will be necessary to understand when and how differences among gene complement and gene sequence affect phage fitness.

Acknowledgments

We thank past and present members of the Phages Rock lab at Gettysburg College. We are especially grateful to Steve Cresawn for developing the Phamerator program and to Steve Cresawn and Lawrence (Owen) Abad for help building our Bacillus Phamerator database.

Authors' Contributions

Conceptualization, V.A.D. and G.P.K.; Methodology, V.A.D., G.P.K.; Validation, V.A.D., G.P.K., and L.H.M.; Formal analysis, V.A.D., G.P.K., and L.H.M.; Investigation, V.A.D., G.P.K., L.H.M., A.C.V., B.E.C., K.B.L., M.S.S., A.A.G., and J.M.D.; Resources, V.A.D. and G.P.K.; Data curation, V.A.D., G.P.K., L.H.M; A.C.V., B.E.C., K.B.L., M.S.S., A.A.G., and J.M.D.; Writing—original draft preparation, V.A.D., G.P.K., and L.H.M.; Writing—review and editing, V.A.D., G.P.K., L.H.M., A.C.V., M.S.S., B.E.C., J.M.D., and A.A.G.; Visualization, V.A.D., G.P.K., and L.H.M.; supervision, V.A.D. and G.P.K.; Project administration, V.A.D. and G.P.K; Funding acquisition, V.A.D. and G.P.K.

Author Disclosure Statement

No competing financial interests exist.

Funding Information

This research was supported by a grant to Gettysburg College from the Howard Hughes Medical Institute through the Precollege and Undergraduate Science Education Program (Grant No. 52007540) and by Research and Professional Development grants from Gettysburg College to V.A.D. and G.P.K.

Supplementary Figure S1
Supplementary Figure S2
Supplementary Figure S3
Supplementary Figure S4
Supplementary Table S1
Supplementary Table S2

References

  • 1. Chanal A, Chapon V, Benzerara K, et al. The desert of Tataouine: An extreme environment that hosts a wide diversity of microorganisms and radiotolerant bacteria. Environ Microbiol 2006;8(3):514–525; doi: 10.1111/j.1462-2920.2005.00921.x [DOI] [PubMed] [Google Scholar]
  • 2. Oliveira H, Melo LDR, Santos SB, et al. Molecular aspects and comparative genomics of bacteriophage endolysins. J Virol 2013;87(8):4558–4570; doi: 10.1128/JVI.03277-12 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Prestel E, Regeard C, Salamitou S, et al. The bacteria and bacteriophages from a Mesquite Flats site of the Death Valley desert. Antonie Van Leeuwenhoek 2013;103(6):1329–1341; doi: 10.1007/s10482-013-9914-4 [DOI] [PubMed] [Google Scholar]
  • 4. Prestel E, Salamitou S, DuBow MS. An examination of the bacteriophages and bacteria of the Namib desert. J Microbiol 2008;46(4):364–372; doi: 10.1007/s12275-008-0007-4 [DOI] [PubMed] [Google Scholar]
  • 5. Roberts MS, Cohan FM. Recombination and migration rates in natural populations of Bacillus subtilis and Bacillus mojavensis. Evolution 1995;49(6):1081–1094; doi: 10.1111/j.1558-5646.1995.tb04435.x [DOI] [PubMed] [Google Scholar]
  • 6. Koskella B, Brockhurst MA. Bacteria–phage coevolution as a driver of ecological and evolutionary processes in microbial communities. FEMS Microbiol Rev 2014;38(5):916–931; doi: 10.1111/1574-6976.12072 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Penadés JR, Chen J, Quiles-Puchalt N, et al. Bacteriophage-mediated spread of bacterial virulence genes. Curr Opin Microbiol 2015;23:171–178; doi: 10.1016/j.mib.2014.11.019 [DOI] [PubMed] [Google Scholar]
  • 8. Scanlan PD. Bacteria–Bacteriophage coevolution in the human gut: Implications for microbial diversity and functionality. Trends Microbiol 2017;25(8):614–623; doi: 10.1016/j.tim.2017.02.012 [DOI] [PubMed] [Google Scholar]
  • 9. Grose JH, Jensen GL, Burnett SH, et al. Erratum to: Genomic comparison of 93 Bacillus phages reveals 12 clusters, 14 singletons and remarkable diversity. BMC Genomics 2014;15(1):1184; doi: 10.1186/1471-2164-15-1184 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Bibby K. Improved bacteriophage genome data is necessary for integrating viral and bacterial ecology. Microb Ecol 2014;67(2):242–244; doi: 10.1007/s00248-013-0325-x [DOI] [PubMed] [Google Scholar]
  • 11. Li X, Wang T, Fu B, et al. Improvement of aquaculture water quality by mixed Bacillus and its effects on microbial community structure. Environ Sci Pollut Res 2022;29(46):69731–69742; doi: 10.1007/s11356-022-20608-0 [DOI] [PubMed] [Google Scholar]
  • 12. Radhakrishnan R, Hashem A, Abd_Allah EF. Bacillus: A Biological Tool for Crop Improvement through bio-molecular changes in adverse environments. Front Physiol 2017;8:667; doi: 10.3389/fphys.2017.00667 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Li G, Zheng X, Zhu Y, et al. Bacillus symbiont drives alterations in intestinal microbiota and circulating metabolites of lepidopteran host. Environ Microbiol 2022;24(9):4049–4064; doi: 10.1111/1462-2920.15934 [DOI] [PubMed] [Google Scholar]
  • 14. Earl AM, Losick R, Kolter R. Ecology and genomics of Bacillus subtilis. Trends Microbiol 2008;16(6):269–275; doi: 10.1016/j.tim.2008.03.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Errington J, van der Aart LT. Microbe Profile: Bacillus subtilis: Model organism for cellular development, and industrial workhorse. Microbiology (Reading) 2020;166(5):425–427; doi: 10.1099/mic.0.000922. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Stork DA, Squyres GR, Kuru E, et al. Designing efficient genetic code expansion in Bacillus subtilis to gain biological insights. Nat Commun 2021;12(1):5429; doi: 10.1038/s41467-021-25691-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. The Actinobacteriophage Database. Available from: http://bacillus.phagesdb.org [Last accessed: April 8, 2023].
  • 18. Duncan KE, Ferguson N, Kimura K, et al. Fine-scale genetic and phenotypic structure in natural populations of Bacillus subtilis and Bacillus licheniformis: Implications for bacterial evolution and speciation. Evolution 1994;48(6):2002–2025; doi: 10.1111/j.1558-5646.1994.tb02229.x [DOI] [PubMed] [Google Scholar]
  • 19. Istock CA, Ferguson N, Istock NL, et al. Geographical diversity of genomic lineages in Bacillus subtilis (Ehrenberg) Cohn sensu lato. Org Divers Evol 2001;1(3):179–191; doi: 10.1078/1439-6092-00017 [DOI] [Google Scholar]
  • 20. Russell DA. Sequencing, assembling, and finishing complete bacteriophage genomes. Methods Mol Biol 2018;1681:109–125; doi: 10.1007/978-1-4939-7343-9_9 [DOI] [PubMed] [Google Scholar]
  • 21. Gordon D, Green P. Consed: A graphical editor for next-generation sequencing. Bioinformatics 2013;29(22):2936–2937; doi: 10.1093/bioinformatics/btt515 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Garneau JR, Depardieu F, Fortier LC, et al. PhageTerm: A tool for fast and accurate determination of phage termini and packaging mechanism using next-generation sequencing data. Sci Rep 2017;7(1):8292; doi: 10.1038/s41598-017-07910-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Lawrence J. DNAMaster. Available from: http://cobamide2.bio.pitt.edu/ [Last accessed: July 1, 2022].
  • 24. Delcher AL, Harmon D, Kasif S, et al. Improved microbial gene identification with GLIMMER. Nucleic Acids Res 1999;27(23):4636–4641; doi: 10.1093/nar/27.23.4636 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Lukashin AV, Borodovsky M. GeneMark.hmm: New solutions for gene finding. Nucleic Acids Res 1998;26(4):1107–1115; doi: 10.1093/nar/26.4.1107 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Boratyn GM, Schäffer AA, Agarwala R, et al. Domain enhanced lookup time accelerated BLAST. Biol Direct 2012;7:12; doi: 10.1186/1745-6150-7-12 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Söding J, Biegert A, Lupas AN. The HHpred interactive server for protein homology detection and structure prediction. Nucleic Acids Res 2005;33(Web Server issue):W244–W248; doi: 10.1093/nar/gki408 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Laslett D, Canback B. ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences. Nucleic Acids Res 2004;32(1):11–16; doi: 10.1093/nar/gkh152 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Yukgehnaish K, Rajandas H, Parimannan S, et al. PhageLeads: Rapid assessment of phage therapeutic suitability using an ensemble machine learning approach. Viruses 2022;14(2):342; doi: 10.3390/v14020342 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Hockenberry AJ, Wilke CO. BACPHLIP: Predicting bacteriophage lifestyle from conserved protein domains. PeerJ 2021;9:e11396; doi: 10.7717/peerj.11396 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Cresawn SG, Bogel M, Day N, et al. Phamerator: A bioinformatic tool for comparative bacteriophage genomics. BMC Bioinformatics 2011;12:395; doi: 10.1186/1471-2105-12-395 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. Gauthier CH, Cresawn SG, Hatfull GF. PhaMMseqs: A new pipeline for constructing phage gene phamilies using MMseqs2. G3 (Bethesda) 2022;12(11):jkac233; doi: 10.1093/g3journal/jkac233 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. Mavrich TN, Gauthier C, Abad L, et al. pdm_utils: A SEA-PHAGES MySQL phage database management toolkit. Bioinformatics 2021;37(16):2464–2466; doi: 10.1093/bioinformatics/btaa983 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. Rodriguez-R LM, Konstantinidis KT. The enveomics collection: A toolbox for specialized analyses of microbial genomes and metagenomes. PeerJ Preprints 4:e1900v1; doi: 10.7287/peerj.preprints.1900v1 [DOI] [Google Scholar]
  • 35. Krumsiek J, Arnold R, Rattei T. Gepard: A rapid and sensitive tool for creating dotplots on genome scale. Bioinformatics 2007;23(8):1026–1028; doi: 10.1093/bioinformatics/btm039 [DOI] [PubMed] [Google Scholar]
  • 36. Tamura K, Stecher G, Kumar S. MEGA11: Molecular evolutionary genetics analysis version 11. Mol Biol Evol 2021;38(7):3022–3027; doi: 10.1093/molbev/msab120 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. Robert X, Gouet P. Deciphering key features in protein structures with the new ENDscript server. Nucleic Acids Res. 2014. Jul;42(Web Server issue):W320–W324; doi: 10.1093/nar/gku316 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38. Xie Y, Wahab L, Gill JJ. Development and validation of a microtiter plate-based assay for determination of bacteriophage host range and virulence. Viruses 2018;10(4):189; doi: 10.3390/v10040189 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39. Kunst F, Ogasawara N, Moszer I, et al. The complete genome sequence of the Gram-positive bacterium Bacillus subtilis. Nature 1997;390(6657):249–256; doi: 10.1038/36786 [DOI] [PubMed] [Google Scholar]
  • 40. Comeau AM, Tremblay D, Moineau S, et al. Phage morphology recapitulates phylogeny: The comparative genomics of a new group of myoviruses. PLoS One 2012;7(7):e40102; doi: 10.1371/journal.pone.0040102 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41. Hatfull GF. Bacteriophage genomics. Curr Opin Microbiol 2008;11(5):447–453; doi: 10.1016/j.mib.2008.09.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42. Payne KM, Hatfull GF. Mycobacteriophage endolysins: Diverse and modular enzymes with multiple catalytic activities. PLoS One 2012;7(3):e34052; doi: 10.1371/journal.pone.0034052 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43. Kerff F, Petrella S, Mercier F, et al. Specific structural features of the N-acetylmuramoyl-L-alanine amidase AmiD from Escherichia coli and mechanistic implications for enzymes of this family. J Mol Biol 2010;397(1):249–259; doi: 10.1016/j.jmb.2009.12.038 [DOI] [PubMed] [Google Scholar]
  • 44. Hatfull GF, Hendrix RW. Bacteriophages and their genomes. Curr Opin Virol 2011;1(4):298–303; doi: 10.1016/j.coviro.2011.06.009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45. Edgell DR, Gibb EA, Belfort M. Mobile DNA elements in T4 and related phages. Virol J 2010;7:290; doi: 10.1186/1743-422X-7-290 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46. Friedrich NC, Torrents E, Gibb EA, et al. Insertion of a homing endonuclease creates a genes-in-pieces ribonucleotide reductase that retains function. Proc Natl Acad Sci U S A 2007;104(15):6176–6181; doi: 10.1073/pnas.0609915104 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47. Sun C, Guo Y, Tang K, et al. MqsR/MqsA toxin/antitoxin system regulates persistence and biofilm formation in Pseudomonas putida KT2440. Front Microbiol 2017;8:840; doi: 10.3389/fmicb.2017.00840 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48. Skliros D, Karpouzis E, Kalloniati C, et al. Comparative genomic analysis of dwarf Vibrio myoviruses defines a conserved gene cluster for successful phage infection. Arch Virol 2022;167(2):501–516; doi: 10.1007/s00705-021-05340-3 [DOI] [PubMed] [Google Scholar]
  • 49. Kristensen DM, Cai X, Mushegian A. Evolutionarily conserved orthologous families in phages are relatively rare in their prokaryotic hosts. J Bacteriol 2011;193(8):1806–1814; doi: 10.1128/JB.01311-10 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50. Pope WH, Bowman CA, Russell DA, et al. Whole genome comparison of a large collection of mycobacteriophages reveals a continuum of phage genetic diversity. Elife 2015;4:e06416; doi: 10.7554/eLife.06416 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51. Hyman P, Abedon ST. Bacteriophage host range and bacterial resistance. Adv Appl Microbiol 2010;70:217–248; doi: 10.1016/S0065-2164(10)70007-1 [DOI] [PubMed] [Google Scholar]
  • 52. Vernyik V, Karcagi I, Tímár E, et al. Exploring the fitness benefits of genome reduction in Escherichia coli by a selection-driven approach. Sci Rep 2020;10(1):7345; doi: 10.1038/s41598-020-64074-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53. Krasowska A, Biegalska A, Augustyniak D, et al. Isolation and characterization of phages infecting Bacillus subtilis. Biomed Res Int 2015;2015:179597; doi: 10.1155/2015/179597 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54. Arbon JR. Characterization of Bacteriophage Targeting Bacillus licheniformis in Milk Processes and Thermal Stability of Bacteriophage During HTST Pasteurization. Theses and Dissertations; 2021; p. 9350. Available from: https://scholarsarchive.byu.edu/etd/9350 [Last accessed: April 8, 2023].

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Figure S1
Supplementary Figure S2
Supplementary Figure S3
Supplementary Figure S4
Supplementary Table S1
Supplementary Table S2

Articles from PHAGE: Therapy, Applications, and Research are provided here courtesy of Mary Ann Liebert, Inc.

RESOURCES