Abstract
Several species from the Alicyclobacillus genus have received much attention from the food and beverages industries. Their presence has been co-related with spoilage events of acidic food matrices, namely fruit juices and other fruit-based products, the majority attributed to Alicyclobacillus acidoterrestris. In this work, a combination of short and long reads enabled the assembly of the complete genome of A. acidoterrestris DSM 3922T, perfecting the draft genome already available (AURB00000000), and revealing the presence of one chromosome (4,222,202 bp; GC content 52.3%) as well as one plasmid (124,737 bp; GC content 46.6%). From the 4,288 genes identified, 4,004 sequences were attributed to coding sequences with proteins, with more than 80% being functionally annotated. This allowed the identification of metabolic pathways and networks and the interpretation of high-level functions with significant reliability. Furthermore, the additional genes of interest related to spore germination, off-flavor production, namely the vdc cluster, and CRISPR arrays, were identified. More importantly, this is the first complete and closed genome sequence for a taint-producing Alicyclobacillus species and thus represents a valuable reference for further comparative and functional genomic studies.
Keywords: Alicyclobacillus acidoterrestris, spoilage bacteria, taints and off-flavors, complete genome, genome sequencing, assembly, and annotation, function prediction
Introduction
During the last decades, different Alicyclobacillus (ACB) strains have challenged a variety of commercial activities, including the dairy, bakery, distilling, and beverage industries, since their presence in acidic food matrices, namely fruit juices and other fruit-based products, has been correlated with spoilage events (Merle and Montville 2014). In addition, certain ACB strains have been described as producing metabolites, such as phenolic compounds, with the ability to modify the organoleptic properties of the food matrices they contaminate (Gocmen et al. 2005). These off-flavors and odors, described as antiseptic, smoky, and medicinal-like, were shown to be produced by strains belonging to certain ACB species, where Alicyclobacillus acidoterrestris is the one associated with the majority of the reported spoilage events (Gocmen et al. 2005; Niwa 2005; Danyluk et al. 2011; Zhang et al. 2013; Roth et al. 2021). Thus, A. acidoterrestris has been the focus of most studies related to ACB spoilage control by exploring different strategies for the detection and growth inhibition of these bacteria in food matrices (Bahçeci and Acar 2007; Sokołowska et al. 2012; Bevilacqua et al. 2013; Porębska et al. 2017; Hu et al. 2020). To continue devising strategies in a targeted way, a thorough characterization of A. acidoterrestris, both genetic and phenotypic, is crucial to enhance the knowledge regarding microbial behavior, adaptation, unique features, and, ultimately, control.
Phenotypic traits of the type strain A. acidoterrestris DSM 3922T (= ATCC 49025 = LMG 16906) have been widely explored. This gram-positive aerobic bacterium can grow in acidic environments (pH 2.5 – pH 6.0) and at a wide range of temperatures (25°C – 60°C). It can produce subterminal to terminal oval spores, with ω-alicyclic fatty acids in the composition of its membrane, which are distinctive fatty acids present across the majority of ACB species (Wisotzkey et al. 1992; Goto et al. 2003). This type strain can metabolize various carbon sources and in presence of vanillin, vanillic acid, or other aromatic compounds, it can produce guaiacol, 2,6-dichlorophenol, and 2,6-dibromophenol, which are the compounds identified as the leading cause of consumer complaints of spoiled food products (Silva and Gibbs 2001; Cai et al. 2015).
The genome of A. acidoterrestris started being extensively explored after 2013 when Shemesh et al. (2013) presented a draft genome for the type strain, exclusively obtained from short reads. This genome comprises 207 contigs, where most of the predicted annotations correspond to hypothetical proteins (89.7% of the identified protein-coding sequences). Relevant regions such as the 16S rRNA region and genes involved in guaiacol production, namely the vdcC gene, had already been elucidated and used for phylogenetic analysis and the prediction of off-flavor production, respectively (da Costa et al. 2009; Wang, Yue, et al. 2021).
Despite that, a comprehensive elucidation of the complete A. acidoterrestris genomic traits may disclose novel key features with relevance for spoilage control (e.g. spore germination, additional off-flavors producing enzymes) or biotechnological applications (e.g. thermostable enzymes) (Correa-llantén et al. 2014; Murphy et al. 2020). Bearing this in mind, the aim of this work is to: (1) provide the first complete genome of a taint-producing ACB strain by solving the unassembled draft genome of A. acidoterrestris DSM 3922T through sequencing and assembly of long and short reads; (2) functionally annotate most coding sequences, including the ones newly revealed; and (3) present a global functional analysis highlighting distinct features of interest (e.g. off-flavor production-related genes) that may be of great relevance to comparative and functional genomic analysis within this spoilage-associated bacterial group.
Methods
Strain origin and growth conditions
Alicyclobacillus acidoterrestris DSM 3922T, obtained from Deutsche Sammlung von Mikroorganismen und Zellkulturen (DSMZ, Germany), was used in this study. When needed, the strain was cultured either in BAT broth (Scharlau, Spain) or BAT agar (Scharlau, Spain), both adjusted to pH 4.
To obtain adequate bacterial growth to perform the DNA extraction steps, A. acidoterrestris DSM 3922T was cultivated from a pure culture grown on BAT agar plates. Isolated colonies were selected and subsequently inoculated in BAT broth. For convenience, the liquid culture was grown for 16 h at 37°C under orbital shaking conditions. A volume of 20 mL of bacterial growth was centrifuged at 4°C, 2,000 g, for 10 min, and the pellet was washed twice in 0.9% w/v NaCl (Panreac, USA). Pelleted bacterial cells were resuspended in RNA/DNA Shield reagent (Zymo Research, USA) to obtain an OD600 nm of 11, corresponding approximately to 109 CFU/mL of vegetative bacteria. The bacterial cell suspension was sent to MicrobesNG (http://www.microbesng.com), which performed the DNA extraction, library preparation, and sequencing as described in the sections below.
DNA extraction
The previously obtained bacterial suspension, 5 – 40 µL, was incubated with 120 µL of Tris-EDTA buffer containing lysozyme and RNase A (ITW Reagents, Barcelona, Spain), both at final concentrations of 0.1 mg/mL, for 25 min at 37°C. Proteinase K (VWR Chemicals, Ohio, USA) and SDS (Sigma-Aldrich, Missouri, USA) were added at final concentrations of 0.1 mg/mL and 0.5% v/v, respectively, and incubated for 5 min at 65°C. Genomic DNA was purified using an equal volume of Solid Phase Reversible Immobilization beads (Beckman Coulter, USA) and finally, resuspended in EB buffer (Qiagen, Germany). DNA was quantified with the Quant-iT dsDNA HS kit (ThermoFisher Scientific, USA) assay in an Eppendorf AF2200 plate reader (Eppendorf UK Ltd, United Kingdom).
Library preparation and sequencing
Short-read genomic DNA libraries were prepared using the Nextera XT Library Prep Kit (Illumina, San Diego, USA) following the manufacturer’s protocol with modifications: (1) input DNA was doubled and (2) PCR elongation time was increased to 45 s. DNA quantification and library preparation were carried out on a Hamilton Microlab STAR automated liquid handling system (Hamilton Bonaduz AG, Switzerland). The Kapa Biosystems Library Quantification Kit for Illumina (Roche, Switzerland) was used to quantify pooled libraries that were sequenced using the NovaSeq 6000 (Illumina, USA), using a 250 bp paired-end protocol.
Long-read genomic DNA libraries were prepared with the Oxford Nanopore SQK-LSK109 kit with Native Barcoding EXP-NBD196 (ONT, United Kingdom) using 400 – 500 ng of DNA. The barcoded sample was loaded in an FLO-MIN106 (R.9.4.1) flow cell in a GridION system (ONT, United Kingdom).
Genome assembly
After an initial trimming of Illumina adapters from the totality of raw short reads, the reads were quality and length filtered using Trimmomatic (v0.39) (Bolger et al. 2014) with a sliding window of 5, a quality cut-off of Q30, and a length cut-off of 50 bp. The Oxford Nanopore long reads were quality and length filtered using Filtlong (v0.2.1) (https://github.com/rrwick/Filtlong) with a quality cut-off of 90% and length cut-off of 1,000 bp.
De novo assembly was performed using Unicycler (v0.4.0) (Wick et al. 2017) applying a hybrid assembly with filtered Illumina short reads and Oxford Nanopore long reads. In the same way, Flye (v2.9) (Kolmogorov et al. 2019) was also used for de novo genome assembly by exclusively using long reads. Results retrieved from both assemblers were evaluated using Geneious Prime 2022.0.1 (https://www.geneious.com), and discrepancies were manually curated by comparing with the obtained reads and the available contigs of the previously published draft genome (Shemesh et al. 2013). Bandage (v0.8.1) (Wick et al. 2015) was used to visualize the assembly graphs when needed.
Genome annotation and analysis
The complete assembled genome sequence obtained was submitted to NCBI. The annotation was performed using the NCBI Prokaryotic Genome Annotation Pipeline (PGAP) (version 2021-11-29 build5742), which combines ab initio gene prediction algorithms with homology-based methods, enabling an automatic annotation of chromosomes and plasmids (Tatusova et al. 2016; Haft et al. 2018; Li et al. 2021).
The annotated genome was analyzed using different databases to explore genomic features of A. acidoterrestris DSM 3922T. In this case, BLASTKOALA, KEGG, UniProt, and COG databases were used to characterize individual gene functions, categorize proteins with common functions, and identify different metabolic pathways (Kanehisa et al. 2016; Galperin et al. 2019; Cantalapiedra et al. 2021). In addition, the antiSMASH database was also used (Blin et al. 2017) to annotate biosynthetic gene clusters. Furthermore, CGView Server (Grant and Stothard 2008) was used to generate the graphical map of the complete and circular genome, allowing the visualization of sequence features calculated directly from the primary sequence (coding sequences, GC content, GC skew, RNA sequences, and repeat regions).
Results and discussion
De novo complete genome assembly and corroboration
In this work, Illumina and Oxford Nanopore were the platforms selected to obtain sequencing data to perform a hybrid assembly of short and long reads and obtain the complete genome sequence of A. acidoterrestris DSM 3922T. This combination of both reads was previously shown to display promising and reliable results regarding the complete assembly of complex bacterial genomes (De Maio et al. 2019). Long reads, obtained from the Oxford Nanopore platform, provide the required scaffolding and genome structure information. The addition of short reads, retrieved from the Illumina platform, allows a detailed assembly at the local scale, correcting the inherently error-prone long reads (Ruan et al. 2020). The integration of both quality and length filtered datasets (Supplementary Table 1) enabled the reliable, complete, and high coverage sequencing of A. acidoterrestris DSM 3922T genome, with short reads (50 – 251 bp) displaying an average genomic coverage of 70× that supplemented the average coverage of 36× obtained from long reads (1,000 – 119,497 bp). As a result, the complete genome of this strain was revealed to be composed of one long chromosome with 4,222,202 bp and one plasmid with 124,737 bp (Fig. 1, Table 1).
Fig. 1.
Graphical circular maps of A. acidoterrestris DSM 3922T chromosome and its plasmid pDSM3922. From the outside to the inside: coding sequencing (CDS) regions identified on the forward strand, CDSs identified on the reverse strand, GC content , and GC skew. All genomic features identified, namely RNA genes (tRNA, rRNA, tmRNA, and ncRNA), regulatory elements, and repeat regions associated with CRISPR arrays, on the forward and reverse strands (except CDSs) were labeled with clockwise and counterclockwise arrows, respectively.
Table 1.
Genomic features of A. acidoterrestris DSM 3922T.
| Attribute | Draft genome (AURB00000000) |
Complete genome |
||||||
|---|---|---|---|---|---|---|---|---|
| Original contigs | Reannotated contigs | Reannotated and merged contigs |
||||||
| Total | Contigs that map to |
Total | Chromosome (CP080467) | pDSM3922 (CP080468) | ||||
| Chromosome | pDSM3922 | |||||||
| N° of contigs | 207 | 207 | 175 | 166 | 9 | 2 | 1 | 1 |
| Genome size (bp) | 4,063,548 | 4,063,548 | 4,062,183 | 3,941,656 | 120,527 | 4,346,939 | 4,222,202 | 124,737 |
| G + C content (%) | 52.2 | 52.2 | 52.2 | 52.4 | 46.5 | 52.1 | 52.3 | 46.6 |
| Total genes | 4,240 | 4,087 | 4,077 | 3,964 | 113 | 4,288 | 4,174 | 114 |
| Pseudogenes | 4 | 87 | 89 | 89 | – | 109 | 105 | 4 |
| Total CDSs | 4,103 | 3,949 | 3,939 | 3,826 | 113 | 4,113 | 3,999 | 114 |
| CDSs with protein | 3,883 | 3,862 | 3,850 | 3,737 | 113 | 4,004 | 3,894 | 110 |
| Hypothetical proteins | 3,483 | 690 | 682 | 629 | 53 | 678 | 627 | 51 |
| rRNAs | 2 | 2 | 2 | 2 | – | 39 | 13–13–13 (5S–16S–23S) | – |
| ncRNAs | 4 | 4 | 4 | 4 | – | 4 | 4 | – |
| tmRNAs | 1 | 1 | 1 | 1 | – | 1 | 1 | – |
| Regulatory | – | 23 | 23 | 23 | – | 38 | 38 | – |
| tRNA | 131 | 131 | 131 | 131 | – | 131 | 131 | – |
| CRISPR arrays | 1 | 1 | 1 | 1 | – | 1 | 1 | – |
| Transposases | 15 | 71 | 71 | 63 | 8 | 203 | 191 | 12 |
Annotations were obtained using PGAP version 2021-11-29 build5742 (GenBank accession number indicated between brackets).
The de novo assembled genome sequence of A. acidoterrestris DSM 3922T was analyzed against the previously available draft genome (Shemesh et al. 2013) to corroborate the nucleotide assembly obtained and evaluate the knowledge increment provided by this study. For that, the contigs from the draft genome were mapped to the de novo assembly using Minimap2 (Li, 2018). The mapped draft contigs that overlapped were merged and manually trimmed, which reduced the number of draft contigs from the initial 207 to 175 (Table 1, Supplementary Table 2). This work describes a plasmid in the A. acidoterrestris DSM 3922T genome for the first time. However, sequences that belong to plasmid pDSM3922 were already present in the draft genome. A total of 16 contigs from the draft genome, which could be further merged into 9 contigs (Table 1, Supplementary Table 2), mapped to this plasmid. Nonetheless, over 4 kbp from the plasmid are missing in the draft genome, which previously hindered its circularization and completion, unlike what is presented in this study. The remaining 166 merged contigs (originating from 191 contigs) mapped to the chromosome, revealing that more than 280 kbp were missing in the draft genome (Table 1, Supplementary Table 2). The hybrid assembly strategy performed in this work enabled not only the reconstitution of 2 circular genomic sequences (chromosome and plasmid) but also the addition of nucleotide regions that had not previously been elucidated in the draft genome and that are now fully accessible.
Any occasional nucleotide divergences observed between the merged contigs from the draft genome and the complete A. acidoterrestris DSM 3922T genome were corroborated by the high coverage provided by the short reads obtained. Therefore, the mapping, comparison, and corroboration performed fully sustain the structure and the sequence of the obtained assembly. These findings support that the application of long-read sequencing platforms can solve unassembled regions when using Illumina-only assemblies (Wick et al. 2017). Nonetheless, the manual curation of assembly results retrieved from different sequencing and assembly strategies should still be considered a practical and advantageous approach due to the challenges that next-generation sequencing platforms still encounter and the variety of available pipelines with alternative assembly algorithms (e.g. Unicyler, Flye, SPAdes) (Bankevich et al. 2012; De Maio et al. 2019).
Genome annotation
The annotation of the de novo assembled genome sequence of A. acidoterrestris DSM 3922T was performed with the PGAP pipeline (Tatusova et al. 2016) (Table 1). In this complete genome, a total of 4,288 genes were identified, from which 4,113 were recognized as coding sequences (CDSs) and 4,004 of them codifying for proteins. Besides the short- and long-read sequencings supporting the existence of a plasmid, the genome annotation confirms it. Homologs of rep and par genes, required for rolling circle plasmid replication and the control of plasmid partition, respectively (Thomas et al. 2017), were identified in plasmid pDSM3922, with genes parA, parB, parM, and a sequence that codes for a replication-relaxation family of proteins, being consequently annotated.
To thoroughly and consistently compare the annotations from the complete genome, all 207 contigs of the draft genome, as well as the 175 newly merged contigs, a reannotation of all contigs was performed with the same PGAP pipeline (Table 1). Regarding the draft genome, the reannotation updated the number of genes from 4,240 to 4,087 and the number of CDSs from 4,103 to 3,949 CDSs. With the merging of contigs, and the removal of overlapping sequences, the annotations were further updated, and the number of genes and CDSs reduced to 4,077 and 3,939, respectively. Moreover, a significant reduction (∼80%) of the amount of CDSs annotated as hypothetical proteins were observed between the draft genome contigs and the complete genome sequence, as it was now possible to functionally annotate the majority of sequences (3,312 functionally annotated CDSs with protein out of a total of 4,004, where 678 remain hypothetical proteins), considering the Protein Family Model collection (Tatusova et al. 2016).
The mapping of the merged contigs to the complete genome revealed that the gaps that remain, extending in length between 25 and 7,469 bp in both chromosome and plasmid sequences, mainly contain transposase/integrase sequences (Supplementary Table 3). The high abundance of mobile genetic elements present along the complete genome (224 CDSs including transposases, integrases, and recombinases) suggests an increased plasticity and adaptability, similar to what was previously observed for the Alicyclobacillus acidocaldarius DSM 466T type strain (Mavromatis et al. 2010). These remaining gaps also accommodate additional genes that were absent in the draft genome contigs, including genes related to spore germination (e.g. spore germination protein Ger(x)C), genome replication (e.g. DNA polymerase IV), or gene expression regulation (e.g. helix-turn-helix domain-containing proteins). In addition, 12 repetitions of rRNA genes (Supplementary Table 3) were also identified, which can be relevant for phylogenetic analysis as it was previously demonstrated that variations in multiple copies of the 16S rRNA genes of bacteria could support a proper species identification, namely within the Firmicutes phylum (Ibal et al. 2019).
Further functional analyses of the predicted 4,004 CDSs with protein enabled the attribution of KEGG orthology (KO) identifiers (Kanehisa et al. 2016) to about 78% of sequences, and the classification of 86% of sequences within functional categories of Clusters of Orthologous Groups of proteins (COGs) (Supplementary Tables 4–6) (Galperin et al. 2019). KEGG annotation server Blast KEGG Orthology And Links Annotation (BlastKOALA) enabled the identification of KO identifiers co-related with all the 22 functional categories defined on the database, including energy and lipid metabolism, environmental information processing, metabolism of cofactors and vitamins, and metabolism of terpenoids and polyketides (Supplementary Table 5) (Kanehisa et al. 2016). As expected, most annotated proteins were correlated with essential cellular functions such as genetic information processing, signaling and cellular processes, and carbohydrate and amino acid metabolism (Supplementary Table 5). Using the COG database, it was possible to identify protein-coding sequences from 19 different functional categories (Fig. 2, Supplementary Table 6), covering all COG classes: (1) cellular processes and signaling, (2) information storage and processing, (3) metabolism, and (4) poorly characterized. Categories correlated to essential functions as genome replication, recombination, and repair (L), transcription (K), translation, including ribosome structure and biogenesis (J), amino acid transport and metabolism (I), and carbohydrate metabolism and transport (G) were predominant, corresponding to ∼60% of identified proteins, which was expected considering previous studies (Konstantinidis and Tiedje 2004; Datta et al. 2020). Other relevant functional categories, including protein sequences related to the A. acidoterrestris DSM 3922T ability to resist extreme environments (e.g. multidrug resistance efflux pumps, type II secretion system proteins), were also identified. Coding sequences involved in sporulation and germination mechanisms, which are key processes for ACB stress resistance ability, were further explored. In this work, over 70 different sequences encoding for germination and spores’ production-related proteins were annotated (Supplementary Table 6). These sequences include cotJ operon proteins (CotJA and CotJB), required for the normal formation of the inner layers of the spores’ coat (Henriques et al. 1995), a variety of Spo proteins, correlated with all 5 key stages of sporulation, and transcriptional regulators as GerE, that direct the transcription of several genes (e.g. cotX) responsible for structural components of the protein coat encasing mature spore (Riley et al. 2020).
Fig. 2.

Classification of the clusters of orthologous groups of proteins (COGs) by functional categories of A. acidoterrestris DSM 3922T. Data retrieved from eggNOG-mapper (Huerta-Cepas et al. 2019; Cantalapiedra et al. 2021) corresponding to the analysis of the chromosome and plasmid pDSM3922. One-letter abbreviations were used for the functional categories: C, energy production and conversion; D, cell division and chromosome partitioning; E, amino acid metabolism and transport; F, nucleotide metabolism and transport; G, carbohydrate metabolism and transport; H, coenzyme metabolism; I, lipid metabolism; J, translation, including ribosome structure and biogenesis; K, transcription; L, replication, recombination and repair; M, cell wall structure and biogenesis and outer membrane; N, secretion, motility and chemotaxis; O, molecular chaperones and related functions; P, inorganic ion transport and metabolism; Q, secondary metabolites biosynthesis, transport, and catabolism; S, function unknown; T, signal transduction; U, intracellular trafficking, secretion, and vesicular transport; V, defence mechanisms.
The genetic information contained in the plasmid could play an essential role in A. acidoterrestris DSM 3922T adaptation to different environments. Several transcription regulators were annotated, namely, the transition state regulator AbrB previously described in Bacillus spp. as a key factor in cell survival by regulating spore formation, competence, and biofilm development (Sullivan et al. 2008). In addition, proteins involved in SOS response and DNA repair, type II secretion system proteins, and various transporters were annotated in pDSM3922 (F and U functional categories) and identified as features of interest for cell resistance. Genes related to sporulation and germination processes that were found in the chromosome were also identified in the plasmid sequence. Two spore germination proteins, one Ger(x)C family protein and a transition-state regulator (AbrB/MazE/SpoVT family of DNA-binding domain-containing proteins) were annotated, suggesting that A. acidoterrestris DSM 3922T plasmid may benefit these spore-forming bacteria during different physiological stages, probably through compensation or increase in proteins translated from the chromosome (Eijlander et al. 2016).
These results support that a detailed analysis of COG categories allows not only a detailed functional annotation of sequenced genomes but can also give invaluable information for further comparative studies of related organisms (e.g. prediction of novel functional systems and alternative forms of enzymes, identification of missing and/or undetected genes, comparison of organisms by COG functional categories) (Galperin et al. 2019).
Taints and off-flavors production
Taint-producing ACB isolates, like A. acidoterrestris DSM 3922T, are known for producing different phenolic compounds, namely guaiacol, 2,6-dichlorophenol, and 2,6-dibromophenol, that can lead to the spoilage of food products (Siegmund and Pöllinger-Zierler 2006; Sokołowska et al. 2013). Most studies related to the production of these off-flavors highlight guaiacol production, from either vanillin or vanillic acid, as the most significant spoilage concern. This compound has been identified as the most frequently produced off-flavor by ACB isolates and is usually present at higher concentrations than either halophenols (Gocmen et al. 2005). The metabolic pathways leading to guaiacol production have already been thoroughly described. Vanillin is first oxidized to vanillic acid by a vanillin dehydrogenase (Vdh) and then nonoxidatively decarboxylated to guaiacol by a vanillic acid decarboxylase (Cai et al. 2015; Gallage and Møller 2015). Recently, Wang, Liang, et al. (2021) presented transcriptomic and proteomic data from a different strain, A. acidoterrestris DSM 3923, which supports this mechanism of guaiacol production.
This work made possible the identification in the complete genome sequence of A. acidoterrestris DSM 3922T of several key genetic traits from the guaiacol production metabolic pathway, adding valuable information to the already available draft genome annotations. The first step in that conversion, when starting from vanillin, is catalyzed by the enzyme Vdh (Gallage and Møller 2015). The vanillin dehydrogenase coding gene has not been found in A. acidoterrestris genomes, but a putative vdh was recently proposed considering protein sequence identity with Bacillus subtilis strain 168 vdhT (sequence NP_388616.1) (Borriss et al. 2018; Luong et al. 2021).
In this work, all 9 annotated coding sequences belonging to the aldehyde dehydrogenase family of proteins were further explored to evaluate which could be responsible for the conversion of vanillin to vanillic acid. For that, protein sequences were analyzed with BLASTp (McGinnis and Madden 2004), to search for closely related homologs from the same phylum (Firmicutes) but belonging to different genera (Supplementary Table 7). To assist the prediction of proteins relevant for guaiacol production, and considering the previous study of Luong et al. (2021), vanillin dehydrogenase of B. subtilis 168 strain and other highly similar proteins (>99.6% pairwise identity; E-value = 0) were aligned to all 9 A. acidoterrestris DSM 3922T aldehyde dehydrogenases. Multiple sequence alignment using the MAFFT program (Katoh and Standley 2013) suggested that the aldehyde dehydrogenase annotated with the locus tag K1I37_11200 of A. acidoterrestris genome can play a relevant role in vanillin conversion to guaiacol, being responsible for an intermediate conversion to vanillic acid, by presenting the highest identity (48%) with other known vanillin dehydrogenases (Supplementary Table 8).
The following conversion step, catalyzed by a vanillic acid decarboxylase, has been previously described as being coded by the vdcBCD cluster (Luong et al. 2021). In the draft genome, the vdcC gene, coding for subunit C of this decarboxylase, was already annotated in the forward strand of contig AURB01000113. However, the complete information of the genes coding for the remaining subunits (vdcB and vdcD) from this cluster was lacking and identified as hypothetical proteins. In this work, vdcB and vdcC were successfully identified by PGAP. The automatically attributed gene names were manually adjusted from the ubi family of genes (ubiX and ubiD) to the vdc family of genes (vdcB and vdcC, respectively) for consistency purposes with the most common nomenclature within annotated nucleotide sequences from ACB species (Álvarez-Rodríguez et al. 2003; Gulmezian et al. 2007; Dekowska et al. 2018; Połaska et al. 2021), therefore facilitating comparisons in future studies. Regarding vdcD, the annotation was confirmed by sequence identity with another vdc cluster sequence already available (accession number KX453673) (Dekowska et al. 2018) and added manually. Complementing what was previously described in the draft genome, a complete vdc gene cluster is now annotated in the forward strand of A. acidoterrestris DSM 3922T chromosome between positions 3,824,306 and 3,826,581 bp. It is composed of vdcB with 597 bp (199 aa), vdcC with 1,425 bp (475 aa), and vdcD with 231 bp (77 aa) (Fig. 3). Interestingly, homologs of vdcB (59% identity) and vdcC (41% identity) were found in another region of the chromosome, in the forward strand between 1,872,853 and 1,874,963 bp (Fig. 3). However, the vdcD gene is missing, this vdcC gene has a frameshift, and both genes found (vdcB and vdcC) are not in the same order as the proposed vdcBCD cluster, which questions the function of these homologs in contributing to guaiacol production and acting as an actual cluster (Fig. 3).
Fig. 3.
Vanillic acid decarboxylase-related genes annotated in the chromosome of A. acidoterrestris DSM 3922T: a) Complete vdc gene cluster; b) Incomplete vdc gene cluster, where “*” indicates the presence of a frameshift mutation.
Besides the complete genome of A. acidoterrestris DSM 3922T described here, only 2 other completely sequenced reference genomes from verified Alicyclobacillus species are publicly represented on the Genome resource page at NCBI. The 2 available reference genomes, A. acidocaldarius DSM 446 (Accession: GCF_000024285.1) (Mavromatis et al. 2010) and Alicyclobacillus mengaziensis S30H14 (Accession: GCF_017298635.1) (Jiang et al. 2022), were used to compare the taint production potential of these strains. Unsurprisingly, similar nucleotide sequences to the vdc gene cluster were not found in the A. acidocaldarius genome sequence, as this isolate is not described as a guaiacol producer (Van Luong et al. 2019). Currently, there is no description of the ability of A. mengaziensis to produce guaiacol (Jiang et al. 2022), but the absence of nucleotide sequences in the complete genome with similarity to genes from the vdc cluster suggests that this isolate is not a producer of that off-flavor either.
In the transcriptomic and proteomic data published by Wang, Liang, et al. (2021), additional genes with altered expression levels were identified as related to guaiacol production. The information provided by that work enabled the annotation of sequences surrounding the vdc gene cluster, previously identified as hypothetical proteins (Shemesh et al. 2013). These coding sequences have now their products identified and annotated in the complete genome. They include, downstream of the vdc gene cluster, a pyridoxamine 5′-phosphate oxidase family protein (locus tag K1I37_18815), identified as the second most highly upregulated gene during guaiacol production, and, upstream of the vdc gene cluster, the transcriptional regulator LysR (locus tag K1I37_18795), described as positively regulating the vdc gene cluster. In addition, the benzoate/H(+) symporter BenE (locus tag K1I37_18790), involved in guaiacol excretion from cells (Wang, Liang, et al. 2021), has also been identified. These sequences were compared with the hypothetical proteins previously annotated in the draft genome and only the benzoate/H(+) symporter BenE coding sequence was adjusted in size from 1,260 to 1,254 bp, losing 2 amino acids at the N-terminus (420 aa to 418 aa).
Lastly, no homologs of the genes involved in the pathways leading to vanillin production, from either ferulic acid or vanillyl alcohol (Nguyen et al. 2011; Graf et al. 2016), were identified in the genome. BLAST searches using phenolic acid decarboxylase (BSU_34400) (Borriss et al. 2018) and vanillyl-alcohol oxidase (PDIP_52730) (Marcet-Houben et al. 2012) reference sequences, responsible for the conversion of ferulic acid and vanillyl alcohol to vanillin, respectively, were absent from A. acidoterrestris DSM 3922T genome sequence.
In addition, protein-coding sequences related to the production of halophenols were also explored. It was possible to identify 13 sequences annotated in A. acidoterrestris DSM 3922T chromosome as alpha/beta hydrolases, 12 more than in the draft genome. Some might contribute to the degradation of aromatic compounds when considering their alignment with other hydrolases from Bacillus spp. (e.g. 51% identity with the putative hydrolase MhqD from B. subtilis strain 168, sequence NP_389837.1), but their specificity as bromoperoxidases is uncertain (Borriss et al. 2018). Compared with the extensive information available on guaiacol production by ACB isolates, it is evident that the mechanisms behind halophenols’ production are lacking and should be further explored. The availability of the complete genome sequence of A. acidoterrestris DSM 3922T described here can be helpful to advance the functional analysis of off-flavor production.
Biotechnological applications
The complete genome annotation performed in this work also provides information regarding A. acidoterrestris genomic features that might be of interest to medicine, food, and biotechnological applications. Enzymes that are currently relevant for distinct markets, such as pharmaceuticals, detergents and textiles, food and beverages, biofuels, and chemicals (Adrio and Demain 2014), were searched along A. acidoterrestris DSM 3922T genome. Coding sequences with promising protein products including 4 lipases, 22 esterases, 101 hydrolases, 4 peroxidases, 28 oxidases, and 267 transferases were identified. A superoxide dismutase (locus tag K1I37_07655) that might have the potential to be used as a food additive, similarly to the one produced by A. acidocaldarius previously studied by Dong et al. (2021), was also identified.
In the last decades, several enzymes from ACB species have been explored due to their increased stability and activity at different temperatures and pH values when compared to other commonly used enzymes. A. acidocaldarius, for instance, has been previously used as a thermoacidophilic model organism for molecular and biochemical studies of its enzymes (Di Lauro et al. 2006), and Alicyclobacillus acidiphilus was already investigated regarding its β-glucosidase potential to hydrolyze glucovanillin to vanillin and other flavor compounds commonly found in vanilla (Delgado et al. 2021), conserving its activity at higher operating temperatures. Acidophilic enzymes, such as the ones produced by ACB species, have numerous applications, particularly in the production of sugar syrups from starch, and bioremediation (Mesbah 2022). Undoubtedly, exploring all these extremophile-derived enzymes can reveal different alternatives for a broad range of medical, food, and biotechnological applications, and functional annotations like the one described in this work can provide valuable information so promising sequences can be identified, de novo synthesized, codon-optimized, cloned, and heterologously expressed to reach large scale production (Karan et al. 2012; Littlechild 2015; Mesbah 2022).
The presence of biosynthetic gene clusters was investigated using the antiSMASH database (Blin et al. 2017). This database enables the identification of gene clusters involved in secondary metabolite production, known as the primary source of bioactive compounds with interest for medical or biotechnological applications (Blin et al. 2017). As expected, and considering previously identified biosynthetic gene clusters in distinct ACB and Bacillus species (Hamdache et al. 2011; Ideno et al. 2018; Wang, Liang, et al. 2021), 16 regions of interest were identified in the chromosome of A. acidoterrestris DSM 3922T that are involved in the production of saccharides, fatty acids, terpenes, and beta-lactones (Supplementary Table 9). These identified clusters may be explored to efficiently produce metabolites relevant either for biotechnological, cosmetic, or medicinal applications (e.g. antioxidant and anti-inflammatory properties of terpenes) (Helfrich et al. 2019; Del Prado-Audelo et al. 2021; Catinella et al. 2022).
In addition, a repeated region associated with the CRISPR/Cas (clustered regularly interspaced short palindromic repeats and CRISPR-associated protein) system was also identified in the complete genome sequence of A. acidoterrestris DSM 3922T. This CRISPR array is preceded by 3 different Cas enzymes (Cas2, Cas1, and Cas12b) and can be found in other ACB species (Mavromatis et al. 2010). These enzymes have been explored as promising alternatives to CRISPR/Cas9 systems since they are derived from thermoacidophilic bacteria and can maintain their nuclease activity under extreme conditions. This increased thermal stability and robustness led different authors to explore ACB-derived Cas enzymes for plant genome editing, mammalian genome editing in human and mouse cells, and viral genome detection, thus presenting valuable features for different promising medical and biotechnological applications (Teng et al. 2018; Ali et al. 2020; Ming et al. 2020).
Safety-related features
Despite the ability of different ACB isolates to cause serious spoilage issues on different food matrices, no health and safety concerns have yet been related. In addition, previous studies, where A. acidoterrestris spores were directly injected into mice or spore-inoculated juices were fed to guinea pigs, revealed no pathogenic potential in the animal models tested, with no death or illness symptoms observed (Walls and Chuyate 2000). Nevertheless, genomic features of A. acidoterrestris DSM 3922T that could be linked to health risks or impact food safety were assessed in this study. Antibiotic resistance determinants and acquired resistance genes were explored using ResFinder, ResFinderFG, and KmerResistance databases (Clausen et al. 2016; Bortolaia et al. 2020). ResFinder and ResFinderFG analyses were performed testing all available parameters, including the minimum percentages for length and identity thresholds (20% and 30%, respectively), without identifying any acquired antibiotic resistance genes or determinants. Similarly, no acquired antibiotic resistance genes were identified using the KmerResistance database with different scoring methods, host databases, and minimum available percentages for identity and depth correlation thresholds (10% and 5%, respectively). Furthermore, the absence of any virulence genes related to toxin production was confirmed using all the available data from different species on the VirulenceFinder database (Tetzschner et al. 2020). These results support the characterization of A. acidoterrestris DSM 3922T as nonpathogenic bacteria that may cause issues in the quality of food products but are not a food safety hazard (Pornpukdeewattana et al. 2020).
Supplementary Material
Contributor Information
Inês Carvalho Leonardo, Food & Health Division, iBET, Instituto de Biologia Experimental e Tecnológica, Apartado 12, 2781-901 Oeiras, Portugal; ITQB-NOVA, Instituto de Tecnologia Química e Biológica António Xavier, Universidade Nova de Lisboa, Avenida da República, 2780-157 Oeiras, Portugal.
Maria Teresa Barreto Crespo, Food & Health Division, iBET, Instituto de Biologia Experimental e Tecnológica, Apartado 12, 2781-901 Oeiras, Portugal; ITQB-NOVA, Instituto de Tecnologia Química e Biológica António Xavier, Universidade Nova de Lisboa, Avenida da República, 2780-157 Oeiras, Portugal.
Frédéric Bustos Gaspar, Food & Health Division, iBET, Instituto de Biologia Experimental e Tecnológica, Apartado 12, 2781-901 Oeiras, Portugal; ITQB-NOVA, Instituto de Tecnologia Química e Biológica António Xavier, Universidade Nova de Lisboa, Avenida da República, 2780-157 Oeiras, Portugal.
Data Availability
The data underlying this article are available in GenBank of NCBI at https://www.ncbi.nlm.nih.gov and can be accessed with the accession numbers CP080467 and CP080468, assigned to the sequences of A. acidoterrestris DSM 3922T chromosome and plasmid pDSM3922, respectively. The associated BioProject, BioSample, and SRA numbers are PRJNA751022, SAMN20503121, and SRR17245873 for long reads and SRR17245874 for short reads, respectively.
Supplemental material is available at G3 online.
Funding
ICL acknowledges Fundação para a Ciência e a Tecnologia (FCT) for the financial support received through PhD scholarship 2020.08210.BD. Funding from INTERFACE Programme, through the Innovation, Technology and Circular Economy Fund (FITEC), is gratefully acknowledged, as well as iNOVA4Health—UIDB/04462/2020 and UIDP/04462/2020 and LS4FUTURE (LA/P/0087/2020), programs financially supported by FCT and Ministério da Ciência, Tecnologia e Ensino Superior. The authors also acknowledge the support of INCD funded by FCT and ERDF under project 01/SAICT/2016 n° 022153.
Conflicts of interest
None declared.
Literature cited
- Adrio J, Demain A.. Microbial enzymes: tools for biotechnological processes. Biomolecules. 2014;4(1):117–139. doi: 10.3390/biom4010117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ali Z, Aman R, Mahas A, Rao GS, Tehseen M, Marsic T, Salunke R, Subudhi AK, Hala SM, Hamdan SM, et al. iSCAN: an RT-LAMP-coupled CRISPR-Cas12 module for rapid, sensitive detection of SARS-CoV-2. Virus Res. 2020;288:198129. doi: 10.1016/j.virusres.2020.198129. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Álvarez-Rodríguez ML, Belloch C, Villa M, Uruburu F, Larriba G, Coque JJR.. Degradation of vanillic acid and production of guaiacol by microorganisms isolated from cork samples. FEMS Microbiol Lett. 2003;220(1):49–55. doi: 10.1016/S0378-1097(03)00053-3. [DOI] [PubMed] [Google Scholar]
- Bahçeci KS, Acar J.. Modeling the combined effects of pH, temperature and ascorbic acid concentration on the heat resistance of Alicyclobacillus acidoterrestis. Int J Food Microbiol. 2007;120(3):266–273. doi: 10.1016/j.ijfoodmicro.2007.09.004. [DOI] [PubMed] [Google Scholar]
- Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, Lesin VM, Nikolenko SI, Pham S, Prjibelski AD, et al. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol. 2012;19(5):455–477. doi: 10.1089/cmb.2012.0021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bevilacqua A, Campaniello D, Speranza B, Sinigaglia M, Corbo MR.. Control of Alicyclobacillus acidoterrestris in apple juice by citrus extracts and a mild heat-treatment. Food Control. 2013;31(2):553–559. doi: 10.1016/j.foodcont.2012.12.014. [DOI] [Google Scholar]
- Blin K, Medema MH, Kottmann R, Lee SY, Weber T.. The antiSMASH database, a comprehensive database of microbial secondary metabolite biosynthetic gene clusters. Nucleic Acids Res. 2017;45(D1):D555–D559. doi: 10.1093/nar/gkw960. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bolger AM, Lohse M, Usadel B.. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30(15):2114–2120. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Borriss R, Danchin A, Harwood CR, Médigue C, Rocha EPC, Sekowska A, Vallenet D.. Bacillus subtilis, the model Gram-positive bacterium: 20 years of annotation refinement. Microb Biotechnol. 2018;11(1):3–17. doi: 10.1111/1751-7915.13043. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bortolaia V, Kaas RS, Ruppe E, Roberts MC, Schwarz S, Cattoir V, Philippon A, Allesoe RL, Rebelo AR, Florensa AF, et al. ResFinder 4.0 for predictions of phenotypes from genotypes. J Antimicrob Chemother. 2020;75(12):3491–3500. doi: 10.1093/jac/dkaa345. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cai R, Yuan Y, Wang Z, Guo C, Liu B, Liu L, Wang Y, Yue T.. Precursors and metabolic pathway for guaiacol production by Alicyclobacillus acidoterrestris. Int J Food Microbiol. 2015;214:48–53. doi: 10.1016/j.ijfoodmicro.2015.07.028. [DOI] [PubMed] [Google Scholar]
- Cantalapiedra CP, Hernández-Plaza A, Letunic I, Bork P, Huerta-Cepas J.. eggNOG-mapper v2: functional annotation, orthology assignments, and domain prediction at the metagenomic scale. BioRxiv. 2021;5. doi: 10.1101/2021.06.03.446934. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Catinella G, Donzella S, Borgonovo G, Dallavalle S, Contente ML, Pinto A.. Efficient 2-step enzymatic cascade for the bioconversion of oleuropein into hydroxytyrosol. Antioxidants. 2022;11(2):260. doi: 10.3390/antiox11020260. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Clausen PTLC, Zankari E, Aarestrup FM, Lund O.. Benchmarking of methods for identification of antimicrobial resistance genes in bacterial whole genome data. J Antimicrob Chemother. 2016;71(9):2484–2488. doi: 10.1093/jac/dkw184. [DOI] [PubMed] [Google Scholar]
- Correa-llantén DN, Amenábar MJ, Muñoz PA, Monsalves MT, Castro ME, Blamey JM.. Alicyclobacillus sp. strain CC2, a thermo-acidophilic bacterium isolated from Deception Island (Antarctica) containing a thermostable superoxide dismutase enzyme. Adv Polar Sci. 2014;25(2):92–96. doi: 10.13679/j.advps.2014.2.00092. [DOI] [Google Scholar]
- da Costa MS, Rainey FA, Albuquerque L.. Genus I. Alicyclobacillus. In: Bergey’s Manual of Systematic Bacteriology. Vol 3. The Firmicutes. New York (NY: ): Springer; 2009. p. 229–243. [Google Scholar]
- Danyluk MD, Friedrich LM, Jouquand C, Goodrich-Schneider R, Parish ME, Rouseff R.. Prevalence, concentration, spoilage, and mitigation of Alicyclobacillus spp. in tropical and subtropical fruit juice concentrates. Food Microbiol. 2011;28(3):472–477. doi: 10.1016/j.fm.2010.10.008. [DOI] [PubMed] [Google Scholar]
- Datta S, Saha D, Chattopadhyay L, Majumdar B.. Genome comparison identifies different Bacillus species in a bast fibre-retting bacterial consortium and provides insights into pectin degrading genes. Sci Rep. 2020;10(1):1–15. doi: 10.1038/s41598-020-65228-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- De Maio N, Shaw LP, Hubbard A, George S, Sanderson ND, Swann J, Wick R, Oun MA, Stubberfield E, Hoosdally SJ, et al. ; on behalf of the REHAB consortium. Comparison of long-read sequencing technologies in the hybrid assembly of complex bacterial genomes. Microbial Genomics. 2019;5(9):294–305. doi: 10.1099/mgen.0.000294. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dekowska A, Niezgoda J, Sokołowska BS.. Genetic heterogeneity of Alicyclobacillus strains revealed by RFLP analysis of VDC region and rpoB gene. Biomed Res Int. 2018;2018:9608756. doi: 10.1155/2018/9608756. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Del Prado-Audelo ML, Cortés H, Caballero-Florán IH, González-Torres M, Escutia-Guadarrama L, Bernal-Chávez SA, Giraldo-Gomez DM, Magaña JJ, Leyva-Gómez G.. Therapeutic applications of terpenes on inflammatory diseases. Front Pharmacol. 2021;12(8):704197. doi: 10.3389/fphar.2021.704197. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Delgado L, Heckmann CM, De Benedetti S, Nardini M, Gourlay LJ, Paradisi F.. Producing natural vanilla extract from green vanilla beans using a β-glucosidase from Alicyclobacillus acidiphilus. J Biotechnol. 2021;329:21–28. doi: 10.1016/j.jbiotec.2021.01.017. [DOI] [PubMed] [Google Scholar]
- Di Lauro B, , RossiM, , Moracci M.. Characterization of a beta-glycosidase from the thermoacidophilic bacterium Alicyclobacillus acidocaldarius. Extremophiles. 2006;10(4):301–310. doi: 10.1007/s00792-005-0500-1. [DOI] [PubMed] [Google Scholar]
- Dong X, Wang W, Li S, Han H, Lv P, Yang C.. Thermoacidophilic Alicyclobacillus superoxide dismutase: good candidate as additives in food and medicine. Front Microbiol. 2021;12(3):577001–577011. doi: 10.3389/fmicb.2021.577001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Eijlander RT, Holsappel S, Jong A d, Ghosh A, Christie G, Kuipers OP.. SpoVT: from fine-tuning regulator in Bacillus subtilis to essential sporulation protein in Bacillus cereus. Front Microbiol. 2016;7(10):1607–1611. doi: 10.3389/fmicb.2016.01607. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gallage NJ, Møller BL.. Vanillin-bioconversion and bioengineering of the most popular plant flavor and its de novo biosynthesis in the vanilla orchid. Mol Plant. 2015;8(1):40–57. doi: 10.1016/j.molp.2014.11.008. [DOI] [PubMed] [Google Scholar]
- Galperin MY, Kristensen DM, Makarova KS, Wolf YI, Koonin EV.. Microbial genome analysis: the COG approach. Brief Bioinform. 2019;20(4):1063–1070. doi: 10.1093/bib/bbx117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gocmen D, Elston A, Williams T, Parish M, Rouseff RL.. Identification of medicinal off-flavours generated by Alicyclobacillus species in orange juice using GC-olfactometry and GC-MS. Lett Appl Microbiol. 2005;40(3):172–177. doi: 10.1111/j.1472-765X.2004.01636.x. [DOI] [PubMed] [Google Scholar]
- Goto K, Mochida K, Asahara M, Suzuki M, Kasai H, Yokota A.. Alicyclobacillus pomorum sp. nov., a novel thermo-acidophilic, endospore-forming bacterium that does not possess ω-alicyclic fatty acids, and emended description of the genus Alicyclobacillus. Int J Syst Evol Microbiol. 2003;53(Pt 5):1537–1544. doi: 10.1099/ijs.0.02546-0. [DOI] [PubMed] [Google Scholar]
- Graf N, Wenzel M, Altenbuchner J.. Identification and characterization of the vanillin dehydrogenase YfmT in Bacillus subtilis 3NA. Appl Microbiol Biotechnol. 2016;100(8):3511–3521. doi: 10.1007/s00253-015-7197-6. [DOI] [PubMed] [Google Scholar]
- Grant JR, Stothard P.. The CGView Server: a comparative genomics tool for circular genomes. Nucleic Acids Res. 2008;36(Web Server):W181–W184. doi: 10.1093/nar/gkn179. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gulmezian M, Hyman KR, Marbois BN, Clarke CF, Javor GT.. The role of UbiX in Escherichia coli coenzyme Q biosynthesis. Arch Biochem Biophys. 2007;467(2):144–153. doi: 10.1016/j.abb.2007.08.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Haft DH, DiCuccio M, Badretdin A, Brover V, Chetvernin V, O'Neill K, Li W, Chitsaz F, Derbyshire MK, Gonzales NR, et al. RefSeq: an update on prokaryotic genome annotation and curation. Nucleic Acids Res. 2018;46(D1):D851–D860. doi: 10.1093/nar/gkx1068. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hamdache A, Lamarti A, Aleu J, Collado IG.. Non-peptide metabolites from the genus Bacillus. J Nat Prod. 2011;74(4):893–899. doi: 10.1021/np100853e. [DOI] [PubMed] [Google Scholar]
- Helfrich EJN, Lin GM, Voigt CA, Clardy J.. Bacterial terpene biosynthesis: challenges and opportunities for pathway engineering. Beilstein J Org Chem. 2019;15:2889–2906. doi: 10.3762/bjoc.15.283. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Henriques AO, Beall BW, Roland K, Moran CP.. Characterization of cotJ, a σ(E)-controlled operon affecting the polypeptide composition of the coat of Bacillus subtilis spores. J Bacteriol. 1995;177(12):3394–3406. doi: 10.1128/jb.177.12.3394-3406.1995. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hu X, Huang E, Barringer SA, Yousef AE.. Factors affecting Alicyclobacillus acidoterrestris growth and guaiacol production and controlling apple juice spoilage by lauric arginate and ϵ-polylysine. LWT. 2020;119(11):108883. doi: 10.1016/j.lwt.2019.108883. [DOI] [Google Scholar]
- Huerta-Cepas J, Szklarczyk D, Heller D, Hernández-Plaza A, Forslund SK, Cook H, Mende DR, Letunic I, Rattei T, Jensen LJ, et al. EggNOG 5.0: a hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses. Nucleic Acids Res. 2019;47(D1):D309–D314. doi: 10.1093/nar/gky1085. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ibal JC, Pham HQ, Park CE, Shin JH.. Information about variations in multiple copies of bacterial 16S rRNA genes may aid in species identification. PLoS One. 2019;14(2):e0212090. doi: 10.1371/journal.pone.0212090. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ideno N, Umeyama S, Watanabe T, Nakajima M, Sato T, Hoshino T.. Alicyclobacillus acidocaldarius squalene-hopene cyclase: the critical role of steric bulk at Ala306 and the first enzymatic synthesis of epoxydammarane from 2,3-oxidosqualene. Chembiochem. 2018;19(17):1873–1886. doi: 10.1002/cbic.201800281. [DOI] [PubMed] [Google Scholar]
- Jiang Z, Wu D, Liang ZL, Li XT, Huang Y, Zhou N, Liu ZH, Zhang GJ, Jia Y, Yin HQ, et al. Alicyclobacillus curvatus sp. nov. and Alicyclobacillus mengziensis sp. nov., two acidophilic bacteria isolated from acid mine drainage. Int J Syst Evol Microbiol. 2022;72(3):1–8. doi: 10.1099/ijsem.0.005285. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kanehisa M, Sato Y, Morishima K.. BlastKOALA and GhostKOALA : KEGG tools for functional characterization of genome and metagenome sequences. J Mol Biol. 2016;428(4):726–731. doi: 10.1016/j.jmb.2015.11.006. [DOI] [PubMed] [Google Scholar]
- Karan R, Capes MD, DasSarma S.. Function and biotechnology of extremophilic enzymes in low water activity. Aquat Biosyst. 2012;8(1):4. doi: 10.1186/2046-9063-8-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Katoh K, Standley DM.. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 2013;30(4):772–780. doi: 10.1093/molbev/mst010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kolmogorov M, Yuan J, Lin Y, Pevzner PA.. Assembly of long, error-prone reads using repeat graphs. Nat Biotechnol. 2019;37(5):540–546. doi: 10.1038/s41587-019-0072-8. [DOI] [PubMed] [Google Scholar]
- Konstantinidis KT, Tiedje JM.. Trends between gene content and genome size in prokaryotic species with larger genomes. Proc Natl Acad Sci U S A. 2004;101(9):3160–3165. doi: 10.1073/pnas.0308653100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics. 2018;34(18):3094–3100. doi: 10.1093/bioinformatics/bty191. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li W, O'Neill KR, Haft DH, DiCuccio M, Chetvernin V, Badretdin A, Coulouris G, Chitsaz F, Derbyshire MK, Durkin AS, et al. RefSeq: expanding the Prokaryotic Genome Annotation Pipeline reach with protein family model curation. Nucleic Acids Res. 2021;49(D1):D1020–D1028. doi: 10.1093/nar/gkaa1105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Littlechild JA. Enzymes from extreme environments and their industrial applications. Front Bioeng Biotechnol. 2015;3(10):161–169. doi: 10.3389/fbioe.2015.00161. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Luong TSV, Moir C, Bowman JP, Chandry PS.. Heat resistance and genomics of spoilage Alicyclobacillus spp. isolated from fruit juice and fruit-based beverages. Food Microbiol. 2021;94(3):103662. doi: 10.1016/j.fm.2020.103662. [DOI] [PubMed] [Google Scholar]
- Marcet-Houben M, Ballester AR, de la Fuente B, Harries E, Marcos JF, González-Candelas L, Gabaldón T.. Genome sequence of the necrotrophic fungus Penicillium digitatum, the main postharvest pathogen of citrus. BMC Genomics. 2012;13(1):646. doi: 10.1186/1471-2164-13-646. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mavromatis K, Sikorski J, Lapidus A, del Rio TG, Copeland A, Tice H, Cheng JF, Lucas S, Chen F, Nolan M, et al. Complete genome sequence of Alicyclobacillus acidocaldarius type strain (104-IA T). Stand Genomic Sci. 2010;2(1):9–18. doi: 10.4056/sigs.591104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- McGinnis S, Madden TL.. BLAST: at the core of a powerful and diverse set of sequence analysis tools. Nucleic Acids Res. 2004;32(Web Server):W20–W25. doi: 10.1093/nar/gkh435. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Merle J, Montville TJ.. Alicyclobacillus acidoterrestris: the organism, the challenge, potential interventions. J Food Process Preserv. 2014;38(1):153–158. doi: 10.1111/j.1745-4549.2012.00758.x. [DOI] [Google Scholar]
- Mesbah NM. Industrial biotechnology based on enzymes from extreme environments. Front Bioeng Biotechnol. 2022;10(4):870083–870016. doi: 10.3389/fbioe.2022.870083. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ming M, Ren Q, Pan C, He Y, Zhang Y, Liu S, Zhong Z, Wang J, Malzahn AA, Wu J, et al. CRISPR–Cas12b enables efficient plant genome engineering. Nat Plants. 2020;6(3):202–208. doi: 10.1038/s41477-020-0614-6. [DOI] [PubMed] [Google Scholar]
- Murphy J, Ryan MP, Walsh G.. Purification and characterization of a novel β-galactosidase from the thermoacidophile Alicyclobacillus vulcanalis. Appl Biochem Biotechnol. 2020;191(3):1190–1206. doi: 10.1007/s12010-020-03233-w. [DOI] [PubMed] [Google Scholar]
- Nguyen TKC, Tran NP, Cavin JF.. Genetic and biochemical analysis of padR-padC promoter interactions during the phenolic acid stress response in Bacillus subtilis 168. J Bacteriol. 2011;193(16):4180–4191. doi: 10.1128/JB.00385-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Niwa M. Control of hazardous bacteria in acidic beverages by using a guaiacol detection kit (peroxidase method). Fruit Process. 2005;15:388–392. [Google Scholar]
- Połaska M, Dekowska A, Sokolowska B.. Isolation and identification of guaiacol-producing Alicyclobacillus fastidiosus strains from orchards in Poland. Acta Biochim Pol. 2021;68(2):301–307. doi: 10.18388/abp.2020_5574. [DOI] [PubMed] [Google Scholar]
- Porębska I, Sokołowska B, Skąpska S, Rzoska SJ.. Treatment with high hydrostatic pressure and supercritical carbon dioxide to control Alicyclobacillus acidoterrestris spores in apple juice. Food Control. 2017;73:24–30. doi: 10.1016/j.foodcont.2016.06.005. [DOI] [Google Scholar]
- Pornpukdeewattana S, Jindaprasert A, Massa S.. Alicyclobacillus spoilage and control—a review. Crit Rev Food Sci Nutr. 2020;60(1):108–122. doi: 10.1080/10408398.2018.1516190. [DOI] [PubMed] [Google Scholar]
- Riley EP, Schwarz C, Derman AI, Lopez-Garrido J.. Milestones in Bacillus subtilis sporulation research. Microb Cell. 2020;8(1):1–16. doi: 10.15698/MIC2021.01.739. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Roth K, Rana YS, Daeschel D, Kovac J, Worobo R, Snyder AB.. Alicyclobacillus mali sp. nov., Alicyclobacillus suci sp. nov. and Alicyclobacillus fructus sp. nov., thermoacidophilic sporeforming bacteria isolated from fruit beverages. Int J Syst Evol Microbiol. 2021;71(9):005016. doi: 10.1099/ijsem.0.005016. [DOI] [PubMed] [Google Scholar]
- Ruan Z, Wu J, Chen H, Draz MS, Xu J, He F.. Hybrid genome assembly and annotation of a pandrug-resistant Klebsiella pneumoniae strain using nanopore and illumina sequencing. Infect Drug Resist. 2020;13:199–206. doi: 10.2147/IDR.S240404. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shemesh M, Pasvolsky R, Sela N, Green SJ, Zakin V.. Draft genome sequence of Alicyclobacillus acidoterrestris strain ATCC 49025. Genome Announc. 2013;1(5):e00638-13. doi: 10.1128/genomeA.e00638-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Siegmund B, Pöllinger-Zierler B.. Odor thresholds of microbially induced off-flavor compounds in apple juice. J Agric Food Chem. 2006;54(16):5984–5989. doi: 10.1021/jf060602n. [DOI] [PubMed] [Google Scholar]
- Silva FVM, Gibbs P.. Alicyclobacillus acidoterrestris spores in fruit products and design of pasteurization processes. Trends Food Sci Technol. 2001;12(2):68–74. doi: 10.1016/S0924-2244(01)00070-X. [DOI] [Google Scholar]
- Sokołowska B, Skąpska S, Fonberg-Broczek M, Niezgoda J, Chotkiewicz M, Dekowska A, Rzoska S.. The combined effect of high pressure and nisin or lysozyme on the inactivation of Alicyclobacillus acidoterrestris spores in apple juice. High Pressure Res. 2012;32(1):119–127. doi: 10.1080/08957959.2012.664642. [DOI] [Google Scholar]
- Sokołowska B, Skąpska S, Fonberg-Broczek M, Niezgoda J, Chotkiewicz M, Dekowska A, Rzoska SJ.. Factors influencing the inactivation of Alicyclobacillus acidoterrestris spores exposed to high hydrostatic pressure in apple juice. High Pressure Res. 2013;33(1):73–82. doi: 10.1080/08957959.2013.772170. [DOI] [Google Scholar]
- Sullivan DM, Bobay BG, Kojetin DJ, Thompson RJ, Rance M, Strauch MA, Cavanagh J 2009.. Insights into the nature of DNA binding of AbrB-like transcription factors. Structure. 2008;16(11):1702–1713. doi: 10.1016/j.str.2008.08.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tatusova T, DiCuccio M, Badretdin A, Chetvernin V, Nawrocki E, Zaslavsky L, Lomsadze A, Pruitt K, Borodovsky M, Ostell J.. NCBI prokaryotic genome annotation pipeline. Nucleic Acids Res. 2016;44(14):6614–6624. doi: 10.1093/nar/gkw569. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Teng F, Cui T, Feng G, Guo L, Xu K, Gao Q, Li T, Li J, Zhou Q, Li W.. Repurposing CRISPR-Cas12b for mammalian genome engineering. Cell Discov. 2018;4(1):39–83. doi: 10.1038/s41421-018-0069-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tetzschner A, Johnson J, Johnston B, Lund O, Scheutz F.. In silico genotyping of Escherichia coli isolates for extraintestinal virulence genes by use of whole-genome sequencing data. J Clin Microbiol. 2020;58(10):1–13. doi: 10.1128/JCM.01269-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Thomas CM, Thomson NR, Cerdeño-Tárraga AM, Brown CJ, Top EM, Frost LS.. Annotation of plasmid genes. Plasmid. 2017;91(12):61–67. doi: 10.1016/j.plasmid.2017.03.006. [DOI] [PubMed] [Google Scholar]
- Van Luong TS, Moir CJ, Kaur M, Frank D, Bowman JP, Bradbury MI.. Diversity and guaiacol production of Alicyclobacillus spp. from fruit juice and fruit-based beverages. Int J Food Microbiol. 2019;311(8):108314. doi: 10.1016/j.ijfoodmicro.2019.108314. [DOI] [PubMed] [Google Scholar]
- Walls I, Chuyate R.. Spoilage of fruit juice by Alicyclobacillus acidoterrestris. Food Australia. 2000;52:286–288. [PubMed] [Google Scholar]
- Wang Z, Liang Y, Wang Q, Jia H, Yue T, Yuan Y, Gao Z, Cai R.. Integrated analysis of transcriptome and proteome for exploring the mechanism of guaiacol production by Alicyclobacillus acidoterrestris. Food Res Int. 2021;148:110621. doi: 10.1016/j.foodres.2021.110621. [DOI] [PubMed] [Google Scholar]
- Wang Z, Yue T, Yuan Y, Zhang Y, Gao Z, Cai R.. Targeting the vanillic acid decarboxylase gene for Alicyclobacillus acidoterrestris quantification and guaiacol assessment in apple juices using real time PCR. Int J Food Microbiol. 2021;338:109006. doi: 10.1016/j.ijfoodmicro.2020.109006. [DOI] [PubMed] [Google Scholar]
- Wick RR, Judd LM, Gorrie CL, Holt KE.. Completing bacterial genome assemblies with multiplex MinION sequencing. Microbial Genomics. 2017;3(10):1–7. doi: 10.1099/mgen.0.000132. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wick RR, Schultz MB, Zobel J, Holt KE.. Bandage: interactive visualization of de novo genome assemblies. Bioinformatics. 2015;31(20):3350–3352. doi: 10.1093/bioinformatics/btv383. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wisotzkey JD, Jurtshuk P, Fox GE, Deinhard G, Poralla K.. Comparative sequence analyses on the 16S rRNA (rDNA) of Bacillus acidocaldarius, Bacillus acidoterrestris, and Bacillus cycloheptanicus and proposal for creation of a new genus, Alicyclobacillus gen. nov. Int J Syst Bacteriol. 1992;42(2):263–269. doi: 10.1099/00207713-42-2-263. [DOI] [PubMed] [Google Scholar]
- Zhang J, Yue T, Yuan Y.. Alicyclobacillus contamination in the production line of Kiwi products in China. PLoS One. 2013;8(7):e67704. doi: 10.1371/journal.pone.0067704. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The data underlying this article are available in GenBank of NCBI at https://www.ncbi.nlm.nih.gov and can be accessed with the accession numbers CP080467 and CP080468, assigned to the sequences of A. acidoterrestris DSM 3922T chromosome and plasmid pDSM3922, respectively. The associated BioProject, BioSample, and SRA numbers are PRJNA751022, SAMN20503121, and SRR17245873 for long reads and SRR17245874 for short reads, respectively.
Supplemental material is available at G3 online.


