Abstract
Oenococcus kitaharae is only the second member of the genus Oenococcus to be identified and is the closest relative of the industrially important wine bacterium Oenococcus oeni. To provide insight into this new species, the genome of the type strain of O. kitaharae, DSM 17330, was sequenced. Comparison of the sequenced genomes of both species show that the genome of O. kitaharae DSM 17330 contains many genes with predicted functions in cellular defence (bacteriocins, antimicrobials, restriction-modification systems and a CRISPR locus) which are lacking in O. oeni. The two genomes also appear to differentially encode several metabolic pathways associated with amino acid biosynthesis and carbohydrate utilization and which have direct phenotypic consequences. This would indicate that the two species have evolved different survival techniques to suit their particular environmental niches. O. oeni has adapted to survive in the harsh, but predictable, environment of wine that provides very few competitive species. However O. kitaharae appears to have adapted to a growth environment in which biological competition provides a significant selective pressure by accumulating biological defence molecules, such as bacteriocins and restriction-modification systems, throughout its genome.
Introduction
Oenococcus kitaharae is a lactic acid bacterium (LAB) that was recently isolated from composting distilled Shochu residue [1]. This species represents only the second member of the genus Oenococcus to be identified, with Oenococcus oeni, the founding member of this genus, being reclassified from Leuconostoc oenos by Dicks et al in 1995 [2]. Whereas little is known regarding the biology or ecology of O. kitaharae, O. oeni plays a pivotal role in the production of wine (its almost exclusive habitat) where it is responsible for performing malolactic fermentation (MLF) [3]. However, initial phenotypic comparisons would indicate that the environmental niche of O. kitaharae is very different to that of O. oeni. The two species display markedly different pH (6.0 to 6.8 versus 4.8, respectively) and temperature (30°C versus 22°C, respectively) optima and O. kitaharae is also incapable of growth in concentrations of ethanol routinely found in wine [1].
Given the importance of LAB to the food and beverage industries, it is not surprising that this group of bacteria has been the focus of extensive research, including several genome sequencing efforts. These have resulted in a broad phylogenetic genome sequencing survey of eight LAB genera (covering over 80 species) and, of particular relevance to the study of O. kitaharae, an intra-specific study of the genomes of three individual O. oeni isolates [4]–[6]. The results of these preliminary comparative genomic studies indicated that the LAB group harbours extensive genetic variation, such that even within single species such as O. oeni, coding potential can be over 10% different between any two strains [6].
In order to expand our understanding of the LAB group and to provide a point of comparison for understanding the genome dynamics of O. oeni, we have sequenced the genome of the O. kitaharae type strain DSM 17330 [1]. Comparisons between the Oenococcus genomes uncovered several genetic differences that have the potential to translate into important points of inter-specific phenotypic differentiation. These include several major metabolic differences such as the ability to ferment maltose, citrate and malate and the ability to synthesize specific amino acids such as L-arginine and L-histidine. In addition to these metabolic differences, the O. kitaharae genome also encodes many proteins involved in defence against both bacteriophage (restriction-modification and CRISPR) and other microorganisms (bacteriocins), and has had its genome populated by at least two conjugative transposons, which is in contrast to currently available genome sequences of O. oeni which lack the vast majority of these defence proteins. It therefore appears that the genome of O. kitaharae has been shaped by its need to survive in a competitive growth environment that is vastly different from that encountered by O. oeni, where environmental stresses provide the greatest challenge to growth and reproduction.
Results
The O. kitaharae DSM 17330 genome was assembled from 1×106 Illumina paired-end reads (500 bp spacing) into 17 contigs, comprising 12 single-copy contigs in addition to five contigs that were present in two copies each (with three of these having 100% identity). Through the application of paired-end information and the precise order of the unique sequences bounding these repeats, these 17 contigs were able to be manually arranged into two replicons (Fig. 1). The first of these is a 1.84 Mb circular chromosome which, due to the presence of a highly repetitive repeat cluster that is associated with the coding region of a serine-repeat protein, contains one assembly gap. The second replicon is predicted to be an 8.3 kb plasmid, which, on the basis of sequencing coverage, is predicted to be present in low copy number (∼2 copies per cell, data not shown).
Genome annotation using a combination of Glimmer [7] and RAST [8] identified a total of 1833 predicted open reading frames (ORFs), four rRNA genes (two copies of each of the large and small ribosomal subunits), which are identical in sequence to the previously published ribosomal sequences for O. kitaharae [1], and 44 tRNA genes (see Table S1).
Comparative genomics of O. kitaharae and LAB
While the genome size of O. kitaharae is similar to that of O. oeni PSU1 (1.84 Mb vs 1.78 Mb), initial estimates of DNA-DNA relatedness were only 25% to 30% [1]. By direct comparison of the available genome sequences of O. kitaharae and O. oeni, a similar picture emerged. As expected, the majority of the ORFs predicted in the O. kitaharae genome (62%) have their closest homolog in O. oeni supporting the classification of O. kitaharae DSM 17330 within the Oenococcus genus (Fig. 2A). Of the remaining 703 O. kitaharae ORFs, 487 have at least one match in the Genbank non-redundant protein database, with LAB such as Lactobacillus spp, Leuconostoc spp, Weissella spp and Lactococcus spp predominating (Fig. 2A), while the remainder display no recognizable homolog and either represent novel protein sequences or false positives of the ORF prediction methodology applied.
In order to compare the phylogenetic position of O. kitaharae as determined from 16S rDNA sequencing [1] with that based upon whole genome data, homologs of each of the predicted ORFs of O. kitaharae were sought from whole genome sequence annotations of thirteen strains of LAB representing the genera Enterococcus, Lactobacillus, Lactococcus, Leuconostoc, Oenococcus, Pediococcus and Weissella. A total of 561 of the O. kitaharae ORFs were shown to be conserved across all thirteen strains (BLAST e-value<10−20, minimum 50% coverage of the query protein). Of these, 95 were then selected based on high sequence conservation and lack of potential paralogous sequences (see Table S2 for a complete list of the protein sequences used). Each of the subsequent individual protein alignments produced from these conserved groups of ORFs were then concatenated and used to construct a single maximum-likelihood phylogeny (Fig. 2B). The result of this analysis was consistent with phylogenies based upon 16S rDNA [1], [9], [10] and positions O. kitaharae as a clear sister species to O. oeni with both Leuconostoc spp. and Weisella spp. being the next closest evolutionary relatives.
Chromosomal elements of “foreign” origin
The majority of the O. kitaharae genome is conserved with that of O. oeni with the exception of several large islands (Fig. 3). In all but a limited number of cases, these O. kitaharae-specific islands were also shown to lack identifiable homology with other species of LAB used in the phylogenetic constructions. In addition, the majority of these regions display a high probability of being acquired by horizontal gene transfer (HGT) [11] and would be expected to have entered the O. kitaharae genome via genetic elements such as bacteriophage or conjugative plasmids.
As previous investigations between strains of O. oeni have shown that non-conserved genomic islands can often be attributed to the differential presence of prophage elements [12], homology searches were used to identify classical prophage genes, such as those that encode conserved bacteriophage integrase and lyase proteins, in the genome of DSM 17330. While several O. kitaharae ORFs were found to be homologous to phage proteins (Table S1), each potential prophage region lacked the repertoire of proteins which would be expected for the presence of functional prophage elements [12]. In addition, in all but one case (the genomic region from 382388 bp–404717 bp), these genomic islands were not found downstream of tRNA genes as is observed for O. oeni phage, which use tRNA genes as attachment sites [12].
Whereas the non-conserved genomic islands in the O. kitaharae genome do not appear to encode active phage, two of these islands have the hallmarks of genomically-integrated conjugative transposons (Fig. 4). Conjugative transposons are DNA elements that combine features of bacteriophage, transposons and conjugative plasmids [13]. These elements are often found integrated into the genome and are able to transpose to new sites in the host genome via the formation of a circular intermediate, which can sometimes replicate as a plasmid. As the name suggests, conjugative transposons also encode the ability to be horizontally transferred to new cells (and across species boundaries) via the classical conjugation pathway [13]. As yet, there have been no documented cases of conjugative transposons found in the genome of any strain of O. oeni.
The two potential conjugative transposons of O. kitaharae share a conserved core of 12 proteins. These proteins are predicted to encode the machinery necessary for conjugation and are highly homologous to a 12 kb region of the conjugative plasmid pWCFS103 from Lactobacillus plantarum [14]. In addition to the conserved conjugative core, one transposon is predicted to encode transposase-based integrative functions and the ability to replicate as a plasmid via the RepA and RepB plasmid replication proteins, whereas the other appears to use bacteriophage-based proteins for integration into the genome.
Accompanying the replicative functions of the O. kitaharae conjugative transposons, each element is predicted to encode several proteins that can be broadly categorised as functioning as part of “cellular defence” from either foreign DNA or from competition imposed by other microorganisms. One conjugative transposon is predicted to encode two separate restriction-modification (RM) systems (one combined Type III enzyme, and one Type IIs RM pair) and the other contains at least five ORFs that are potentially involved in the production of the antibacterial compounds (Fig. 4). In addition to the gene content of these potentially transmissible elements, the O. kitaharae DSM 17330 genome encodes another twelve proteins with putative roles in bacteriocin/antibacterial manufacture, transport or detoxification, four proteins involved in DNA RM and one CRISPR pathway array which, in other bacterial species, has been shown to provide a memory-based immunity to bacteriophage infection and possibly to the transmission of plasmid DNA (reviewed in [15]) (Table 1). All of these cellular defence mechanisms are lacking conserved homologs in the sequenced strains of O. oeni, with the exception of a single Type III RM enzyme which is found specifically in O. oeni strain AWRIB429 [6].
Table 1. Genes from Oenococcus kitaharae predicted to be involved in cellular defence.
Function | Description | ORF(s) |
Bacteriocin production or immunity | Putative bacteriocin ABC transporter | OKIT_0291 |
Bacteriocin immunity-associated integral membrane protein | OKIT_0292 | |
Bacteriocin, lactococcin 972 family | OKIT_0293 | |
Bacitracin transport ATP-binding protein | OKIT_0298 | |
Putative blasticidin S deaminase 2C | OKIT_0304 | |
Putative bacteriocin transport accessory protein | OKIT_0665 | |
Streptolysin S biosynthesis protein B (SagB) | OKIT_0885 | |
Streptolysin S biosynthesis protein C (SagC) | OKIT_0886 | |
Streptolysin S biosynthesis protein D (SagD) | OKIT_0887 | |
Phenazine biosynthesis protein PhzF | OKIT_0912 | |
Lactococcin A immunity protein | OKIT_0790 | |
Nisin transport protein | OKIT_0796 | |
Bacitracin export permease protein | OKIT_1723 | |
Putative bacteriocin ABC transporter | OKIT_1725 | |
Restriction modification | Type III restriction enzyme2C res subunit:DEAD/DEAH box helicase2C N-terminal | OKIT_0515 |
Type IIs modification methyltransferase | OKIT_0538 | |
Type IIs restriction endonuclease | OKIT_0539 | |
Type I restriction-modification system2C restriction subunit R (EC 3.1.21.3);Ontology_term = KEGG_ENZYME:3.1.21.3 | OKIT_0971 | |
Type II restriction modification enzyme methyltransferase | OKIT_0974 | |
Type III restriction enzyme, restriction subunit | OKIT_0978 | |
5-methylcytosine-specific restriction enzymea | OKIT_1348 | |
CRISPR | CRISPR-associated protein2C SAG0894 family | OKIT_1269 |
CRISPR-associated protein Cas1 | OKIT_1270 | |
CRISPR-associated protein Cas2 | OKIT_1271 | |
CRISPR-associated Csn2 family protein | OKIT_1272 |
homolog found in O. oeni AWRIB429.
Phenotypic differences attributable to genomic variation
In the initial characterization of O. kitaharae [1] several phenotypic traits were noted that differentiate this new species from O. oeni. Comparative genomics reveals a basis for some of these known differences while also suggesting several additional points of phenotypic differentiation.
Sugar utilization
One of the defining biochemical differences between O. kitaharae and O. oeni that was noted in its original isolation was the ability of O. kitaharae to produce acid from maltose [1]. This trait is rare in O. oeni, which is formally classified as maltose negative [16], [17]. By comparing available whole-genome annotations for O. oeni with O. kitaharae DSM 17330 [8], it was possible to identify several genes associated with sugar utilization that are differentially present across the species (Table 2). Of these, at least four genes which are present in O. kitaharae, but absent in the O. oeni genomes, are predicted to be involved in the utilization of maltose, providing a direct genetic basis for this phenotype. In addition to genes predicted to be involved in the species-specific utilization of maltose, there are several ORFs predicted to be involved in the metabolism of trehalose, D-gluconate, D-ribose and fructose that are specifically present in O. kitaharae. While the assimilation of these sugars is often carried out by specific strains of O. oeni [17], this genotypic data agrees well with biochemical tests performed previously that indicated that O. kitaharae was able to utilize all of these various carbon sources [1], [17].
Table 2. Carbohydrate utilization genes displaying inter-species differences.
Species | RAST Pathway | Description | ORF(s) |
O. kitaharae | Fructooligosaccharide and Raffinose Utilization | MSM (multiple sugar metabolism) operon regulatory protein | OKIT_0495OKIT_0684 |
Sucrose-6-phosphate hydrolase (EC 3.2.1.26) | OKIT_0688 | ||
O. kitaharae | Maltose & Maltodextrin Utilization | Maltose O-acetyltransferase (EC 2.3.1.79) | OKIT_0692 |
Maltose/maltodextrin transport ATP-binding protein MalK (EC 3.6.3.19) | OKIT_0712 | ||
Neopullulanase (EC 3.2.1.135) | OKIT_0711 | ||
Pullulanase (EC 3.2.1.41) | OKIT_0709 | ||
O. kitaharae | D-ribose utilization | Ribose ABC transport system, ATP-binding protein RbsA (TC 3.A.1.2.1) | OKIT_0349 |
Ribose ABC transport system, periplasmic ribose-binding protein RbsB (TC 3.A.1.2.1) | OKIT_0347 | ||
Ribose ABC transport system, permease protein RbsC (TC 3.A.1.2.1) | OKIT_0348 | ||
O. kitaharae | Fructose utilization | PTS system, fructose-specific IIA component (EC 2.7.1.69) | OKIT_0249 |
PTS system, fructose-specific IIB component (EC 2.7.1.69) | OKIT_0248 | ||
PTS system, fructose-specific IIC component (EC 2.7.1.69) | OKIT_0250 | ||
O. oeni | COG3533 | Arabinose-proton symporter | fig|203123.5.peg.226a |
L-arabinose isomerase (EC 5.3.1.4) | fig|203123.5.peg.224b | ||
L-ribulose-5-phosphate 4-epimerase (EC 5.1.3.4) | fig|203123.5.peg.223b | ||
Putative glycosyl hydrolase of unknown function (DUF1680) | YP_809865.1c | ||
Ribulokinase (EC 2.7.1.16) | YP_809879.1c | ||
Transcriptional repressor of arabinoside utilization operon, GntR family | YP_809878.1c | ||
Xyloside transporter XynT | YP_810752.1c; fig|203123.5.peg.206a |
RAST protein ID (not annotated in O. oeni PSU-1 Genbank submission).
RAST protein ID (pseudogene in O. oeni PSU-1, full ORF present in other strains of O. oeni).
O. oeni Genbank protein ID from genome accession number NC_008528.1.
In addition to those genes that are specifically present in O. kitaharae DSM 17330, several were identified that were present only in strains of O. oeni and which are predicted to be involved in the uptake and metabolism of arabinose and xylose (Table 2). This is consistent with the inability of O. kitaharae to produce acid from either L-arabinose or D-xylose, two biochemical reactions that many strains of O. oeni, including those for which genome sequence are available often perform [1], [17].
Amino acid biosynthesis
Both O. oeni and O. kitaharae are fastidious microorganisms that require many exogenous vitamins and amino acids. However, it appears that the O. kitaharae genome encodes biosynthetic pathways for at least two amino acids, arginine and histidine, which are lacking in O. oeni [18].
The O. kitaharae DSM 17330 genome encodes the six genes necessary for the production of arginine from glutamate via the ornithine/carbamoyl-phosphate (CP) pathway, in addition to encoding a second set of carbamoyl-phosphate synthase (CPSase) proteins (Table 3). As CP is an important intermediate in both the arginine and pyrimidine biosynthetic pathways, many bacteria, such as Lactobacillus plantarum, contain two completely separate sets of CPSase proteins [19]. In this situation, one protein is encoded in an operon with genes involved in arginine biosynthesis and regulated by arginine, while a second gene is located in the pyrimidine biosynthetic operon and regulated by exogenous pyrimidines. O. kitaharae contains both sets of CPSase enzymes while the O. oeni genomic sequences are predicted to encode only the single pyrimidine-associated CPSase [19].
Table 3. Genes involved in arginine and histidine biosynthesis in O. kitaharae.
RAST Pathway | Description | ORF(s) |
Arginine biosynthesis | Acetylglutamate kinase (EC 2.7.2.8) | OKIT_0634 |
Acetylornithine aminotransferase (EC 2.6.1.11) | OKIT_0630 | |
Glutamate N-acetyltransferase (EC 2.3.1.35) | OKIT_0629 | |
N-acetyl-gamma-glutamyl-phosphate reductase (EC 1.2.1.38) | OKIT_0628 | |
N-acetylglutamate synthase (EC 2.3.1.1) | OKIT_0629 | |
Ornithine carbamoyltransferase (EC 2.1.3.3) | OKIT_0631 | |
Carbamoyl-phosphate synthase small chain (EC 6.3.5.5) | OKIT_0632 | |
Carbamoyl-phosphate synthase large chain (EC 6.3.5.5) | OKIT_0633 | |
Histidine biosynthesis | phosphoribosyl-AMP cyclohydrolase( EC:3.5.4.19 ) | OKIT_1691 |
Phosphoribosyl-ATP pyrophosphatase (EC 3.6.1.31) | OKIT_1690 | |
Imidazoleglycerol-phosphate dehydratase (EC 4.2.1.19) | OKIT_1695 | |
Imidazole glycerol phosphate synthase cyclase subunit (EC 4.1.3.-) | OKIT_1692 | |
Imidazole glycerol phosphate synthase amidotransferase subunit (EC 2.4.2.-) | OKIT_1694 | |
Histidinol-phosphate aminotransferase (EC 2.6.1.9) | OKIT_1689 | |
Histidinol-phosphatase (EC 3.1.3.15) | OKIT_1699 | |
Histidinol dehydrogenase (EC 1.1.1.23) | OKIT_1696 | |
ATP phosphoribosyltransferase regulatory subunit (EC 2.4.2.17) | OKIT_1698 | |
ATP phosphoribosyltransferase catalytic subunit (EC 2.4.2.17) | OKIT_1697 | |
Phosphoribosylformimino-5-aminoimidazole carboxamide ribotide isomerase (EC 5.3.1.16) | OKIT_1693 |
O. kitaharae also appears to encode all of the enzymes necessary for the synthesis of histidine from the pentose phosphate pathway intermediate 5-phosphoribosyl 1-pyrophosphate (PRPP) (Table 3). These 11 genes lie adjacent to each other in the O. kitaharae DSM 17330 genome while O. oeni genome sequences lack this entire complement of enzymes.
Organic acid metabolism
Many strains of O. oeni are capable of fermenting citrate when additional sources of fermentable carbon sources are also present [20], [21]. Several genes in the O. oeni PSU-1 genome have been identified previously that would provide this strain with the ability to convert citrate to pyruvate [4] (Table 4). These genes are absent from the O. kitaharae DSM 17330 genome leading to the prediction that, unlike O. oeni, this strain would lack the ability to ferment this organic acid.
Table 4. Genes involved in citrate utilization in O. oeni.
RAST Pathway | Description | ORF(s) |
Citrate Metabolism, Transport, and Regulation | 2-(5″-triphosphoribosyl)-3′-dephosphocoenzyme-A synthase (EC 2.7.8.25) | YP_810049.1a |
Apo-citrate lyase phosphoribosyl-dephospho-CoA transferase (EC 2.7.7.61) | YP_810048.1a | |
Citrate lyase alpha chain (EC 4.1.3.6) | YP_810047.1a | |
Citrate lyase beta chain (EC 4.1.3.6) | YP_810046.1a | |
Citrate lyase gamma chain, acyl carrier protein (EC 4.1.3.6) | YP_810045.1a | |
Citrate lyase transcriptional regulator CitI | YP_810041.1a | |
Oxaloacetate decarboxylase involved in citrate fermentation (EC 4.1.1.3) | YP_810042.1a | |
[Citrate [pro-3S]-lyase] ligase (EC 6.2.1.22) | YP_810044.1a |
O. oeni Genbank protein ID from genome accession number NC_008528.1.
One of the key defining biochemical features that separates O. oeni from O. kitaharae is the ability to perform malolactic fermentation. Malolactic fermentation has been shown to require the action of three proteins, a malate permease, which transports malate into the cell, the malolactic enzyme, which is responsible for converting malic acid into lactic acid, and a regulatory protein for these two downstream genes [22]. Surprisingly, the O. kitaharae genome was shown to contain genes that are orthologous to those which encode all three of these activities in O. oeni (Fig. 5A). It was subsequently shown that, while the sequences of all three genes are present in the O. kitaharae genome, the gene encoding malolactic enzyme contained a nonsense mutation that would prematurely truncate the protein coding region (Fig. 5B). The alteration of a single base in this premature stop codon would be sufficient to restore the full-length malolactic enzyme coding region (Fig. 5B) that is highly conserved with malolactic enzymes from many bacteria (Fig. S1). Furthermore, the O. kitaharae gene was shown to have a low ratio of non-synonymous to synonymous mutations (dN/dS = 0.0123) when compared with its O. oeni homologue. This would indicate that there has been limited opportunity for the unconstrained accumulation of synonymous mutations in the two fragments of the malolactic enzyme coding region in O. kitaharae (as would be expected in a non-functional gene undergoing random drift). It is therefore likely that the conversion of the malolactic enzyme to a pseudogene is a very recent event in O. kitaharae and it may be possible to obtain a functional enzyme through reversion of the nonsense mutation or to find a functional malate pathway in strains of O. kitaharae other than DSM 17330.
Discussion
O. kitaharae and O. oeni comprise the only known members of their genus. Sequencing of the O. kitaharae DSM 17330 genome has provided important insights into the genetic diversity across this genus. These two species of Oenococcus appear to inhabit significantly different ecological niches, with O. oeni being found almost exclusively in the highly stressful environment of wine whereas O. kitaharae was isolated from a composting shochu residue of unknown nutrient composition. Accordingly, the two species have accumulated genetic adaptations that reflect different metabolic needs and environmental constraints.
Although little is known regarding the exact nutrient profile of the residue from which O. kitaharae was isolated, the average composition of the major wine metabolites are well known. Finished wine has little or no glucose, fructose or maltose but does contain significant quantities of arabinose and xylose [23], [24]. Many strains of O. oeni are capable of exploiting these carbohydrates, and contain genes whose biochemical functions are consistent with this ability, but, in all but a limited number of cases [16], [17], cannot utilize maltose. In comparison, O. kitaharae DSM 17330 lacks the genes required for the use of arabinose and xylose but has the ability to utilize the maltose that would be present in the feedstocks, such as barley, which are used in the production of shochu.
In addition, while little is known regarding the amino acid profile of the shochu residue in which O. kitaharae was isolated, it would be predicted that these feedstocks would generally be lower in arginine and histidine than wine (where they are often amongst the most prevalent amino acids [24]) given the presence of the biosynthetic pathways for both of these amino acids in O. kitaharae. Interestingly, both the histidine and arginine biosynthetic pathways display a scattered pattern of presence throughout the LAB phylogeny with only a limited number of species within a genus possessing these pathways (Fig. S2). It appears that there must be significant selective pressure working for and against these biosynthetic pathways in an environmentally-dependent manner across the LAB. For O. kitaharae, the evolutionary origins of both pathways are more consistent with loss of these enzymes followed by horizontal gain from a Lactobacillus-related species (Fig. S2).
Wine represents a harsh growth environment in which only a select few species of bacteria are capable of growth to significant levels [25]. O. oeni is therefore faced with little competition from other species of bacteria during its growth and its genome is almost devoid of proteins that are involved in defence against other bacteria or even to invasion by bacteriophage. In contrast, the O. kitaharae genome contains numerous proteins that potentially provide a selective advantage over other bacteria (bacteriocins/antimicrobials), to defend against attack by other species of bacteria (bacteriocin immunity proteins) and to also defend against invasion by foreign DNA, such as that introduced by bacteriophage (restriction-modification systems and the CRISPR element). Although the biological diversity of composting residue in which O. kitaharae was not formally evaluated, it can be assumed that there was sufficient microbiological competition for resources to justify the selective advantage for the presence of these defence compounds. This argument is further supported by the fact that at least two other novel species of LAB have been isolated from this environment in addition to many other species of LAB that have been shown to be present during the shochu production process [1], [26]–[29]. O. kitaharae has therefore evolved to compete in a mixed-species environment whereas O. oeni has adapted to a niche in which the extreme nature of the growth substrate has removed the majority of biological competition.
Whereas the applicability of O. kitaharae for use as an industrial species is yet to be determined, it could prove useful for the development of improved strains of its relative O. oeni. Despite the environment of wine providing protection from competition for O. oeni, it is still argued that many instances of failed malolactic fermentations are due to the action of bacteriophage on susceptible strains [30], [31]. If it were possible to move genes, such as those of the CRISPR array, from O. kitaharae into O. oeni via the conjugation machinery which is predicted to be present in O. kitaharae, this could provide a non-GM means of equipping industrial O. oeni strains with general resistance to bacteriophage infection. Likewise, if the genes involved in antimicrobial production can be transferred to O. oeni, these strains could limit potential negative impacts on wine quality due to the growth of spoilage bacteria such as Pediococcus spp, Lactobacillus spp and acetic acid bacteria, and may provide the means to reduce the amount of sulfite that is currently used for the microbial stabilization of wine [32]. The use of genes from O. kitaharae may therefore allow the production of strains of O. oeni which are not only able to thrive in the harshness of the wine environment, but are more resistant to potential biological competition from bacteriophage or other microorganisms.
Materials and Methods
DNA isolation and sequencing
O. kitaharae DSM 17330 was obtained from DSMZ (Germany) and was grown in modified MRS media (Amyl, Australia). Genomic DNA was isolated using standard phenol-chloroform extractions. Sequencing was performed on an Illumina GAIIx using 2×100 bp paired-end ends with an average library size of 500 bp (Ramacioitti Centre, NSW, Australia).
Genome assembly
A total of 990,000 reads (∼50-fold expected genome coverage) were randomly selected and assembled using MIRA (version 3.2.1). The MIRA output was imported into Seqman Pro (DNAstar, Madison, WI) for manual alignment and editing of the assembly. This Whole Genome Shotgun project has been deposited at DDBJ/EMBL/GenBank under the accession AFVZ00000000. The version described in this paper is the first version, AFVZ01000000.
Genome Annotation
Gene predictions were made using Glimmer 3.02 [7]. Gene functional annotations were performed using the RAST server [8] and BLAST [33] with comparisons to the non-redundant Genbank database. Predictions of genomic regions likely to have been acquired by horizontal gene transfer were calculated using Alien Hunter [11]. dN/dS ratios were calculated using Pal2nal [34]. Circular genome plots were compiled using Artemis [35] and DNAplotter [36].
Comparisons to the various LAB genomes were performed using BLAST [33] and custom written scripts. For phylogenetic analysis, proteins used for the analysis were first screened to ensure that they were conserved (minimum 60% identity when compared to the homologous O. kitaharae protein) in all of the LAB genomes used in this study (See Table S2). Next, proteins which had potential paralogs (which could confound the phylogeny) were identified by assigning each protein to specific orthoMCL [37] clusters and then only retaining those groups of orthologs in which each protein was the only member of a particular orthoMCL group. Individual protein alignments were then performed on each set of homologous sequences using Muscle [38]. These individual alignments were then concatenated into a single large sequence for each strain which was used to construct a maximum-likelihood phylogenetic tree using PhyML [39].
Supporting Information
Footnotes
Competing Interests: The authors have declared that no competing interests exist.
Funding: The Australian Wine Research Institute (AWRI), a member of the Wine Innovation Cluster in Adelaide, is supported by Australia's grapegrowers and winemakers through their investment body the Grape and Wine Research Development Corporation with matching funds from the Australian Government. Systems Biology research at the AWRI is performed using resources provided as part of the National Collaborative Research Infrastructure Strategy, an initiative of the Australian Government, in addition to funds from the South Australian State Government. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Endo A, Okada S. Oenococcus kitaharae sp. nov., a non-acidophilic and non-malolactic-fermenting oenococcus isolated from a composting distilled shochu residue. Int J Syst Evol Microbiol. 2006;56:2345–2348. doi: 10.1099/ijs.0.64288-0. [DOI] [PubMed] [Google Scholar]
- 2.Dicks LM, Dellaglio F, Collins MD. Proposal to reclassify Leuconostoc oenos as Oenococcus oeni [corrig.] gen. nov., comb. nov. Int J Syst Bacteriol. 1995;45:395–397. doi: 10.1099/00207713-45-2-395. [DOI] [PubMed] [Google Scholar]
- 3.Bartowsky EJ, Borneman AR. Genomic variations of Oenococcus oeni strains and the potential to impact on malolactic fermentation and aroma compounds in wine. Appl Microbiol Biotechnol. 2011;92:441–447. doi: 10.1007/s00253-011-3546-2. [DOI] [PubMed] [Google Scholar]
- 4.Mills DA, Rawsthorne H, Parker C, Tamir D, Makarova K. Genomic analysis of Oenococcus oeni PSU-1 and its relevance to winemaking. FEMS Microbiol Rev. 2005;29:465–475. doi: 10.1016/j.femsre.2005.04.011. [DOI] [PubMed] [Google Scholar]
- 5.Makarova K, Slesarev A, Wolf Y, Sorokin A, Mirkin B, et al. Comparative genomics of the lactic acid bacteria. Proc Natl Acad Sci U S A. 2006;103:15611–15616. doi: 10.1073/pnas.0607117103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Borneman AR, Bartowsky EJ, McCarthy J, Chambers PJ. Genotypic diversity in Oenococcus oeni by high-density microarray comparative genome hybridization and whole genome sequencing. Appl Microbiol Biotechnol. 2010;86:681–691. doi: 10.1007/s00253-009-2425-6. [DOI] [PubMed] [Google Scholar]
- 7.Delcher AL, Bratke KA, Powers EC, Salzberg SL. Identifying bacterial genes and endosymbiont DNA with Glimmer. Bioinformatics. 2007;23:673–679. doi: 10.1093/bioinformatics/btm009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Aziz RK, Bartels D, Best AA, DeJongh M, Disz T, et al. The RAST Server: rapid annotations using subsystems technology. BMC Genomics. 2008;9:75. doi: 10.1186/1471-2164-9-75. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Chelo IM, Ze-Ze L, Tenreiro R. Genome diversity in the genera Fructobacillus, Leuconostoc and Weissella determined by physical and genetic mapping. Microbiology. 2007;156:420–430. doi: 10.1099/mic.0.028308-0. [DOI] [PubMed] [Google Scholar]
- 10.De Bruyne K, Camu N, De Vuyst L, Vandamme P. Weissella fabaria sp. nov., from a Ghanaian cocoa fermentation. Int J Syst Evol Microbiol. 2010;60:1999–2005. doi: 10.1099/ijs.0.019323-0. [DOI] [PubMed] [Google Scholar]
- 11.Vernikos GS, Parkhill J. Interpolated variable order motifs for identification of horizontally acquired DNA: revisiting the Salmonella pathogenicity islands. Bioinformatics. 2006;22:2196–2203. doi: 10.1093/bioinformatics/btl369. [DOI] [PubMed] [Google Scholar]
- 12.Sao-Jose C, Santos S, Nascimento J, Brito-Madurro AG, Parreira R, et al. Diversity in the lysis-integration region of oenophage genomes and evidence for multiple tRNA loci, as targets for prophage integration in Oenococcus oeni. Virology. 2004;325:82–95. doi: 10.1016/j.virol.2004.04.029. [DOI] [PubMed] [Google Scholar]
- 13.Salyers AA, Shoemaker NB, Stevens AM, Li LY. Conjugative transposons: an unusual and diverse set of integrated gene transfer elements. Microbiol Rev. 1995;59:579–590. doi: 10.1128/mr.59.4.579-590.1995. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.van Kranenburg R, Golic N, Bongers R, Leer RJ, de Vos WM, et al. Functional analysis of three plasmids from Lactobacillus plantarum. Appl Environ Microbiol. 2005;71:1223–1230. doi: 10.1128/AEM.71.3.1223-1230.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Marraffini LA, Sontheimer EJ. CRISPR interference: RNA-directed adaptive immunity in bacteria and archaea. Nat Rev Genet. 2010;11:181–190. doi: 10.1038/nrg2749. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Davis CR, Wibowo D, Fleet GH, Lee TH. Properties of wine lactic-acid bacteria - their potential enological significance. Am J Enol Vitic. 1988;39:137–142. [Google Scholar]
- 17.Dicks LM, Holzapfel WH. Genus II. Oenococcus. In: Vos P, Garrity G, Jones D, Krieg NR, Ludwig W, et al., editors. Bergey's Manual of Systematic Bacteriology second edition. New York: Springer; 2009. pp. 635–642. [Google Scholar]
- 18.Terrade N, Mira de Orduna R. Determination of the essential nutrient requirements of wine-related bacteria from the genera Oenococcus and Lactobacillus. Int J Food Microbiol. 2009;133:8–13. doi: 10.1016/j.ijfoodmicro.2009.03.020. [DOI] [PubMed] [Google Scholar]
- 19.Bringel F, Frey L, Boivin S, Hubert JC. Arginine biosynthesis and regulation in Lactobacillus plantarum: the carA gene and the argCJBDF cluster are divergently transcribed. J Bacteriol. 1997;179:2697–2706. doi: 10.1128/jb.179.8.2697-2706.1997. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Garvie EI. Leuconostoc oenos sp.nov. J Gen Microbiol. 1967;48:431–438. doi: 10.1099/00221287-48-3-431. [DOI] [PubMed] [Google Scholar]
- 21.Ramos A, Santos H. Citrate and Sugar Cofermentation in Leuconostoc oenos, a (sup13)C Nuclear Magnetic Resonance Study. Appl Environ Microbiol. 1996;62:2577–2585. doi: 10.1128/aem.62.7.2577-2585.1996. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Bartowsky EJ. Oenococcus oeni and malolactic fermentation - moving into the molecular arena. Aust J Grape Wine Res. 2005;11:174–187. [Google Scholar]
- 23.Henschke PA, Jiranek V. Yeasts - metabolism of nitrogen compounds. In: Fleet GH, editor. Wine microbiology and biotechnology. Chur, Switzerland: Harwood Academic Publishers; 1993. pp. 77–164. [Google Scholar]
- 24.Hernandez-Orte P, Cacho JF, Ferreira V. Relationship between varietal amino acid profile of grapes and wine aromatic composition. Experiments with model solutions and chemometric study. J Agric Food Chem. 2002;50:2891–2899. doi: 10.1021/jf011395o. [DOI] [PubMed] [Google Scholar]
- 25.Wibowo D, Eschenbruch R, Davis CR, Fleet GH, Lee TH. Occurrence and growth of lactic-acid bacteria in wine - a Review. Am J Enol Vitic. 1985;36:302–313. [Google Scholar]
- 26.Endo A, Okada S. Monitoring the lactic acid bacterial diversity during shochu fermentation by PCR-denaturing gradient gel electrophoresis. J Biosci Bioeng. 2005;99:216–221. doi: 10.1263/jbb.99.216. [DOI] [PubMed] [Google Scholar]
- 27.Endo A, Okada S. Lactobacillus satsumensis sp. nov., isolated from mashes of shochu, a traditional Japanese distilled spirit made from fermented rice and other starchy materials. Int J Syst Evol Microbiol. 2005;55:83–85. doi: 10.1099/ijs.0.63248-0. [DOI] [PubMed] [Google Scholar]
- 28.Endo A, Okada S. Lactobacillus composti sp. nov., a lactic acid bacterium isolated from a compost of distilled shochu residue. Int J Syst Evol Microbiol. 2007;57:870–872. doi: 10.1099/ijs.0.64743-0. [DOI] [PubMed] [Google Scholar]
- 29.Endo A, Okada S. Lactobacillus farraginis sp. nov. and Lactobacillus parafarraginis sp. nov., heterofermentative Lactobacilli isolated from a compost of distilled shochu residue. Int J Syst Evol Microbiol. 2007;57:708–712. doi: 10.1099/ijs.0.64618-0. [DOI] [PubMed] [Google Scholar]
- 30.Davis C, Silveira NF, Fleet GH. Occurrence and properties of bacteriophages of Leuconostoc oenos in Australian wines. Appl Environ Microbiol. 1985;50:872–876. doi: 10.1128/aem.50.4.872-876.1985. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Henick-Kling T, Lee TH, Nicholas DJD. Inhibition of bacterial growth and malolactic fermentation in wine by bacteriophage. J Appl Bacteriol. 1986;61:287–293. [Google Scholar]
- 32.Bartowsky EJ. Bacterial spoilage of wine and approaches to minimize it. Lett Appl Microbiol. 2009;48:149–156. doi: 10.1111/j.1472-765X.2008.02505.x. [DOI] [PubMed] [Google Scholar]
- 33.Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997;25:3389–3402. doi: 10.1093/nar/25.17.3389. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Suyama M, Torrents D, Bork P. PAL2NAL: robust conversion of protein sequence alignments into the corresponding codon alignments. Nucleic Acids Res. 2006;34:W609–612. doi: 10.1093/nar/gkl315. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Carver T, Berriman M, Tivey A, Patel C, Bohme U, et al. Artemis and ACT: viewing, annotating and comparing sequences stored in a relational database. Bioinformatics. 2008;24:2672–2676. doi: 10.1093/bioinformatics/btn529. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Carver T, Thomson N, Bleasby A, Berriman M, Parkhill J. DNAPlotter: circular and linear interactive genome visualization. Bioinformatics. 2009;25:119–120. doi: 10.1093/bioinformatics/btn578. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Chen F, Mackey AJ, Stoeckert CJ, Roos DS. OrthoMCL-DB: querying a comprehensive multi-species collection of ortholog groups. Nucleic Acids Res. 2006;34:D363–368. doi: 10.1093/nar/gkj123. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004;32:1792–1797. doi: 10.1093/nar/gkh340. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Guindon S, Dufayard JF, Lefort V, Anisimova M, Hordijk W, et al. New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst Biol. 2010;59:307–321. doi: 10.1093/sysbio/syq010. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.