Skip to main content
MicrobiologyOpen logoLink to MicrobiologyOpen
. 2019 Apr 11;8(9):e00840. doi: 10.1002/mbo3.840

High‐quality draft genome sequence of Gaiella occulta isolated from a 150 meter deep mineral water borehole and comparison with the genome sequences of other deep‐branching lineages of the phylum Actinobacteria

Rita Severino 1, Hugo J C Froufe 2, Cristina Barroso 1,2, Luciana Albuquerque 1, Alexandre Lobo‐da‐Cunha 3,4, Milton S da Costa 1, Conceição Egas 1,2,
PMCID: PMC6741124  PMID: 30977302

Abstract

Gaiella occulta strain F2‐233T (=CECT 7815 = LMG 26412), isolated from a 150 meter deep mineral water aquifer, was deemed a candidate for high‐quality draft genome sequencing because of the rare environment from which it was isolated. The draft genome sequence (QQZY00000000) of strain F2‐233T is composed of approximately 3 Mb, predicted 3,119 protein‐coding genes of which 2,545 were assigned putative functions. Genome analysis was done by comparison with the other deep‐branching Actinobacteria neighbors Rubrobacter radiotolerans, Solirubrobacter soli and Thermoleophilum album. The genes for the tricarboxylic acid cycle, gluconeogenesis and pentose phosphate pathway, were identified in G. occulta, R. radiotolerans, S. soli and T. album genomes. Genes of the Embden–Meyerhof–Parnas pathway and nitrate reduction were identified in G. occulta, R. radiotolerans and S. soli, but not in the T. album genome. Alkane degradation is precluded by genome analysis in G. occulta. Genes involved in myo‐inositol metabolism were found in both S. soli and G. occulta genomes. A Calvin–Benson–Bassham (CBB) cycle with a type I RuBisCO was identified in G. occulta genome, as well. However, experimental growth under several conditions was negative and CO2 fixation could not be proven in G. occulta.

Keywords: Actinobacteria, deep mineral water aquifer, Gaiellaocculta, Gaiellaceae, genome

1. INTRODUCTION

Bottled mineral waters originate from aquifers that are generally tapped from boreholes and piped to plants for blotting. According to European Union law mineral water cannot be disinfected to remove or decrease the number of microorganisms (Anonymous, 2009). However, the water can be treated to remove unstable elements such as iron, manganese, sulfur, and arsenic or to (re)introduce carbon dioxide. Therefore, bottled mineral water contains a complex microbiota that, presumably, originates from the source. Quality control procedures are mandatory to monitor the presence/absence of indicator bacteria for fecal contamination, as well as surface water infiltration and pathogenic bacteria such as Pseudomonas aeruginosa (Anonymous, 2009). Heterotrophic plate counts are also monitored to record alterations in the number of colony‐forming units (CFUs).

The majority of studies on the microbial diversity of bottled water have been performed on still natural mineral waters using culture‐dependent approaches (Guillot & Leclerc, 1993; Morais & da Costa, 1990; Vachée, Mossel, & Leclerc, 1997). Microbial abundances estimated by CFUs indicate that heterotrophic bacteria number is low at source (around 10 CFU/ml) but increases to about 104–105 CFU/ml during storage at room temperature (Croville, Cantet, & Saby, 2011; Morais & da Costa, 1990; Warburton, 1993). More recently culture‐dependent and culture‐independent techniques were used together to determine the microbial diversity and abundances present at source of one mineral water, to assess microbial stability of the source over a 1 year period, and to examine the microbial dynamics after bottling throughout 6 months of storage of the mineral water in factory produced plastic bottles (França, Lopéz‐Lopéz, Rosselló‐Móra, & da Costa, 2015). In all cases, communities were largely dominated by Bacteria affiliated with the Alphaproteobacteria, Betaproteobacteria, and Gammaproteobacteria. Several isolates representing new species were also characterized and described from the aquifer and the bottled mineral water (Albuquerque et al., 2011; França, Albuquerque, & da Costa, 2015; Franca, Albuquerque, Sanchez, Farelaira, & da Costa, 2017; Leandro, França, Nobre, Rainey, & da Costa, 2013; Leandro et al., 2012). Among them, Gaiella occulta is the sole representative of the family Gaiellaceae of the order Gaiellales within the deep‐branching lineages of the phylum Actinobacteria and was deemed a candidate for high‐quality genome sequencing given the rare environment from which it was isolated. The phylum Actinobacteria comprises several deeply branching lineages that consist of species of the orders Rubrobacterales, Solirubrobacterales, Thermoleophilales, and Gaiellales (Foesel, Geppert, Rohde, & Overmann, 2016). The order Gaiellales and the family Gaiellaceae comprises only the species G. occulta strain F2‐233T (Albuquerque et al., 2011).

1.1. Organism information

Gaiella occulta is a nonpigmented, Gram‐negative staining, nonmotile, aerobic, and chemoorganotrophic, with an optimal growth temperature of 35–37°C, optimum pH for growth between 6.5 and 7.5, and was isolated from a 150 meter deep water aquifer in Portugal (Albuquerque et al., 2011; Foesel et al., 2016). The strain forms short rod‐shaped cells 1.0–3.0 µm in length by 0.3–0.5 µm in width (by transmission elctro microscopy) (Figure 1). Strain F2‐233T can assimilate carbohydrates, organic acids, and amino acids. Nitrate is reduced to nitrite; long chain n‐alkanes are not used as carbon and energy source.

Figure 1.

Figure 1

Transmission electron microscopy of Gaiella occulta. (a) Low amplification of a group of cells in longitudinal section and cross section. (b) Dividing cell with central septum. (c) Higher magnification showing a thin peptidoglycan layer (arrow) and outer leaflet of the cell membrane (arrow head)

16S rRNA gene sequence comparisons show G. occulta to belong to the family Gaiellacea of the order Gaiellales which represents a deep branch lineage of the phylum Actinobacteria. The phylogenetic tree based on the 16S rRNA gene of strain F2‐233T in the deep branching taxa of the Actinobacteria shows G. occulta F2‐233T to form a distinct lineage separate from those of the orders Rubrobacterales, Thermoleophilales, and Solirubrobacterales (Figure 2).

Figure 2.

Figure 2

16S rRNA gene sequence phylogenetic tree. The position of Gaiella occulta within the radiation of the deep‐branching taxa of the Actinobacteria is shown. The scale bar represents two inferred substitutions per 100 nucleotides. The numbers at branching points of the neighbor‐joining tree represent bootstrap values from 1,000 replications

2. MATERIALS AND METHODS

2.1. Growth conditions and genomic DNA preparation

Strain F2‐233T was grown in 1 L Erlenmeyer flasks containing 300 ml of R2A medium (http://www.dsmz.de/microorganisms/medium/pdf/DSMZ_Medium830.pdf) at 37°C in a rotary water bath shaker until late exponential phase of growth for DNA extraction. To ascertain CO2 fixation and the enzymatic activity of RuBisCo, G. occulta was grown under aerobic and anaerobic conditions in sealed 50 ml serum ampules at 37°C containing 40 ml of a minimal medium previously described, supplemented with 0.02% yeast extract (Hahnke, Moosmann, Erb, & Strous, 2014). The medium contained (per liter) 0.5 g (NH4)2SO4, 0.5 g MgSO4.7H2O, 0.1 g CaCl2.2H2O, 6 g HEPES, 0.12 g K2HPO4, 0.04 g KH2PO4, and 1 ml trace element solution. Phosphate was added after sterilization. The trace element solution contained (per liter) 7.3 g Na2EDTA, 0.085 g CuCl2.2H2O, 0.72 g ZnSO4.7H2O, 2.5 g FeSO4.7H2O, 0.02 g MnCl2.4H2O, 0.242 g Na2MoO4.2H2O and 2 g NaHCO3. The headspace for anaerobic growth was flushed with N2 for 5 min. Formate, acetate and pyruvate were tested as sole carbon sources and mixtures of formate and acetate, or formate and pyruvate were used to improve growth. In order to test the utilization of hydrogen as electron acceptor, approximately 3 ml H2 was also added to anaerobic and aerobic ampules after sterilization and prior to inoculation. Cell density was measured at 610 nm.

Total genomic DNA was extracted following the method of Nielsen et  al. (Nielsen, Fritze, & Priest, 1995). Briefly, cells were lysed with a solution of lysozyme, guanidium thiocyanate, and sodium n‐lauryl sarcosine. DNA was extracted with chloroform:isoamyl alcohol (24:1, v:v), precipitated with isopropanol and washed with 70% ethanol, dried and resuspended in water. RNAse was included in the extraction process. DNA purity was assessed in a 1% agarose gel electrophoresis. DNA was quantified by fluorescence in the Invitrogen Qubit® 2.0 fluorometer (Thermo Fisher Scientific, Carlsbad, CA).

2.2. Genome sequencing and assembly

Genomic DNA was prepared with the Nextera XT DNA Library Preparation Kit (Illumina) and sequenced using the paired‐end (PE) 2x300 bp V3 kit on the MiSeq® (Illumina, San Diego) at the Next Generation Sequencing Unit, Center for Neuroscience and Cell Biology (CNC/Biocant, Cantanhede, Portugal). Sequenced reads were quality filtered with Trimmomatic (Bolger, Lohse, & Usadel, 2014) and assembled with SPAdes version 3.9.0 (Bankevich et al., 2012).

2.3. Genome annotation and analysis

The resulting contigs were annotated with PGP (Prokaryotic Genome Prediction) (Egas et al., 2014). Genome estimated completeness was verified with CheckM (version 1.0.7) based on lineage‐specific marker sets (Parks, Imelfort, Skennerton, Hugenholtz, & Tyson, 2015). Contamination was tested by CheckM for protein‐coding genes and RNAmmer version 1.2 (Lagesen et al., 2007) and Usearch61 (Edgar, 2010) (with the Greengenes database, version 13.8, ID threshold of 0.6 and an e‐value of 1e−6) for complete or partial 16S rRNA gene. The genome of strain F2‐233T was compared with the available genomes of organisms of deeply branching orders of the Actinobacteria, namely Rubrobacter radiotolerans RSPS‐4 (NZ_CP007514.1, NZ_CP007515.1, NZ_CP007516.1 and NZ_CP007517.1), Solirubrobacter soli DSM 22325T (NZ_AUIK00000000.1), and Thermoleophilum album strain ATCC 35263T (NZ_FNWJ00000000.1), with GET_HOMOLOGUES using BLASTP and OrthoMCL (Contreras‐Moreira & Vinuesa, 2013). Orthologous genes were annotated against the Kyoto Encyclopedia of Genes and Genomes (KEGG) and assigned to metabolic pathways (sequence similarity cutoff e‐value of 1e−5) using KOBAS 2.0 (Xie et al., 2011).

One partial genome sequence of a Gaiella sp. (LSTI01000000) appearing in the database was amplified from a single cell recovered from soil. The sequence contains 995,360 bp with 1,036 genes coding for 915 proteins the vast majority of which are annotated as hypothetical proteins. The organism is closely related to G. occulta (95%) based on the phylogenetic analysis of a truncated 16S rRNA gene with 892 bp. The G + C ratio of the DNA is 66.4%.

3. RESULTS AND DISCUSSION

3.1. Genome properties

The F2‐233T strain DNA sequence run generated 5,261,564 paired‐end reads of which 3,362,091 high‐quality reads remained after quality filtering. The average read length was of 197 bp. The de novo read assembly produced 34 contigs with an N50 size of 401,372 bp. The high‐quality draft assembled genome sequence consisted of 3,028,529 bp, with a sequencing depth of coverage of 520‐fold and a DNA G + C content of 71,65% (Table 1). The genome had a total of 3,167 genes, including 2,545 protein‐coding genes, 45 tRNA genes and three rRNA genes (a single copy of the genes 23S, 16S and 5S) (Table 1). CheckM estimated the genome to be near‐completion (97.84%) and the level of contamination to be extremely low (1.29%). No contamination was detected for 16S rRNA genes as tested by RNAmmer and Usearch61.

Table 1.

Genome and annotation statistics for Gaiella occulta F2‐233T

Attribute Value % of Total
Genome size (bp) 3,028,529 100
DNA coding (bp) 2,785,809 91.99
DNA G + C (bp) 2,169,941 71.65
DNA scaffolds 34  
Total genes 3,167 100
Protein coding genes 3,119 98.48
RNA genes 48 1.51
Genes with function prediction 2,545 81.60
Genes assigned to COGs 1,718 54.24
Genes with Pfam domains 2,502 79.00
CRISPR repeats 1  

The draft genome sequence of G. occulta F2‐233T(=CECT 7815T = LMG 26415T) has been deposited in the Short Read Archive (SRA) under the accession number SRR7537062 and the genome assembly under the accession number QQZY00000000.

3.2. Genome annotation

The draft genome comprised 2,545 genes with putative functions (~82% of total protein‐coding genes) and 1,718 genes allocated to the Clusters of orthologous grups (COG) functional categories (55% of total protein‐coding genes). The most abundant COG category was “Amino acid transport and metabolism” followed by “General function prediction only” and “Energy production and conversion” (Table 2).

Table 2.

Number of genes associated with general COG functional categories

Code Value % of Total Description
J 137 4.39 Translation, ribosomal structure and biogenesis
A 0 RNA processing and modification
K 150 4.81 Transcription
L 83 2.66 Replication, recombination and repair
B 1 0.03 Chromatin structure and dynamics
D 23 0.74 Cell cycle control, cell division, chromosome partitioning
V 19 0.61 Defense mechanisms
T 63 2.02 Signal transduction mechanisms
M 116 3.72 Cell wall/membrane biogenesis
N 36 1.15 Cell motility
U 43 1.38 Intracellular trafficking and secretion
O 65 2.08 Posttranslational modification, protein turnover, chaperones
C 170 5.45 Energy production and conversion
G 132 4.23 Carbohydrate transport and metabolism
E 263 8.43 Amino acid transport and metabolism
F 67 2.15 Nucleotide transport and metabolism
H 108 3.46 Coenzyme transport and metabolism
I 89 2.85 Lipid transport and metabolism
P 89 2.85 Inorganic ion transport and metabolism
Q 49 1.57 Secondary metabolites biosynthesis, transport and catabolism
R 240 7.69 General function prediction only
S 118 3.78 Function unknown
0 44.92 Not in COGs

3.3. Insights from the genome sequence

3.3.1. Central metabolism

Genes coding for the enzymes of the Embden–Meyerhof–Parnas pathway (EMP), namely ATP‐dependent 6‐phosphofructokinase (EC 2.7.1.11, Gocc_2786, Gocc_2787) and fructose‐bisphosphate aldolase (EC 4.1.2.13, Gocc_1033) were identified in the genome of G. occulta as well as in the genomes of R. radiotolerans and S. soli. However, a fructose‐bisphosphate aldolase gene was not identified in the genome sequence of T. album, predicting that the organism cannot channel hexoses through this pathway.

The genes coding for 2‐keto‐3‐deoxygluconate‐6‐phosphate aldolase (EC 4.1.2.14) and 6‐phosphogluconate dehydratase (EC 4.2.1.12), characteristic of the Entner‐Doudoroff pathway were not found in the four genomes, indicating that glycolysis does not proceed through this pathway. The gene coding for the enzyme fructose‐1,6‐bisphosphatase (EC 3.1.3.11, Gocc_1034), required for gluconeogenesis, is present in the genome of G. occulta and in the genomes of R. radiotolerans, S. soli, and T. album. All putative genes coding for the enzymes of the pentose phosphate pathway and tricarboxylic acid cycle were identified in the genomes of G. occulta, R. radiotolerans and S. soli and T. album.

Gaiella occulta and S. soli utilize myo‐inositol as single carbon and energy source for growth (Albuquerque et al., 2011; Kim et al., 2007). Several genes coding for enzymes involved in the metabolism of myo‐inositol, namely iolABCDEG, were identified in G. occulta and S. soli genomes (Yoshida et al., 2008). However, the gene iolJ that codes for 6‐phospho‐5‐dehydro‐2‐deoxy‐D‐gluconate aldolase (EC 4.1.2.29) and leads to the formation of dihydroxyacetone‐phosphate and malonate semialdehyde was not identified in the genomes of these two organisms. This gene was also not found in the genomes of other bacterial strains capable of myo‐inositol catabolism such as Clostridium perfringens, Legionella pneumophila, and Enterobacter aerogenes. It is possible that the iolJ gene is replaced by a yet annotated aldolase gene (Berman & Magasanik, 1966; Kawsar, Ohtani, Okumura, Hayashi, & Shimizu, 2004; Manske, Schell, & Hilbi, 2016).

The genome predicts a Calvin–Benson–Bassham (CBB) cycle with a type I RuBisCO (EC 4.1.1.39, Gocc_0241 and Gocc_0242) but lacks sedoheptulose‐bisphosphatase (EC 3.1.3.37) that leads to the synthesis of sedoheptulose‐phosphate. This cycle is also predicted in the genome of S. soli. In S. soli DSM 22325T the fructose‐1,6‐bisphosphatase/sedoheptulose‐1,7‐bisphosphatase gene is predicted to have a dual function, and this may also be the case for G. occulta. Other genes potentially involved in CO2 metabolism were identified in G. occulta, namely a formate dehydrogenase (EC 1.17.99.7, Gocc_1455), a putative hydrogenase (Gocc_1122 to Gocc_1127) and three carbon monoxide dehydrogenases (CODH, EC 1.2.5.3, Gocc_1071 to Gocc_1073, Gocc_1636 to Gocc_1638 and Gocc_2945 to Gocc_2947) (Shi et al., 2015). Several CODH genes were also identified in R. radiotolerans and S. soli. Growth was not observed under anaerobic conditions. Growth was observed under aerobic conditions in media containing acetate and pyruvate or combinations of formate plus acetate and formate plus pyruvate. The addition of hydrogen did not enhance growth under any condition examined. In all cases growth under aerobiosis was low and it was impossible to obtain enough cell mass to perform enzymatic assays for RuBisCO activity. We, therefore, predict but do not confirm CO2 fixation via the CBB cycle in this organism. A complete CBB cycle has also been identified in the genome of Thermus sp. NMX2.A1 although CO2 fixation through this pathway has not been confirmed in this organism (Müller et al., 2016).

A complete electron transport chain is predicted in G. occulta genome, namely NADH dehydrogenase (EC 1.6.5.3, Gocc_2267 to Gocc_2280), succinate dehydrogenase (EC 1.3.5.1, Gocc_1693 to Gocc_1695), cytochrome bc1 (EC 1.10.2.‐, Gocc_3013 to Gocc_3015), and cytochrome c oxidase (EC 1.9.3.1, Gocc_0001, Gocc_0002, Gocc_0004). Homologs of G. occulta NADH dehydrogenase, succinate dehydrogenase, cytochrome bc1 and cytochrome c oxidase genes were identified in the genomes of S. soli and T. album. Both G. occulta, S. soli, and T. album succinate dehydrogenase complex seems to lack the gene sdhD, which codes for the anchor subunit D, although the absence of this gene is not unusual (Horsefield, Iwata, & Byrne, 2004). In the genome of R. radiotolerans, we could not identify homologs for the following genes, NADH dehydrogenase nuoE, nuoF and nuoG, succinate dehydrogenase sdhC, and cytochrome bc1 cytochrome‐c subunit. The ATPase of the type strains examined here are of the common bacterial F‐type.

n‐Alkanes with 13–20 carbons in length are utilized for growth of T. album (Zarilla & Perry, 1984). Genes involved in alkane degradation were identified in the T. album genome, namely an AlkG2‐type rubredoxin (WP_093117361.1), a putative flavin‐containing monooxygenase (EC 1.14.13.8, WP_093115507.1), and NAD(P)/FAD‐dependent oxidoreductase (WP_093116100.1), which shares 47.8% aminoacid sequence identity with Acinetobacter sp. almA (A5H9N6), involved in the degradation of long‐chain n‐alkanes (Rojo, 2009; Smits, Witholt, & van Beilen, 2003; Van Beilen et al., 2002). Homologs of T. album alkane‐degradation genes were also identified in S. soli. Genes for the metabolism of alkanes were not identified in the genome of R. radiotolerans, where alkane utilization was not examined, or in G. occulta, that does not utilize alkanes as growth sources (Albuquerque et al., 2011).

Genes coding for the uptake and reduction of nitrate were identified in the genomes of G. occulta, R. radiotolerans, and S. soli, namely the MFS‐type nitrate/nitrite transporter (narK/nasA, Gocc_2854) and the respiratory narGHIJ nitrate reductase complex (EC 1.7.5.1, Gocc_2855 to Gocc_2859), although the reduction of nitrate to nitrite was not observed in S. soli (Kim et al., 2007). Genes coding for the two‐component system transduction pathway NarX/NarL identified in G. occulta (Gocc_1932 and Gocc_1933) were not identified in S. soli genome, but nitrate reductase expression does not seem to be altered by the absence of narX (Laub & Goulian, 2007; Sohaskey & Wayne, 2003). The sox genes, as well as other genes involved in sulfite oxidation/sulfate reduction, namely adenylylsulfate reductase (EC 1.8.99.2), dissimilatory sulfite reductase (EC 1.8.99.5) or sulfite dehydrogenase (cytochrome) (EC 1.8.2.1), were not identified in the G. occulta genome sequence or the other deeply branching Actinobacteria whose genome has been sequenced, precluding the utilization of reduced sulfur compounds as electron donors.

Gaiella occulta uses fructose, glucose, mannose, ribose, xylose, and myo‐inositol as single carbon source for growth (Albuquerque et al., 2011). The genes for ABC transport systems for ribose (Gocc_0079 to Gocc_0081; Gocc_2298 to Gocc_2300; Gocc_3068, Gocc_3069 and Gocc_3071), rhamnose (Gocc_0123 to Gocc_0125; Gocc_0231 to Gocc_0234), D‐xylose (Gocc_1293 to Gocc_1295), and raffinose/stachyose/melibiose (Gocc_1613 to Gocc_1616) were identified in G. occulta genome. Four unspecific sugar transport systems (Gocc_0123 to Gocc_0125, Gocc_0222 to Gocc_0224, Gocc_2321 to Gocc_2323 and Gocc_3042 to Gocc_3044), that may be involved in the transport of glucose, fructose, mannose, and myo‐inositol, were also identified. No PTS‐type transporters were identified in G. occulta genome. By in large, S. soli and R. radiotolerans have similar ABC transporters for sugars (Egas et al., 2014).

3.3.2. Stress response

Gaiella occulta genome sequence has the key enzymes for the main DNA repair mechanisms, except for the mismatch repair pathway. Genes mutS and mutL were not encountered, as in many Actinobacteria and Archaea (Castañeda‐García et al., 2017). A gene coding for the endonuclease NucS (Gocc_1770) was identified, suggesting G. occulta may use the noncanonical mismatch repair pathway described recently for Mycobacterium smegmatis and Streptomyces coelicolor (Castañeda‐García et al., 2017). Thermoleophilum album and S. soli may also use this alternative mismatch repair pathway as they also lack homologs of mutS and mutL and have a nucS homolog.

Rubrobacter xylanophilus and R. radiotolerans accumulate the compatible solutes mannosylglycerate, trehalose, and low levels of di‐myo‐inositol‐phosphate generally involved in osmotic adapation in (hyper)thermophilic organisms that in the Rubrobacter spp. are constitutively accumulated (Empadinhas et al., 2007; Nobre, Alarico, Fernandes, Empadinhas, & da Costa, 2008). Genes coding for enzymes involved in the synthesis of mannosylglycerate or di‐myo‐inositol‐phosphate were not identified in G. occulta. In R. radiotolerans, trehalose synthesis can proceed via four pathways namely TpS/TpP, TreS, TreT, and TreY/TreZ (Egas et al., 2014; Nobre et al., 2008). In G. occulta only the genes coding for TpS/TpP and TreS were detected in the genome (tpS, EC 2.4.1.15, Gocc_2203, tpP, EC 3.1.3.12, Gocc_2154, treS, EC 5.4.99.16, Gocc_0097). The genes tpS and tpP were also identified in the genomes of S. soli and T. album.

Several genes involved in reactive oxygen species (ROS) detoxification in R. radiotolerans were identified in the genome of G. occulta, namely, two genes encoding peroxiredoxins (EC 1.11.1.15, Gocc_0321, Gocc_2238), which reduce hydrogen peroxide to water, genes coding for thioredoxin TrxA (Gocc_1414), thioredoxin reductase TrxB (EC 1.8.1.9, Gocc_2333), and glutaredoxin GrxC (Gocc_0876), involved in redox balance, three genes coding for LysR transcriptional regulators (Gocc_1704, Gocc_1792 and Gocc_0289) and one ABC‐type Mn2+/Zn2+ transport system (Gocc_1485, Gocc_1486) (Egas et al., 2014). In T. album genome we could not identify lysR transcriptional regulators or ABC‐type Mn2+/Zn2+ transport system genes.

4. CONCLUSIONS

The genome of G. occulta, the sole representative of the order Gaiellales, was sequenced, analyzed and compared to the existing genomes of the most closely related actinobacterial deep‐branching species R. radiotolerans, S. soli, and T. album. The main objective was to compare the results of the genome sequence analysis with the phenotypic characteristics of these organisms. Despite the phylogenetic distances between these strains obtained by 16S rRNA gene sequence analysis and different phenotypic characteristics, genome sequence analysis showed that many characteristics were shared among these organisms. With the exception of T. album, the genome analysis of the three other strains were similar with respect to the metabolism of hexoses and the central metabolism in general. We conclude that T. album cannot use sugars for lack of transporters and the lack of a fructose‐bisphosphate aldolase gene for metabolism of sugars. The genome of G. occulta and S. soli appear to predict CO2 fixation via the CBB cycle, however, CO2 fixation could not be proven. The type strain of T.album has genes that predict the hydrolysis of n‐alkanes and is known to be able to use only these substrates for growth. S. soli also possesses genes for the degradation of n‐alkanes, but growth on these substrates were not examined in this organism. Gaiella occulta does not possess the homologs found in T. album and does not grow on n‐alkanes. The results obtained in this study indicate that all organisms appear to be strict chemoorganotrophs and, for the most part, corroborate the phenotypes of these strains.

CONFLICT OF INTERESTS

None declared.

AUTHORS CONTRIBUTION

L. A. grew the organism, and extracted DNA. C. B. sequenced G. occulta genome. H. J. C. F. assembled, annotated and compared the genome sequence to other genome sequences available. A. L.‐d.‐C. performed Transmission Electron Microscopy. R. S. conducted anaerobic growth experiments and analyzed metabolic pathways. R. S., C. E. and M. S. d. C. wrote the manuscript.

ETHICS STATEMENT

None required.

ACKNOWLEDGEMENTS

We would like to thank Fred Rainey, University of Alaska Anchorage USA, for the phylogenetic analysis. This work was supported by HORIZON 2020, under the project Metafluidics, ref. 685474. This work was also supported by FEDER funds through the Operational Programme Competitiveness Factors—COMPETE 2020 and national funds by FCT—Foundation for Science and Technology under the strategic project UID/NEU/04539/2013. H. J. C. F. was supported by the GenomePT project (POCI‐01‐0145‐FEDER‐022184), with funds from COMPETE 2020—Operational Programme for Competitiveness and Internationalisation (POCI), Lisboa, Portugal Regional Operational Programme (Lisboa2020), Algarve Portugal Regional Operational Programme (CRESC Algarve2020), under the PORTUGAL 2020 Partnership Agreement, through the European Regional Development Fund (ERDF), and by Fundação para a Ciência e a Tecnologia (FCT).

Severino R, Froufe HJC, Barroso C, et al. High‐quality draft genome sequence of Gaiella occulta isolated from a 150 meter deep mineral water borehole and comparison with the genome sequences of other deep‐branching lineages of the phylum Actinobacteria . MicrobiologyOpen. 2019;8:e840 10.1002/mbo3.840

DATA ACCESSIBILITY

The genome sequence of G. occulta F2‐233T is publicly available in the SRA under the accession number SRR7537062 and the genome assembly under the accession number QQZY00000000.

REFERENCES

  1. Albuquerque, L. , França, L. , Rainey, F. A. , Schumann, P. , Nobre, M. F. , & da Costa, M. S. (2011). Gaiella occulta gen. nov., sp. nov., a novel representative of a deep branching phylogenetic lineage within the class Actinobacteria and proposal of Gaiellaceae fam. nov. and Gaiellales ord. nov. Systematic and Applied Microbiology, 34(8), 595–599. 10.1016/j.syapm.2011.07.001 [DOI] [PubMed] [Google Scholar]
  2. Anonymous . (2009). Directive 2009/54/EC of the European Parliament and of the Council of 18 June 2009 on the exploitation and marketing of natural mineral water. Official Journal of the European Union, L164, 45–58. [Google Scholar]
  3. Bankevich, A. , Nurk, S. , Antipov, D. , Gurevich, A. A. , Dvorkin, M. , Kulikov, A. S. , … Pevzner, P. A. (2012). SPAdes: A new genome assembly algorithm and its applications to single‐cell sequencing. Journal of Computational Biology, 19(5), 455–477. 10.1089/cmb.2012.0021 [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Berman, T. , & Magasanik, B. (1966). The pathway of myo‐inositol degradation in Aerobacter aerogenes . The Journal of Biological Chemistry, 241, 800–806. [PubMed] [Google Scholar]
  5. Bolger, A. M. , Lohse, M. , & Usadel, B. (2014). Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics (Oxford, England), 30(15), 2114–2120. 10.1093/bioinformatics/btu170 [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Castañeda‐García, A. , Prieto, A. I. , Rodríguez‐Beltrán, J. , Alonso, N. , Cantillon, D. , Costas, C. , … Blázquez, J. (2017). A non‐canonical mismatch repair pathway in prokaryotes. Nature Communications, 8, 14246 10.1038/ncomms14246 [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Contreras‐Moreira, B. , & Vinuesa, P. (2013). GET_HOMOLOGUES, a versatile software package for scalable and robust microbial pangenome analysis. Applied and Environmental Microbiology, 79(24), 7696–7701. 10.1128/AEM.02411-13 [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Croville, J. L. , Cantet, J. , & Saby, S. (2011). Water treatments In Dege N. (Ed.), Technology of bottled water (3rd ed., pp. 141–174). Wiley‐Blackwell. [Google Scholar]
  9. Edgar, R. C. (2010). Search and clustering orders of magnitude faster than BLAST. Bioinformatics, 26(19), 2460–2461. 10.1093/bioinformatics/btq461 [DOI] [PubMed] [Google Scholar]
  10. Egas, C. , Barroso, C. , Froufe, H. J. C. , Pacheco, J. , Albuquerque, L. , & da Costa, M. S. (2014). Complete genome sequence of the Radiation‐Resistant bacterium Rubrobacter radiotolerans RSPS‐4. Standards in Genomic Sciences, 9(3), 1062–1075. 10.4056/sigs.5661021 [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Empadinhas, N. , Mendes, V. , Simões, C. , Santos, M. S. , Mingote, A. , Lamosa, P. , … Costa, M. S. d. (2007). Organic solutes in Rubrobacter xylanophilus: The first example of di‐myo‐inositol‐phosphate in a thermophile. Extremophiles, 11(5), 667–673. 10.1007/s00792-007-0084-z [DOI] [PubMed] [Google Scholar]
  12. Foesel, B. U. , Geppert, A. , Rohde, M. , & Overmann, J. (2016). Parviterribacter kavangonensis gen. nov., sp. nov. and Parviterribacter multiflagellatus sp. nov., novel members of Parviterribacteraceae fam. nov. within the order Solirubrobacterales, and emended descriptions of the classes Thermoleophilia and Rubrobacteria and their orders and families. International Journal of Systematic and Evolutionary Microbiology, 66(2), 652–665. 10.1099/ijsem.0.000770 [DOI] [PubMed] [Google Scholar]
  13. França, L. , Albuquerque, L. , & da Costa, M. S. (2015). Cavicella subterranea gen. nov., sp. nov., isolated from a deep mineral‐water aquifer, and emended description of the species Perlucidibaca piscinae . International Journal of Systematic and Evolutionary Microbiology, 65(11), 3812–3817. 10.1099/ijsem.0.000493 [DOI] [PubMed] [Google Scholar]
  14. Franca, L. , Albuquerque, L. , Sanchez, C. , Farelaira, P. , & da Costa, M. S. (2017). Ampullimonas aquatilis gen. nov., sp. nov. isolated from bottled mineral water. International Journal of Systematic and Evolutionary Microbiology, 66, 1459–1465. 10.1099/ijsem.0.000903 [DOI] [PubMed] [Google Scholar]
  15. França, L. , Lopéz‐Lopéz, A. , Rosselló‐Móra, R. , & da Costa, M. S. (2015). Microbial diversity and dynamics of a groundwater and a still bottled natural mineral water. Environmental Microbiology, 17(3), 577–593. 10.1111/1462-2920.12430 [DOI] [PubMed] [Google Scholar]
  16. Guillot, E. , & Leclerc, H. (1993). Biological specificity of bottled natural mineral waters: Characterization by ribosomal ribonucleic acid gene restriction patterns. Journal of Applied Bacteriology, 75(3), 292–298. 10.1111/j.1365-2672.1993.tb02779.x [DOI] [PubMed] [Google Scholar]
  17. Hahnke, S. M. , Moosmann, P. , Erb, T. J. , & Strous, M. (2014). An improved medium for the anaerobic growth of Paracoccus denitrificans Pd1222. Frontiers in Microbiology, 5(18), 1–9. 10.3389/fmicb.2014.00018 [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Horsefield, R. , Iwata, S. , & Byrne, B. (2004). Complex II from a structural perspective. Current Protein & Peptide Science, 5(2), 107–118. 10.2174/1389203043486847 [DOI] [PubMed] [Google Scholar]
  19. Kawsar, H. I. , Ohtani, K. , Okumura, K. , Hayashi, H. , & Shimizu, T. (2004). Organization and transcriptional regulation of myo‐inositol operon in Clostridium perfringens. FEMS Microbiology Letters, 235(2), 289–295. 10.1016/j.femsle.2004.04.047 [DOI] [PubMed] [Google Scholar]
  20. Kim, M. K. , Na, J.‐R. , Lee, T.‐H. , Im, W.‐T. , Soung, N.‐K. , & Yang, D.‐C. (2007). Solirubrobacter soli sp. nov., isolated from soil of a ginseng field. International Journal of Systematic and Evolutionary Microbiology, 57, 1453–1455. 10.1099/ijs.0.64715-0 [DOI] [PubMed] [Google Scholar]
  21. Lagesen, K. , Hallin, P. , Rødland, E. A. , Stærfeldt, H. H. , Rognes, T. , & Ussery, D. W. (2007). RNAmmer: Consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Research, 35(9), 3100–3108. 10.1093/nar/gkm160 [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Laub, M. T. , & Goulian, M. (2007). Specificity in two‐component signal transduction pathways. Annual Review of Genetics, 41(1), 121–145. 10.1146/annurev.genet.41.042007.170548 [DOI] [PubMed] [Google Scholar]
  23. Leandro, T. , França, L. , Nobre, M. F. , Rainey, F. A. , & da Costa, M. S. (2013). Heliimonas saccharivorans gen. nov., sp. nov., a member of the family Chitinophagaceae isolated from a mineral water aquifer, and emended description of Filimonas lacunae . International Journal of Systematic and Evolutionary Microbiology, 63, 3793–3799. 10.1099/ijs.0.050021-0 [DOI] [PubMed] [Google Scholar]
  24. Leandro, T. , França, L. , Nobre, M. F. , Schumann, P. , Rosselló‐Móra, R. , & da Costa, M. S. (2012). Nevskia aquatilis sp. nov. and Nevskia persephonica sp. nov., isolated from a mineral water aquifer and the emended description of the genus Nevskia . Systematic and Applied Microbiology, 35(5), 297–301. 10.1016/j.syapm.2012.05.001 [DOI] [PubMed] [Google Scholar]
  25. Manske, C. , Schell, U. , & Hilbi, H. (2016). Metabolism of myo‐inositol by Legionella pneumophila promotes infection of amoeba and macrophages. Applied and Environmental Microbiology, 82(16), 5000–5014. 10.1128/AEM.01018-16 [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Morais, P. V. , & da Costa, M. S. (1990). Alterations in the major heterotrophic bacterial populations isolated from a still bottled mineral water. Journal of Applied Bacteriology, 69(5), 750–757. 10.1111/j.1365-2672.1990.tb01572.x [DOI] [PubMed] [Google Scholar]
  27. Müller, W. J. , Tlalajoe, N. , Cason, E. D. , Litthauer, D. , Reva, O. , Brzuszkiewicz, E. , & van Heerden, E. (2016). Whole Genome Comparison of Thermus sp. NMX2.A1 Reveals Principal Carbon Metabolism Differences with Closest Relation Thermus scotoductus SA‐01. G3: GENES, GENOMES, GENETICS, 6(9), 2791–2797. 10.1534/g3.116.032953 [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Nielsen, P. , Fritze, D. , & Priest, F. G. (1995). Phenetic diversity of alkaliphilic Bacillus strains: Proposal for nine new species. Microbiology, 141(7), 1745–1761. 10.1099/13500872-141-7-1745 [DOI] [Google Scholar]
  29. Nobre, A. , Alarico, S. , Fernandes, C. , Empadinhas, N. , & da Costa, M. S. (2008). A unique combination of genetic systems for the synthesis of trehalose in Rubrobacter xylanophilus: Properties of a rare actinobacterial TreT. Journal of Bacteriology, 190(24), 7939–7946. 10.1128/JB.01055-08 [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Parks, D. H. , Imelfort, M. , Skennerton, C. T. , Hugenholtz, P. , & Tyson, G. W. (2015). CheckM: Assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Research, 25(7), 1043–1055. 10.1101/gr.186072.114 [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Rojo, F. (2009). Degradation of alkanes by bacteria: Minireview. Environmental Microbiology, 11(10), 2477–2490. 10.1111/j.1462-2920.2009.01948.x [DOI] [PubMed] [Google Scholar]
  32. Shi, J. , Jiang, Y. , Jiang, Z. , Wang, X. , Wang, X. , Zhang, S. , … Yang, C. (2015). Enzymatic conversion of carbon dioxide. Chemical Society Reviews, 44, 5981–6000. 10.1039/c5cs00182j [DOI] [PubMed] [Google Scholar]
  33. Smits, T. H. M. , Witholt, B. , & van Beilen, J. B. (2003). Functional characterization of genes involved in alkane oxidation by Pseudomonas aeruginosa . Antonie Van Leeuwenhoek, International Journal of General and Molecular Microbiology, 84(3), 193–200. 10.1023/A:1026000622765 [DOI] [PubMed] [Google Scholar]
  34. Sohaskey, C. D. , & Wayne, L. G. (2003). Role of narK2X and narGHJI in Hypoxic Upregulation of Nitrate Reduction by Mycobacterium tuberculosis . Journal of Bacteriology, 185(24), 7247–7256. 10.1128/JB.185.24.7247-7256.2003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Vachée, A. , Mossel, D. , & Leclerc, H. (1997). Antimicrobial activity among Pseudomonas and related strains of mineral water origin. Journal of Applied Microbiology, 83(5), 652–658. 10.1046/j.1365-2672.1997.00274.x [DOI] [PubMed] [Google Scholar]
  36. Van Beilen, J. B. , Neuenschwander, M. , Smits, T. H. M. , Roth, C. , Balada, S. B. , & Witholt, B. (2002). Rubredoxins involved in alkane oxidation. Journal of Bacteriology, 184(6), 1722–1732. 10.1128/JB.184.6.1722-1732.2002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Warburton, D. W. (1993). A review of the microbiological quality of bottled water sold in Canada. Part 2. The need for more stringent standards and regulations. Canadian Journal of Microbiology, 39(2), 158–168. 10.1139/m93-022 [DOI] [PubMed] [Google Scholar]
  38. Xie, C. , Mao, X. , Huang, J. , Ding, Y. , Wu, J. , Dong, S. , … Wei, L. (2011). KOBAS 2.0: A web server for annotation and identification of enriched pathways and diseases. Nucleic Acids Research, 39, W316–W322. 10.1093/nar/gkr483 [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Yoshida, K. I. , Yamaguchi, M. , Morinaga, T. , Kinehara, M. , Ikeuchi, M. , Ashida, H. , & Fujita, Y. (2008). myo‐Inositol Catabolism in Bacillus subtilis . Journal of Biological Chemistry, 283(16), 10415–10424. 10.1074/jbc.M708043200 [DOI] [PubMed] [Google Scholar]
  40. Zarilla, K. A. , & Perry, J. J. (1984). Thermoleophilum album gen. nov. and sp. nov., a bacterium obligate for thermophily and n‐alkane substrates. Archives of Microbiology, 137(4), 286–290. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The genome sequence of G. occulta F2‐233T is publicly available in the SRA under the accession number SRR7537062 and the genome assembly under the accession number QQZY00000000.


Articles from MicrobiologyOpen are provided here courtesy of Wiley

RESOURCES