Skip to main content
G3: Genes | Genomes | Genetics logoLink to G3: Genes | Genomes | Genetics
. 2019 May 31;9(8):2489–2496. doi: 10.1534/g3.119.400123

Systematic Analysis of Metabolic Pathway Distributions of Bacterial Energy Reserves

Liang Wang *,†,1, Jianye Yang *, Yue Huang *, Qinghua Liu , Yaping Xu *, Xue Piao *,, Michael J Wise §,**
PMCID: PMC6686919  PMID: 31151997

Abstract

Previous bioinformatics studies have linked gain or loss of energy reserves with host-pathogen interactions and bacterial virulence based on a comparatively small number of bacterial genomes or proteomes. Thus, understanding the theoretical distribution patterns of energy reserves across bacterial species could provide a shortcut route to look into bacterial lifestyle and physiology. So far, five major energy reserves have been identified in bacteria due to their capacity to support bacterial persistence under nutrient deprivation conditions. These include polyphosphate (polyP), glycogen, wax ester (WE), triacylglycerol (TAG), and polyhydroxyalkanoates (PHAs). Although the enzymes related with metabolism of energy reserves are well understood, there is a lack of systematic investigations into the distribution of bacterial energy reserves from an evolutionary point of view. In this study, we sourced 8282 manually reviewed bacterial reference proteomes and combined a set of hidden Markov sequence models (HMMs) to search homologs of key enzymes related with the metabolism of energy reserves. Our results revealed that specific pathways like trehalose-related glycogen metabolism and enzymes such as wax ester synthase/acyl-CoA:diacylglycerol acyltransferase (WS/DGAT) are mainly restricted within specific types of bacterial groups, which provides evolutionary insights into the understanding of their origins and functions. In addition, the study also confirms that loss of energy reserves like polyP metabolism absence in Mollicutes is correlated with bacterial genome reduction. Through this analysis, a clearer picture about the metabolism of energy reserves in bacteria is presented, which could serve as a guide for further theoretical and experimental analyses of bacterial energy metabolism.

Keywords: Energy reserve, Hidden Markov model, Evolution, Proteome, Metabolic pathway


Due to the diversity of environmental niches that bacteria have colonized through millions of years of adaptation and evolution, bacteria have evolved specialized sets of metabolic pathways to live optimally in these environments, which can be reflected in their characteristic genomes, gene transcription profiles, and proteomes (Zhu et al. 2016). Previously, comparative genomics studies have shown that the loss of glycogen metabolism could serve as an indicator for bacterial parasitic lifestyle while the gain of polyphosphate (polyP) metabolism seems to link with free-living lifestyle and higher bacterial virulence (Wang et al. 2018; Henrissat et al. 2002). In addition, it has been observed that loss of glycogen or polyP metabolism is associated with reduced genome size, providing a hint about genome reduction (Wang and Wise 2011; Wang et al. 2018). It is well known that both glycogen and polyP are important energy sources in bacteria. Thus, presence or absence of energy reserves in bacteria could be important for in silico analysis of bacterial physiology and lifestyle, especially when large numbers of sequenced bacterial genomes are available and many of those are unculturable by traditional laboratory techniques.

It is well known now that energy reserves play essential roles in bacteria for their regular activities to sense and respond to changing environments and different types of stresses, such as temperature fluctuation and nutrient deprivation, etc. (Wang and Wise 2011). Although there are many different energy-related compounds, not all of them can be classified as energy reserves. According to Wilkinson, three principles should be satisfied for a compound to be considered as an energy reserve, which are: 1) accumulation when energy is over-supplied, 2) utilization when energy is insufficient, and 3) apparent advantages by consuming the compound when compared with those without it (Wilkinson 1959). Through physiological and biochemical studies, five energy storage compounds have been regarded to meet the criteria, which are wax ester (WE), triacylglycerol (TAG), polyhydroxyalkanoates (PHAs), polyphosphates (polyP) and glycogen (Wang et al. 2017; Wang and Wise 2011; Wang et al. 2018; Wilkinson 1959).

Several studies have investigated the distribution patterns of energy reserves in bacteria, most of which were based on small sets of bacterial genomes or proteomes. No comprehensive and systematic phylogenetic analysis currently exists (Zhang et al. 2002; Whitehead et al. 2014; Achbergerová and Nahálka 2014; Kalscheuer and Steinbüchel 2003; Wang and Wise 2011; Wang et al. 2018). In this study, we collected 8282 manually reviewed bacterial proteomes from UniProt database and sourced key enzymes directly involved in the metabolism of five energy reserves from public literature and database (Table 1) (The UniProt Consortium 2017). Hidden Markov sequence models were used for searching homologs in bacterial proteomes. Distribution patterns of representative metabolic pathways of the fiver major reserves are presented in Table S1. In order to gain an explicit view about distributions of the enzymes along the evolutionary paths, we incorporated the enzyme distributions into phylogenetic trees constructed via NCBI taxonomy identifiers (Federhen 2011). Through a combinational analysis of the pathways in the phylogenetic trees, we have identified interesting distribution patterns of metabolic pathways that are linked with bacterial groups specific lifestyles, which may improve our understanding of the functions of energy reserves in bacteria. In addition, our systematic analysis also gives us an overview of enzyme distributions, which could serve a as guide for further theoretical and experimental analyses of energy reserves in bacteria.

Table 1. Key enzymes and corresponding UniProt sequences used in this study for statistical modeling via HMMER package.

Reference Species Gene Enzyme Length UniProt ID Pfam Domain ID
Escherichia coli ppk1 Polyphosphate kinase 688 E7QTB5# PF02503, PF13090, PF17941, PF13089
Mycobacterium tuberculosis ppk2 Polyphosphate kinase 2 295 O05877 PF03976
Escherichia coli ppx Ppx/GppA phosphatase 513 P0AFL6 PF02541
Escherichia coli glgC Glucose-1-phosphate adenylyltransferase 431 P0A6V1 PF00483
Escherichia coli glgA Glycogen synthase 477 P0A6U8 PF08323, PF00534
Thermococcus kodakaraensis glgB* Apha-1,4-glucan branching enzyme (GH57) 675 Q5JDJ7# PF09210, PF03065, PF14520
Escherichia coli glgB Alpha-1,4- glucan branching enzyme (GH13) 728 P07762# PF02922, PF00128, PF02806
Mycobacterium tuberculosis treS Trehalose synthase/amylase 601 P9WQ19! PF00128, PF16657
Mycobacterium tuberculosis pep2 Maltokinase 455 Q7DAF6! PF18085
Mycobacterium tuberculosis glgE Alpha-1,4-glucan: maltose-1-phosphate Maltosyltransferase 701 P9WQ17! PF00128, PF11896
Mycobacterium tuberculosis Rv3032 Glycogen synthase 414 P9WMY9 PF13439, PF00534
Cupriavidus necator phaA Acetyl-CoA acetyltransferase 246 P14611 PF02803, PF00108
Cupriavidus necator phaB Acetoacetyl-CoA reductase 393 P14697 PF00106
Allochromatium vinosum phaC Group 1& Poly(3-hydroxyalkanoate) polymerase subunit C 355 P45370 PF00561
Pseudomonas aeruginosa phaC Group 2& Class II poly(R)-hydroxyalkanoic acid synthase 559 Q51513 PF07167
Escherichia coli fabG 3-Oxoacyl-[acyl-carrier-protein] reductase 244 P0AEK2 PF13561
Pseudomonas aeruginosa phaJ (R)-Enoyl-CoA hydratase/enoyl-CoA hydratase I 156 Q9LBK2 PF01575
Escherichia coli fabD Malonyl CoA-acyl carrier protein transacylase 209 P0AAI9 PF00698
Clostridium kluyveri sucD Succinic semialdehyde dehydrogenase 453 P38947 PF00171
Clostridium kluyveri 4hbD NAD-dependent 4-hydroxybutyrate dehydrogenase 371 P38945 PF00465
Clostridium kluyveri orfZ 4-Hydroxybutyrate CoA-transferase 437 A0A1L5FD42 PF02550, PF13336
Saccharomyces cerevisiae PDAT^ Phospholipid: diacylglycerol acyltransferase 661 P40345 PF02450
Acinetobacter baylyi wax-dgaT Wax Ester Synthase/Acyl Coenzyme A: Diacylglycerol Acyltransferase 458 Q8GGG1 PF06974, PF03007
*

Archaeal type of glycogen branching enzyme (encoded by glgB gene) belonging to GH57 class in CATH database. #Hidden Markov models of protein sequences with more than 2 non-redundant domains were constructed from scratch.! HMMs of TreS, Pep2, and GlgE were constructed from scratch for full sequence models due to the existence of very short PFAM domains (Malt_amylase_C and Mak_N_cap) or unknown function PFAM domain (DUF3416) in their sequences. &phaC Group 1 with PFAM domain Abhydrolase_1 (PF00561) and Group 2 with PFAM domain PhaC_N (PF07167) should not be confused with the four types of PHA synthases that are classified based on primary sequences, substrate specificity, and subunit composition. ^TAG synthesis enzyme PDAT dominantly present in eukaryotes and only 42 out of 8282 bacterial proteomes show single or double homologs of the enzyme.

Materials and Methods

Proteomes and enzymes collection

Bacterial proteomes were downloaded from UniProt database by using two keywords, Bacteria and Reference Proteomes, as filters (The UniProt Consortium 2017). A total of 8282 bacterial proteomes were collected, reflecting the state of knowledge as at 2018. Of these, 68 proteomes were removed due to outdated taxonomy identifiers that cannot be identified in NCBI taxonomy database when constructing phylogenetic trees (Federhen 2011). A complete list of all the 8282 bacteria with UniProt proteome identifiers, NCBI taxonomy identifiers, bacterial names, proteome sizes, and distribution patterns of key enzymes is available in the Table S1. For each of the five major energy reserves, only key enzymes were considered. For the synthesis of WE and TAG, the bifunctional enzyme wax ester synthase/acyl-CoA:diacylglycerol acyltransferase (WS/DGAT) was studied due to its pivotal role in these processes (Kalscheuer and Steinbüchel 2003). In addition, the enzyme phospholipid: diacylglycerol acyltransferase (PDAT) that catalyses the acyl-CoA-independent formation of triacylglycerol in yeast and plants were also screened in bacterial proteomes (Dahlqvist et al. 2000). For metabolism of polyP, the key enzyme polyphosphate kinase (PPK1) for synthesis and two degradation enzymes, intracellular polyphosphate kinase 2 (PPK2) and extracellular Ppx/GppA phosphatase (PPX), were included (Wang et al. 2018). For glycogen metabolism, two synthesis pathways were considered. The first one involves glucose-1-phosphate adenylyltransferase (GlgC), glycogen synthase (GlgA), and glycogen branching enzyme (GlgB) (Wang and Wise 2011). The second pathway includes trehalose synthase/amylase (TreS), maltokinase (Pep2), α-1,4-glucan: maltose-1-phosphate maltosyltransferase (GlgE), and glycogen branching enzyme (GlgB) (Chandra et al. 2011). Key enzyme Rv3032 for the elongation of α-1,4-glucan in another pathway relevant to glycogen metabolism and capsular glucan was included (Chandra et al. 2011). In addition, archaeal type GlgB belonging to the glycosyl hydrolase family 57 (GH57) was also investigated for comparative analysis due to its importance. As for PHAs, major enzymes responsible for synthesis such as Acetyl-CoA acetyltransferase (PhaA), acetoacetyl-CoA reductase (PhaB) and poly(3-hydroxyalkanoate) polymerase subunit C (PhaC) have been identified (Handrick et al. 2004; Verlinden et al. 2007). Based on Pfam analysis, PhaC could be further divided into two groups, PhaC Group 1 with PFAM domain Abhydrolase_1 (PF00561) and Group 2 with PFAM domain PhaC_N (PF07167). In addition, other enzymes in the PHAs synthesis pathway were also studied, which are 3-oxoacyl-[acyl-carrier-protein] reductase (FabG), (R)-Enoyl-CoA hydratase/enoyl-CoA hydratase I (PhaJ), malonyl CoA-acyl carrier protein transacylase (FabD), succinic semialdehyde dehydrogenase (SucD), NAD-dependent 4-hydroxybutyrate dehydrogenase (4HbD), and 4-hydroxybutyrate CoA-transferase (OrfZ) (Breuer 2010). Together with poly(3-hydroxyalkanoate) polymerase subunit C, these enzymes are able to synthesize PHAs in bacteria. All the enzymes are screened for presence and absence in collected bacterial proteomes. For details of these enzymes, please refer to Table 1.

De novo construction of HMMs

Each selected seed protein was used for building HMMs via HMMER package (Pearson and Eddy 2011). In specificity, after obtaining sequences for all seed proteins from UniProt database, remote BLAST was performed to collect homologous sequences for each seed protein from the NCBI non-redundant database of protein sequences (McGinnis and Madden 2004). Usearch was used to remove the homologous sequences with more than 98% similarity from the selected proteins (Edgar 2010). The standalone command-line version of MUSCLE was used so the multiple-sequence alignments were created automatically (Edgar 2004). N- and C-termini of multiple sequence alignments tend to be more inconsistent (Wise 2010). Thus, all MSAs were manually edited to remove these parts by using JalView (Waterhouse et al. 2009). HMMER was selected for the construction of HMMs through hmmbuild command. Since HMMER only recognizes STOCKHOLM format, all MSAs results were converted from FASTA to STOCKHOLM format. In addition, another set of HMM models sourced directly from PFAM database based on the domain structures of seed proteins were collected. For seed proteins with more than two domains, de novo constructed HMM models were used for homologous screening, while those with two or less domains, established PFAM domains were used. For searching homologs in bacterial proteomes, routine procedures were performed by following HMMER User’s Guide (eddylab.org/software/hmmer3/3.1b2/Userguide.pdf). Only those hits meeting the criteria of E-value less than 1e-10 and hit length greater than 60% of query domains will be considered. Results obtained from HMM screening can be found in Table S1, that is, the presence (copy numbers) or absence of a specific enzyme in a certain bacterial proteome.

Data visualization

Phylogenetic trees were first constructed based on NCBI taxonomy identifiers for all bacteria in this study via the commercial web server PhyloT (https://phylot.biobyte.de/), and were then visualized through the online interactive Tree of Life (iTOL) server (https://itol.embl.de/) (Letunic and Bork 2016). Distribution patterns of enzymes and their combinations in terms of energy reserves were added to the trees through iTOL pre-defined tol_simple_bar template (Letunic and Bork 2016).

Statistical analysis

Unpaired two-tailed Student’s t-test was used for statistical analysis through R package. Significant difference was defined as P-value < 0.05.

Data Availability Statement

Supplemental material available at https://gsajournals.figshare.com/s/08e2bf674f15b9704676. Supplemental material available at FigShare: https://doi.org/10.25387/g3.8138951.

Results

Wax ester and triacylglycerol

The key enzyme that is involved in both WE and TAG synthesis in bacteria is WS/DGAT. Through the screening of HMM-based statistical models, 673 out of 8282 bacterial species harbor a single copy or multiple copies of WS/DGAT homologs, which are mainly present in phylum Actinobacteria and the super-phylum Proteobacteria. Only a few bacteria in groups such as FCB (also known as Sphingobacteria) and PVC (also known as Planctobacteria), etc. have WS/DGAT. No species belonging to phylum Firmicutes has WS/DGAT. As for the unclassified bacteria, although no WS/DGAT was identified, they will not be studied due to their uncertain classification at current stage. For details, please refer to Figure 1A. By comparing the proteome sizes of bacteria species with or without WS/DGAT, we found that bacteria with WS/DGAT have average proteome size of 5294 proteins/proteome while those without WS/DGAT have average proteome size of 3117 proteins/proteome (P-value < 0.001).

Figure 1.

Figure 1

Distribution patterns of key enzymes and representative pathways along the NCBI taxonomy ID-based phylogenetic tree. (A) Distribution of key enzyme WS/DGAT (green circle) and PDAT (blue circle), together with bacterial proteome sizes (red circle); (B) distribution of the classical PHA synthesis pathway PhaABC (orange circle) and the two groups of PHA synthases, Group I with Abhydrolase_1 domain (blue circle) and Group II with PhaC_N domain (green circle); (C) distribution of key enzyme polyphosphate kinase PPK1 (blue circle), polyphosphate kinase 2 PPK2 (green circle), and exopolyphosphatase PPX (orange circle); (D) distribution of two glycogen synthesis pathways and four key enzymes, that is, classical synthesis pathway (yellow circle) including GlgC, GlgA, and GlgB, trehalose-based glycogen synthesis pathway including TreS, Pep2, GlgE, GlgB (black circle), glycogen synthases (Rv3032 in blue circle and GlgA in green circle) and glycogen branching enzymes (GH57 GlgB in orange circle and GH13 GlgB in violet circle). Five groups of bacteria are highlighted, which are Firmicutes (green), Actinobacteria (blue), Proteobacteria (orange), Unclassified Bacteria (violet), and FCB group (brown).

Within the major phylum of Proteobacteria, WS/DGAT is not evenly distributed and γ-, δ-, and ε-Proteobacteria sub-divisions have more species harboring WS/DGAT genes. In addition, two orders, Rhodobacterales (305 species) and Enterobacterales (168 species) that belong to α- and γ-Proteobacteria phylum, respectively, do not have any WS/DGAT except for one species Plesiomonas shigelloides 302-73 (NCBI taxonomy ID 1315976). As for the phylum Actinobacteria, two WS/DGAT abundant regions (R1 and R3) and one WS/DGAT absence region (R2) in the phylogenetic tree are worth further exploration. R1 region includes three closely related families with most species harboring WS/DGATs, which includes Streptomycetaceae, Pseudonocardiaceae, and Nocardioidaceae. R3 includes only one family Mycobacteriaceae (115 species) in which bacteria have up to 10 homologs of WS/DGAT. R4 is the family Corynebacteriaceae (69 species) that only includes WS/DGAT-free bacteria.

As for phospholipid:diacylglycerol acyltransferase (PDAT), it is involved in TAG storage in yeasts and plants. Recently, PDAT activity was proven in Streptomyces coelicolor for the first time (Arabolaza et al. 2008). However, no bacterial homologs were known yet with similarity to the respective eukaryotic sequences (Röttig et al. 2016). Our systematic screening of Saccharomyces cerevisiae PDAT unexpectedly identified 46 proteins belonging to 42 out of 8282 bacterial species (Table S2), 38 homologs belong to small genome-sized unclassified bacteria while 8 homologs were present in Terrabacteria group. Both E-values (>1E-10) and target sequence lengths (> 60% query sequence length) were also given to confirm the significance of the results. Thus, presence of eukaryotic PDAT in bacteria is theoretically true, though functionality of the enzymes requires further experimental validation. Although most of the sequences were annotated as uncharacterized proteins in the UniProt database, some were described as acyltransferases. Interestingly, the PDAT homolog in Clostridium butyricum E4 str. BoNT E BL5262 was thought to be a putative prophage LambdaBa01 acyltransferase, which indicated that the enzyme could jump around among species via horizontal gene transfer.

Polyhydroxyalkanoates

Although TAG and WE are a more common storage lipid in several groups of bacteria, the majority of all bacteria store PHA rather than TAG or WE (Rottig and Steinbuchel 2013). Three major enzymes (PhaA, PhaB and PhaC) involved in the synthesis of PHAs, forming the classical synthesis pathway and two enzymes (intracellular PhaZ and extracellular PhaZ) involved in the utilization of PHAs in bacteria. In this study, we mainly focused on the distribution patterns of PHA synthesis pathway PhaABC. PhaC has been further divided into four classes depending on substrate specificities and subunit compositions, represented by Cupriavidus necator (class I), Pseudomonas aeruginosa (class II), Allochromatium vinosum (class III), and Bacillus megaterium (class IV) (Rehm 2003). However, based on Pfam analysis, the four classes of PhaC could be included into two groups, PhaC Group 1 with PFAM domain Abhydrolase_1 (PF00561) and Group 2 with PFAM domain PhaC_N (PF07167), which were also thoroughly investigated for their distribution patterns. However, PhaE and PhaR as PhaC subunits were not be considered in this study. On the other hand, there are another four confirmed pathways for PHAs synthesis, which are 1) FabG, 2) PhaJ, 3) FabD, 4) SucD, 4HbD and OrfZ (Breuer 2010). All the enzymes are screened for presence and absence in collected bacterial proteomes (Table S1).

Preliminary analysis showed that 3166 bacterial species with average proteome size of 3772 proteins/proteome have PhaABC pathway while 1036 bacterial species with average proteome size of 1059 proteins/proteome do not have the pathway (P-value < 0.001). In general, PHA synthesis was widely distributed across bacterial species. It is interesting to notice that Group II PHA synthase is dominantly present in Proteobacteria Phylum while Group I PHA synthase is abundantly present in Actinobacteria Phylum. In addition, three representative regions R1, R2, and R3 with loss of the classical PHA synthesis pathway were highlighted, that is, Bacteroidia Class, Epsilon-Proteobacteria Subdivision Class, and Tenericutes Phylum. In these regions, two bacterial species, Lentimicrobium saccharophilum and Helicobacter sp. 13S00477-4 actually harbored the complete PhaABC pathway. Distribution patterns of the other four PHA synthesis pathways mentioned above were not visualized along phylogenetic tree since they were distributed with no featured patterns (Table S1). Please refer to Figure 1B for the distribution patterns of the studied enzymes and the pathway.

Polyphosphate

Three key enzymes PPK1, PPK2, and PPX are related with polyP metabolism. PPK1 is responsible for polyP synthesis. In this study, a total of 5273 bacterial species have PPK1. PPK2 and PPX are used for intracellular and extracellular polyP degradation, respectively. 2584 bacterial species have PPK1, PPK2 and PPX enzymes while 2211 bacteria species do not have any of the three enzymes. The average proteome sizes of the two groups of bacteria are 4571 proteins/proteome and 1615 proteins/proteome, respectively, which are significantly different (P-value < 0.001). In our analysis, we independently reviewed the distribution patterns of the three enzymes along phylogenetic trees and the result is displayed in Figure 1C. The three enzymes are widely distributed across bacterial species, which reflects the essentiality of the polymer in bacterial physiology. In addition, comparison shows that Firmicutes phylum seems to favor PPX more than PPK2 for polyP degradation. In addition, although it was observed that several regions had missing synthesis enzyme or degradation enzyme, only unclassified bacteria and Mollicutes class (94 bacterial species) showed apparent lack of the three polyP metabolism enzymes.

Glycogen

Glycogen metabolism in bacteria has multiple pathways, which include the classical pathway (GlgC, GlgA, GlgB, GlgP and GlgX) (Wang and Wise 2011), trehalose pathway (TreS, Pep2, GlgE, and GlgB), and the novel Rv3032 pathway (Wang et al. 2019). Rv3032 is an alternative enzyme for the elongation of α-1,4-glucan, which was compared for distribution patterns with GlgA. We also focused on the two glycogen synthesis pathways and compared their distribution patterns. In addition, there are two types of glycogen branching enzymes. One is the common bacterial GlgB, belonging to GH13 in CATH database, and the other one is known as archaeal GlgB, belonging to GH57 in CATH database (Orengo et al. 1997). We also looked into their distribution patterns in bacteria since GlgB is essential in determining the branched structure of glycogen. Our study showed that 3137 bacteria have the classical synthesis pathway (GlgC, GlgA, and GlgB) and their average proteome size is 3966 proteins/proteome while only 510 bacterial species (average proteome size of 1120 proteins/proteome) do not have these enzymes (P-value < 0.001). Comparison of the two synthesis pathways confirmed that full classical glycogen synthesis pathway (yellow circle in Figure 1D) distributes widely in bacteria except for Actinobacteria phylum, FCB group and unclassified bacteria group. Random loss of the classical pathway can also be spotted. In contrast, trehalose pathway is tightly associated with Actinobacteria phylum. It was also shown that Rv3032 was actually more widespread than GlgA. As for the two GlgBs, GH13 GlgB is widely distributed in 4500 bacterial species with a trend of random loss, while GH57 GlgB is identified in only 785 bacterial species that are mainly fall into Terrabacteria Group and PVC group, etc. In addition, GH57 GlgB is also identified in PVC group, Spirochaetes, Acidobacteria, Fusobacteria, Thermotogae, Nitrospirae, Aquificae, Synergistetes, Elusimicrobia, Nitrospinae/Tectomicrobia group, Thermodesulfobacteria, Rhodothermaeota, and Dictyoglomi.

Discussion

From an evolutionary point of view, if an organism can obtain compounds from other sources, it would tend to discard the corresponding biosynthetic pathways (Moran 2002). For example, strictly intracellular bacteria such as Rickettsia species, Mycoplasma species, and Buchnera, etc. have extensively reduced genome sizes and common pathways relevant to energy metabolism have been eliminated (Moran 2002). Although a common belief is that organism should evolve toward high complexity, recent analysis reported that reduction and simplification could be the dominant mode of evolution while increasing complexity is just a transitional stage according to the neutral genetic loss and streamlining hypothesis (Wolf and Koonin 2013). Independent analyses of the distribution patterns of the five energy reserves in bacteria found a consistent and statistically significant correlation between energy reserve loss and reduced proteome size, as shown in the results. Previous studies also reported this correlation in terms of glycogen and polyP metabolism in bacteria (Wang and Wise 2011; Wang et al. 2018). In this study, we extended the conclusion by adding the neutral lipid reserves of WE, TAG, and PHA. It was also confirmed that bacteria losing energy reserve metabolism capacity tended to have a niche-dependent or host-dependent lifestyle (Wang and Wise 2011; Wang et al. 2017; Wang et al. 2018). Thus, by looking into bacterial energy reserve metabolism, we could obtain preliminary views in terms of bacterial lifestyles, though other evidence is required to verify this hypothesis. It is worth mentioning that proteomes that we used in this study were derived from translated coding genes, not based on condition-specific expression of proteins (The UniProt Consortium 2017). Thus, there is no bias in protein coverage or gene expression level. From this point of view, this equivalent to bacterial genome analysis, except for that HMM models perform much better for protein sequences than DNA sequences in terms of remote homolog identification. It is also worth mentioning that by comparing with other tools such as BLAST for ortholog identification, HMM provides more accuracy with less bias because it is a statistical sequence model built on the alignments of multiple homologous sequences (Pearson and Eddy 2011).

WS/DGAT is a bifunctional enzyme and key to the biosynthesis of WE and TAG in bacteria. It was previously thought that WE and TAG are very uncommon lipid storage compounds in bacteria when compared with plants and animals, until this novel enzyme was identified (Kalscheuer and Steinbüchel 2003). From our analysis, it could be seen that many bacteria belonging to both Gram-positive and Gram-negative categories have the potential to synthesize WE and TAG. However, studies about WS/DGAT are mainly restricted to Mycobacteria genus (Actinobacteria phylum) and Acinetobacter genus (γ-Proteobacteria phylum) due to their clinical significance and potentially industrial use. WS/DGAT genes in the phylum Actinobacteria tend to have more paralogs than other phyla, especially for the bacteria in the R1 and R3 regions, which include Mycobacteriaceae, Dietziaceae, Gordoniaceae, Nocardiaceae, Tsukamurellaceae, Williamsiaceae, Nocardioidaceae and Pseudonocardiales. On the other hand, no WS/DGAT is found in the family of Corynebacteriaceae (R2 region), although Corynebacteriaceae is closely related with Mycobacteriacea (Pukall 2014). In addition, bacteria in phylum Firmicutes do not have any WS/DGAT enzymes. Screening of Phospholipid:diacylglycerol acyltransferase (PDAT), an enzyme that catalyses the acyl-CoA-independent formation of triacylglycerol in yeast and plants, found 46 homologs in bacteria (Dahlqvist et al. 2000). This contradicts that eukaryotic PDAT is exclusively present in higher organisms with no homologs in prokaryotic genomes (Rottig and Steinbuchel 2013).

The family Corynebacteriaceae contains the genera Corynebacterium and monospecific genus Turicella (Kämpfer 2010). Mycobacterium tuberculosis is the dominant species in Mycobacteriaceae (97 species). Mycolic acid (MA), with wax ester as the oxidized form of MA in Mycobacterium tuberculosis, is present in the cell wall and plays essential roles in host invasion, environmental persistence, and also drug resistance (Barkan et al. 2009). In addition, Mycobacterium tuberculosis also relies on wax ester for dormancy, though specific functions of WE in M. tuberculosis require further investigation. Thus, abundance of WS/DGAT in Mycobacteriacea has selective advantages in evolution. Considering the abundance of wax ester and its slow degradation, it could also contribute to the long-term survival (more than 360 days) of M. tuberculosis in environment (Wang et al. 2017). On the other hand, Corynebacterium does not rely on oxidized mycolic acid while Turicella does not have mycolic acid at all (Marrakchi et al. 2014; Funke et al. 1994). Thus, there is no need for them to be equipped with the WS/DGAT enzyme. However, how Mycobacteriaceae gains WS/DGAT, or Corynebacteriaceae loses it, is not clear and needs more investigation. As for Firmicutes, it is the low G+C counterpart of the high G+C Actinobacteria. Most of its species can form endospores and are resistant to extreme environmental conditions such as desiccation, temperature fluctuation, and nutrient deprivation, etc. (Galperin 2013). Thus, they may not need compounds such as WE or TAG for storing energy and dealing with harsh external conditions. How G+C content in the two phyla may impact on the gain or loss of wax ester metabolism is currently not known.

PHAs are a group of compounds that include but are not limited to components such as polyhydroxybutyrate (PHB) and polyhydroxyvalerate (PHV), etc., among which PHB is the most common and most prominent member in bacteria (Verlinden et al. 2007; Uchino et al. 2007). Currently, there are eight pathways responsible for the synthesis of PHB in bacteria (Breuer 2010). The complexity of the metabolic pathways in bacteria is worthy of a separate and complete investigation and is unable to be fully addressed in this study. Here, we only focused on the classical pathway that mainly involves PhaA, PhaB and PhaC (Verlinden et al. 2007). Its phylogenetic analysis revealed that PHA synthesis is widely distributed across bacterial species, which is consistent with its role as a dominant neutral lipid reserve in bacteria. As discussed above, PHA synthase is further divided into four classes. However, HMM analysis of representative sequences revealed that differences of the four-class enzymes at domain level only involves Abhydrolase_1 (PF00561) and PhaC_N (PF07167) domains. Further exploration of all bacterial PhaC domain structures shows higher heterogeneity, indicating that a more complex classification system for this enzyme should be introduced (L. Wang, unpublished data) and will be investigated in future study.

PolyP is known to be ubiquitous in different life domains and claimed to be present in all types of cells in Nature due to its essential roles as energy and phosphate sources (Wang et al. 2018). Although a number of enzymes are directly linked with polyP metabolism, we only focused on PPK1, PPK2, and PPX in this study because they are most essential enzymes to polyP metabolism. Figure 1C gives an overview of the distribution patterns of the three enzymes. Although 2212 bacterial species across the phylogenetic tree lack of all three enzymes, an apparent gap was only evident in the phylum Tenericutes and was further confirmed to be Mollicutes. A previous analysis of 944 bacterial proteomes showed that bacteria which have completely lost the polyP metabolic pathways (PPK1, PPK2, PAP, SurE, PPX, PpnK and PpgK) are heavily host-dependent and tend to adopt obligate intracellular or symbiotic lifestyles (Wang et al. 2018). Consistently, Mollicutes is a group of parasitic bacteria that have evolved from a common Firmicutes ancestor through reductive evolution (Gojobori et al. 2014). From here, we could infer that not only loss of complete metabolism pathway, but also even loss of key enzymes for energy reserve metabolism could give a hint at bacterial lifestyle.

For glycogen metabolism, we compared two synthesis pathways, the classical pathway (GlgC, GlgA and GlgB) and the newly identified trehalose-related pathway (TreS, Pep2, GlgE and GlgB) (Wang and Wise 2011; Chandra et al. 2011). Although initial analysis via BLAST search showed in 2011 that GlgE pathway is represented in 14 % of sequenced genomes from diverse bacteria, our studies showed that, when searching for the complete GlgE pathway by including another three enzymes, it is dominantly restricted to Actinobacteria phylum while the classical pathway is widely present in the phylogenetic tree as seen in Figure 1D (Chandra et al. 2011). In addition, the two types of GlgBs also showed interesting distribution patterns. Although GH13 GlgB is widely identified in 54.33% bacteria, GH57 GlgB is only present in 9.48% bacteria with skewed distribution in groups such as the Terrabacteria phylum and PVC group, etc. Another study of 427 archaea proteomes found that the 11 archaea have GH13 GlgB, while 18 archaea have GH57 GlgB (Wang et al. 2019). Thus, the two GlgBs are rarely present in archaea and mainly exist in bacteria. However, why trehalose-related glycogen metabolism pathway is markedly associated with Actinobacteria phylum still needs more experimental exploration.

Conclusions

Distribution patterns of key enzymes and their combined pathways in bacteria provided a comprehensive view of how energy reserves are incorporated and lost. In general, polyP, PHA, and glycogen are widely distributed across bacterial species as energy storage compounds. The other two neutral lipids investigated in this study are comparatively minor energy reserves in bacteria and mainly found in the super phylum Proteobacteria and phylum Actinobacteria. Within the group, more bacteria have the capacity to accumulate WE and TAG due to the abundance of WS/DGAT homolog. Comparatively, polyP acts as a transient energy reserve while neutral lipids are a more sustainable energy provider (Finkelstein et al. 2010; Wang and Wise 2011). Thus, neutral lipids could be major players for bacterial persistence under harsh conditions such terrestrial and aquatic environments. As for glycogen, its ability to enhance bacterial environmental viability is still controversial. Its widespread distribution in bacteria indicates that its metabolism is tightly linked with bacterial essential activities. In sum, through this study, we obtained a much clearer picture about how key enzymes responsible for the metabolism of energy reserves are distributed in bacteria. Further investigation via incorporating bacterial physiology and lifestyle could supply additional explanations to illustrate the distribution patterns, although experimental evidence is indispensable to confirm the computational analysis.

Acknowledgments

We thank Professor Ximiao He from Huazhong University of Science and Technology and the anonymous reviewers for genuine comments and suggestions, which greatly improves the quality of this study. The work was supported by the Startup Foundation for Excellent Researchers at Xuzhou Medical University [D2016007], Innovative and Entrepreneurial Talent Scheme of Jiangsu Province (2017), and Natural Science Foundation of Jiangsu Province [BK20180997].

Footnotes

Supplemental material available at FigShare: https://doi.org/10.25387/g3.8138951.

Communicating editor: R. Kulathinal

Literature Cited

  1. Achbergerová L., and Nahálka J., 2014.  PPK1 and PPK2 — which polyphosphate kinase is older? Biologia 69: 263–269. 10.2478/s11756-013-0324-x [DOI] [Google Scholar]
  2. Arabolaza A., Rodriguez E., Altabe S., Alvarez H., and Gramajo H., 2008.  Multiple Pathways for Triacylglycerol Biosynthesis in Streptomyces coelicolor. Appl. Environ. Microbiol. 74: 2573–2582. 10.1128/AEM.02638-07 [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Barkan D., Liu Z., Sacchettini J. C., and Glickman M. S., 2009.  Mycolic Acid Cyclopropanation is Essential for Viability, Drug Resistance, and Cell Wall Integrity of Mycobacterium tuberculosis. Chem. Biol. 16: 499–509. 10.1016/j.chembiol.2009.04.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Breuer U., 2010.  Plastics from Bacteria - Natural Functions and Applications. By Guo-Qiang Chen (Editor), Alexander Steinbüchel (Series Editor). Biotechnol. J. 5:1351–1351. [Google Scholar]
  5. Chandra G., Chater K. F., and Bornemann S., 2011.  Unexpected and widespread connections between bacterial glycogen and trehalose metabolism. Microbiology 157: 1565–1572. 10.1099/mic.0.044263-0 [DOI] [PubMed] [Google Scholar]
  6. Dahlqvist A., Stahl U., Lenman M., Banas A., Lee M. et al. , 2000.  Phospholipid:diacylglycerol acyltransferase: An enzyme that catalyzes the acyl-CoA-independent formation of triacylglycerol in yeast and plants. Proc. Natl. Acad. Sci. USA 97: 6487–6492. 10.1073/pnas.120067297 [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Edgar R. C., 2004.  MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 32: 1792–1797. 10.1093/nar/gkh340 [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Edgar R. C., 2010.  Search and clustering orders of magnitude faster than BLAST. Bioinformatics 26: 2460–2461. 10.1093/bioinformatics/btq461 [DOI] [PubMed] [Google Scholar]
  9. Federhen S., 2011.  The NCBI Taxonomy database. Nucleic Acids Res. 40: D136–D143. 10.1093/nar/gkr1178 [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Finkelstein D. B., Brassell S. C., and Pratt L. M., 2010.  Microbial biosynthesis of wax esters during desiccation: Adaptation for colonization of the earliest terrestrial environments? Geology 38: 247–250. 10.1130/G30398.1 [DOI] [Google Scholar]
  11. Funke G., Stubbs S., Altwegg M., Carlotti A., and Collins M. D., 1994.  Turicella otitidis gen. nov., sp. nov., a Coryneform Bacterium Isolated from Patients with Otitis Media. Int. J. Syst. Bacteriol. 44: 270–273. 10.1099/00207713-44-2-270 [DOI] [PubMed] [Google Scholar]
  12. Galperin M. Y., 2013.  Genome Diversity of Spore-Forming Firmicutes. Microbiol. Spectr. 1: 1–15. 10.1128/microbiolspectrum.TBS-0015-2012 [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Gojobori T., Grosjean H., Breton M., Sirand-Pugnet P., Tardy F. et al. , 2014.  Predicting the Minimal Translation Apparatus: Lessons from the Reductive Evolution of Mollicutes. PLoS Genet. 10: e1004363. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Handrick R., Reinhardt S., Kimmig P., and Jendrossek D., 2004.  The “intracellular” poly(3-hydroxybutyrate) (PHB) depolymerase of Rhodospirillum rubrum is a periplasm-located protein with specificity for native PHB and with structural similarity to extracellular PHB depolymerases. J. Bacteriol. 186: 7243–7253. 10.1128/JB.186.21.7243-7253.2004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Henrissat B., Deleury E., and Coutinho P. M., 2002.  Glycogen metabolism loss: a common marker of parasitic behaviour in bacteria? Trends Genet. 18: 437–440. 10.1016/S0168-9525(02)02734-8 [DOI] [PubMed] [Google Scholar]
  16. Kalscheuer R., and Steinbüchel A., 2003.  A Novel Bifunctional Wax Ester Synthase/Acyl-CoA:Diacylglycerol Acyltransferase Mediates Wax Ester and Triacylglycerol Biosynthesis inAcinetobacter calcoaceticusADP1. J. Biol. Chem. 278: 8075–8082. 10.1074/jbc.M210533200 [DOI] [PubMed] [Google Scholar]
  17. Kämpfer P., 2010.  Actinobacteria, pp. 1819–1838 in Handbook of Hydrocarbon and Lipid Microbiology, Springer, Berlin, Heidelberg. [Google Scholar]
  18. Letunic I., and Bork P., 2016.  Interactive tree of life (iTOL) v3: an online tool for the display and annotation of phylogenetic and other trees. Nucleic Acids Res. 44: W242–W245. 10.1093/nar/gkw290 [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Marrakchi H., Lanéelle M.-A., and Daffé M., 2014.  Mycolic Acids: Structures, Biosynthesis, and Beyond. Chem. Biol. 21: 67–85. 10.1016/j.chembiol.2013.11.011 [DOI] [PubMed] [Google Scholar]
  20. McGinnis S., and Madden T.L., 2004. BLAST: at the core of a powerful and diverse set of sequence analysis tools. Nucleic Acids Research 32 (Web Server):W20–W25. 10.1093/nar/gkh435 [DOI] [PMC free article] [PubMed]
  21. Moran N. A., 2002.  Microbial Minimalism. Cell 108: 583–586. 10.1016/S0092-8674(02)00665-7 [DOI] [PubMed] [Google Scholar]
  22. Orengo C. A., Michie A. D., Jones S., Jones D. T., Swindells M. B. et al. , 1997.  CATH – a hierarchic classification of protein domain structures. Structure 5: 1093–1109. 10.1016/S0969-2126(97)00260-8 [DOI] [PubMed] [Google Scholar]
  23. Pearson W. R., and Eddy S. R., 2011.  Accelerated Profile HMM Searches. PLOS Comput. Biol. 7: 1–16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Pukall R., 2014.  The Family Dietziaceae, pp. 327–338 in The Prokaryotes, Springer-Verlag, Berlin, Heidelberg. [Google Scholar]
  25. Rehm B. H., 2003.  Polyester synthases: natural catalysts for plastics. Biochem. J. 376: 15–33. 10.1042/bj20031254 [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Rottig A., and Steinbuchel A., 2013.  Acyltransferases in Bacteria. Microbiol. Mol. Biol. Rev. 77: 277–321. 10.1128/MMBR.00010-13 [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Röttig A., Strittmatter C. S., Schauer J., Hiessl S., Poehlein A. et al. , 2016.  Role of Wax Ester Synthase/Acyl Coenzyme A:Diacylglycerol Acyltransferase in Oleaginous Streptomyces sp. Strain G25. Appl. Environ. Microbiol. 82: 5969–5981. 10.1128/AEM.01719-16 [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Uchino K., Saito T., Gebauer B., and Jendrossek D., 2007.  Isolated Poly(3-Hydroxybutyrate) (PHB) Granules Are Complex Bacterial Organelles Catalyzing Formation of PHB from Acetyl Coenzyme A (CoA) and Degradation of PHB to Acetyl-CoA. J. Bacteriol. 189: 8250–8256. 10.1128/JB.00752-07 [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. The UniProt Consortium , 2017.  UniProt: the universal protein knowledgebase. Nucleic Acids Res. 45: D158–D169. 10.1093/nar/gkw1099 [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Verlinden R. A., Hill D. J., Kenward M. A., Williams C. D., and Radecka I., 2007.  Bacterial synthesis of biodegradable polyhydroxyalkanoates. J. Appl. Microbiol. 102: 1437–1449. 10.1111/j.1365-2672.2007.03335.x [DOI] [PubMed] [Google Scholar]
  31. Wang L., Liu Q., Wu X., Huang Y., Wise M. J. et al. , 2019.  Bioinformatics Analysis of Metabolism Pathways of Archaeal Energy Reserves. Sci. Rep. 9: 1034 10.1038/s41598-018-37768-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Wang L., Liu Z., Dai S., Yan J., and Wise M. J., 2017.  The Sit-and-Wait Hypothesis in Bacterial Pathogens: A Theoretical Study of Durability and Virulence. Front. Microbiol. 8: 2167 10.3389/fmicb.2017.02167 [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Wang L., and Wise M. J., 2011.  Glycogen with short average chain length enhances bacterial durability. Naturwissenschaften 98: 719–729. 10.1007/s00114-011-0832-x [DOI] [PubMed] [Google Scholar]
  34. Wang L., Yan J., Wise M. J., Liu Q., Asenso J. et al. , 2018.  Distribution Patterns of Polyphosphate Metabolism Pathway and Its Relationships With Bacterial Durability and Virulence. Front. Microbiol. 9: 782 10.3389/fmicb.2018.00782 [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Waterhouse A. M., Procter J. B., Martin D. M. A., Clamp M., and Barton G. J., 2009.  Jalview Version 2–a multiple sequence alignment editor and analysis workbench. Bioinformatics 25: 1189–1191. 10.1093/bioinformatics/btp033 [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Whitehead M.P., Eagles L., Hooley P.,and Brown M.R.W., 2014. Most bacteria synthesize polyphosphate by unknown mechanisms. Microbiology 160 (Pt_5):829–831. 10.1099/mic.0.075366-0 [DOI] [PubMed]
  37. Wilkinson J., 1959.  The problem of energy-storage compounds in bacteria. Exp. Cell Res. 7: 111–130. 10.1016/0014-4827(59)90237-X [DOI] [Google Scholar]
  38. Wise M. J., 2010.  No so HoT - heads or tails is not able to reliably compare multiple sequence alignments. Cladistics 26: 438–443. 10.1111/j.1096-0031.2009.00292.x [DOI] [PubMed] [Google Scholar]
  39. Wolf Y. I., and Koonin E. V., 2013.  Genome reduction as the dominant mode of evolution. BioEssays 35: 829–837. 10.1002/bies.201300037 [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Zhang H., Ishige K., and Kornberg A., 2002.  A polyphosphate kinase (PPK2) widely conserved in bacteria. Proc. Natl. Acad. Sci. USA 99: 16678–16683. 10.1073/pnas.262655199 [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Zhu B., Ibrahim M., Cui Z., Xie G., Jin G. et al. , 2016.  Multi-omics analysis of niche specificity provides new insights into ecological adaptation in bacteria. ISME J. 10: 2072–2075. 10.1038/ismej.2015.251 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

Supplemental material available at https://gsajournals.figshare.com/s/08e2bf674f15b9704676. Supplemental material available at FigShare: https://doi.org/10.25387/g3.8138951.


Articles from G3: Genes|Genomes|Genetics are provided here courtesy of Oxford University Press

RESOURCES