Summary
In this issue of Chemistry & Biology, Rhee and co-workers develop a metabolic profiling technique using a mycobacterial small molecule extract and use it to correctly assign enzyme function to a previously mis-annotated gene product of Mycobacterium tuberculosis (de Carvalho et al., 2010).
With the advent of whole-genome sequencing comes the daunting task of assigning cellular and molecular function to the thousands of proteins encoded in a single genome. This challenge has inspired the creation of a new field termed ‘functional genomics’, in which researchers set out to assign function to a gene whose physiological role is not known. Historically, the function of an unknown gene was putatively assigned based on the similarity of the gene product’s amino acid sequence with sequences of proteins whose functions are known (Lee et al., 2007). Although this method has proven useful in successfully annotating the function of a large number of genes, several shortcomings still persist. For example, protein sequence similarity does not necessarily equate to cellular or molecular functional similarity (Saghatelian and Cravatt, 2005). Furthermore, it is estimated that an error rate of 5–40% exists in the functional annotation of enzymes, where the large range in reported error arises from varying definitions of the term “function” (Furnham et al., 2009; Valencia, 2005).
In order to unambiguously assign protein function to a particular gene found within the genome of Mycobacterium tuberculosis (Mtb), Rhee and co-workers (de Carvalho et al., 2010) devised an ingenious technique to analyze and assign protein function in the presence of Mtb’s metabolic milieu. By incubating the recombinant protein of interest with cellular extract of Mtb, the protein is presented with the most physiologically germane and unbiased collection of substrates. Successful assignment of protein function in this work, achieved by detection of both consumption and production of two metabolites in the presence of the Mtb metabolome, was largely attributed to the advances in liquid chromatography (LC) separation techniques as well as the increased sensitivity and accuracy of mass spectrometry (MS) instruments (Bucholz et al. 2001).
The protein of interest in this study was the gene product of Rv1248c from Mtb, which was originally assigned, based on sequence homology, as the α-ketoglutarate (α-KG) decarboxylase (E1) component of an α-KG dehydrogenase complex (Cole et al. 1998). Although recombinant Rv1248c was shown in vitro to produce succinic semialdehyde (SSA) via decarboxylation of α-KG (Tian et al., 2005), Rhee and coworkers show that when Rv1248c is presented with its physiological substrate pool, SSA is in fact a product of a slower side reaction and not the physiologic product of Rv1248c.
To unveil the true, physiological function of Rv1248c, the authors incubate recombinant protein and potential cofactors with a small molecule extract of Mycobacterium bovis and analyze the time-dependant consumption and production of small molecules using LC-MS. They refer to this technique as Activity Based Metabolomic Profiling (ABMP) and argue its merit for use in identifying the functions of proteins encoded by essential genes in microorganisms. Analysis of the LC-MS experiments showed only two small molecules whose abundance changed over time. These changes, which required the presence of the Rv1248c enzyme, Mg2+, and thiamine diphosphate, included the decrease in abundance of α-KG and concomitant increase in abundance of 5-hydroxylevulinic acid (HLA).
The authors propose the catalytic mechanism of HLA formation as shown in Figure 1A. Once bound to the protein-sequestered thiamine diphosphate (TDP) cofactor, the activated aldehyde α-KG intermediate undergoes condensation with a second substrate, glyoxylate (GLX) to form the unstable β-keto acid, 2-hydroxy-3-oxoadipate (HOA), which then undergoes non-enzymatic decarboxylation to form HLA. Confirmation of this mechanism was achieved via several rigorous experiments. First, recombinant Rv1248c was incubated with pure substrates (α-KG and GLX) and cofactors (TDP and Mg2+), and the reaction progress was monitored by LC-MS; both consumption of α-KG and production of HLA occurred only when all reaction components were present. Next, the authors further confirmed the proposed mechanism by trapping the HOA intermediate via methylation (using trifluoromethanesulfonate) to provide dimethyl HOA (Figure 1B), which, unlike HOA, was detectable by LC-MS. Finally, the time-dependant disappearance of α-KG and production of HLA were monitored by 1H-NMR and found to be conditional on the inclusion of protein, substrates and cofactors. Collectively, these biochemical studies suggest that the Rv1248c gene product be referred to as an HOA synthase.
Figure 1.
A. Proposed catalytic mechanism of Rv1248c. Following decarboxylation, the activated aldehyde derivative of α-ketoglutarate (α-KG) undergoes condensation with glyoxylate (GLX) to form 2-hydroxy-3-oxoadipate (HOA). HOA spontaneously decarboxylates to form 5-hydroxylevulinate (HLA). B. Trapping method used to observe the HOA intermediate.
The novel work presented by Rhee and coworkers in this issue of Chemistry & Biology is sure to have an impact on the field of functional genomics and its imperative quest to unambiguously assign cellular function to gene products. Not only did the authors apply their developed ABMP method to assign the correct function to a previously mis-annotated gene of Mtb, but they also demonstrated that ABMP could be implemented in assigning function to essential genes in which knockout mutants can not be created. Since essential microbial gene products have often been targeted for therapeutic development, this profiling method will ultimately aid in both the assignment of promising, novel antimicrobial protein targets and deciphering the vital biochemical function of those targets.
Footnotes
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
References
- Bucholz A, Takors R, Wandre C. Anal Biochem. 2001;295:129–137. doi: 10.1006/abio.2001.5183. [DOI] [PubMed] [Google Scholar]
- Cole ST, Brosch R, Parkhill J, Garnier T, Churcher C, Harris D, Gordon SV, Eiglmeier K, Gas S, Barry CE, 3rd, et al. Nature. 1998;393:537–544. doi: 10.1038/31159. [DOI] [PubMed] [Google Scholar]
- de Carvalho LPS, Zhao H, Dickinson CE, Arango NM, Lima CD, Fischer SM, Ouerfelli O, Nathan C, Rhee KY. Chem & Biol. 2010 doi: 10.1016/j.chembiol.2010.03.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Furnham N, Garavelli JS, Apweiler R, Thornton JM. Nat Chem Biol. 2009;5:521–525. doi: 10.1038/nchembio0809-521. [DOI] [PubMed] [Google Scholar]
- Lee D, Redfern O, Orengo C. Nat Rev Mol Cell Biol. 2007;8:995–1005. doi: 10.1038/nrm2281. [DOI] [PubMed] [Google Scholar]
- Saghatelian A, Cravatt BF. Nat Chem Biol. 2005;1:130–142. doi: 10.1038/nchembio0805-130. [DOI] [PubMed] [Google Scholar]
- Tian J, Bryk R, Itoh M, Suematsu M, Nathan C. Proc Natl Acad Sci U S A. 2005;102:10670–10675. doi: 10.1073/pnas.0501605102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Valencia A. Curr Opin Struct Biol. 2005;15:267–274. doi: 10.1016/j.sbi.2005.05.010. [DOI] [PubMed] [Google Scholar]

