Insight into the 3D structure and substrate specificity of previously uncharacterized GNAT superfamily acetyltransferases from pathogenic bacteria

Karolina A Majorek; Tomasz Osinski; David T Tran; Alina Revilla; Wayne F Anderson; Wladek Minor; Misty L Kuhn

doi:10.1016/j.bbapap.2016.10.011

. Author manuscript; available in PMC: 2018 Jan 1.

Published in final edited form as: Biochim Biophys Acta. 2016 Oct 23;1865(1):55–64. doi: 10.1016/j.bbapap.2016.10.011

Insight into the 3D structure and substrate specificity of previously uncharacterized GNAT superfamily acetyltransferases from pathogenic bacteria

Karolina A Majorek ^1,⁴, Tomasz Osinski ^1,⁴, David T Tran ², Alina Revilla ², Wayne F Anderson ^3,⁴, Wladek Minor ^1,^4,^*, Misty L Kuhn ^2,^*

PMCID: PMC5127773 NIHMSID: NIHMS828278 PMID: 27783928

Abstract

Members of Gcn5-related N-acetyltransferase (GNAT) superfamily catalyze the acetylation of a wide range of small molecule and protein substrates. Due to their abundance in all kingdoms of life and diversity of their functions, they are implicated in many aspects of eukaryotic and prokaryotic physiology. Although numerous GNATs have been identified thus far, many remain structurally and functionally uncharacterized. The elucidation of their structures and functions is critical for broadening our knowledge of this diverse and important superfamily. In this work, we present the structural and kinetic analyses of two previously uncharacterized bacterial acetyltransferases - SACOL1063 from Staphylococcus aureus strain COL and CD1211 from Clostridium difficile strain 630. Our structures of SACOL1063 show substantial flexibility of a loop that is likely responsible for substrate recognition and binding compared to structures of other homologs. In the CoA complex structure, we found two CoA molecules bound in both the canonical AcCoA/CoA-binding site and the acceptor-substrate-binding site. Our work also provides initial clues regarding the substrate specificity of these two enzymes; however, their native function(s) remain unknown. We found both proteins act as N-rather than O-acetyltransferases and preferentially acetylate L-threonine. The combination of structural and kinetic analyses of these two previously uncharacterized GNATs provides fundamental knowledge and a framework on which future studies can be built to elucidate their native functions.

Keywords: acetyltransferase, Gcn5-related N-acetyltransferase, GNAT, amino acid acetylation, serine and threonine acetylation

1. Introduction

The Gcn5-related N-acetyltransferase (GNAT) superfamily consists of evolutionarily related acetyltransferases from all kingdoms of life [1-3]. GNATs catalyze the transfer of an acyl group from acyl-coenzyme A (most often acetyl-coenzyme A (AcCoA)) to a primary amine or hydroxyl group, but the range of observed substrates is very wide. GNAT substrates include aminoglycoside antibiotics, polyamines, arylalkylamines, glucosamine-6-phosphate, and other small molecules [3]. GNATs are also involved in protein acetylation, including both N-terminal (Nα) acetylation of protein and peptide termini and N-epsilon (Nε) acetylation of internal lysine residues [2-6]. Although the structures and functions of many GNATs have been identified, the substrate specificities and biological activities of the majority of them remain unknown. Based on the functions of GNATs characterized to date, it is clear these biological roles are very diverse and are not necessarily easy to identify.

Structural characterization of proteins often provides new insight into their function, therefore it is important to characterize new members of diverse protein superfamilies. A number of GNATs have been structurally characterized and many reveal very interesting features such as different forms of dimer formation and numerous variants of domain swapping [3]. One approach we have used to identify possible substrates for structural and functional studies of uncharacterized GNATs is to screen them using a broad-substrate-screening assay [7]. This assay provides a mechanism to identify initial ‘hits’ from a variety of classes of substrates and then further characterize the enzymes both structurally and kinetically based on these ‘hits.’ One advantage to this assay is that it limits the possible classes of substrates that a GNAT uses. Even when substrates are not identified, the screen provides useful information as to what classes of substrates are not used. We previously screened ten uncharacterized GNATs from pathogenic bacteria using this assay and found that two of them preferentially acetylated L-amino acids—SACOL1063 from Staphylococcus aureus strain COL and CD1211 from Clostridium difficile strain 630. Since these proteins had neither been structurally nor functionally characterized and represent uncharacterized subgroups of the GNAT superfamily, we decided to further characterize them here. The information we gained from this set of experiments will be useful for future studies to identify the native substrates of these enzymes.

2. MATERIALS AND METHODS

2.1 Materials

All substrates were purchased from Sigma-Aldrich at the highest quality available. Acetyl coenzyme A was purchased as the tri-lithium salt.

2.2 Cloning, expression and purification

The SACOL1063 from Staphylococcus aureus subsp. aureus COL and CD1211 from Clostridium difficile 630 genes were cloned into the pMCSG7 pET21-derived expression vector. The vector encodes an N-terminal 6×His-tag with a spacer followed by a tobacco etch virus (TEV) protease cleavage site. The amino acid sequence Ser-Asn-Ala remains attached to the N-terminus of the protein after tag-cleavage with TEV protease. The fusion protein was overexpressed in E. coli BL21-RIPL (DE3) cells (Stratagene). The cells were grown in M9 SeMET (selenomethionine) High-Yield Growth Medium for protein that was used for crystallization and in TB for protein for kinetic characterization. Cells were induced, harvested, and the proteins were purified to near homogeneity as described previously [7, 8]. The TEV cleaved protein was passed through a Superdex 200 column in a buffer containing 10 mM Tris-HCl pH 7.5 and 150 mM NaCl. After gel filtration, fractions containing the protein were pooled and concentrated to 18 and 9 mg/ml (SACOL1063 and CD1211, respectively).

2.3 Site-directed mutagenesis

Mutants of SACOL1063 and CD1211 were generated using the QuikChange site-directed mutagenesis kit (Stratagene) according to the manufacturer's instructions. The wild-type genes were used as templates and the presence of mutations was confirmed by DNA sequencing.

2.4 Crystallization

We used the Xtaldb [9] and LabDB [10] to track protein preparation and all crystallization experiments. The crystals of SACOL1063 were grown at 16°C using vapor diffusion and a sitting drop setup for the apo-form, and hanging drop for CoA-complex. The crystallization drops were a 1:1 mixture of protein solution and the precipitant solution from the wells (0.2 M Sodium thiocyanate and 20% PEG3350 in the case of the apo form, and 0.1 M Bis-tris propane pH 7.0 and 2.2 M DL-Malic acid in case of the CoA-complex). The apo form crystal grew due to micro-seeding process that occurred on the fiber of unknown source that was present in the crystallization drop. For the structure of the complex with CoA, the SACOL1063 protein was incubated on ice with 10mM CoA for 1 hour prior to crystallization setup. Prior to data collection, each crystal was transferred to a solution containing a 2:1 mixture of precipitant solution and ethylene glycol for the apo structure and LV Cryo Oil (MiTeGen) for the CoA-complex. Crystals were immediately cryo-cooled in liquid nitrogen.

2.5 Data collection, structure determination and refinement

Data collection was performed at the 19-ID beam line of the Structural Biology Center [11] at the Advanced Photon Source (APS). Data were collected at a temperature of 100 K and processed with HKL-2000 [12]. The structures were determined by Se single-wavelength anomalous diffraction (SAD) using HKL-3000 [13] integrated with SHELEXC/D/E [14, 15], CCP4 [16] and ARP/wARP [17]. Refinement was performed using HKL-3000 coupled with REFMAC5 [18], COOT [19, 20], and selected programs from the CCP4 package [16]. The B-factors were refined using Translation/Libration/Screw (TLS) groups assigned by the TLSMD server [21]. Validation of the structures was performed using MOLPROBITY [22] and ADIT [23]. The coordinates and structures factors have been deposited in the Protein Data Bank. (PDB codes: 5JQ4, 5JPH). Statistics describing crystallographic data collection and refinement are summarized in Table 2.

Table 2.

Crystallographic data and refinement statistics for SACOL1063 structures. Values in parenthesis refer to the highest resolution shell. AU=asymmetric unit.

PDB code	5JQ4 (apo)	5JPH (CoA complex)
Data collection
Wavelength (Å)	0.98	0.98
Unit cell (Å,°)	a = 129.5, b = 33.8, c = 67.5	a = 50.1, b = 93.8, c = 68.9
	α = γ = 90, β = 109.5	α = γ = 90, β = 97.2
Space group	C2	P2₁
Resolution range (Å)	50.0-1.8	50.0-1.46
Highest resolution shell (Å)	1.8-1.83	1.46-1.49
Unique reflections	x	x
Redundancy	4.2 (4.1)	4.7 (4.6)
Completeness (%)	99.9 (100.0)	99.1 (98.3)
R_merge (%)	0.100 (0.666)	0.060 (0.672)
Average I/σ(I)	26.1 (2.1)	36.7 (2.1)
Refinement
R (%)	18.0	14.2
R_free (%)	21.2	17.0
Mean B value (Å²)	27	21
B from Wilson plot (Å²)	22.1	20.1
RMS deviation bond lengths (Å)	0.02	0.01
RMS deviation bond angles (°)	1.8	1.5
Ramachandran plot
Most favored regions (%)	98.6	98.4

Open in a new tab

2.6 Activity screening and measurement of kinetic activity

To determine the kinetic parameters of the SACOL1063 and CD1211 enzymes, we produced substrate saturation curves for substrates identified from the previous broad-screening assay (see section 3.8 and (19)) by varying concentration of one substrate while holding the concentration of the other substrate constant. For instance, if we were testing the kinetic parameters for AcCoA, we varied its concentration (0-2 mM) while holding the acceptor substrate concentration constant at 10 mM. Alternatively, when we tested kinetic parameters for acceptor substrates, we varied their concentrations (0-10 mM) while holding AcCoA concentration constant at 1 mM. Curves were produced in triplicate and data were fitted to a modified Hill equation as described previously [8]. In order to compare our data from before, all assays were performed as described previously [7, 8] in 50 μL reaction mixtures containing 50 mM Tris-HCl pH 8.0 buffer, 10 mM acceptor substrate, and 1 mM AcCoA unless otherwise stated. Reactions were allowed to proceed for 10 min at 37°C and were initiated with 10 μl of enzyme (final concentration of WT SACOL1063 and CD1211 enzymes was 0.19 μM and 0.084 μM).

In order to determine the kinetic mechanism for each enzyme we produced a series of AcCoA substrate saturation curves at set concentrations of L-Thr (2.5, 5, 10, and 20 mM) in triplicate using the same reaction conditions and concentrations of enzymes as described above. The data were fitted to a set of kinetic models as described previously [24] to determine which model was most compatible with the data. To evaluate whether conserved tyrosine residues were critical for catalysis, we compared specific activities of SACOL1063 WT, SACOL1063 Y114F, CD1211 WT, and CD1211 Y133F enzymes in triplicate. Reactions contained 50 mM Tris-HCl pH 8.0 buffer, 10 mM L-Thr, and 1 mM AcCoA. The final concentration of enzyme in each reaction was 0.12 μM, 0.11 μM, 1.2 μM, or 1.1 μM, respectively. Protein from the monomer peaks for SACOL1063 and CD1211 after gel-filtration chromatography was used for all kinetic assays. Under the described assay conditions, one unit of enzyme activity is defined as one μmol of CoA produced per minute.

2.7 Sequence analyses

Searches of the NCBI non-redundant (nr) database, using the sequences of SACOL1063 and CD1211 proteins as queries, were performed using PSI-BLAST [25] with a stringent expectation (e)-value threshold of 1e-30. These sequences were used for phylogenetic distribution analysis and to select sequences for the alignment. A multiple sequence alignment (MSA) of selected sequences was generated using PROMALS3D [26] and manually adjusted. In order to analyze the occurrence of particular amino acids in N-termini of proteins in Clostridium difficile strain 630 and Staphylococcus aureus strain COL we downloaded all protein sequences for these strains from the NCBI Protein database. We removed identical sequences to obtain a non-redundant data set and calculated the incidence of each amino acid in the position following the N-terminal Met.

2.8 Modeling of CD1211 protein structure

Secondary structure prediction and Fold-Recognition (FR) analysis of CD1211 were carried out via the GeneSilico Meta-Server gateway (for references to original methods see https://genesilico.pl/meta2) [27]. FR alignments to the top-scoring templates from the Protein Data Bank were compared, evaluated and ranked by PCONS [28]. Based on the FR analysis the structures of PaiA acetyltransferase from Bacillus subtilis (PDB ID: 1TIQ) and the tabtoxin resistance protein from Pseudomonas syringae (PDB ID: 1GHE) were chosen as templates for modeling the CD1211 structure. Alignments reported by FR methods were manually refined to maximize the number of aligned homologous residues and preserve the continuity of predicted secondary structure elements, and they were then used for automated comparative modeling using MODELLER [29]. Energy minimization of the model was performed using MOE (Molecular Operating Environment (47). software with an AMBER99 force field to a root mean square deviation gradient of 0.1 kcal/mol/Å².

In order to assess the model accuracy, we used the meta-server MetaMQAP that obtains scores from a number of third-party model quality assessment programs and calculates a predicted deviation between the position of each residue in the computational model and its position in the real (unknown) structure [30]. Independently, global evaluation of model quality was performed with the PROQ method [31]. For crude modeling of the CD1211:AcCoA complex, the CD1211 apo protein model was superimposed with the structure 1GHE and the AcCoA was copied from the 1GHE structure. Energy minimization of AcCoA molecule was performed using MOE with an MMF94x force field, followed by energy minimization of the whole complex with an AMBER99 force field, and again energy minimization of the AcCoA alone with MMF94x. The resulting CD1211:AcCoA complex was considered an approximation of how CD1211 will interact with AcCoA.

3. RESULTS

3.1 Identification and sequence comparison of SACOL1063 and CD1211 related proteins

The Staphylococcus aureus (strain COL) gene SACOL1063 encodes a 144 amino acid protein classified as a member of the UPF0039 (ElaA) family in UniProtKB and the PF00583 acetyltransferase family in the Pfam database. Homologs of SACOL1063 were identified using a PSI-Blast search of the non-redundant (nr) database using a stringent expectation (e)-value threshold of 1e-30 and the phylogenetic distribution of the identified proteins were analyzed (Supplementary Table 1). The data set comprised mostly sequences from Bacteria, but some sequences from Archaea and Eukaryota were also present. The Archaeal sequences included proteins that were predominantly from Halobacteriaceae, but also included some from Methanosarcina. The Eukaryotic proteins were mainly from Fungi, Euglenozoa, Amebozoa, and Oomycetes. The sequence alignment of selected proteins from organisms representing each of these groups is presented in Figure 1A.

Sequences are denoted by a five-letter abbreviation for genus and species followed by GI number. The positions showing at least 80% identity/similarity are background-colored. Secondary structure elements were determined using STRIDE [44] and are shown above the alignment with α-helices represented by tubes, β-strands by arrows, and loops by continuous lines.

The Clostridium difficile (strain 630) gene CD1211 encodes a 159 amino acid protein, which is also classified as a member of the PF00583 acetyltransferase family in the Pfam database. The phylogenetic distribution based on a stringent PSI-Blast search indicated that the closest homologs of CD1211 (Supplementary Table 2) are present in Bacteria and some Eukaryota, but not in Archaea. The Eukaryotic proteins are found mostly in Fungi, but our search also identified a few proteins from Chlorella, Trichoplax and one protein from Acanthamoeba. Figure 1B shows the sequence alignment of CD1211 homologs selected from organisms representing each of these taxonomic groups.

3.2 Overall structure of SACOL1063

In the apo-form, SACOL1063 crystallized in the space group C2 with two molecules in the asymmetric unit. SACOL1063 has a standard GNAT fold with the core β-sheet composed of seven, mainly antiparallel β-strands, and helices on both sides of the central β-sheet. When we compared the two molecules in the asymmetric unit, we observed a significant difference in the conformations of the loop between β6 and β7 (residues 121-135) (Figure 2A). Moreover, when we displayed two symmetry-related molecules we saw that this loop binds in the cavity of the neighboring protein molecule in both molecules, even though the conformation of the loop is different in each of them. The Cys108 residues from the two molecules of the asymmetric unit are positioned next to each other and form a disulfide bond (Figure 2B). The way the loop binds to the neighboring molecule causes it to block both the AcCoA-binding site and the acceptor-substrate-binding site (Figure 2C). However, the conformation of the loop is likely a result of crystal packing and it does not necessarily represent the exact conformation of the enzyme in solution. Since this flexible loop blocks the active site of the protein, we were unable use this crystal form for the crystallographic study of the interactions with ligands.

(A) Superposition of each molecule in the asymmetric unit, (B) SACOL1063 (bright colors) with the symmetry-related molecules (pale colors). Cys108 residues that form disulfide bridges are shown as red sticks. (C) SACOL1063 apo structure containing both molecules in the asymmetric unit (gray and red) with CoA copied from the superimposed structure of the SACOL1063 CoA complex. CoA is shown as sticks, with cyan carbon atoms of the CoA bound in the canonical AcCoA-binding site and yellow carbon atoms of CoA bound in the canonical acceptor-substrate-binding site. The loop between β6 and β7 of one molecule in the asymmetric unit (red) blocks both the AcCoA-binding and the acceptor-substrate-binding site of the neighboring molecule (gray).

3.3 Structure of SACOL1063 in complex with CoA

We co-crystallized the SACOL1063 protein in the presence of CoA in the space group P2₁ with three protein molecules in the asymmetric unit. The conformation of the protein in all three molecules was the same and each protein molecule had two molecules of CoA bound - one in the canonical AcCoA/CoA-binding site and one in the acceptor-substrate-binding site (Figure 3A). The thiol groups of the two CoA molecules are directed towards each other with a distance of 2.9 Å between the two sulphur atoms (Figure 3B). The CoA binding mode in the AcCoA/CoA-binding site is similar to previously determined GNAT structures in complex with AcCoA/CoA [1, 32]. The classical structural characteristic features of GNATs are maintained with a “β-bulge” in β4 and the splay between β4 and β5 strands that creates a V-shaped cleft in the β-sheet to provide a binding site for the pantetheine of CoA. The pantetheine makes hydrogen bonds with the main chain of β4, mimicking an antiparallel β-strand. The conformation of CoA bound in the AcCoA/CoA-binding site, with flexion of the pantothenate moiety, is also characteristic for GNATs. The adenosine moiety of CoA is located on the surface of the protein and interacts with the protein only through water molecules. The pyrophosphate moiety of CoA is coordinated mostly by the main chain nitrogen atoms of the conserved phosphate binding loop (Arg78-Gly79-Gln80-Gly81-Met82-Gly83) between β4 and α3 (the conserved motif in GNATs is (Q/R)-x-x-G-x-(G/A)) [33, 34]. Arg84, which is not part of the GNAT motif, coordinates the pyrophosphate with its main chain nitrogen and the 3'-phosphate of the ribose with its side chain.

(A) Two CoA molecules were bound - one in the canonical AcCoA/CoA-binding site (cyan), and one in the acceptor-substrate-binding site (yellow). 2Fo-Fc electron density map of the bound CoA molecules is presented (σ=1.0). (B) The CoA molecule bound in the acceptor-substrate-binding site (yellow) is very sharply bent. The distance of 2.9 Å between the two sulphur atoms of the two CoA molecules, and the closest distance (3.4 Å) between the pantothenate and adenine groups of the CoA molecule bound in the acceptor-substrate-binding site are indicated. (C) Residues involved in binding of CoA in the acceptor-substrate-binding site.

The CoA molecule bound in the acceptor-substrate-binding site is very sharply bent (Figure 3B), and the closest distance between the pantothenate and adenine groups is 3.4 Å. Binding CoA in this site displays an interaction of the pyrophosphate of CoA with Asn105 and an interaction of the ribose of CoA with Glu129. The majority of the interactions of CoA in the acceptor-substrate-binding site come from the pantothenate moiety with Asp35, Arg57, and Arg70 (Figure 3C). The combination of these hydrogen bonding interactions with CoA in the acceptor-substrate-binding site suggests that CoA may mimic substrate binding. If so, these residues that interact with CoA may be important for substrate recognition in this site.

3.4 Comparison of SACOL1063 apo- and CoA structures

In the apo-SACOL1063 structure, we observed substantial flexibility of the loop between β6 and β7. This loop adopts a different conformation upon binding CoA in the acceptor-substrate-binding site compared the apo structure. Binding CoA in the acceptor-substrate site causes β6 and β7 to become extended and the loop between them shortens significantly. Additionally, the entire β6-loop-β7 fragment shifts much closer to α1 and α2 helicies (residues 18-35) and ‘closes’ the acceptor-substrate-binding site (Figure 4A). These regions of the protein are conserved in SACOL1063 homologs, which may suggest their importance in substrate recognition and binding. The conformation of the β6-loop-β7 fragment in this ‘closed’ form is very similar to its conformation in structures of homologous proteins, especially the PA0115 N-acyltransferase from Pseudomonas aeruginosa (PDB ID: 1XEB) (Figure 4B). The protein structures of SACOL1063 homologs, including the PA0115 protein, an uncharacterized N-acetyltransferase YjcF from Bacillus subtilis (PDB ID: 1Q2Y), and an uncharacterized N-acetyltransferase from Lactobacillus plantarum (PDB ID: 3EFA), all display a similar conformation of this loop even though they do not contain ligands bound in the acceptor-substrate-binding site. A comparison of the conformation of this loop in both apo-form molecules and the CoA-bound form shows that this region is highly flexible and suggests that binding substrates may induce a conformational change.

(A) Superposition of SACOL1063 structures in the apo form (chains A (blue) and B (green)) and in the CoA-bound form (gray). Conformational change of the loop between β6 and β7 is observed upon CoA binding. (B) Superposition of SACOL1063 structure in the CoA-bound form (gray) with the PA0115 N-acyltransferase from *Pseudomonas aeruginosa* (PDB ID: 1XEB) (purple). A similar conformation of the loop between β6 and β7 is visible.

3.5 Comparison of SACOL1063 structure to other GNATs

A structural similarity search of the PDB using DALI [35] revealed that SACOL1063 showed the highest structural similarity to several uncharacterized GNATs. The hits with the highest scores included the structure of the uncharacterized N-acetyltransferase YjcF from B. subtilis (PDB ID: 1Q2Y; root mean square deviation (rmsd) value of 1.5 Å and 38% identity between the two sequences), an uncharacterized N-acetyltransferase from L. plantarum (PDB ID: 3EFA; rmsd of 1.6 Å and 27% identity), and the PA0115 N-acyltransferase from P. aeruginosa (PDB ID: 1XEB; rmsd of 1.8 Å and 29% identity). Since the GNAT fold is relatively conserved, SACOL1063 also showed some structural similarity to known proteins, including protein Nα-acetyltransferases RimI from S. enterica (PDB ID: 2CNS [34]; rmsd 2.3 Å and 17% identity), RimL from S. enterica (PDB ID: 1S7N [36]; rmsd 2.9 Å and 8% identity), and RimJ from V. fischeri (PDB ID: 3IGR; rmsd 2.7 Å and 9% identity), as well as the histone acetyltransferase TGCN5 from T. thermophila (PDB ID: 1M1D; rmsd 2.7 Å and 15% identity). The largest differences between the Rim structures and SACOL1063 were observed in helices α1 and α2.

3.6 Comparative model of CD1211

Despite significant efforts, we were unable to obtain suitable crystals for data collection of the CD1211 protein in the apo-form or in complex with AcCoA/CoA. There are no close homologs of CD1211 (e.g. with 30% or higher sequence identity) with a known structure. Therefore, to gain insight as to the positions of residues potentially important for the activity of the CD1211 enzyme, we used bioinformatics methods to produce a comparative (homology) model of the CD1211 structure and its complex with AcCoA. Fold-recognition (FR) methods, which try to find folds of the known protein structures that are compatible with a particular sequence of interest, revealed clear similarity of CD1211 to other GNATs [27]. Based on the FR analysis, we chose structures of the PaiA acetyltransferase from B. subtilis (PDB ID: 1TIQ) [37] and the tabtoxin resistance protein from P. syringae (PDB ID: 1GHE) [38] as templates for modeling the CD1211 structure.

The alignments for most of the protein sequence were unambiguous, but there were some differences that were most prominent in the α2 helix and the loop between α1 and α2 helicies between the template and CD1211protein. Since no structure of a close homolog has been determined, the accuracy of the prediction in this region may be lower than for the remainder of the protein structure because it is most variable between different GNATs. The model of the CD1211 protein was evaluated using MetaMQAP [30] and PROQ [31] and is shown in Figure 5A. MetaMQAP predicted a very high global accuracy of our model, with an rmsd to “real structure” predicted to be ~ 2.1 Å and the GDT_TS (global distance test total score) predicted to be 80.7. The PORQ program gave an LG (Levitt-Gerstein) score of 5.3, indicating a potentially “extremely good” model.

(A) The model is colored according to the predicted accuracy from blue (highly confident) to red (predicted low accuracy). AcCoA (orange sticks) was copied from the tabotoxin resistance protein structure (PDB ID: 1GHE) and the entire model was subjected to energy minimization (see section 2.8). The frames indicate the region showing the most significant differences between CD1211 and the corresponding region of the homologs - the tabotoxin resistance protein structure (PDB ID: 1GHE) (pink) and the PaiA acetyltransferase template structure (PDB ID:1TIQ) (green). (B) Position of conserved residues of CD1211 that are directed toward the acceptor-substrate-binding site and may play a role in substrate recognition and/or binding. Catalytic Tyr133 is also indicated.

The main differences between the modeled CD1211 structure and the templates used to produce the model are in the α2 helix, the loop between α1 and α2 helicies, and the length of the loop between β6 and β7 (Figure 5A). These loops are shorter in CD1211, which could potentially allow binding of a larger substrate. The optimized alignment with the templates is shown in Figure 1B. We also prepared a model of the structure of the CD1211:AcCoA complex (see section 2.8). Based on this model, we could unambiguously determine that the conserved Tyr133 is perfectly positioned to play an important role in catalysis. We also analyzed conservation of other residues in the acceptor-substrate-binding site between the closest homologs of CD1211. The residues that are conserved in this region and could potentially play a role in catalysis or substrate recognition and binding include Glu83, Lys85, Arg86, Asp121 and the more distant Glu44 (Figure 1B, Figure 5B). Comparison of structures of SACOL1063 and CD1211 show similarities; however, the acceptor-substrate-binding pocket is in a more “open” conformation in CD1211 compared to the more “closed” CoA bound form of SACOL1063 (Figure 6).

(A) SACOL1063 in CoA bound form (gray) and CD1211 homology model (blue), (B) overlay of SACOL1063 in CoA bound form (gray), SACOL1063 in apo-form (magenta), and CD1211 homology model (blue).

3.7 Oligomeric states of SACOL1063 and CD1211

We performed gel-filtration chromatography during purification of both SACOL1063 and CD1211 proteins and found SACOL1063 eluted as monomers, whereas the CD1211 protein eluted as predominantly monomers but a subpopulation of dimers also eluted (Supplementary Figure 1). This result indicates that the CD1211 protein may exist in multiple oligomeric states. If the apo protein structure of SACOL1063 is used as a query in the PISA server [39], dimeric or tetrameric states are predicted to be the stable oligomeric forms; however, the large surface of the interaction between the two molecules in the asymmetric unit is likely caused by contacts that are the result of crystal packing. Therefore, they are most likely not biologically relevant. When the CoA-bound form is used as a query, the server predicts the protein is monomeric. The combination of gel filtration data and structural analysis suggests that the SACOL1063 protein functions as a monomer.

3.8 Substrate specificity and kinetic parameters of SACOL1063 and CD1211 proteins

Since the native acetyl-acceptor substrates for SACOL1063 and CD1211 are unknown, we previously performed a broad-substrate screening assay to identify potential classes of substrates for these enzymes [7]. The assay is composed of a variety of compounds including antibiotics, polyamines, organic building blocks, all 20 standard amino acids, small peptides, and other metabolites. Our prior screening revealed that both enzymes had a preference for acetylating some of the 20 standard L-amino acids [7], but we did not perform further kinetic assays to determine substrate specificity or kinetic parameters since the aim was strictly screening in nature. During our prior screening, we identified L-threonine, L-tryptophan, and L-tyrosine as possible substrates for SACOL1063 and L-serine, O-acetyl-L-serine, L-threonine, L-methionine, L-tryptophan, L-valine, L-isoleucine, L-leucine, L-homoserine, 7-aminocephalosporanic acid, thiamine pyrophosphate, and thiamine as possible substrates for CD1211 [7]. Here, we kinetically characterized the enzymes to determine substrate specificity and kinetic parameters of both enzymes.

SACOL1063 showed a strong preference for acetylating L-Thr and displayed a one order of magnitude lower kcat and catalytic efficiency for L-Trp (Table 1). The enzyme also exhibited positive cooperativity toward AcCoA in the presence of L-Thr, whereas no cooperativity was observed toward AcCoA in the presence of L-Trp (Table 1). Additionally, we determined that L-Tyr was not a viable substrate for SACOL1063 under the described assay conditions. Given that SACOL1063 was active toward L-Thr and L-Trp but not N-methyl-L-Thr and N-acetyl-L-Trp [7], we can conclude that SACOL1063 is specific for acetylating the alpha amino group of amino acids and is therefore an N- not O-acetyltransferase.

Table 1.

Kinetic parameters of SACOL1063 and CD1211 enzymes.

AcCoA Substrate Saturation Curves		k_cat (s⁻¹)	S_0.5 (μM)	n	Catalytic Efficiency k_cat/S_0.5 (M⁻¹ s⁻¹)
SACOL1063
	Constant [L-Thr]^a	5.37	419 ± 22	1.43 ± 0.08	1.28 ×10⁴
	Constant [L-Trp]^a	0.365	72.1 ± 7.9	1.05 ± 0.11	5.06 ×10³
CD1211
	Constant [L-Thr]^a	7.97	248 ± 16	1.56 ± 0.13	3.09 ×10⁴
	Constant [L-Ser]^b	2.47	83.6 ± 6.8	1.79 ± 0.20	2.95 ×10⁴
	Constant [O-acetyl-L-Ser]^b	0.447	18.9 ± 1.5	1.91 ± 0.24	2.37 ×10⁴
	Constant [L-Met]^b	0.245	10.8 ± 0.4	2.90 ± 0.50	2.27 ×10⁴

Acceptor Substrate (L-Thr) Saturation Curves		k_cat (s⁻¹)	S_0.5 (mM)	n	Catalytic Efficiency k_cat/S_0.5 (M⁻¹ s⁻¹)
SACOL1063	Constant [AcCoA]^c	10.3	11.6 ± 0.7	1.14 ± 0.02	8.90 ×10²
CD1211	Constant [AcCoA]^c	8.32	5.96 ± 0.41	1.04 ± 0.02	1.40 ×10³

Open in a new tab

Concentration of acceptor substrate was held at 10 mM. The data were fitted to a modified Hill equation as described in Materials and Methods over the concentration range of 0-2 mM AcCoA.

Concentration of acceptor substrate was held at 10 mM. The data were fitted to a modified Hill equation as described in Materials and Methods over the concentration range of 0-0.5 mM AcCoA because slight substrate inhibition was seen at higher concentrations of AcCoA.

Concentration of AcCoA was held at 1 mM. The data were fitted to a modified Hill equation as described in Materials and Methods over the concentration range of 0-10 mM L-Thr.

During our kinetic characterization of CD1211, we found the enzyme acetylated L-Thr, L-Ser, O-acetyl-L-Ser, and L-Met with comparable catalytic efficiencies (Table 1). The highest efficiencies were displayed toward L-Thr and L-Ser, and the kcat toward L-Thr was nearly 3.2-, 17.8-, and 32.5-fold higher compared to L-Ser, O-acetyl-L-Ser, and L-Met, respectively (Table 1). The CD1211 enzyme was not active under the described assay conditions toward the remainder of the compounds identified as possible substrates from the broad-substrate screening assay. Unlike the SACOL1063 enzyme that exhibited positive cooperativity toward AcCoA in the presence of only L-Thr, CD1211 displayed positive cooperativity toward AcCoA in the presence of all four substrates—L-Thr, L-Ser, O-acetyl-L-Ser, and L-Met (Table 1). Effectively no cooperativity was observed for either SACOL1063 or CD1211 toward L-Thr (Table 1). This indicates SACOL1063 and CD1211 only exhibit positive cooperativity toward AcCoA, not both substrates of the bisubstrate reaction. This type of asymmetric cooperativity, where the enzyme only displays cooperativity toward one of the two substrates of a bisubstrate reaction, has been previously observed in some pyruvate kinases [40, 41].

The CD1211 enzyme was active toward L-Thr, L-Ser, and L-Met, but not N-methyl-L-Thr, N-methyl-L-Ser, and N-acetyl-L-Met from the previous screening assay [7]; therefore, the enzyme acts as an N-acetyltransferase by acetylating the alpha amino group of the amino acids. The compound O-acetyl-L-Ser has a modified side chain hydroxyl group and free alpha amino and alpha carboxyl groups available for additional acetylation. Since CD1211 displays a relatively similar catalytic efficiency toward this compound compared to L-Ser, the enzyme activity is not significantly affected when the side chain of L-Ser is modified. This may be important when identifying the native substrate of this enzyme for two main reasons: 1) if CD1211 acts as an N-terminal protein acetyltransferase the protein substrate's N-terminal residue could be either unmodified or post-translationally modified at this position, and 2) if the enzyme acetylates a non-protein substrate the modification of the side chain of O-acetyl-L-Ser may provide clues regarding substituents that are acceptable in this position.

3.9 Catalytic residues and kinetic mechanism

The mechanism of acetyl transfer that is considered to be most common in GNATs is one in which the acetyl group is transferred directly from the AcCoA donor to a primary amine of the acceptor. In this mechanism the ternary complex of the enzyme, which contains acceptor-substrate and AcCoA, is formed followed by nucleophilic attack of the primary amine on the carbonyl carbon of AcCoA. Both SACOL1063 and CD1211 have a Tyr residue that is conserved and suitably positioned in the structure for a role in catalysis. The corresponding Tyr in this position in other GNATs has been shown to be important [8, 34], and it likely functions as a general acid by donating a proton to the thiolate anion of CoA formed after the transfer of the acetyl group. To determine if Tyr114 (SACOL1063) and Tyr133 (CD1211) were important for catalysis, we replaced these residues with phenylalanine and measured their specific activity compared to the wild-type proteins in the presence of 1 mM AcCoA and 10 mM L-Thr. The specific activity of the SACOL1063 Y114F mutant decreased 176-fold compared to the WT protein (8.20 U/mg compared to 0.0466 U/mg). The specific activity of the CD1211 Y133F mutant decreased 8080-fold compared to the WT protein (19.3 U/mg compared to 0.00239 U/mg). These results indicate that the conserved tyrosine residues in both SACOL1063 and CD1211 proteins are indeed critical for catalysis.

To determine whether the SACOL1063 and CD1211 enzymes used the common direct transfer kinetic mechanism or an alternative mechanism that involves an acyl-enzyme intermediate, we tested the fitting of a set of kinetic models using a previously described procedure [24]. We used data obtained from AcCoA substrate saturation curves in the presence of different concentrations of L-Thr (Figure 7). The most compatible model for SACOL1063 was a bireactant random steady-state model (Model VIII, [24]) and a bireactant random steady-state 2 model (Model IX, [24]) for CD1211 (Figure 7, Supplementary Table 3, Supplementary Figure 1), which indicates that both enzymes prefer a direct transfer kinetic mechanism. The minor differences between these two models indicate that both enzymes share a common core kinetic mechanism to acetylate L-Thr. For both enzymes the rate of product formation (k_p) is much greater than the rate constants of the formation of intermediates, and as a result the enzymes prefer a steady-state rather than rapid equilibrium kinetic mechanism. The binding order of L-Thr or AcCoA is random for both enzymes, and either path will produce product. However, it appears that the path of binding AcCoA prior to L-Thr may be preferred because less buildup of intermediate will occur compared to binding L-Thr first.

AcCoA curves are shown in the presence of 2.5, 5, 10, or 20 mM L-Thr as filled circles, open triangles, filled triangles, and open upside down triangles, respectively. The fitting to the most compatible models (ones with the lowest AICc values; Supplementary Table 3) is shown: The models that were most compatible with the data are shown below the graphs: SACOL1063 to the bireactant steady-state model and CD1211 data to the bireactant steady-state 2 model. Equations for each model are shown in [24]. Kinetic parameters obtained from the model for SACOL1063 include K_a (0.77 μM), K_b (53 μM), c (3.0), k_{off_a} (0.72s⁻¹), k_{off_b} (1.1 s⁻¹), and k_p (66 s⁻¹). Calculated rate constants for the top path include k_{on_a} (0.93 s⁻¹μM⁻¹) and k_{on_b} (0.0070 s⁻¹μM⁻¹) and rate constants for the bottom path are k_{on_a} (0.31 s⁻¹μM⁻¹) and k_{on_b} (0.021 s⁻¹μM⁻¹). Kinetic parameters obtained from the model for CD1211 include K_a (0.064 μM), K_b (8.5 μM), c (5.0), k_{off_a1} (0.25 s⁻¹), k_{off_a2} (1.1 s⁻¹), k_{off_b1} (0.96 s⁻¹), k_{off_b2} (0.78 s⁻¹), and k_p (15 s⁻¹). Calculated rate constants include k_{on_a1} (3.9 s⁻¹μM⁻¹), k_{on_a2} (3.3 s⁻¹μM⁻¹), k_{on_b1} (0.11 s⁻¹μM⁻¹), and k_{on_b2} (0.018 s⁻¹μM⁻¹).

4. DISCUSSION

Although many GNATs have been identified in all kingdoms of life, the biological activities of many of them remain unknown and are very diverse. To provide a framework for downstream studies of GNATs from pathogenic bacteria, our laboratories seek to determine the 3D structures of uncharacterized bacterial GNATs and study their kinetic properties. Our contribution toward addressing the need for increased functional and structural characterization of bacterial GNATs is presented in this work where we identified and characterized two prokaryotic N-acetyltransferases from Gram-positive pathogenic bacteria - SACOL1063 from S. aureus and CD1211 C. difficile. These proteins represent two different and previously uncharacterized subgroups within the large superfamily of GNATs.

Using X-ray crystallography, we determined the structure of SACOL1063 in the apo-form and in complex with CoA. In the CoA complex, we observed two molecules of CoA—one bound in the canonical AcCoA/CoA binding site, and one bound in the acceptor-substrate-binding site. A significant conformational change of this loop upon binding CoA in the acceptor-substrate-binding site occurs, and the manner in which the protein interacts with CoA in this site suggests how the enzyme may interact with its substrate. The loop that ‘closes’ the acceptor-substrate-binding site upon binding CoA in this site was observed to be substantially more flexible in the apo structure and its conformation has not been observed in structures of homologous proteins. Both SACOL1063 and CD1211 proteins were extremely difficult to crystallize. The flexibility of the aforementioned loop in SACOL1063 may be a culprit in the inability to form suitable crystals. In the case of the SACOL1063 apo structure the crystal grew due to micro seeding, while for the CoA complex the loop was stabilized by the bound CoA. We were unable to obtain suitable crystals of CD1211 for data collection despite numerous attempts, therefore we used bioinformatics methods to generate a comparative (homology) model of CD1211 to gain insight into its three dimensional structure.

We previously screened the SACOL1063 and CD1211 proteins to identify potential substrates and found that they both preferentially acetylate amino acids [7]; however, a thorough kinetic characterization with these substrates was not performed until now. The broad-substrate screen used for identification of potential substrates provides a starting point for further kinetic characterization and narrows the possible classes of substrates for analysis to a more manageable number. Our results showed SACOL1063 has a very narrow substrate specificity compared to CD1211, which could acetylate several amino acids (L-Thr, L-Ser, O-acetyl-L-Ser, and L-Met). Although both SACOL1063 and CD1211 showed higher activity with L-Thr, they exhibit different kinetic characteristics. We analyzed the phylogenetic distribution of the proteins that showed high sequence similarity to our enzymes and found when SACOL1063 was used as a query we identified its homolog from Clostridium difficile 630 that was about 30% identical (GI: 126699123). However, when CD1211 was used as a query we did not identify any similar protein in Staphylococcus aureus. These two distinct enzymes from different GNAT subgroups are likely involved in very specific roles within their respective organisms.

Since these enzymes both N-acetylate the alpha amino group of amino acids, it is tempting to speculate that they may be involved in N-terminal protein acetylation (Nα-acetylation), but they may also acetylate other substrates like antimicrobial peptides, lipopeptides, other metabolites, or catalyze the acylation of lipoamino acids. Antimicrobial peptides are host defense molecules naturally found in all living organisms, but are also common as synthetic alternatives to other antimicrobial agents like classical antibiotics. Antimicrobial peptides undergo numerous post-translational modifications, including N-acetylation, that change their physicochemical properties [42]. Nα-acetylation occurs when an acetyl group is transferred from AcCoA to the N-terminal (Nα) amine of a protein substrate. It is one of the earliest modifications that can occur during protein synthesis and is considered an irreversible modification that is either on the N-terminal methionine or the consecutive residue after removal of the initial methionine [2, 4, 6, 43]. Although N-terminal protein acetylation is relatively well characterized in eukaryotes, little is known about it in bacteria [2, 4, 6, 43]. Recently, Nα-acetylation in bacteria has been described as being more abundant than previously recognized and a number of substrate proteins have been identified [5]; however, knowledge regarding which enzymes catalyze these reactions remain obscure.

We wanted to know if L-Thr Nα-acetylation could be possible in S. aureus and C. difficile, therefore, we analyzed the occurrence of each amino acid in the position following the N-terminal Met in the S. aureus strain COL and the C. difficile strain 630 (Figure 8). The incidence of Thr in the position after the initiator Met residue in all proteins of S. aureus strain COL is more than 11%, which, if SACOL1063 is a protein Nα-acetyltransferase, may reflect the tighter substrate specificity of SACOL1063 for L-Thr. Distribution of other amino acids in the C. difficile strain 630 is generally similar to that of S. aureus strain COL; however, Thr is only present in the position following the N-terminal Met in just over 2% of proteins from C. difficile strain 630.

*Staphylococcus aureus strain COL* (blue) and *Clostridium difficile strain 630* (red).

5. CONCLUSIONS

The physiological substrates for both SACOL1063 and CD1211 enzymes remain to be determined, but our work provides a foundation that will be beneficial for interpreting structural and kinetic details of how these enzymes may function. Further characterization of these proteins in the context of their in vivo environment may reveal the true substrates for these enzymes and increase our understanding of their function(s) in bacteria or within a host.

Supplementary Material

NIHMS828278-supplement-1.pdf^{(282.7KB, pdf)}

NIHMS828278-supplement-2.xls^{(948.5KB, xls)}

NIHMS828278-supplement-3.xls^{(172KB, xls)}

NIHMS828278-supplement-04.pdf^{(54.9KB, pdf)}

Highlights.

We characterized SACOL1063 and CD1211 GNATs of unknown function
SACOL1063 preferentially acetylates the amino acid L-threonine
CD1211 prefers L-Thr, but can also acetylate L-Ser, O-acetyl-L-Ser, and L-Met
The SACOL1063 structure shows substantial conformational changes of an important loop
Two molecules of CoA were bound in donor and acceptor sites of SACOL1063

ACKNOWLEDGEMENTS

We thank Dr. Sergii Pshenychnyi at the Recombinant Protein Production Core at Northwestern University for protein expression and purification services. This project was funded in whole or in part with Federal funds from the National Institute of Allergy and Infectious Diseases, National Institutes of Health (NIH), Department of Health and Human Services, under Contracts No. HHSN272200700058C and HHSN272201200026C (CSGID). Additional funding for this project includes San Francisco State University Startup Funds (to MLK). Some results shown in this report are derived from work performed at the Structural Biology Center Sector 19 at the Advanced Photon Source. The Advanced Photon Source, an Office of Science User Facility, is operated for the U.S. Department of Energy (DOE) Office of Science by Argonne National Laboratory. Argonne is operated by UChicago Argonne, LLC, for the U.S. DOE Office of Biological and Environmental Research under contract DE-AC02-06CH11357.

Footnotes

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

REFERENCES

1.Vetting MW, S.d.C. LP, Yu M, Hegde SS, Magnet S, Roderick SL, Blanchard JS. Structure and functions of the GNAT superfamily of acetyltransferases. Arch Biochem Biophys. 2005;433:212–226. doi: 10.1016/j.abb.2004.09.003. [DOI] [PubMed] [Google Scholar]
2.Soppa J. Protein acetylation in archaea. bacteria, and eukaryotes, Archaea. 2010;2010 doi: 10.1155/2010/820681. [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Salah Ud-Din AI, Tikhomirova A, Roujeinikova A. Structure and Functional Diversity of GCN5-Related N-Acetyltransferases (GNAT) International journal of molecular sciences. 2016;17 doi: 10.3390/ijms17071018. [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Hollebeke J, Van Damme P, Gevaert K. N-terminal acetylation and other functions of Nalpha-acetyltransferases. Biol Chem. 2012;393:291–298. doi: 10.1515/hsz-2011-0228. [DOI] [PubMed] [Google Scholar]
5.Ouidir T, Jarnier F, Cosette P, Jouenne T, Hardouin J. Characterization of N-terminal protein modifications in Pseudomonas aeruginosa PA14. Journal of proteomics. 2015;114:214–225. doi: 10.1016/j.jprot.2014.11.006. [DOI] [PubMed] [Google Scholar]
6.Jones JD, O'Connor CD. Protein acetylation in prokaryotes. Proteomics. 2011;11:3012–3022. doi: 10.1002/pmic.201000812. [DOI] [PubMed] [Google Scholar]
7.Kuhn ML, Majorek KA, Minor W, Anderson WF. Broad-substrate screen as a tool to identify substrates for bacterial Gcn5-related N-acetyltransferases with unknown substrate specificity. Protein Sci. 2013;22:222–230. doi: 10.1002/pro.2199. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Majorek KA, Kuhn ML, Chruszcz M, Anderson WF, Minor W. Structural, functional and inhibition studies of a GNAT superfamily protein PA4794: a new C-terminal lysine protein acetyltransferase from Pseudomonas aeruginosa. J Biol Chem. 2013 doi: 10.1074/jbc.M113.501353. [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Zimmerman MD, Chruszcz M, Koclega KD, Otwinowski Z, Minor W. The Xtaldb system for project salvaging in high-throughput crystallization. Acta Crystallogr. Sect. A. 2005;61:c178–c179. [Google Scholar]
10.Zimmerman MD, Grabowski M, Domagalski MJ, Maclean EM, Chruszcz M, Minor W. Data management in the modern structural biology and biomedical research environment. Methods in molecular biology. 2014;1140:1–25. doi: 10.1007/978-1-4939-0354-2_1. [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Rosenbaum G, Alkire RW, Evans G, Rotella FJ, Lazarski K, Zhang RG, Ginell SL, Duke N, Naday I, Lazarz J, Molitsky MJ, Keefe L, Gonczy J, Rock L, Sanishvili R, Walsh MA, Westbrook E, Joachimiak A. The Structural Biology Center 19ID undulator beamline: facility specifications and protein crystallographic results. J Synchrotron Radiat. 2006;13:30–45. doi: 10.1107/S0909049505036721. [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Otwinowski Z, Minor W. Methods in enzymology: Macromolecular crystallography , part A. Vol. 276. Academic Press; New York: 1997. Processing of X-ray diffraction data collected in oscillation mode. pp. 307–326. [DOI] [PubMed] [Google Scholar]
13.Minor W, Cymborowski M, Otwinowski Z, Chruszcz M. HKL-3000: the integration of data reduction and structure solution--from diffraction images to an initial model in minutes. Acta Crystallogr D Biol Crystallogr. 2006;62:859–866. doi: 10.1107/S0907444906019949. [DOI] [PubMed] [Google Scholar]
14.Sheldrick GM. A short history of SHELX. Acta Crystallogr A. 2008;64:112–122. doi: 10.1107/S0108767307043930. [DOI] [PubMed] [Google Scholar]
15.Sheldrick GM. Experimental phasing with SHELXC/D/E: combining chain tracing with density modification. Acta Crystallogr D Biol Crystallogr. 2010;66:479–485. doi: 10.1107/S0907444909038360. [DOI] [PMC free article] [PubMed] [Google Scholar]
16.N. Collaborative Computional Project The CCP4 suite: programs for protein crystallography. Acta Crystallogr, Sect D: Biol. Crystallogr. 1994;50:760–763. doi: 10.1107/S0907444994003112. [DOI] [PubMed] [Google Scholar]
17.Perrakis A, Morris R, Lamzin VS. Automated protein model building combined with iterative structure refinement. Nat Struct Biol. 1999;6:458–463. doi: 10.1038/8263. [DOI] [PubMed] [Google Scholar]
18.Murshudov GN, Skubak P, Lebedev AA, Pannu NS, Steiner RA, Nicholls RA, Winn MD, Long F, Vagin AA. REFMAC5 for the refinement of macromolecular crystal structures. Acta Crystallogr D Biol Crystallogr. 2011;67:355–367. doi: 10.1107/S0907444911001314. [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Emsley P, Cowtan K. Coot: model-building tools for molecular graphics. Acta Crystallogr D Biol Crystallogr. 2004;60:2126–2132. doi: 10.1107/S0907444904019158. [DOI] [PubMed] [Google Scholar]
20.Emsley P, Lohkamp B, Scott WG, Cowtan K. Features and development of Coot. Acta Crystallogr D Biol Crystallogr. 2010;66:486–501. doi: 10.1107/S0907444910007493. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Painter J, Merritt EA. TLSMD web server for the generation of multi-group TLS models. Journal of Applied Crystallography. 2006;39:109–111. [Google Scholar]
22.Davis IW, Leaver-Fay A, Chen VB, Block JN, Kapral GJ, Wang X, Murray LW, Arendall WB, 3rd, Snoeyink J, Richardson JS, Richardson DC. MolProbity: all-atom contacts and structure validation for proteins and nucleic acids. Nucleic Acids Res. 2007;35:W375–383. doi: 10.1093/nar/gkm216. [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Yang H, Guranovic V, Dutta S, Feng Z, Berman HM, Westbrook JD. Automated and accurate deposition of structures solved by X-ray diffraction to the Protein Data Bank. Acta Crystallographica Section D. 2004;60:1833–1839. doi: 10.1107/S0907444904019419. [DOI] [PubMed] [Google Scholar]
24.Filippova EV, Kuhn ML, Osipiuk J, Kiryukhina O, Joachimiak A, Ballicora MA, Anderson WF. A novel polyamine allosteric site of SpeG from Vibrio cholerae is revealed by its dodecameric structure. J Mol Biol. 2015;427:1316–1334. doi: 10.1016/j.jmb.2015.01.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997;25:3389–3402. doi: 10.1093/nar/25.17.3389. [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Pei J, Kim BH, Grishin NV. PROMALS3D: a tool for multiple protein sequence and structure alignments. Nucleic Acids Res. 2008;36:2295–2300. doi: 10.1093/nar/gkn072. [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Kurowski MA, Bujnicki JM. GeneSilico protein structure prediction meta-server. Nucleic Acids Res. 2003;31:3305–3307. doi: 10.1093/nar/gkg557. [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Lundstrom J, Rychlewski L, Bujnicki J, Elofsson A. Pcons: a neural-network-based consensus predictor that improves fold recognition. Protein Sci. 2001;10:2354–2362. doi: 10.1110/ps.08501. [DOI] [PMC free article] [PubMed] [Google Scholar]
29.Sali A, Blundell TL. Comparative protein modelling by satisfaction of spatial restraints. J Mol Biol. 1993;234:779–815. doi: 10.1006/jmbi.1993.1626. [DOI] [PubMed] [Google Scholar]
30.Pawlowski M, Gajda MJ, Matlak R, Bujnicki JM. MetaMQAP: a meta-server for the quality assessment of protein models. BMC Bioinformatics. 2008;9:403. doi: 10.1186/1471-2105-9-403. [DOI] [PMC free article] [PubMed] [Google Scholar]
31.Wallner B, Elofsson A. Can correct protein models be identified? Protein Sci. 2003;12:1073–1086. doi: 10.1110/ps.0236803. [DOI] [PMC free article] [PubMed] [Google Scholar]
32.Dyda F, Klein DC, Hickman AB. GCN5-related N-acetyltransferases: a structural overview. Annu Rev Biophys Biomol Struct. 2000;29:81–103. doi: 10.1146/annurev.biophys.29.1.81. [DOI] [PMC free article] [PubMed] [Google Scholar]
33.Neuwald AF, Landsman D. GCN5-related histone N-acetyltransferases belong to a diverse superfamily that includes the yeast SPT10 protein. Trends Biochem Sci. 1997;22:154–155. doi: 10.1016/s0968-0004(97)01034-7. [DOI] [PubMed] [Google Scholar]
34.Vetting MW, Bareich DC, Yu M, Blanchard JS. Crystal structure of RimI from Salmonella typhimurium LT2, the GNAT responsible for N(alpha)-acetylation of ribosomal protein S18. Protein Sci. 2008;17:1781–1790. doi: 10.1110/ps.035899.108. [DOI] [PMC free article] [PubMed] [Google Scholar]
35.Holm L, Rosenstrom P. Dali server: conservation mapping in 3D. Nucleic Acids Res. 2010;38:W545–549. doi: 10.1093/nar/gkq366. [DOI] [PMC free article] [PubMed] [Google Scholar]
36.Vetting MW, de Carvalho LP, Roderick SL, Blanchard JS. A novel dimeric structure of the RimL Nalpha-acetyltransferase from Salmonella typhimurium. J Biol Chem. 2005;280:22108–22114. doi: 10.1074/jbc.M502401200. [DOI] [PubMed] [Google Scholar]
37.Forouhar F, Lee IS, Vujcic J, Vujcic S, Shen J, Vorobiev SM, Xiao R, Acton TB, Montelione GT, Porter CW, Tong L. Structural and functional evidence for Bacillus subtilis PaiA as a novel N1-spermidine/spermine acetyltransferase. J Biol Chem. 2005;280:40328–40336. doi: 10.1074/jbc.M505332200. [DOI] [PubMed] [Google Scholar]
38.He H, Ding Y, Bartlam M, Sun F, Le Y, Qin X, Tang H, Zhang R, Joachimiak A, Liu J, Zhao N, Rao Z. Crystal structure of tabtoxin resistance protein complexed with acetyl coenzyme A reveals the mechanism for beta-lactam acetylation. J Mol Biol. 2003;325:1019–1030. doi: 10.1016/s0022-2836(02)01284-6. [DOI] [PubMed] [Google Scholar]
39.Krissinel E, Henrick K. Inference of macromolecular assemblies from crystalline state. J Mol Biol. 2007;372:774–797. doi: 10.1016/j.jmb.2007.05.022. [DOI] [PubMed] [Google Scholar]
40.Munoz ME, Ponce E. Pyruvate kinase: current status of regulatory and functional properties, Comparative biochemistry and physiology. Part B. Biochemistry & molecular biology. 2003;135:197–218. doi: 10.1016/s1096-4959(03)00081-2. [DOI] [PubMed] [Google Scholar]
41.Faustova I, Järv J. Kinetic analysis of cooperativity of phosphorylated L-type pyruvate kinase. Proc. Estonian Acad. Sci. Chem. 2006;55:179–189. [Google Scholar]
42.Wang G. Post-translational Modifications of Natural Antimicrobial Peptides and Strategies for Peptide Engineering. Current biotechnology. 2012;1:72–79. doi: 10.2174/2211550111201010072. [DOI] [PMC free article] [PubMed] [Google Scholar]
43.Bradshaw RA, Brickey WW, Walker KW. N-terminal processing: the methionine aminopeptidase and N alpha-acetyl transferase families. Trends Biochem Sci. 1998;23:263–267. doi: 10.1016/s0968-0004(98)01227-4. [DOI] [PubMed] [Google Scholar]
44.Heinig M, Frishman D. STRIDE: a web server for secondary structure assignment from known atomic coordinates of proteins. Nucleic Acids Res. 2004;32:W500–502. doi: 10.1093/nar/gkh429. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

NIHMS828278-supplement-1.pdf^{(282.7KB, pdf)}

NIHMS828278-supplement-2.xls^{(948.5KB, xls)}

NIHMS828278-supplement-3.xls^{(172KB, xls)}

NIHMS828278-supplement-04.pdf^{(54.9KB, pdf)}

[R1] 1.Vetting MW, S.d.C. LP, Yu M, Hegde SS, Magnet S, Roderick SL, Blanchard JS. Structure and functions of the GNAT superfamily of acetyltransferases. Arch Biochem Biophys. 2005;433:212–226. doi: 10.1016/j.abb.2004.09.003. [DOI] [PubMed] [Google Scholar]

[R2] 2.Soppa J. Protein acetylation in archaea. bacteria, and eukaryotes, Archaea. 2010;2010 doi: 10.1155/2010/820681. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R3] 3.Salah Ud-Din AI, Tikhomirova A, Roujeinikova A. Structure and Functional Diversity of GCN5-Related N-Acetyltransferases (GNAT) International journal of molecular sciences. 2016;17 doi: 10.3390/ijms17071018. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R4] 4.Hollebeke J, Van Damme P, Gevaert K. N-terminal acetylation and other functions of Nalpha-acetyltransferases. Biol Chem. 2012;393:291–298. doi: 10.1515/hsz-2011-0228. [DOI] [PubMed] [Google Scholar]

[R5] 5.Ouidir T, Jarnier F, Cosette P, Jouenne T, Hardouin J. Characterization of N-terminal protein modifications in Pseudomonas aeruginosa PA14. Journal of proteomics. 2015;114:214–225. doi: 10.1016/j.jprot.2014.11.006. [DOI] [PubMed] [Google Scholar]

[R6] 6.Jones JD, O'Connor CD. Protein acetylation in prokaryotes. Proteomics. 2011;11:3012–3022. doi: 10.1002/pmic.201000812. [DOI] [PubMed] [Google Scholar]

[R7] 7.Kuhn ML, Majorek KA, Minor W, Anderson WF. Broad-substrate screen as a tool to identify substrates for bacterial Gcn5-related N-acetyltransferases with unknown substrate specificity. Protein Sci. 2013;22:222–230. doi: 10.1002/pro.2199. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R8] 8.Majorek KA, Kuhn ML, Chruszcz M, Anderson WF, Minor W. Structural, functional and inhibition studies of a GNAT superfamily protein PA4794: a new C-terminal lysine protein acetyltransferase from Pseudomonas aeruginosa. J Biol Chem. 2013 doi: 10.1074/jbc.M113.501353. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R9] 9.Zimmerman MD, Chruszcz M, Koclega KD, Otwinowski Z, Minor W. The Xtaldb system for project salvaging in high-throughput crystallization. Acta Crystallogr. Sect. A. 2005;61:c178–c179. [Google Scholar]

[R10] 10.Zimmerman MD, Grabowski M, Domagalski MJ, Maclean EM, Chruszcz M, Minor W. Data management in the modern structural biology and biomedical research environment. Methods in molecular biology. 2014;1140:1–25. doi: 10.1007/978-1-4939-0354-2_1. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R11] 11.Rosenbaum G, Alkire RW, Evans G, Rotella FJ, Lazarski K, Zhang RG, Ginell SL, Duke N, Naday I, Lazarz J, Molitsky MJ, Keefe L, Gonczy J, Rock L, Sanishvili R, Walsh MA, Westbrook E, Joachimiak A. The Structural Biology Center 19ID undulator beamline: facility specifications and protein crystallographic results. J Synchrotron Radiat. 2006;13:30–45. doi: 10.1107/S0909049505036721. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R12] 12.Otwinowski Z, Minor W. Methods in enzymology: Macromolecular crystallography , part A. Vol. 276. Academic Press; New York: 1997. Processing of X-ray diffraction data collected in oscillation mode. pp. 307–326. [DOI] [PubMed] [Google Scholar]

[R13] 13.Minor W, Cymborowski M, Otwinowski Z, Chruszcz M. HKL-3000: the integration of data reduction and structure solution--from diffraction images to an initial model in minutes. Acta Crystallogr D Biol Crystallogr. 2006;62:859–866. doi: 10.1107/S0907444906019949. [DOI] [PubMed] [Google Scholar]

[R14] 14.Sheldrick GM. A short history of SHELX. Acta Crystallogr A. 2008;64:112–122. doi: 10.1107/S0108767307043930. [DOI] [PubMed] [Google Scholar]

[R15] 15.Sheldrick GM. Experimental phasing with SHELXC/D/E: combining chain tracing with density modification. Acta Crystallogr D Biol Crystallogr. 2010;66:479–485. doi: 10.1107/S0907444909038360. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R16] 16.N. Collaborative Computional Project The CCP4 suite: programs for protein crystallography. Acta Crystallogr, Sect D: Biol. Crystallogr. 1994;50:760–763. doi: 10.1107/S0907444994003112. [DOI] [PubMed] [Google Scholar]

[R17] 17.Perrakis A, Morris R, Lamzin VS. Automated protein model building combined with iterative structure refinement. Nat Struct Biol. 1999;6:458–463. doi: 10.1038/8263. [DOI] [PubMed] [Google Scholar]

[R18] 18.Murshudov GN, Skubak P, Lebedev AA, Pannu NS, Steiner RA, Nicholls RA, Winn MD, Long F, Vagin AA. REFMAC5 for the refinement of macromolecular crystal structures. Acta Crystallogr D Biol Crystallogr. 2011;67:355–367. doi: 10.1107/S0907444911001314. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R19] 19.Emsley P, Cowtan K. Coot: model-building tools for molecular graphics. Acta Crystallogr D Biol Crystallogr. 2004;60:2126–2132. doi: 10.1107/S0907444904019158. [DOI] [PubMed] [Google Scholar]

[R20] 20.Emsley P, Lohkamp B, Scott WG, Cowtan K. Features and development of Coot. Acta Crystallogr D Biol Crystallogr. 2010;66:486–501. doi: 10.1107/S0907444910007493. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R21] 21.Painter J, Merritt EA. TLSMD web server for the generation of multi-group TLS models. Journal of Applied Crystallography. 2006;39:109–111. [Google Scholar]

[R22] 22.Davis IW, Leaver-Fay A, Chen VB, Block JN, Kapral GJ, Wang X, Murray LW, Arendall WB, 3rd, Snoeyink J, Richardson JS, Richardson DC. MolProbity: all-atom contacts and structure validation for proteins and nucleic acids. Nucleic Acids Res. 2007;35:W375–383. doi: 10.1093/nar/gkm216. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R23] 23.Yang H, Guranovic V, Dutta S, Feng Z, Berman HM, Westbrook JD. Automated and accurate deposition of structures solved by X-ray diffraction to the Protein Data Bank. Acta Crystallographica Section D. 2004;60:1833–1839. doi: 10.1107/S0907444904019419. [DOI] [PubMed] [Google Scholar]

[R24] 24.Filippova EV, Kuhn ML, Osipiuk J, Kiryukhina O, Joachimiak A, Ballicora MA, Anderson WF. A novel polyamine allosteric site of SpeG from Vibrio cholerae is revealed by its dodecameric structure. J Mol Biol. 2015;427:1316–1334. doi: 10.1016/j.jmb.2015.01.009. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R25] 25.Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997;25:3389–3402. doi: 10.1093/nar/25.17.3389. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R26] 26.Pei J, Kim BH, Grishin NV. PROMALS3D: a tool for multiple protein sequence and structure alignments. Nucleic Acids Res. 2008;36:2295–2300. doi: 10.1093/nar/gkn072. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R27] 27.Kurowski MA, Bujnicki JM. GeneSilico protein structure prediction meta-server. Nucleic Acids Res. 2003;31:3305–3307. doi: 10.1093/nar/gkg557. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R28] 28.Lundstrom J, Rychlewski L, Bujnicki J, Elofsson A. Pcons: a neural-network-based consensus predictor that improves fold recognition. Protein Sci. 2001;10:2354–2362. doi: 10.1110/ps.08501. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R29] 29.Sali A, Blundell TL. Comparative protein modelling by satisfaction of spatial restraints. J Mol Biol. 1993;234:779–815. doi: 10.1006/jmbi.1993.1626. [DOI] [PubMed] [Google Scholar]

[R30] 30.Pawlowski M, Gajda MJ, Matlak R, Bujnicki JM. MetaMQAP: a meta-server for the quality assessment of protein models. BMC Bioinformatics. 2008;9:403. doi: 10.1186/1471-2105-9-403. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R31] 31.Wallner B, Elofsson A. Can correct protein models be identified? Protein Sci. 2003;12:1073–1086. doi: 10.1110/ps.0236803. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R32] 32.Dyda F, Klein DC, Hickman AB. GCN5-related N-acetyltransferases: a structural overview. Annu Rev Biophys Biomol Struct. 2000;29:81–103. doi: 10.1146/annurev.biophys.29.1.81. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R33] 33.Neuwald AF, Landsman D. GCN5-related histone N-acetyltransferases belong to a diverse superfamily that includes the yeast SPT10 protein. Trends Biochem Sci. 1997;22:154–155. doi: 10.1016/s0968-0004(97)01034-7. [DOI] [PubMed] [Google Scholar]

[R34] 34.Vetting MW, Bareich DC, Yu M, Blanchard JS. Crystal structure of RimI from Salmonella typhimurium LT2, the GNAT responsible for N(alpha)-acetylation of ribosomal protein S18. Protein Sci. 2008;17:1781–1790. doi: 10.1110/ps.035899.108. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R35] 35.Holm L, Rosenstrom P. Dali server: conservation mapping in 3D. Nucleic Acids Res. 2010;38:W545–549. doi: 10.1093/nar/gkq366. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R36] 36.Vetting MW, de Carvalho LP, Roderick SL, Blanchard JS. A novel dimeric structure of the RimL Nalpha-acetyltransferase from Salmonella typhimurium. J Biol Chem. 2005;280:22108–22114. doi: 10.1074/jbc.M502401200. [DOI] [PubMed] [Google Scholar]

[R37] 37.Forouhar F, Lee IS, Vujcic J, Vujcic S, Shen J, Vorobiev SM, Xiao R, Acton TB, Montelione GT, Porter CW, Tong L. Structural and functional evidence for Bacillus subtilis PaiA as a novel N1-spermidine/spermine acetyltransferase. J Biol Chem. 2005;280:40328–40336. doi: 10.1074/jbc.M505332200. [DOI] [PubMed] [Google Scholar]

[R38] 38.He H, Ding Y, Bartlam M, Sun F, Le Y, Qin X, Tang H, Zhang R, Joachimiak A, Liu J, Zhao N, Rao Z. Crystal structure of tabtoxin resistance protein complexed with acetyl coenzyme A reveals the mechanism for beta-lactam acetylation. J Mol Biol. 2003;325:1019–1030. doi: 10.1016/s0022-2836(02)01284-6. [DOI] [PubMed] [Google Scholar]

[R39] 39.Krissinel E, Henrick K. Inference of macromolecular assemblies from crystalline state. J Mol Biol. 2007;372:774–797. doi: 10.1016/j.jmb.2007.05.022. [DOI] [PubMed] [Google Scholar]

[R40] 40.Munoz ME, Ponce E. Pyruvate kinase: current status of regulatory and functional properties, Comparative biochemistry and physiology. Part B. Biochemistry & molecular biology. 2003;135:197–218. doi: 10.1016/s1096-4959(03)00081-2. [DOI] [PubMed] [Google Scholar]

[R41] 41.Faustova I, Järv J. Kinetic analysis of cooperativity of phosphorylated L-type pyruvate kinase. Proc. Estonian Acad. Sci. Chem. 2006;55:179–189. [Google Scholar]

[R42] 42.Wang G. Post-translational Modifications of Natural Antimicrobial Peptides and Strategies for Peptide Engineering. Current biotechnology. 2012;1:72–79. doi: 10.2174/2211550111201010072. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R43] 43.Bradshaw RA, Brickey WW, Walker KW. N-terminal processing: the methionine aminopeptidase and N alpha-acetyl transferase families. Trends Biochem Sci. 1998;23:263–267. doi: 10.1016/s0968-0004(98)01227-4. [DOI] [PubMed] [Google Scholar]

[R44] 44.Heinig M, Frishman D. STRIDE: a web server for secondary structure assignment from known atomic coordinates of proteins. Nucleic Acids Res. 2004;32:W500–502. doi: 10.1093/nar/gkh429. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Insight into the 3D structure and substrate specificity of previously uncharacterized GNAT superfamily acetyltransferases from pathogenic bacteria

Karolina A Majorek

Tomasz Osinski

David T Tran

Alina Revilla

Wayne F Anderson

Wladek Minor

Misty L Kuhn

Abstract

1. Introduction

2. MATERIALS AND METHODS

2.1 Materials

2.2 Cloning, expression and purification

2.3 Site-directed mutagenesis

2.4 Crystallization

2.5 Data collection, structure determination and refinement

Table 2.

2.6 Activity screening and measurement of kinetic activity

2.7 Sequence analyses

2.8 Modeling of CD1211 protein structure

3. RESULTS

3.1 Identification and sequence comparison of SACOL1063 and CD1211 related proteins

Figure 1. Sequence alignments of selected proteins from organisms representing SACOL1063 (A) and CD1211 (B) homologs.

3.2 Overall structure of SACOL1063

Figure 2. Structure of SACOL1063 in the apo form.

3.3 Structure of SACOL1063 in complex with CoA

Figure 3. Structure of SACOL1063 in complex with CoA.

3.4 Comparison of SACOL1063 apo- and CoA structures

Figure 4. Superposition of SACOL1063 structures.

3.5 Comparison of SACOL1063 structure to other GNATs

3.6 Comparative model of CD1211

Figure 5. Homology model of CD1211.

Figure 6. Comparison of homology model of CD1211 with structure of SACOL1063.

3.7 Oligomeric states of SACOL1063 and CD1211

3.8 Substrate specificity and kinetic parameters of SACOL1063 and CD1211 proteins

Table 1.

3.9 Catalytic residues and kinetic mechanism

Figure 7. Series of AcCoA curves at varying concentrations of L-threonine for both SACOL1063 and CD1211.

4. DISCUSSION

Figure 8. The incidence of each amino acid in the position following the N-terminal Met.

5. CONCLUSIONS

Supplementary Material

Highlights.

ACKNOWLEDGEMENTS

Footnotes

REFERENCES

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases