Abstract
Many proteins are transported across lipid membranes by protein translocation systems in living cells. The twin-arginine transport (Tat) system identified in bacteria and plant chloroplasts is a unique system that transports proteins across membranes in their fully-folded states. Up to date, the detailed molecular mechanism of this process remains largely unclear. The Escherichia coli Tat system consists of three essential transmembrane proteins: TatA, TatB and TatC. Among them, TatB and TatC form a tight complex and function in substrate recognition. The major component TatA contains a single transmembrane helix followed by an amphipathic helix, and is suggested to form the translocation pore via self-oligomerization. Since the TatA oligomer has to accommodate substrate proteins of various sizes and shapes, the process of its assembly stands essential for understanding the translocation mechanism. A structure model of TatA oligomer was recently proposed based on NMR and EPR observations, revealing contacts between the transmembrane helices from adjacent subunits. Herein we report the construction and stabilization of a dimeric TatA, as well as the structure determination by solution NMR spectroscopy. In addition to more extensive inter-subunit contacts between the transmembrane helices, we were also able to observe interactions between neighbouring amphipathic helices. The side-by-side packing of the amphipathic helices extends the solvent-exposed hydrophilic surface of the protein, which might be favourable for interactions with substrate proteins. The dimeric TatA structure offers more detailed information of TatA oligomeric interface and provides new insights on Tat translocation mechanism.
Introduction
Distinct to the ubiquitous Sec-pathway that translocates proteins across lipid membranes in unfolded states via a threading mechanism, the twin-arginine transport (Tat) system translocates proteins across membranes in fully folded states, some in homo- or hetero-oligomeric forms, or harboring cofactors or metal centers [1], [2]. In consequence, transportation of these folded proteins across membranes would require a highly dynamic and versatile translocation pore. Thus far, the molecular mechanism for Tat system to carry out the challenging task of translocating proteins with various shapes and sizes remains unclear.
The Escherichia coli Tat system is a model system that has been extensively investigated. It contains three essential components: two single-pass transmembrane proteins TatA and TatB, and a six-pass transmembrane protein TatC. Among these, TatA is the major component and is suggested to form the protein translocation pore by self-oligomerization [3]–[5]. Experimental evidences suggest that the TatB and TatC components function as a complex in substrate recognition and binding [6]–[8]. In the presence of proton motive force (PMF), the substrate-binding event triggers the recruitment of TatA subunits, which subsequently assemble into a high-oligomeric state for substrate translocation [9]. The high-oligomeric TatA has been suggested to function as the protein translocation pore, and the mechanism for TatA oligomer assembly proves essential for understanding Tat-mediated protein translocation.
A number of studies have been carried out to investigate the structure and oligomerization mechanism of the TatA subunit [10]–[13]. E. coli TatA is an 89 amino acid protein containing one transmembrane helix (TMH) and one amphipathic helix (APH). Both in vitro electron microscopy (EM) [3] and in vivo single-molecule imaging [5] studies showed that TatA self-oligomerizes into complexes of variable sizes and form a channel-like structure. Electron paramagnetic resonance (EPR) data suggested that residues Ile12 and Val14 in the TMH are the main contributors to TatA assembly [14]. Recently, the NMR structure of a TatA T22P mutant in dodecylphosphocholine (DPC) micelles was solved [12]. This mutant exists in the monomeric form but displays a spectrum similar to the oligomeric form. Based on this structure, a 9-mer TatA oligomer was further modelled using distance restraints from both EPR and NMR results [12]. However, inter-subunit NOE restraints are sparse and limited only in the TMH segment. On the other hand, previous biochemical studies demonstrated that the TatA function is more sensitive to mutations in the APH segment than in the TMH [15]. This raises the question of how APH is involved in the oligomerization.
Herein we report the construction and structure determination of TatA dimer by solution NMR spectroscopy. The dimeric TatA is stabilized by the introduction of a disulphide cross-linkage in the N-terminus without disrupting the overall structure. By using this dimeric sample in combination with high-resolution NMR spectroscopy, we were able to observe more extensive inter-subunit contacts in not only the TMH region, but also the APH region. The dimeric TatA structure offers a more detailed atomic picture of TatA oligomerization interface and provides valuable clues for further investigations.
Materials and Methods
NMR Samples Preparations
Two TatA constructs MCG-TatA and MCG-TatA1–55 were prepared, both containing three additional amino acids Met-Cys-Gly at the N-terminus. In the MCG-TatA1–55 construct, the C-terminal region was truncated and only residues Met1-Gln55 were kept. Expression and purification procedures of TatA proteins were similar to previously reported [10]. TatA protein was purified in DPC (Anatrace) and exchanged to a buffer containing 50 mM sodium phosphate (pH 7.0). For N-terminal linked dimeric TatA samples, purified MCG-TatA or MCG-TatA1–55 was first incubated with 10 mM H2O2 for 10 minutes at room temperature, followed by ultrafiltration to remove the oxidant. For 13C/15N-filtered-NOESY experiments, the sample was prepared by mixing unlabeled and 13C/15N-labeled MCG-TatA at 2∶1 molar ratio, then treated with 20 mM DTT in ∼300 mM deuterated DPC (dDPC, Anatrace). Subsequently, DTT and dDPC concentrations were gradually reduced by ultrafiltration. The mixed protein sample was then treated with H2O2 as described above. In the final sample, the detergent-to-protein molar ratio (DPR) was about 80 (Table S1 in File S1).
For paramagnetic relaxation enhancement (PRE) experiments [16], we constructed two TatA mutants TatA1–55-I12C and TatA1–55-I11C. Unlabeled TatA1–55-I12C or TatA1–55-I11C protein (0.5 mM) was dissolved in 50 mM sodium phosphate buffer (pH 7.0) with 150 mM DPC. 1-oxy-2,2,5,5-tetramethyl-D-pyrroline-3-methyl methanethiosulfonate (MTSL, Toronto Research Chemicals, Inc.) was added to a final concentration of 5 mM. The sample was mixed with 0.5 mM 15N-labeled TatA1–55 protein and incubated overnight at room temperature. Excess MTSL and DPC were removed by buffer exchange. The final DPR was 40. To obtain the diamagnetic reference spectra, MTSL was reduced by the addition of 4 µL of a 500 mM ascorbic acid solution.
NMR Spectroscopy
The NMR experiments were performed at 35°C on Bruker Avance 600, 700 and 800 MHz spectrometers equipped with four RF channels and triple-resonance cryo-probes with pulsed-field gradients. The chemical shift assignments were obtained by conventional triple resonance experiments [17] using 13C/15N-labeled TatA sample in dDPC micelles. 3D 15N- and 13C-edited NOESY-HSQC spectra (mixing times 100 ms and 200 ms) for TatA in dDPC micelles were collected to confirm the assignments and obtain distance restraints. For N-terminal linked dimeric TatA comprising mixed unlabeled and 13C/15N-labeled subunits, 3D 13C/15N-filtered 13C-edited NOESY-HSQC experiments (mixing time of 200 ms, 300 ms and 400 ms) and 13C-edited NOESY-HSQC experiment (mixing time of 150 ms) were recorded to obtain intermolecular NOEs.
RDC Measurements
The backbone N-H RDC measurements of E. coli TatA were performed using the liquid crystalline phase of G-tetrad DNA [18]. 1H-15N IPAP-HSQC spectra were collected for the weakly aligned and the isotropic samples at 35°C, and the RDCs were extracted from the differences in 1H-15N splitting [19]. The data were analyzed using software packages PALES [20] and MODULE [21].
Structure Calculation of TatA
The structure calculation of E. coli TatA was carried out using inter-proton NOE-derived distance restraints and dihedral angle restraints. The C-terminal tail (K49-V89) is largely unstructured based on NOESY spectra analysis and was excluded during the calculation. The program TALOS was used to predict dihedral angles ψ and φ restraints [22].
Calculations of monomeric and dimeric TatA structures were carried out using the program Xplor-NIH [23] with or without intermolecular NOEs following a procedure similar to previously described [10]. For TatA monomer, 100 structures were calculated in each cycle, and 50 conformers with the lowest total energy were selected for fitting with backbone N-H RDCs using the single value decomposition method by the program PALES [20]. In the final cycle, 29 conformers showed correlation coefficient values higher than 0.90. Among these, 20 conformers with the lowest energy were selected as the representative structures. The calculation for TatA dimer was similar as above. In the final cycle, 10 lowest energy conformers with correlation coefficient values higher than 0.80 were selected to represent the dimeric structure. The atomic coordinates of monomeric and dimeric TatA have been deposited in the Protein Data Bank with accession code PDB 2MN7 and 2MN6 respectively.
Backbone Dynamics
The backbone 15N relaxation parameters including the longitudinal and transverse relaxation rates R1 and R2 were determined for both the wild-type TatA sample at high DPC concentration (DPR of 200) and TatA1–55 sample at low DPC concentration (DPR of 40). The experiments were performed on a Bruker Avance 800-MHz NMR spectrometer at 35°C. For the wild-type TatA sample at high DPC concentration, the delays used for the R1 experiments were 10 (×3), 100, 200, 300, 400, 600, 800, 1,000, 1,600, 2,400, 3,200 and 4,000 ms, and those used for the R2 experiments were 7.4 (×2), 14.8, 22.3, 29.7, 37.1, 52.0, 66.8, 89.0, 111.4 and 133.6 ms. For TatA1–55 sample at low DPC concentration, the delays used for the R1 experiments were 10 (×2), 100, 300, 600, 1,000, 1,600, 2,400 and 3,200 ms, and those used for the R2 experiments were 7.4 (×2), 14.8, 22.3, 29.7, 52.0 and 66.8 ms. The relaxation rate constants were obtained by fitting the peak intensities to a single exponential function using the nonlinear least-squares method as described [24].
Blue-Native PAGE Analysis
Blue-native PAGE of wild-type TatA and MCG-TatA was performed as described [25]. For this experiment, the TatA proteins were purified by Ni(II)-affinity chromatography similarly as the NMR sample, except that the detergent C12E9 (Anatrace) was used instead of AZ314 (Anatrace) and DPC. Subsequently, the purified protein was subjected to gel filtration on a Superdex 200 column using an ÄKTA FPLC system (Amersham Biosciences) in 20 mM MOPS, 200 mM NaCl, 0.1% C12E9, pH 7.2. Purified proteins were separated on a 5%–14% acrylamide gradient gel and stained by Coomassie brilliant blue.
Functional Complementation Assay
In vivo functional assay was carried out using the E. coli tatA gene knockout strain (CGSC: JW3813-1). The genes of wild type TatA (wt-TatA), MCG-TatA and a truncated version TatA1–55, were constructed into pET-21a plasmids (Novagen) and transformed into the ΔtatA strain. The sodium dodecyl sulfate (SDS) resistance phenotype of TatA variants was tested on LB agar plates containing 2% SDS [26]. All ΔtatA strains carrying plasmids with tatA variant genes were first grown in LB media with ampicillin and kanamycin at 35°C until OD600 reached 0.6 and induced with 20 mg/L isopropyl-β-D-thiogalactoside (IPTG) for 2 h. The culture was diluted to OD600 of 0.2 and tenfold dilution series were spotted on agar plates. The plates were anaerobically incubated at 35°C using an AnaeroPack system (Mitsubishi Gas Chemical, Tokyo, Japan) for 72 hours.
Results
Characterization of the Oligomeric State of TatA
The E. coli TatA protein was first characterized under a relatively high DPR of 300. Under this experimental condition, the sample shows a unique set of cross peaks in the 1H-15N heteronuclear single quantum coherence (HSQC) spectrum. By using 13C/15N-uniformly labeled TatA protein solubilized in dDPC micelles, we were able to obtain near complete backbone and side chain assignments for the full-length protein (Figure S1 in File S1). Secondary structural analysis by consensus chemical shifts index (CSI) demonstrates that the protein comprises two helices in the N-terminal region and an unstructured C-terminus (Figure S2 in File S1) [27], [28], which is in accordance with previously reported TatA structures [10], [12].
During sample preparation and optimization, we observed that changes in DPR would affect the oligomerization state of TatA. At a high DPR (>200), the HSQC spectrum shows one set of cross-peaks. When the DPR decreases, new sets of cross peaks appeared, while intensities of the original set gradually decreased (see Supporting Results and Discussion and Figure S3 in File S1 for more details). The appearance of new peaks was observed for residues in the TMH, the hinge region, as well as the N-terminal part of APH (residues Leu25-Gly33, designated as ‘proximal APH’ hereafter following a previous publication by Rodriguez et al [12]), suggesting these regions exist in two oligomeric states that are in slow exchange with each other. The observation that TatA oligomerization affects not only the TMH but also the APH region is consistent with the previous observation of a truncated version of TatA (residue 1–49) [12]. On the other hand, residues in the C-terminal part of APH (residues Ala34-Ser44, designated as ‘distal APH’ hereafter following a previous publication [12]) only show chemical shift perturbations but not appearance of new peaks, whereas the unstructured C-terminus is unaffected by reducing the DPR. For clarity, we designated the original set of cross peaks as the 1st set, and the new set of peaks as the 2nd set.
To overcome the problem of spectral overlap, we generated a truncated TatA mutant to remove the unstructured C-terminal region, which has been shown to be unessential for TatA function [29]. This mutant contains only residues Met1-Gln55, and is designated as TatA1–55 hereafter. To better characterize the oligomeric property of TatA, a TatA1–55 sample was prepared in which the DPC concentration adjusted to an appropriate level (DPR ∼70) so that the 1H-15N HSQC spectrum contains two sets of peaks (Figure 1A). Backbone 15N longitudinal and transverse relaxation rates were measured, and the R2/R1 ratios were calculated to analyze the oligomerization states of the two sets of peaks. As shown in Figure 1B, the average R2/R1 ratio of residues on the TMH region is 26 for the 1st set of signals, and 52 for the 2nd set. The data of the 1st set correlates well with a protein-micelle complex showing an average diameter of ∼51 Å, which can be fitted to a monomeric transmembrane helix surrounded by ∼54 DPC molecules. The R2/R1 ratio of the 2nd set corresponds to a diameter of 57 Å, which fits well to a dimeric model showing a ∼3 Å increase in radius (see Supporting Results and Discussion and Figure S6 in File S1 for more details).
Construction of a Stable TatA Dimer
To verify that the 2nd set of peaks corresponds to a dimeric form of TatA, we mutated residue Ile12 to cysteine in the TatA1–55 construct, and tagged the cysteine residue with MTSL for paramagnetic relaxation enhancement (PRE) analysis. For clarity, we abbreviate this sample as TatA1–55-I12C-MTSL hereafter. We then mixed equal molar of unlabeled TatA1–55-I12C-MTSL with 15N-labeled TatA1–55, and optimized the DPC concentration so that the HSQC spectrum contains two sets of peaks. The PRE effect was obtained by comparing the peak intensities in the PRE and reference spectra (see Materials and Methods). Our results showed that the inter-molecular PRE effect is evident on the 2nd set of peaks, while the 1st set is scarcely perturbed (Figure 1C, 1D). An experiment using the I11C mutation gave similar results (Figure 1D). These observations suggest that the 2nd set of peaks represents a condition in which two TatA subunits are closer in space, such as two TatA monomers trapped in one micelle. For the PRE experiment using the TatA1–55-I12C-MTSL sample, the general intensity reduction for the 2nd set of peaks is about 25%, which is in good agreement with the expected result based on a dimeric model, but not a trimetric or a tetrameric model (see Supporting Results and Discussion and Figure S8 in File S1 for more details). The observation that the PRE effect induced by the I11C mutation appears more significant than the I12C mutation could be explained by the fact that Ile11 is not located on the dimerization interface and the flexibility of the attached paramagnetic tag allows more populations to be affected (Figure S8 in File S1).
Finally, for preparation of a stabilized TatA dimer, we constructed a mutant protein in which the N-terminus of wild-type TatA was extended by three residues Met-Cys-Gly, named as MCG-TatA. This mutant protein was cross-linked together by a disulfide bond under oxidizing condition, generating a covalently linked TatA dimer, abbreviated as d-MCG-TatA (‘d’ designates dimer). The introduction of a disulfide bond at the flexible N-terminus brings two TatA molecules close together, while at the same time avoids perturbation at structured regions. The formation of the TatA dimer was validated by SDS-PAGE as well as NMR spectroscopy (Figure S9 in File S1). In addition, the formation of a disulfide bridge is confirmed by the Cβ chemical shift at ∼41 ppm for the cysteine residue in the tripeptide extension. The backbone 15N dynamics data of d-MCG-TatA showed similar R2/R1 values compared to the 2nd peak set of TatA1–55 (Figure S7 in File S1). The oligomerization pattern of MCG-TatA and d-MCG-TatA were both verified to be similar to the wild-type protein by blue native gel analysis (Figure 2A).
Moreover, we used a modified SDS resistance assay to test MCG-TatA function in vivo. The assay was designed by combining two previously reported growth tests of tat deletion mutants, namely the growth in the presence of SDS or under anaerobic atmosphere [26], to enhance the phenotype of ΔtatA in the presence of the tatA gene homolog tatE. We discovered that the ΔtatA bacterial strain can grow in the presence of SDS under aerobic condition or under anaerobic in the absence of SDS, but cannot grow on SDS plates incubated anaerobically. We therefore used this system to perform the functional complementation assay of wild-type TatA and its variants. As shown in Figure 2B, MCG-TatA is able to partially restore SDS resistance in E. coli ΔtatA strain under anaerobic growth condition, whereas its activity is lower compared to wt-TatA. However, MCG-TatA supports the bacterial growth better than the TatA1–55, which was suggested to retain the functional activity of TatA as previously reported under a different experimental condition [29]. Our current results suggest that the N-terminal tripeptide extension has less effect on the TatA activity than the truncation of C-terminal residues under the severe growth condition, and indicate that the d-MCG-TatA sample can be used as a mimic for characterizing the dimeric TatA structure.
Figure 3 shows the 2D 1H-15N HSQC spectrum of d-MCG-TatA in DPC micelles. Spectral comparisons between MCG-TatA, d-MCG-TatA and TatA (Figure S10 in File S1) indicate that d-MCG-TatA closely resemble the dimeric form (2nd peak set) of TatA. The chemical shift assignments were obtained for 96% of backbone amides except for proline residues. In comparison with the monomeric TatA, the cross peaks corresponding to the N-terminal of TatA (residue 1–44) are broadened and show chemical shift perturbation, while those in the C-terminal unstructured region are unaffected.
Structures of Monomeric and Dimeric TatA
Solution structures of monomeric and dimeric TatA were calculated based on NOE derived distance restraints. The atomic coordinates have been deposited in the Protein Data Bank (PDB code: 2MN7 and 2MN6). For the calculation of the dimeric structure, inter-molecular NOEs were determined by 3D 13C/15N-filtered NOESY experiments using d-MCG-TatA sample containing mixed 13C/15N-labeled and unlabeled protomers. A total of 75 unambiguous inter-subunit restraints were first identified and subsequently used to extend NOE assignments in 3D 13C, 15N-filtered-13C-edited NOESY and 3D 13C-edited NOESY spectra. Finally, 139 inter-subunit NOE restraints were identified and used in the calculation of dimeric TatA structures. Structural statistics for both monomeric and dimeric TatA are summarized in Table 1.
Table 1. Structural statistics for E. coli TatA.
Experimental restraints | monomer | dimer |
Total NOE | 1636 | 3477 |
Total unambiguous NOE | 1067 | 2204 |
Intra-residue | 451 | 902 |
Inter-residue | ||
Sequential (|i-j| = 1) | 328 | 656 |
Medium-range (|i-j|≤4) | 257 | 510 |
Long-range (|i-j|≥5) | 68 | 136 |
Total Ambiguous NOE | 569 | 1134 |
Intermolecular* | 139 | |
Total dihedral angle restraints (φ+ψ) | 65 (32+33) | 130 (64+66) |
Restraint Violations | ||
Distance restraint violations >0.5 Å | 1 | 4 (<0.6 Å) |
Dihedral angle restraint violations >5° | 0 | 1 (<5.5°) |
RMSD** from mean structure (Å) | ||
All heavy atoms | 1.140±0.262 | 1.254±0.381 |
All backbone atoms | 0.844±0.252 | 1.005±0.346 |
Secondary structure heavy (6–20, 24–42) | 1.084±0.258 | 1.222±0.364 |
Secondary structure backbone (6–20, 24–42) | 0.751±0.235 | 0.980±0.328 |
Ramachandran Statistics† (%) | ||
Most favored regions | 95.9 | 87.7 |
Additional allowed regions | 3.9 | 12.1 |
Generously allowed regions | 0.2 | 0.2 |
Disallowed regions | 0 | 0 |
*The total intermolecular NOEs include all unique and ambiguous NOEs identified from 3D 13C/15N-filtered 13C-edited NOESY-HSQC experiment and 13C-edited NOESY-HSQC experiment.
**The RMSD values were calculated for residues 5–44.
The values of Ramachandran statistics were calculated for residues 5–44. The C-terminal unstructured region is excluded from the analysis.
As shown in Figure 4A, the structure of the monomeric E. coli TatA consists of a 15-residue TMH (residues Ile6-Phe20), a 21-residue APH (residues Lys24-Met43) and a mostly unstructured C-terminal region. The two helices are linked by a highly conserved four-residue hinge F20G21T22K23, and pack into an L-shaped structure. The structure is generally similar to the E. coli TatA (T22P) structure previously reported [12]. When we align the two structures using backbone atoms from all structural regions (residues Ile6-Met43), the derived root mean square deviation (RMSD) of the Cα atoms is 2.5 Å. However, when aligning the TMH and APH separately, higher structural convergence is observed with Cα RMSD values of 0.6 Å and 1.6 Å, respectively. The subtle difference between the two structures resides in the relative orientation between the two helices and the kink at Gly33, which may be due to the one-residue mutation or differences in sample condition. Despite this variance, similar interaction network was observed among residues Val16, Val17, Phe20, Thr22/Pro22 and Leu25 at the hinge between the two helices.
As shown in Figure 4B, the TatA dimer is formed by the side-by-side packing of two TatA subunits. The structures of the two individual protomers are essentially similar to the monomeric TatA structure. The characteristic α-helical NOE patterns are preserved in both TMH and APH. The intra-molecular NOE contacts between the two helices, in particular the interaction network at the hinge region are the identical in monomeric and dimeric TatA. Furthermore, the RDC data of the d-MCG-TatA sample fit reasonably well with both the monomeric and dimeric structures, with correlation coefficient values R of 0.88 and 0.81 respectively, suggesting no significant changes in the TMH-APH orientation upon dimerization.
The TatA Oligomerization Interface
In the dimeric TatA structure, inter-subunit NOEs are observed for both the TMH and APH segments. Among these, two-thirds are contributed by residues in the TMH. The number of inter-subunit NOEs observed in the filtered-NOESY spectrum for residues in the APH is about half of that for the TMH, and the intensities of the NOE peaks for the APH are generally weaker than those in the TMH (∼40%). In addition, the NOE peaks for residues in the distal APH are weaker than those in the proximal APH. Considering the fact that residues in the distal APH show only peak shifts during the DPR-dependent TatA oligomerization as presented above, a possible scenario is that the TMH interactions are stronger and play a main role in TatA dimer formation, whereas the APH interactions are more dynamic and undergo fast exchanges between ‘contact’ and ‘off-contact’ states (Figure S12 in File S1). Moreover, residues in the hinge region and residue Gly33 in the APH show relatively weak intramolecular NOE signals possibly due to conformational exchanges. We were therefore unable to detect intermolecular NOEs for these residues although they reside on the dimeric interface.
The interface between two TMHs is formed by residues Leu9, Ile12, Val16 and Leu19 from one subunit, and residues Leu10, Val14, Val17 and Leu18 from the other (Figure 5A, 5C). Residues Ile12 and Val14 are located in the center of the TMH contacting surface and are close to each other, which is consistent with previous EPR observations [14]. In a recent NMR study of the E. coli TatA structure, a 9-mer TatA complex was modeled based on the monomeric TatA-T22P mutant structure [12]. To compare the dimerization pattern, we extracted two adjacent protomers from the 9-mer TatA complex (PDB: 2LZS) and aligned it with our dimeric structure. The interface between two TMHs is generally identical between the two structures, which is comprised mainly of hydrophobic residues Leu9, Ile12, Val14, Val16, Leu18 and Phe20 (Figure 5D). Most residues identified to display inter-subunit NOE signals, including Leu9, Ile12, Val14, Val16, Leu18 and Phe20, are the same in both structures. However, the two structures display subtle difference in the relative orientations between neighboring TMHs. This could probably be attributed to the different procedures used for building the models. It is also possible that the TMH orientations could be fine-tuned in correspondence to TatA oligomers of various sizes.
On the other hand, we also identified inter-subunit NOEs in the APH segment by using 13C/15N-filtered NOESY experiments. In contrast to the TMH, inter-APH NOE cross-peaks are generally weaker, and have not been observed in the previous studies [12]. The inter-APH NOE contacts are observed between residues Ile28, Leu32, Ser35, Ile36 and Phe39 from one subunit, and residues Leu25, Ser30, Gly33, Ala34, Lys37 and Lys40 from the other (Figure 6A-C). Among these, the long-chain hydrophobic residues Ile28, Leu32 and Ile36 are aligned on one side of the APH, forming a highly hydrophobic strip. On the opposite side of the helix, three glycine residues (Gly26, Gly29 and Gly33) align into another strip. These two strips of residues pack side by side at the dimeric interface (Figure 6E).
Functionality of TatA has been demonstrated to crucially depend on the APH segment, since many cysteine substitutions abolished Tat activity [15], [30], [31]. These mutations may affect either TatA self-assembly or TatA-substrate interactions. In particular, single point mutations at the dimeric interface including G29C, G33S, G33A, G33D, I28T, L32C, I36T and I36C have been reported to abolish the Tat function [15], [31], [32], whereas non-charged residues such as Ser27, Ala34 and Ser35 that do not contribute to the APH assembly appear non-essential. Moreover, cross-linking experiments have shown that G33S and G33D mutations significantly affect TatA self-assembly. The G33D mutation showed no cross-linking activity at all and an electrostatic repulsion by this mutation was suggested [32].
The Hydrophilic Surface is Extended upon Dimerization
The structure of TatA shows a clear segregation of hydrophobic and hydrophilic residues. The TMH segment is highly hydrophobic. The APH segment contains a relatively hydrophobic side that faces the TMH and a hydrophilic side that is exposed to the aqueous solution. The exposed hydrophilic surface comprises most of the charged residues including Ser27, Ser30, Asp31, Lys37 and Lys41 from APH, as well as residues Lys23-Lys24 from the hinge. In the dimeric form of TatA, the hydrophilic surfaces of two protomers are positioned side-by-side, and form a larger highly charged area as shown in Figure 6D. This area displays an electrostatic potential pattern with negative charges in the center, sandwiched by two positively charged zones.
Previous functional studies suggested that a charged surface may be essential for translocation of well-folded proteins, since cysteine mutations of Lys23, Lys24, Asp31, Lys37 and Lys40 or glutamine substitutions of three lysine residues (Lys37, Lys40 and Lys41) all abolished protein transport by the Tat system [15]. A possible scenario is that upon formation of higher oligomeric states, this charged surface is more significantly extended, and better facilitate interactions with substrate proteins that are fully folded and contain generally hydrophilic surfaces.
Discussion
The mechanism of TatA self-assembly and its role in protein transport has long been the center of research interest. Two major models for TatA mechanism have been proposed based on a variety of biochemical, structural and theoretical data emerged in recent years, the observations and interpretations of which are sometimes controversial [33]. The ‘trap-door’ or ‘pore-forming’ model that gained earlier popularity is based on the observed pore structures by cryo-EM and the possibility of APH to flip through the lipid bilayer, suggesting a protein translocation pore with hydrophilic interior formed by the APH [3], [34]. The ‘membrane-weakening’ model suggests TatA does not form pores of different sizes but aggregate to disrupt local lipid-bilayer structures, resulting in the thinning of the lipid and facilitating protein translocation [35]. This model become more favorable recently, and was supported by a number of experimental evidences [11], [36]. In both models, although the role of TMH in mediating TatA oligomerization is better characterized by multiple experimental techniques [12], [14], the role of the APH region in TatA assembly remains less well understood.
The NMR and molecular dynamics studies by Rodriguez et al support the ‘membrane-weakening’ model [12]. In their 9-mer TatA structure, the APH was placed on the outer side of the TMH ring. A model was postulated in which the APH contacts the substrate protein and is pulled into the lipid bilayer during translocation so that the angle between TMH and APH increases. Although no APH interactions were observed in the 9-mer TatA structure, as the number of TatA subunits increases and as the APH is pulled into the bilayer, the distance between neighboring APH becomes closer. At this stage, the weak and dynamic interaction between adjacent APHs may come to play an important role in holding multiple APHs together to form a continuous and charged surface. This could increase the interaction surface between TatA complex and fully-folded substrate proteins, and may help in substrate protein quality control or sealing of the pore to prevent leakage of small molecule, but remains to be further investigated.
Recently, Walther et al introduced a ‘charge zipper mechanism’ by a simulated model in which the APH and the unstructured charged C-terminus form complementary charge interactions to allow TatA assembly [37]. This model was primarily based on Bacillus subtilis TatAd structure, and is more in line with the ‘pore-forming’ model. The APH segments, however, are not in close contact with each other based on this model. Moreover, the primary sequences of TatA from E. coli and other Gram-negative bacteria are somewhat different from the Gram-positive B. subtilis TatA sequences. In particular, the highly conserved negative charges that were suggested to be important for this ‘charge zipper’ is located immediately after the last turn of the APH in E. coli TatA sequence (residues D45D46E47), whereas in B. subtilis TatAd and TatAy sequences the negatively charged motifs are separated from the APH by a 4 or 6-residue loop. In the absence this connecting loop, it would be difficult for the negatively charged sequence of E. coli TatA to fold back and form the zipper conformation without unwinding the last turn of the APH. On the other hand, mutagenesis studies demonstrated this charged D45D46E47 sequence is essential for TatA function, whereas its exact role remains unclear [38]. In our current study, we observed intermolecular contact between two APHs, but not between APH and the charged C-terminus. However, one intriguing observation is that the truncation of the C-terminal region appears to promote DPC-dependent aggregation of TatA, suggesting a negative regulation on TatA oligomerization by the C-terminus, which remains to be further investigated.
Conclusion
In summary, by using a combination of state of the art solution NMR and protein engineering techniques, we constructed a stable TatA dimer and observed more extensive inter-subunit contacts in not only the TMH region, but also the APH region. The dimeric TatA structure offers a more detailed atomic picture of TatA oligomerization interface and provides clues for further investigation.
Supporting Information
Acknowledgments
All NMR data were collected at Beijing NMR Center (BNMRC), Peking University. We thank Dr. Hongwei Li from BNMRC for his efforts in this project. We thank Prof. Jiyan Ma from Ohio State University for critical reading and discussions on the manuscript.
Data Availability
The authors confirm that all data underlying the findings are fully available without restriction. All coordinate files are available from the PDB database accession numbers 2MN7, 2MN6.
Funding Statement
This work is supported by National Natural Science Foundation of China Grant 31070649 and the Grant for Innovative Methods 2010IM030700 from China Ministry of Science and Technology (to C.J.). Y.H. acknowledges a special support from China Postdoctoral Foundation. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1. Sargent F (2007) The twin-arginine transport system: moving folded proteins across membranes. Biochem Soc Trans 35: 835–847. [DOI] [PubMed] [Google Scholar]
- 2. Natale P, Bruser T, Driessen AJ (2008) Sec- and Tat-mediated protein secretion across the bacterial cytoplasmic membrane—distinct translocases and mechanisms. Biochim Biophys Acta 1778: 1735–1756. [DOI] [PubMed] [Google Scholar]
- 3. Gohlke U, Pullan L, McDevitt CA, Porcelli I, de Leeuw E, et al. (2005) The TatA component of the twin-arginine protein transport system forms channel complexes of variable diameter. Proc Natl Acad Sci U S A 102: 10482–10486. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Dabney-Smith C, Mori H, Cline K (2006) Oligomers of Tha4 organize at the thylakoid Tat translocase during protein transport. J Biol Chem 281: 5476–5483. [DOI] [PubMed] [Google Scholar]
- 5. Leake MC, Greene NP, Godun RM, Granjon T, Buchanan G, et al. (2008) Variable stoichiometry of the TatA component of the twin-arginine protein transport system observed by in vivo single-molecule imaging. Proc Natl Acad Sci U S A 105: 15376–15381. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Bolhuis A, Mathers JE, Thomas JD, Barrett CM, Robinson C (2001) TatB and TatC form a functional and structural unit of the twin-arginine translocase from Escherichia coli . J Biol Chem 276: 20213–20219. [DOI] [PubMed] [Google Scholar]
- 7. Orriss GL, Tarry MJ, Ize B, Sargent F, Lea SM, et al. (2007) TatBC, TatB, and TatC form structurally autonomous units within the twin arginine protein transport system of Escherichia coli . FEBS Lett 581: 4091–4097. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Tarry MJ, Schafer E, Chen S, Buchanan G, Greene NP, et al. (2009) Structural analysis of substrate binding by the TatBC component of the twin-arginine protein transport system. Proc Natl Acad Sci U S A 106: 13284–13289. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Palmer T, Berks BC (2012) The twin-arginine translocation (Tat) protein export pathway. Nat Rev Microbiol 10: 483–496. [DOI] [PubMed] [Google Scholar]
- 10. Hu Y, Zhao E, Li H, Xia B, Jin C (2010) Solution NMR structure of the TatA component of the twin-arginine protein transport system from gram-positive bacterium Bacillus subtilis . J Am Chem Soc 132: 15942–15944. [DOI] [PubMed] [Google Scholar]
- 11. Walther TH, Grage SL, Roth N, Ulrich AS (2010) Membrane alignment of the pore-forming component TatA(d) of the twin-arginine translocase from Bacillus subtilis resolved by solid-state NMR spectroscopy. J Am Chem Soc 132: 15945–15956. [DOI] [PubMed] [Google Scholar]
- 12. Rodriguez F, Rouse SL, Tait CE, Harmer J, De Riso A, et al. (2013) Structural model for the protein-translocating element of the twin-arginine transport system. Proc Natl Acad Sci U S A 110: E1092–1101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Chan CS, Haney EF, Vogel HJ, Turner RJ (2011) Towards understanding the Tat translocation mechanism through structural and biophysical studies of the amphipathic region of TatA from Escherichia coli . Biochim Biophys Acta 1808: 2289–2296. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. White GF, Schermann SM, Bradley J, Roberts A, Greene NP, et al. (2010) Subunit organization in the TatA complex of the twin arginine protein translocase: a site-directed EPR spin labeling study. J Biol Chem 285: 2294–2301. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Greene NP, Porcelli I, Buchanan G, Hicks MG, Schermann SM, et al. (2007) Cysteine scanning mutagenesis and disulfide mapping studies of the TatA component of the bacterial twin arginine translocase. J Biol Chem 282: 23937–23945. [DOI] [PubMed] [Google Scholar]
- 16. Battiste JL, Wagner G (2000) Utilization of site-directed spin labeling and high-resolution heteronuclear nuclear magnetic resonance for global fold determination of large proteins with limited nuclear overhauser effect data. Biochemistry 39: 5355–5365. [DOI] [PubMed] [Google Scholar]
- 17. Sattler M, Schleucher J, Griesinger C (1999) Heteronuclear multidimensional NMR experiments for the structure determination of proteins in solution employing pulsed field gradients. Prog Nucl Magn Reson Spectros 34: 93–158. [Google Scholar]
- 18. Lorieau J, Yao L, Bax A (2008) Liquid crystalline phase of G-tetrad DNA for NMR study of detergent-solubilized proteins. J Am Chem Soc 130: 7536–7537. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Ottiger M, Delaglio F, Bax A (1998) Measurement of J and dipolar couplings from simplified two-dimensional NMR spectra. J Magn Reson 131: 373–378. [DOI] [PubMed] [Google Scholar]
- 20. Zweckstetter M (2008) NMR: prediction of molecular alignment from structure using the PALES software. Nat Protoc 3: 679–690. [DOI] [PubMed] [Google Scholar]
- 21. Dosset P, Hus JC, Marion D, Blackledge M (2001) A novel interactive tool for rigid-body modeling of multi-domain macromolecules using residual dipolar couplings. J Biomol NMR 20: 223–231. [DOI] [PubMed] [Google Scholar]
- 22. Cornilescu G, Delaglio F, Bax A (1999) Protein backbone angle restraints from searching a database for chemical shift and sequence homology. J Biomol NMR 13: 289–302. [DOI] [PubMed] [Google Scholar]
- 23. Schwieters CD, Kuszewski JJ, Tjandra N, Clore GM (2003) The Xplor-NIH NMR molecular structure determination package. J Magn Reson 160: 65–73. [DOI] [PubMed] [Google Scholar]
- 24. Fushman D, Cahill S, Cowburn D (1997) The main-chain dynamics of the dynamin pleckstrin homology (PH) domain in solution: analysis of 15N relaxation with monomer/dimer equilibration. J Mol Biol 266: 173–194. [DOI] [PubMed] [Google Scholar]
- 25. Schagger H, von Jagow G (1991) Blue native electrophoresis for isolation of membrane protein complexes in enzymatically active form. Anal Biochem 199: 223–231. [DOI] [PubMed] [Google Scholar]
- 26. Stanley NR, Findlay K, Berks BC, Palmer T (2001) Escherichia coli strains blocked in Tat-dependent protein export exhibit pleiotropic defects in the cell envelope. J Bacteriol 183: 139–144. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Wishart DS, Sykes BD (1994) The 13C chemical-shift index: a simple method for the identification of protein secondary structure using 13C chemical-shift data. J Biomol NMR 4: 171–180. [DOI] [PubMed] [Google Scholar]
- 28. Schwarzinger S, Kroon GJ, Foss TR, Chung J, Wright PE, et al. (2001) Sequence-dependent correction of random coil NMR chemical shifts. J Am Chem Soc 123: 2970–2978. [DOI] [PubMed] [Google Scholar]
- 29. Lee PA, Buchanan G, Stanley NR, Berks BC, Palmer T (2002) Truncation analysis of TatA and TatB defines the minimal functional units required for protein translocation. J Bacteriol 184: 5871–5879. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Barrett CM, Mangels D, Robinson C (2005) Mutations in subunits of the Escherichia coli twin-arginine translocase block function via differing effects on translocation activity or tat complex structure. J Mol Biol 347: 453–463. [DOI] [PubMed] [Google Scholar]
- 31. Barrett CML, Mathers JE, Robinson C (2003) Identification of key regions within the Escherichia coli TatAB subunits. FEBS Letters 537: 42–46. [DOI] [PubMed] [Google Scholar]
- 32. Hicks MG, Lee PA, Georgiou G, Berks BC, Palmer T (2005) Positive selection for loss-of-function tat mutations identifies critical residues required for TatA activity. J Bacteriol 187: 2920–2925. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Patel R, Smith SM, Robinson C (2014) Protein transport by the bacterial Tat pathway. Biochim Biophys Acta. In press. [DOI] [PubMed]
- 34. Gouffi K, Gerard F, Santini CL, Wu LF (2004) Dual topology of the Escherichia coli TatA protein. J Biol Chem 279: 11608–11615. [DOI] [PubMed] [Google Scholar]
- 35. Bruser T, Sanders C (2003) An alternative model of the twin arginine translocation system. Microbiol Res 158: 7–17. [DOI] [PubMed] [Google Scholar]
- 36. Fröbel J, Rose P, Müller M (2011) Early Contacts between Substrate Proteins and TatA Translocase Component in Twin-arginine Translocation. J Biol Chem 286: 43679–43689. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Walther TH, Gottselig C, Grage SL, Wolf M, Vargiu AV, et al. (2013) Folding and Self-Assembly of the TatA Translocation Pore Based on a Charge Zipper Mechanism. Cell 152: 316–326. [DOI] [PubMed] [Google Scholar]
- 38. Warren G, Oates J, Robinson C, Dixon AM (2009) Contributions of the Transmembrane Domain and a Key Acidic Motif to Assembly and Function of the TatA Complex. J Mol Biol 388: 122–132. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The authors confirm that all data underlying the findings are fully available without restriction. All coordinate files are available from the PDB database accession numbers 2MN7, 2MN6.