Abstract
Coiled coils are the best-understood protein fold, as their backbone structure can uniquely be described by parametric equations. This level of understanding has allowed their manipulation in unprecedented detail. They do not seem a likely source of surprises, yet we describe here the unexpected formation of a new type of fiber by the simple insertion of two or six residues into the underlying heptad repeat of a parallel, trimeric coiled coil. These insertions strain the supercoil to the breaking point, causing the local formation of short β-strands, which move the path of the chain by 120° around the trimer axis. The result is an α/β coiled coil, which retains only one backbone hydrogen bond per repeat unit from the parent coiled coil. Our results show that a substantially novel backbone structure is possible within the allowed regions of the Ramachandran space with only minor mutations to a known fold.
DOI: http://dx.doi.org/10.7554/eLife.11861.001
Research Organism: None
eLife digest
Proteins are made up of building blocks called amino acids. Groups of amino acids within the protein can then fold into three-dimensional shapes, one of the most common being a helical structure known as an α-helix. Two or more α-helices may be wound around each other to form a bundle called a coiled coil, which is found in many proteins. Each complete turn of an α-helix contains a set number of amino acids, but the number of amino acids in the turns of a coiled coil can vary. The most common pattern in a coiled coil has 7 amino acids over two turns, which is known as a heptad repeat.
When amino acids are added into or deleted from the heptad repeats, the number of amino acids in the turns of a coiled coil changes. However, it cannot increase too far beyond the number of amino acids in each turn of a normal α-helix because there is a limit to the amount of coiling that the helices can tolerate. Many naturally occurring coiled coils have regions where the overall α-helical structure is retained, even though there are small sections where the number of amino acids in a turn is disrupted. This may be due to insertions of small numbers of amino acids. Although the impact of some insertions (e.g. three or four at a time) has been studied, the effect of inserting other amounts of amino acids was not clear.
Hartmann et al. investigated what would happen when two or six amino acids were inserted into the heptad repeats of a coiled coil within a protein from bacteria. These numbers of amino acids have been predicted to cause the greatest strain on the coiled coil structure. The experiments show that inserting these numbers of amino acids caused so much strain that the three α-helices making up the coiled coil break apart and refold into a completely different type of structure called a β-strand. The three short β-strands then associate into a triangular structure that Hartmann et al. named a β-layer.
Further experiments showed that inserting the same numbers of amino acids into the heptad repeats of other coiled coil proteins also resulted in the formation of β-layers. Hartmann et al.’s findings suggest that the alternating α-helix and β-strand structures may help to make the proteins stronger and enable to carry out more versatile roles in cells.
Introduction
α-Helical coiled coils are ubiquitous protein domains, found in a wide range of structural and functional contexts (Lupas, 1996). They were the first protein fold described in atomic detail (Crick, 1953b) and are also the only one whose backbone structure can be computed with parametric equations (Crick, 1953a), placing them at the forefront of protein design efforts (Huang et al., 2014; Joh et al., 2014; Thomson et al., 2014; Woolfson, 2005).
The structure of coiled coils is understood at a level unrivaled by any other fold. They consist of at least two α-helices, wound into superhelical bundles and held together by a mostly hydrophobic core. In their most prevalent form they follow a heptad sequence repeat pattern. The seven positions in a heptad are labeled a – g, where positions a and d are oriented towards the core of the bundle and are thus mostly hydrophobic. Beyond the heptad repeat, a range of other periodicities is accessible to coiled coils, which is only restrained by the periodicity of the unperturbed α-helix (Gruber and Lupas, 2003). This restraint is responsible for the supercoiling of the bundle: As an ideal, straight α-helix has a periodicity of about 3.63 residues per turn, the heptad coiled coil has a left-handed twist to reduce the periodicity to 3.5 residues per turn with respect to the bundle axis. In hendecad coiled coils, the situation is reversed: 11 residues are accommodated in 3 helical turns, resulting in 11/3 = 3.67 residues per turn. As this is slightly above 3.63, hendecads are slightly right-handed. With the periodicity of pentadecad coiled coils, 15/4 = 3.75 residues per turn, right-handedness is as pronounced as left-handedness is in heptad coiled coils.
Many naturally occurring coiled coils contain transitions between segments of different periodicity (Alvarez et al., 2010; Hartmann et al., 2014) or harbor discontinuities that retain the α-helical structure, but perturb the periodicity locally (Parry, 2014). The best understood discontinuities are insertions of 3 or 4 residues, which are close to the periodicity of 3.63 of α-helices (Brown et al., 1996; Hicks et al., 2002; Lupas and Gruber, 2005). The insertion of 3 residues is termed a stammer, the insertion of 4 residues a stutter. With 3 residues being less than one full turn of a helix, stammers lead to a local decrease in periodicity and an increase of left-handedness. Stutters have the opposite effect. Inserted into a heptad coiled coil, a stutter can locally extend one heptad to form a hendecad (7 + 4 = 11 -> 11/3) or, being delocalized over multiple heptads, lead to even higher periodicities like 18 residues over 5 turns (7 + 7 + 4 = 18 -> 18/5). Other periodicities can be brought about by the insertion of multiple stammers or stutters (e.g. 7 + 4 + 4 = 15 -> 15/4). These relationships are illustrated in Figure 1, which shows the effects on coiled-coil periodicity resulting from consecutive insertions of stammers (blue lines) and stutters (green lines), and from their progressive delocalization (red lines).
However, there are limits to the periodicities coiled coils can assume, imposed by the degree of supercoiling the constituent helices can tolerate. The insertion of a stammer into a heptad coiled coil, leading locally to a periodicity of 10/3 = 3.33, was predicted to cause an overwinding of the helices (Brown et al., 1996). We could verify this experimentally: the structure of a stammer showed that the local overwinding introduced sufficient strain to cause the formation of a short 310-helical segment (Hartmann et al., 2009). We therefore assume that 3.33 (10/3) residues per turn mark the lower limit for periodicities. As this is about 0.3 residues per turn less than the periodicity of a perfectly straight helix, one might expect the upper limit at a periodicity of about 3.9. In fact the vast majority of known coiled-coil structures deviating from the heptad repeat have periodicities higher than 3.5 and the most extreme example is found in the trimeric autotransporter YadA, which has a local periodicity of 3.8 (19/5) (Alvarez et al., 2010).
In contrast to stammers and stutters, accommodating insertions of 1 or 5 residues is more demanding for the bundle. According to Figure 1 they have to be delocalized over more than one heptad, as periodicities of 4.0 ((7+1)/2) or 2.66 ((7+1)/3) do not fall into the accessible range, and neither do 2.5 (0+5/2), 4.0 ((7+5)/3) or 3.0 ((7+5)/4). To retain α-helical structure, both insertions of 1 and 5 residues have to be delocalized over at least two heptads, leading to periodicities of 3.75 (15/4) and 3.8 (19/5), respectively. Interestingly, these periodicities can also be brought about by the insertion of 2 (15/4) and 3 (19/5) consecutive stutters. Alternatively, insertions of 1 residue (skip residues) can be accommodated by the local formation of a π-turn in the α-helix, leaving the remaining coiled coil largely unperturbed (Lupas, 1996).
Still missing for a complete picture of coiled-coil periodicities is the understanding of insertions of 2 and 6 residues, which should cause the greatest strain on α-helical geometry. We find that they indeed break the α-helices to form short β-strands, which associate into a triangular supersecondary structure we name the β-layer. β-Layers are found, also repetitively, in natural coiled coils, where they form regular fibers with alternating α- and β-structure, a protein fold that has not been described so far.
Results and discussion
A β-layer in the coiled-coil stalk of Actinobacillus OMP100
We have a long-standing interest in trimeric autotransporter adhesins (TAA), fibrous proteins of the Gram-negative bacterial surface (Bassler et al., 2015; Hartmann et al., 2012; Hoiczyk et al., 2000; Szczesny and Lupas, 2008), whose domains we routinely fuse to stabilizing adaptor coiled coils for biochemical and biophysical study (Deiss et al., 2014; Hernandez Alvarez et al., 2008). In the process, we have repeatedly gained insights into aspects of coiled-coil structure (Alvarez et al., 2010; Grin et al., 2014; Hartmann et al., 2012; 2009; Leo et al., 2011), such as for example into a recurrent polar motif of the hydrophobic core (the N@d layer), in which asparagines in position d of the core coordinate anions at their center (Hartmann et al., 2009). As part of that study, we identified a putative TAA in Actinobacillus actinomycetemcomitans, OMP100, which carries insertions of 2 and of 3 residues within the heptad repeats of its stalk. The insertion of 2 residues extends a heptad to the 9-residue motif IENKADKAD and occurs between three N-terminal and two C-terminal heptads carrying N@d layers; the insertion of 3 residues is directly downstream. This observation was highly puzzling, since the heptad register of the protein could be assigned with great confidence, leaving no doubt that an insertion of 2 residues had occurred, but this insertion could not be explained by coiled-coil theory. For structural characterization, we therefore expressed residues 133Q-198K, covering the two insertions and the five N@d layers, fused N- and C-terminally to the trimeric form of the GCN4 leucine zipper, GCN4-pII (Table 1). The construct yielded a typical α-helical CD spectrum and, upon heating, unfolded cooperatively with a transition midpoint at 91°C. We obtained crystals in space group C2 that diffracted to a resolution of 2.3 Å, with one symmetric trimer in the asymmetric unit. The structure showed a continuous heptad coiled coil with two discontinuities (Figure 2). As expected, the insertion of 3 residues C-terminal to the N@d layers led to the formation of a decad, with a short 310-helical segment, as for the stammer we had described previously (Hartmann et al., 2009).
Table 1.
Construct | Protein sequence | Final buffer |
---|---|---|
OMP100 |
(GCN4-pII)N-IQNVDVR
STENAAR SRANEQK IAENKKA IENKADKAD VEKNRAD IAANSRA IATFRSSSQN IAALTTK-(GCN4pII)c-KLHHHHHH |
20 mM Tris pH 7.5, 400 mM NaCl, 5% Glycerol |
Tcar0761 |
(GCN4-N16V)N-ITLMQAN
–––MATKDD LARMATKDD IANMATKDD IANMATKDD IAKLDVK IENLNTK-(GCN4-N16V)c-GSGHHHHHH |
20 mM MOPS pH 7.2, 500 mM NaCl, 5% Glycerol, 2 M Urea |
T6 | (6xH-TEV)-(GCN4-N16V)N-MATKDD-(GCN4-N16V)c | 20 mM HEPES pH 7.4, 50 mM NaCl, 5% Glycerol, 1 M Urea |
T9 | (6xH-TEV)-(GCN4-N16V)N-MATKDDIAN-(GCN4-N16V)c | 20 mM HEPES pH 7.4, 50 mM NaCl, 5% Glycerol, 1 M Urea |
A6 | (6xH-TEV)-(GCN4-N16V)N-IENKAD-(GCN4-N16V)c | 20 mM HEPES pH 7.4, 50 mM NaCl, 5% Glycerol, 1 M Urea |
A7 | (6xH-TEV)-(GCN4-N16V)N-IENKKAD-(GCN4-N16V)c | 20 mM HEPES pH 7.4, 50 mM NaCl, 5% Glycerol, 1 M Urea |
A9 | (6xH-TEV)-(GCN4-N16V)N-IENKADKAD-(GCN4-N16V)c | 20 mM HEPES pH 7.4, 50 mM NaCl, 5% Glycerol, 1 M Urea |
A9b | (6xH-TEV)-(GCN4-N16V)N-IANKEDKAD-(GCN4-N16V)c | 20 mM HEPES pH 7.5, 50 mM NaCl, 10% Glycerol, 1 M Urea |
(GCN4-pII)N MKQIEDKIEEILSKIYHIENEIARIKKL.
(GCN4-pII)C MKQIEDKIEEILSKIYHIENEIARIKKLI.
(GCN4-N16V)N MKQLEMKVEELLSKVYHLENEVARLKKL.
(GCN4 N16V)C MKQLEWKVEELLSKVYHLENEVARLKKLV.
(6xH-TEV) MKHHHHHHPMSDYDIPTTENLYFQGH.
However, the insertion of 2 residues led to a sharp break in the coiled coil: In the middle of the IENKADKAD motif, the three chains of the trimer cross each other to form a triangular plane perpendicular to the coiled-coil axis. Thereby only the central three residues of the motif, KAD, deviate from α-helical structure (Figure 2). All three fall into the β region of the Ramachandran plot, but only the central residue, alanine, forms backbone hydrogen bonds with the alanine residues of the other chains. We call this structural element a β-layer. It is essentially the same β-layer we described as an adaptor between α-helical and β-stranded segments of TAAs (Hartmann et al., 2012). Here it directly connects two α-helical segments, where the C-terminal one is rotated counterclockwise by ~120° around the trimer axis, as viewed from the N-terminus (Figure 2).
The first three residues of the IENKADKAD sequence motif occupy heptad positions a, b and c of the N-terminal α-helical segment, the last three residues positions e, f and g of the C-terminal segment. Therefore the β-layer, formed by the three central residues KAD, occurs in place of position d. The two segments are stabilized in their relative orientation by backbone hydrogen bonds from the last (c position) residue of each N-terminal helix to the first (e position) residue in the C-terminal helix of the neighboring chain (Figure 2C). This extends the continuous backbone hydrogen-bond network of each α-helix across the chains.
The nature of the discontinuity represented by this β-layer is related to the nature of stammers, but its effects are much stronger. With the insertion of 3 residues, stammers constitute a major strain on the conformation of the constituent helices of the coiled coil. In all examples to date, the resulting overwinding of the helices is absorbed by a short 310-helical segment. While these stammers can be best described to be part of a decad, the β-layer in OMP100 occurs in a motif of nine residues, a nonad. As the requirements of a nonad on its helices would be even more extreme than those of a decad, the strategy for its accommodation is a local but complete departure from helical structure.
β-layers in GCN4 fusions
Given the structural simplicity of β-layers, we wondered whether these could be brought about more generally by insertions of 2 residues into heptad coiled coils. Furthermore we wondered whether insertions of 6 residues, which pose similar demands on the coiled coil (Figure 1), also lead to the formation of β-layers. To tackle these questions experimentally, we designed a set of constructs that had either 6 or 9 residues inserted between two consecutive GCN4 N16V adaptors, based on two different sequence motifs (Figure 3, Table 1). One motif is IENKADKAD from Actinobacillus OMP100. The other, MATKDDIAN, is from a second family of prokaryotic coiled-coil proteins that we found to contain nonads and related periodicities; it occurs for example in 14 consecutive repeats in the protein Tcar0761 of Thermosinus carboxydivorans. From Actinobacillus OMP100 we derived the constructs A9 with the full IENKADKAD motif and A6 with the shortened motif IENKAD, as well as the ‘control’ construct A7 with the 7-residue motif IENKKAD. From Thermosinus Tcar0761 we derived the constructs T9, with the full MATKDDIAN motif, and T6, with the shortened motif MATKDD. The GCN4 N16V variant can form both dimeric and trimeric coiled coils and was chosen for these constructs to test for the oligomerization specificity of the inserts. All five constructs were resistant to proteolysis by proteinase K, showed typical α-helical CD spectra, did not melt upon heating to 95°C and yielded well-diffracting crystals. The structures of all constructs were trimeric and could be solved by molecular replacement, using the trimeric GCN4 structure as a search model. For T9, two structures were solved in alternative conformations (Figure 3). Apart from A7, which carries a heptad insert, all structures formed β-layers. These are identical in their structure (Figure 4), although they are not accommodated in the same way.
In A9 and in one of the T9 structures, T99, the β-layers are formed as in Actinobacillus OMP100. The first three and last three residues of the insert are in heptad register with the flanking GCN4 adaptors and therefore constitute positions a, b, c and e, f, g. The middle three residues, KAD in A9, KDD in T99, form the β-layer in place of position d (Figure 3C). The A6 structure follows the same principle: again the first three residues of the insert are in heptad register with the N-terminal GCN4-adaptor, constituting positions a, b and c. The other three residues of the insert, KAD, form the β-layer in place of position d. As a consequence, the register of the C-terminal GCN4-adaptor is shifted at the junction to the insert, starting with position e instead of position a. This register conflict is resolved further downstream by the formation of a hendecad (highlighted in pink in Figure 3A and C), so that the second half of the C-terminal adaptor retains its original register. In essence, the A6 structure shows a 9-residue element with the same structure as those found in A9 and T99, where the sequence IENKADMKQ borrows the last three residues from the C-terminal CGN4 adaptor and changes the periodicity of the latter at the junction.
In contrast, the T6 structure shows a ‘real’ 6-residue element. Here, the β-layer is formed by the first three residues of the insert, MAT (Figure 3D). The last three residues of the insert assume geometrically clear e, f and g positions and the C-terminal GCN4 adaptor follows in its native register, starting with an a position. Therefore, the β-layer occurs again in place of position d. A conflict with the native register of the N-terminal adaptor is avoided with just a small ‘twist’ to the adaptor’s last residue: The C-terminal leucine, natively occurring in position g, is rotated outward from the core of the bundle by about 15° so that its Crick angle is biased towards the angle of a c position. In Figure 3 this is noted as a g/c position, as it is close enough to an ideal position g for the preceding coiled coil to stay in register and close enough to a position c for the formation of the subsequent β-layer (highlighted in pink in Figure 3D). Surprisingly, the alternative T9 structure, T96, starts out in the same way, forming the β-layer with the first three residues of the insert, MAT, after a g/c position. Consequently, the middle three residues constitute positions e, f and g. It thereby shows the same 6-residue element as the structure T6. The last three residues of the insert are accommodated as a sharply localized stammer, before the C-terminal adaptor starts in position a (highlighted in pink in Figure 3A and D). Thus, the two structures T6 and T96 show that 6-residue elements are accommodated as N-terminally shortened 9-residue elements. While β-layers seem to strictly dictate the downstream register to start with an e position, they can occur after both c and g positions.
The observation that the T9 construct could accommodate the MATKDDIAN insert in two ways, forming the β-layer either at MAT or at KDD, led us to wonder whether the same could happen with the A9 insert, IENKADKAD, if the glutamate was interchanged with the central alanine to mirror the first six residues of the T9 insert (A9b, IANKEDKAD). We had previously found that β-layers which occur as connectors between TAA domains prefer small, hydrophobic residues in their central position (Hartmann et al., 2012; Bassler et al., 2015). We therefore thought that the central aspartate of the T9 insert might have been sufficiently unfavorable (T99) that an alternative, with the alanine of MAT at the center of the β-layer (T96), became observable, even though T99 allows the flanking coiled-coil segments to remain unperturbed and T96 requires their distortion. We reasoned that the larger glutamate residue at the center of A9b might even be sufficiently unfavorable to move this construct quantitatively to the alternative structure, with the β-layer formed over the first three residues (IAN). A9b in fact crystallized in two alternative structures (Figure 3), but in both the β-layer formed over the central glutamate. Since this residue was indeed too large and polar to be accommodated without distortion, the first turns of the downstream helices are perturbed to different extents in both instances, leading to a pronounced kink in one of the structures (highlighted in pink in Figure 3A). We were surprised to see that the penalty introduced by the central glutamate was not sufficient to produce the alternative structure; the reasons for this are unclear to us at present.
The α/β coiled coil
With the expectation to obtain a continuous fiber of alternating α and β elements, we built a construct with repeating nonads, based on Thermosinus Tcar0761 (Figure 5). The 14 consecutive, almost perfect MATKDDIAN repeats in this protein are flanked by long heptad segments. In our construct we omitted the middle ten nonad repeats and trimmed the N- and C terminal heptad segments for in-register fusion to GCN4-N16V (Table 1; red sequence in Figure 5). Crystallization trials yielded crystals in space group P63, diffracting to a resolution of 1.6Å, with one chain in the asymmetric unit and the trimer built by crystallographic symmetry around the c axis. The structure could be solved by molecular replacement using fragments of the T6 and A9 structures. It shows the anticipated α/β coiled coil with four consecutive β-layers. These layers are formed by the residues MAT of the repeats; the other residues, corresponding to KDDIAN, constitute positions e,f,g,a,b,c of the segments between the β-layers. Therefore, in accordance with heptad notation, the repeats can be written as IANMATKDD, with the isoleucine forming classical hydrophobic a layers and the MAT forming β-layers in place of position d (Figure 5). Only the first β-layer is part of a 6-residue element (hexad) and occurs after a position g of the preceding heptad. This g position is biased towards a c position, as described above for the structures T6 and T96, yielding the same g/c position. With its alternating a- and β-layers, the α/β coiled coil is a new class of protein fiber, based on a novel supersecondary structure element.
The α/β coiled coil of Thermosinus Tcar0761 is built of nonads and thus contains six residues per repeat in the α region of the Ramachandran plot, which retain one backbone hydrogen bond characteristic of α-helical structure. We think that it should be possible to reduce this structure by removing three α-helical residues and thus the single remaining backbone hydrogen bond from the parent structure. Such a minimalistic α/β coiled coil would be built of hexads, with three residues in the β and three in the α region of the Ramachandran plot. We have not so far detected coiled-coil proteins with β-layers in hexad spacing, nor have we been successful in constructing such a structure by fusion of MATKDD repeats between GCN4-N16V adaptors. However, as we will show in the next section, a tail-fiber protein from a Streptococcus pyogenes prophage (2C3F) contains an α/β coiled coil with four β-layers, two of which are in a hexad spacing.
β-Layers in proteins of known structure
At the beginning of this project we had identified nonads in the stalks of TAAs and in the N-terminal coiled coils of a family of prokaryotic endonucleases listed in Pfam as PD-(D/E)XK, specifically in the crenarchaeal representatives of this family. The bacterial representatives, where they had the coiled-coil stalk, lacked nonads or related periodicities (in Pfam however, all the coiled-coil segments of this family are grouped together in entry DUF3782). Surprisingly, we found that some bacteria contain coiled-coil proteins that lack the endonuclease domain, but are very similar to the coiled coils of the crenarchaeal proteins; Thermosinus Tcar0761 belongs to these. The β-layers in this family have the consensus sequence [aliphatic]–A–T–K–[polar]–[DE] (Figure 6). Pattern searches with this motif led us to the discovery of a family of integral membrane proteins found in prokaryotes and mitochondria (DUF1640), which carry this motif prominently at the beginning of their C-terminal stalk (Figure 6). However, our sequence searches, both based on sequence patterns and on the discovery of relevant insertions into the heptads of coiled coils, have progressed slowly, as they require much case-by-case analysis. This is due on the one hand to the frequency of the β-layer sequence patterns in coiled coils and on the other to the difficulty of establishing reliably the local register of coiled coils that deviate from the heptad repeat with existing software. Indeed, as we have described for TAAs (Szczesny and Lupas, 2008; Bassler et al., 2015), many of these escape detection entirely with current programs.
Given the structural identity between the β-layers resulting from hexads and nonads in coiled coils, and the supersecondary structures we characterized at the transition between coiled-coil segments and β-stranded domains in TAAs (Hartmann et al., 2012), we searched systematically for other instances of β-layers in proteins of known structure. The results are collected in Figure 7 and Table 2. All proteins we identified are homotrimers, except for the SLH domain, which is a monomer with pseudo-threefold symmetry; a majority are from viruses, mainly bacteriophage. Most β-layers occur in the context of coiled coils and we have termed these ‘canonical’. They are usually found capping one of the ends of the coiled coil, more often the N-terminal than the C-terminal one, and we have found only two further examples of coiled coils with internal β-layers: MPN010, a protein of unknown function from Mycoplasma pneumoniae (2BA2), and the aforementioned tail-fiber protein with hyaluronidase activity from the Streptococcus pyogenes prophage SF370.1 (2C3F). The latter contains a coiled coil with four β-layers, one near the N-terminus, two internal, and one at the C-terminal end; the two internal β-layers have the sequence LQQKADKETVYTKAE and are thus in a hexad spacing, with the first resembling the β-layer sequence of OMP100 and the second the one of Tcar0761. Remarkably, this second β-layer deviates from canonical β-layer structure, which we attribute to the serine in the first core position of the downstream coiled coil. This serine spans a water network in the core of the trimer, which invades the β-interactions of the β-layer with bridging waters, leading to a largely increased diameter of the layer. This wider diameter might be further promoted by the bulky tyrosine side chain of the central β-layer residue, which is bent out of the core. Nevertheless, such tandems with the consensus sequence LxxKADKxxVYTKxE occur in many bacterial ORFs (also in some TAAs, such as Neisseria meningitidis NadA4) and thus probably constitute a co-optimized module.
Table 2.
Cellular proteins (canonical) | ||||||
---|---|---|---|---|---|---|
PDB | Type | Protein | Domain | Species | Sequence | Similar structures |
2YO3 | cc-to-β | SadA | TAA DALL1 | Salmonella enterica |
abcdefgβββEEEECC
1306-LKASEAGSVRYETNAD-1321 |
3WPA, 3WPO, 3WPP, 3WQA (Acinetobacter sp. Tol5), 4USX (Burkholderia pseudomallei) |
2YO2 | cc-to-β | SadA | TAA DALL2 | Salmonella enterica |
abcdefgβββEECCC
310-VAGLAEDALLWDESI-324 |
3ZMF, 2YNZ (Salmonella enterica) |
2YO3 | β-to-cc | SadA | TAA Short neck | Salmonella enterica |
EEEECCCβββefgabcdefghijklmno
1345-AAVNDTDAVNYAQLKRSVEEANTYTDQK-1372 |
4LGO (Bartonella quintana), 3WP8, 3WPA, 3WPR (Acinetobacter sp. Tol5), 1P9H (Yersinia enterocolitica), 2XQH (Escherichia coli), 3D9X (Bartonella henselae), 2YO0 (Salmonella enterica), 3S6L, 4USX (Burkholderia pseudomallei), 2GR7 (Haemophilus influenzae) |
2YO2 | β-to-cc | SadA | TAA Long neck | Salmonella enterica |
EEEEβββefgabcdefg
349-DSTDAVNGSQMKQIEDK-365 |
2YNZ, 3ZMF (Salmonella enterica), 3EMO (Haemophilus influenzae), 3LAA, 3LA9, 4USX (Burkholderia pseudomallei), 3WPA, 3WPO, 3WPP, 3WPR, 3WQA (Acinetobacter sp. Tol5), 3NTN, 3PR7 (Moraxella catarrhalis) |
1S7M | β-to-cc | Hia | TAA Insert neck 1 | Haemophilus influenzae |
EEβββefgabc
642-NTAATVGDLRG-652 |
3EMF (Haemophilus influenzae) |
4C47 | Nterm-to-cc | SadB | - | Salmonella enterica |
CCβββefgabcdefg
23-DYFADKHLVEEMKEQ-37 |
- |
5APP | cc-to-cc | OMP100 | TAA Stalk | Actinobacillus actinomycetemcomitans |
abcdefgabcβββefgabcdefg
153-IAENKKAIENKADKADVEKNRAD-175 |
- |
5APZ | cc-to-cc | Tcar0761 | DUF3782 | Thermosinus carboxydivorans |
abcdefgβββefgabc
68-ITLMQANMATKDDLAR-83 |
- |
2BA2 | Nterm-to-cc | MPN010 | DUF16 | Mycoplasma pneumoniae |
CCCβββefghijk
5-GTRYVTHKQLDEK-17 |
- |
2BA2 | cc-to-cc | MPN010 | DUF16 | Mycoplasma pneumoniae |
hijkabcβββefgabcdefghijk
14-LDEKLKNFVTKTEFKEFQTVVMES-37 |
- |
3PYW | coil-to-cc | S-layer protein Sap | SLH | Bacillus anthracis |
CCCCEβββefghijkabcdef
35-FEPGKELTRAEAATMMAQILN-55 ... 94-FEPNGKIDRVSMASLLVEAYK-114 ... 156-WEPKKTVTKAEAAQFIAKTDK-176 |
- |
Phage and virus proteins (canonical) | ||||||
2C3F | cc-to-cc | Tail fiber hyaluronidase | - | Streptococcus pyogenes (prophage SF370.1) |
abcβββefghijkabcβββefgβββefghijk
69-IDGLATKVETAQKLQQKADKETVYTKAESKQE-99 |
2DP5 (Streptococcus pyogenes) |
2C3F | cc-to-β | Tail fiber hyaluronidase | - | Streptococcus pyogenes (prophage SF370.1) |
defgabcβββCEEEEE
97-SKQELDKKLNLKGGVM-112 |
2DP5 (Streptococcus pyogenes) |
2C3F | β-to-cc | Tail fiber hyaluronidase | TAA short neck homolog | Streptococcus pyogenes (prophage SF370.1) |
EEEECCEβββefghijkabcdefg
310-DPTANDHAATKAYVDKAISELKKL-327 |
2DP5, 2WH7, 2WB3 (Streptococcus pyogenes) |
4MTM | coil-to-cc | gp53 | - | Bacteriophage AP22 |
CCCCEβββefgabcdefg
155-NDVGSALSAAQGKVLNDK-172 |
- |
1YU4 | β-to-cc | Major tropism determinant U1 variant (Mtd-U1) | - | Bordetella Phage BMP-1 |
CCCCEEβββefgab
41-TAGGFPLARHDLVK-54 |
- |
1TSP | cc-to-β | Tailspike protein | Phage P22-tail | Phage P22 |
defghijkβββEEE
113-YSIEADKKFKYSVK-126 |
1CLW, 2XC1, 2VFM, 2VFP, 2VFQ, 2VFO, 2VFN [...] (Phage P22) 4OJP, 4OJ5, 4OJL [...] (E. coli Bacteriophage CBA120) 2V5I (Bacteriophage Det7), 2X3H (Enterobacteria phage K1-5) |
2POH | cc-to-β | Phage P22 tail needle gp26 | - | Phage P22 |
abcdefgβββCEEC
133-ISALQADYVSKTAT-146 |
3C9I, 4LIN, 4ZKP, 4ZKU, 5BU5, 5BU8, 5BVZ (Phage P22) |
1H6W | β-to-cc | Short fiber | Receptor binding domain | Bacteriophage T4 |
EEEEEECCEEβββefgabcde
321-MTGGYIQGKRVVTQNEIDRTI-341 |
1OCY, 1PDI, 2XGF, 2FKK, 2FL8 (Bacteriophage T4) |
4A0T | cc-to-coil | gp17 | gp37_C | Bacteriophage T7 |
cdefghijkβββCCCC
454-WLDAYLRDSFVAKSKA-469 |
4A0U (Bacteriophage T7) |
1MG1 | α-to-cc | Maltose-binding protein GP21 | TLV_coat | Primate T-lymphotrophic virus 1 (HTLV-1) |
HHHHHHEβββefgabcdefghijk
364-AAQTNAAAMSLASGKSLLHEVDKD-387 |
- |
3DUZ | coil-to-cc | GP64 | Baculo_gp64 | Autographa californica Multiple Nucleopolyhedrovirus |
CCCβββefgabcdefg
293-EGDTATKGDLMHIQEE-308 |
- |
4NKJ | Nterm-to-cc | Hemagglutinin | Hemagglutinin HA2 | Influenza B virus |
Eβββefgabcdefghijk
4-VAADLKSTQEAINKITKN-21 |
1QU1 (Influenza A virus) |
Unusual β-layer proteins | ||||||
4NQJ | α-to-α | TRIM Ubiquitin E3 ligase | DUF3583 | Homo sapiens |
HHHHHHHβββHHHHHHH
143-SVGQSKEFLQISDAVHF-159 |
- |
2F0C | (cc-to-)coil-to-β | Receptor binding protein (ORF49) | - | Lactophage tp901-1 |
abcdefgabCCCCβββCEEC
22-LEAINSELTSGGNVVHKTGD-41 |
3D8M, 3DA0 (Lactophage tp901-1) |
1AA0 | (cc-to-)coil-to-β | Fibritin | Fibritin_C | Bacteriophage T4 |
abcdefgCβββEEEEE
450-VQALQEAGYIPEAPRD-465 |
1AVY, 2BSG, 2IBL, 2WW6, 2WW7, 3ALM (Bacteriophage T4), 5C0R (Influenza A), 2LP7 (Human Immunodeficiency Virus 1), 1NAY |
2XGF | coil-to-coil | Long tail fiber needle | - | Bacteriophage T4 |
EEEECCCCCCCCβββCCCCEEEE
934-EAWNGTGVGGNKMSSYAISYRAG-956 |
- |
1H6W | coil-to-coil-(to-β) | Short fiber | - | Bacteriophage T4 |
CCCβββCCCCEEEEE
284-NADVIHQRGGQTING-298 |
- |
4UXG | β-to-coil | Proximal long tail fibre protein gp34 | - | Bacteriophage T4 |
EEEβββCCCCCC
1233-FVQVFDGGNPPQ-1244 |
- |
4UXG | α-to-coil | Proximal long tail fibre protein gp34 | - | Bacteriophage T4 |
HHHHCβββCCCEEE
1245-PSDIGALPSDNATM-1258 |
- |
3QC7 | α-to-coil | Head fiber | - | Bacteriophage Phi29 |
HHHHHHHβββCCCCCCC
221-NLRTMIGAGVPYSLPAA-237 |
- |
A structural analysis of canonical β-layers in light of their conserved sequence patterns (Figure 6, Table 2) shows that they favor hydrophobic residues in β1 and β2 (Figure 6), and particularly the β2 residue tends to be of smaller size (i.e. A or V). They can follow upon either position a or d of the preceding coiled coil, but always lead into positions e, f, g of the following coiled coil. Thus, when they follow upon position a they yield the register a-b-c-β1-β2-β3-e-f-g (seen in nonads), whereas when they follow upon position d they bias the residue in position g towards c to yield the register e-f-g/c-β1-β2-β3-e-f-g (seen in hexads). For the purpose of the following discussion we will refer to these two registers collectively as α1-α2-α3-β1-β2-β3-e-f-g.
For β-layers that occur at the C-terminal end of coiled coils (for example in the DALL1 and DALL2 domains of TAAs), the flanking residues do not form conserved mainchain or sidechain interactions with the layer or with each other, and their conservation pattern is dominated by interactions with the downstream domain. Since β-layers can form interaction networks that provide a C-cap to the preceding coiled coil (see below), it is surprising that they do not do so in most structures where they occur at the C-terminal end of coiled coils.
For β-layers that occur at the N-terminal end (for example in the necks of TAAs or in influenza hemagglutinin HA2), the β3 residue acts as an N-cap for the following helix, coordinating the backbone NH group of residue g (Figure 8); it is thus almost always D, N, T, or S (the capping role of this residue has been described in detail in the fusion-pH structure of influenza hemagglutinin HA2 (Chen et al., 1999). In return, the sidechain of the residue in position g forms a hydrogen bond with the backbone NH group of the β3 residue, closing a ring of sidechain-backbone interactions between these two residues; it is thus almost always D, E, or Q. Where it is D or E, it can further form a salt bridge to the residue in position e of the neighboring chain (clockwise as viewed from the N-terminus), which is broadly conserved as K or R. This residue essentially always forms either this salt bridge, or a hydrogen bond with the backbone carbonyl group of the β1 residue, as depicted in Figure 8. This interaction network allows β-layers to form stably at the N-terminal end of coiled-coil proteins, as seen in the crystal structures of SadB, MPN010, and the fusion-pH structure of influenza hemagglutinin HA2.
β-Layers that occur within coiled coils show substantially the same interactions and conservation patterns as the ones that act as N-caps, when the residue in β1 is hydrophobic. Only occasionally, the K or R in position e shows yet a third conformation, coordinating the backbone carbonyl of the α2 residue of the preceding helix in the neighboring chain (counterclockwise), thus providing a C-capping interaction. However, when the β1 residue is K (mainly in the stalks of TAAs and related phage proteins) the interaction network changes entirely from an N-cap of the following helices to a C-cap of the preceding ones. The K in β1 reaches across the core of the trimer to form one, two, or all three of the following interactions: coordinate the backbone carbonyls of the α1 and β1 residues, and the sidechain of the β3 residue, all from the neighboring chain (clockwise). Additionally, the K or R in position e is entirely found in the C-capping conformation. In all cases, the network is completed by backbone hydrogen bonds from the α3residues to the residues in e (clockwise), as already described for OMP100 (Figure 2C). These considerations suggest that in a tandem of β-layers with hexad spacing, the first layer should favor a C-capping network, with K in position β1, and the second an N-capping network, with a hydrophobic residue in β1. This is in fact observed in the Streptococcus prophage tail-fiber protein (2C3F).
Conclusions
The range of periodicities that α-helical coiled coils can assume is limited by the strain they impose on the constituent helices, as they progressively deviate from the 3.63 residues per turn of an undistorted α-helix. Insertions of 3 residues into a heptad background (stammers, 10/3 = 3.33) lead to the largest strain observed so far in continuous coiled coils and are accommodated by the local distortion of the α-helix into a 310 helix. We find that increasing the strain further by insertions of 2x3 or 3x3 residues leads to a complete loss of helical structure and the local formation of short β-strands. These cross to form a triangular plane, which moves the path of each chain by 120° counterclockwise around the trimer axis. Within this plane, the central residues of the three β-strands form backbone hydrogen bonds whose geometry deviates substantially from that seen in β-sheets. We have named them β-layers and show that they can be brought about in a straightforward way by the insertion of 6 or 2 (9 = 2 modulo 7) residues into a heptad background. We propose that β-layers offer two clear advantages to protein fibers. They increase their resilience by tightly interleaving the monomers within the fiber and they offer a simple mechanism to integrate β-stranded domains into these fibers, thus increasing their functional complexity. Our results show that a novel backbone structure is accessible to the 20 proteinogenic amino acids in the allowed regions of the Ramachandran plot with only minor mutations to a known fold.
Materials and methods
Cloning
If not otherwise indicated, constructs were amplified by primer extension. Primers used for amplification, cloning and mutagenesis are listed in Table 3.
Table 3.
Construct | Primer |
---|---|
OMP100 |
P omp1: 5`-GACCATGGTCTCCGATTCAGAACGTGGATGTGCGCAGCACCGAAAACGCGGCGCGCAGCCGCGCGAACGAACAG P omp2: 5`-GCTTTATCCGCTTTGTTTTCAATCGCTTTTTTGTTTTCCGCAATTTTCTGTTCGTTCGCGCGGCTGC P omp3: 5`-GAAAACAAAGCGGATAAAGCGGATGTGGAAAAAAACCGCGCGGATATTGCGGCGAACAGCCGCGCGATTGCGACCTTTCG P omp4: 5`-GACCATGGTCTCCTCATTTTGGTGGTCAGCGCCGCAATGTTCTGGCTGCTGCTGCGAAAGGTCGCAATCGCGCG |
pASK IBA GCN4 N16V |
P iba1: 5`-ACAAAAATCTAGATAACGAGGGCAAAAAATGAAACAGCTGGAAATGAAAGTTGAAGAACTGCTGTCCAAAGTCTACCACCTGGAAAACGA P iba2: 5`-CTCGAGGGATCCCCGGGTACCGAGCTCGAATTCGGGACCATGGTCTCCCAGTTTTTTCAGACGCGCAACTTCGTTTTCCAGGTGGTAGAC P iba3: 5`-GTACCCGGGGATCCCTCGAGAGGGGGACCATGGTCTCAATGAAACAGCTGGAATGGAAAGTTGAAGAACTGCTGTCCAAAGTCTACCACC P iba4: 5`-CACAGGTCAAGCTTATTAGTGATGGTGATGGTGATGGCCAGAACCAACCAGTTTTTTCAGACGCGCAACTTCGTTTTCCAGGTGGTAGACTTTGGACAGC |
T6 |
T6 p1: 5`-GGAATTCCATATGAAGCAGCTGGAAGACAAGGTGGAGGAACTGTGTCCAAAGTGTACCATCTGGAAAACGAGGTGGCGCGTCTGAAGAAG T6 p2: 5`-CTTGGACAGCAGTTCTTCCACCTTATCTTCCAGCTGCTTCAATCATCTTTGGTCGCCATCAGCTTCTTCAGACGCGCCACCTC T6 p3: 5`-GGTGGAAGAACTGCTGTCCAAGGTGTATCATCTGGAGAATGAGTGGCGCGTCTGAAGAAGCTGGTGGGCGAACGCTGAGGATCCCG T6 p4: 5`-CGGGATCCTCAGCGTTCGCCCACCAGCTTCTTCAGACGCGCCACTCATTCTCCAGATGATACACCTTGGACAGCAGTTCTTCCACC |
T9 |
T9 p1: 5`-GGAATTCCATATGAAGCAGCTGGAAGATAAGGTGGAAGAGCTGCTGTCAAAGTGTACCATCTGGAAAACGAAGTGGCGCGTCTGAAGAAG T9 p2: 5`-CAGCAGTTCTTCCACCTTATCTTCCAGCTGCTTCATGTTCGCAATGTCATCTTTGGTCGCCATCAGCTTCTTCAGACGCGCCACTTC T9 p3: 5`-GATAAGGTGGAAGAACTGCTGTCCAAAGTGTACCATCTGGAAAACGAAGTGGCGCGTCTGAAGAAACTGGTGGGCGAACGCTGAGGATCCCG T9 p4: 5`-CGGGATCCTCAGCGTTCGCCCACCAGTTTCTTCAGACGCGCCACTTCGTTTTCCAGATGGTACACTTTGGACAGCAGTTCTTCCACCTTATC |
A6 |
A6 p1: 5`-GGAATTCCATATGAAGCAACTTGAAGACAAAGTCGAAGAGCTTCTCTCAAGTTTATCATCTTGAGAACGAAGTTGCTCGTCTTAAG A6 p2: 5`-CCTTAGAAAGAAGTTCTTCGACCTTATCCTCAAGTTGCTTCATATCGCTTTGTCTCAATGAGTTTCTTAAGACGAGCAACTTCG A6 p3: 5`-CGAAGAACTTCTTTCTAAGGTTTACCATCTCGAAAATGAGGTTGTCGTTCAGAAGCTTGTTGGCGAACGCTGAGGATCCCG A6 p4: 5`-CGGGATCCTCAGCGTTCGCCAACAAGCTTCTTGAGACGAGCAACCCATTTCGAGATGGTAAACCTTAGAAAGAAGTTCTTCG |
A7 |
MP A6+K se: 5`-CTTAAGAAACTCATTGAGAACAAGAAAGCCGATATGAAGCAAC MP A6+K as: 5`-GTTGCTTCATATCGGCTTTCTTGTTCTCAATGAGTTTCTTAAG |
A9 |
MP A6+KAD se: 5`-CATTGAGAACAAAGCCGATAAGGCTGACATGAAGCAACTTGAGG MP A6+KAD as: 5`-CCTCAAGTTGCTTCATGTCAGCCTTATCGGCTTTGTTCTCAATG |
The OMP100 construct encompasses residues 133–198 of OMP100 from Actinobacillus actinomycetemcomitans (Genbank BAB86905.1), fused at both the N- and the C-terminus in heptad register to the trimeric leucine zipper GCN4-pII. The amplified DNA fragment was cloned in Eco31I-sites of pIBA-GCN4tri-His (Hernandez Alvarez et al., 2008).
The Tcar0761 construct is derived from open reading frame 0761 of Thermosinus carboxydivorans Nor1 (Genbank ZP_01667343.1). A DNA fragment encoding residues 68–101, fused directly to a fragment encoding 191–211, was made by gene synthesis (GenScript) and cloned in the Eco31I-sites of pIBA-GCN4 N16V-His.
The GCN4 N16V version of the pIBA-GCN4 series allows for the expression of protein fragments fused at both termini to GCN4 adaptors carrying the N16V mutation, a variant of the leucine zipper that forms a mixture of dimers and trimers. pIBA-GCN4 N16V-His was constructed by replacing the XbaI/HindIII fragment of pASK IBA2 by a DNA fragment containing the XbaI site, ribosomal binding site, N-terminal GCN4 N16V adaptor, multiple cloning site, C-terminal GCN4 N16V adaptor, (His)6-tag and the HindIII site. Aspartate residues in position f of the first heptad were replaced by methionine and tryptophan in the N- and C-terminal GCN4 adaptor as described before (Deiss et al., 2014).
Constructs A6, T6 and T9 were amplified and cloned into the NdeI and BamHI sites of the expression vector pETHis1a_Nde1, a modified version of pETHis1a (Bogomolovas et al., 2009) allowing for expression of the constructs with a C-terminal (His)6-tag and a TEV-protease cleavage site. A7 and A9 were constructed by site-directed mutagenesis using DNA fragment A6 as a template following the instructions of the QuikChange II XL Site-Directed Mutagenesis Kit. The DNA fragment coding for variant A9b was produced by gene synthesis (GenScript) and cloned in the NdeI and BamHI sites of pETHis1a_Nde1.
Protein expression and purification
A6, A7, A9, A9b, T6 and T9 were expressed in E. coli strain C41 (DE3), OMP100 and Tcar0761 constructs in XL1-blue. Cells were grown at 37°C until OD600 = 0.6, then expression was induced by addition of 1 mM isopropyl β-D-1-thiogalactopyranoside. Cells were cultivated for another 5 hr, harvested by centrifugation and disrupted using a French press cell (SLM Aminco). All proteins were purified under denaturing conditions. 6 M guanidinium chloride was added to the cell lysate and the sample stirred for 1 hr at room temperature. After centrifugation, the supernatant was loaded on a NiNTA column equilibrated with 20 mM Tris, pH 7.9, 400 mM NaCl, 10% glycerol, 6 M guanidinium chloride and bound proteins were eluted with a linear gradient of 0–0.5 M imidazol. Proteins were refolded by dialysis. Corresponding refolding buffers are listed in Table 1. Refolded OMP100 was additionally subjected to a Superdex 75 column. For A6, A7, A9, A9b, T6 and T9 the N-terminal histidine tags were removed before crystallization. As the TEV cleavage site turned out to be not accessible for the TEV protease, the N-terminal tag was digested with Proteinase K. Subsequent analysis of the proteins by mass spectroscopy showed intact proteins lacking only the N-terminal extension including the histidine tag and the TEV cleavage site.
X-ray crystallography and structure analysis
Crystallization trials were set up in 96-well sitting-drop plates with drops consisting of 400 nl protein solution + 400 nl reservoir solution (RS) and reservoirs containing 75 µl RS. Crystallization and cryo-protection conditions for all crystal structures are listed in Table 4. All crystals were loop mounted, flash frozen in liquid nitrogen, and all data collected at the SLS (Paul Scherrer Institute, Villigen, Switzerland) under cryo conditions at 100 K using the beamlines and detectors indicated in Table 5. Data were processed and scaled using XDS (Kabsch, 1993). Structures were solved by molecular replacement using MOLREP (Vagin and Teplyakov, 2000). For OMP100 and the GCN4-fusion constructs, trimmed models of the SadAK3 structure (2WPQ) were used as search models. For Tcar0761, fragments of the T6 and A9 structures were used. After rebuilding with ARP/WARP (Perrakis et al., 1999), all structures were completed in cyclic manual modeling with Coot (Emsley and Cowtan, 2004) and refinement with REFMAC5 (Murshudov et al., 1999). Analysis with Procheck (Laskowski et al., 1993) showed excellent geometries for all structures. Data collection and refinement statistics are summarized in Table 5. Periodicity plots were calculated based on the output of TWISTER (Strelkov and Burkhard, 2002). Molecular depictions were prepared using MolScript (Kraulis, 1991), Raster3D (Merritt and Bacon, 1997) and Pymol (Schrödinger, LLC, New York, NY).
Table 4.
Structure | Protein solution & concentration | Reservoir solution (RS) | Cryo solution |
---|---|---|---|
OMP 100 |
20 mM Tris pH 7.5, 150 mM NaCl, 3% (v/v) Glycerol, 3 mg/ml protein |
0.1 M tri-Sodium citrate pH 5.5, 2% (v/v) Dioxane 15% (w/v) PEG 10,000 |
RS + 15% (v/v) PEG 400 |
A6 | 20 mM HEPES pH 7.2, 50 mM NaCl, 2% (v/v) Glycerol, 1 M Urea, 15 mg/ml protein |
95 mM tri-Sodium citrate pH 5.6, 19% (v/v) Isopropanol, 19% (w/v) PEG 4000, 5% (v/v) Glycerol |
- |
A7 | 20 mM HEPES pH 7.3, 50 mM NaCl, 1 M Urea, 15 mg/ml protein |
0.1 M Citric acid pH 3.5, 3 M NaCl |
- |
A9 | 20 mM HEPES pH 7.2, 50 mM NaCl, 2% (v/v) Glycerol, 1,5 M Urea, 17 mg/ml protein |
1.6 M tri-Sodium citrate pH 6.5 | - |
A9b black |
50 mM HEPES, 50 mM NaCl, 1 M Urea, 7.5 mg/ml protein |
2.4 M Sodium malonate pH 5.0 | - |
A9b grey |
50 mM HEPES, 50 mM NaCl, 1 M Urea, 7.5 mg/ml protein |
0.2 M Sodium citrate, 0.1 M Bis Tris propane pH 6.5, 20% (w/v) PEG 3350 |
- |
T6 | 20 mM HEPES pH 7.2, 50 mM NaCl, 1 M Urea, 13 mg/ml protein |
0.2 M CaCl2, 0.1 M HEPES pH 7.5, 30% (w/v) PEG 4000 |
- |
T96 | 20 mM HEPES pH 7.2, 50 mM NaCl, 2% (v/v) Glycerol, 1.5 M Urea, 15 mg/ml protein |
0.2 M Ammonium phosphate, 0.1 M TRIS pH 8.5, 50% (v/v) MPD |
- |
T99 | „ | 0.1 M Citric acid pH 5.0, 20% (v/v) Isopropanol |
RS + 1 M Urea +25% Glycerol |
Tcar 0761 |
20 mM MOPS pH 7.2, 400 mM NaCl, 5% (v/v) Glycerol, 1.5 M Urea, 7 mg/ml protein |
0.1 M tri-Sodium citrate pH 4.0, 30% (v/v) MPD |
- |
Table 5.
Structure | OMP100 | A6 | A7 | A9 | A9b black | A9b grey | T6 | T96 | T99 | Tcar0761 |
---|---|---|---|---|---|---|---|---|---|---|
Beamline/Detector* | PXII / M | PXII / M | PXII / M | PXIII / M | PXII / P | PXII / P | PXII / P | PXII / M | PXIII / M | PXII / P |
Wavelength (Å) | 0.9786 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 |
Trimers/AU | 1 | 1 | 1/3 | 1 | 1 | 2 | 1 | 1 | 1 | 1/3 |
Space group | C2** | C2** | P321 | P21 | P21 | P21 | P21 | P21 | C2** | P63 |
a (Å) | 62.1 | 60.4 | 38.2 | 65.2 | 26.2 | 71.1 | 34.2 | 25.1 | 60.8 | 37.9 |
b (Å) | 35.9 | 34.8 | 38.2 | 34.6 | 37.5 | 35.0 | 27.0 | 38.3 | 35.1 | 37.9 |
c (Å) | 198.5 | 104.2 | 87.1 | 67.5 | 95.0 | 106.2 | 101.0 | 105.0 | 112.2 | 179.2 |
β (°) | 96.0 | 101.1 | 90 | 117.7 | 92.6 | 101.7 | 93.9 | 93.3 | 100.4 | 90 |
Resolution range (Å)*** | 32.9–2.30 (2.44–2.30) |
30.0–2.10 (2.23–2.10) |
18.2–1.37 (1.45–1.37) |
33.7–1.80 (1.91–1.80) |
34.9–1.35 (1.43–1.35) |
38.1–2.00 (2.12–2.00) |
34.1–1.60 (1.70–1.60) |
34.9–1.80 (1.91–1.80) |
19.5–2.00 (2.12–2.00) |
32.3–1.60 (1.69–1.60) |
Completeness (%) | 92.4 (86.5) | 97.3 (96.2) | 99.0 (98.6) | 98.9 (97.4) | 95.9 (92.1) | 92.4 (98.9) | 98.2 (96.1) | 97.1 (95.4) | 98.7 (97.5) | 99.2 (96.9) |
Redundancy | 2.84 (2.52) | 3.71 (3.71) | 6.35 (6.33) | 3.70 (3.67) | 3.72 (3.47) | 3.29 (3.31) | 3.04 (2.89) | 3.94 (3.81) | 3.73 (3.73) | 3.69 (3.65) |
I/σ(I) | 14.0 (1.88) | 15.5 (2.28) | 18.2 (2.52) | 14.3 (2.07) | 17.6 (2.10) | 13.9 (2.43) | 13.6 (2.33) | 14.5 (2.14) | 19.5 (2.25) | 20.3 (2.23) |
Rmerge (%) | 4.2 (44.8) | 4.8 (62.1) | 5.1 (75.5) | 5.1 (61.7) | 3.4 (66.6) | 5.0 (51.5) | 4.4 (42.3) | 7.2 (71.7) | 4.0 (63.2) | 2.9 (60.2) |
Rcryst (%) | 22.5 | 20.8 | 19.5 | 20.6 | 16.3 | 20.6 | 17.4 | 18.7 | 21.1 | 17.7 |
Rfree (%) | 25.4 | 25.1 | 23.8 | 25.6 | 19.9 | 25.3 | 20.5 | 22.6 | 25.5 | 21.3 |
PDB code | 5APP | 5APQ | 5APS | 5APT | 5APU | 5APV | 5APW | 5APX | 5APY | 5APZ |
*M = MARRESEARCH mar225 CCD detector; P = DECTRIS PILATUS 6M detector.
**twinned with apparent H32 symmetry and twinning operators.
1/2*h-3/2*k,-1/2*h-1/2*k,-1/2*h+1/2*k-l and 1/2*h+3/2*k,1/2*h-1/2*k,-1/2*h-1/2*k-l.
***values in parenthesis refer to the highest resolution shell.
Bioinformatics
Sequence similarity searches were carried out at the National Institute for Biotechnology Information (NCBI; http://blast.ncbi.nlm.nih.gov/) and in the MPI Bioinformatics Toolkit (http://toolkit.tuebingen.mpg.de; Biegert et al., 2006), using PSI-Blast (Altschul et al., 1997) at NCBI and PatternSearch, CS-Blast (Biegert and Söding, 2009), HMMER (Eddy, 2011), HHblits (Remmert et al., 2011) and HHpred (Söding et al., 2005) in the MPI Toolkit. The sequence relationships of proteins identified in these searches were explored by clustering them according their pairwise Blast P-values in CLANS (Frickey and Lupas, 2004). Sequence logos were created from representative, non-redundant alignments using the WebLogo3 web server (Crooks et al., 2004) with composition correction switched off.
Secondary structure propensity was evaluated in the MPI Toolkit with the meta-tools Quick2D and Ali2D, and coiled-coil propensity was estimated with COILS/PCOILS (Lupas et al., 1991; Gruber et al., 2006) and MARCOIL (Delorenzi and Speed, 2002).
Searches for structures containing β-layers were performed over the Protein Data Bank (PDB, Dec 8 2015) in a two-step procedure: First, their torsion angles were scanned with seven-residue sliding windows of βββαααα and ααααβββ, where α must satisfy -70° ≤ ψ ≤ -10° and -180° ≤ φ ≤ -40°, and β must satisfy 20° ≤ ψ ≤ 180° and -180° ≤ φ ≤ -40°. Second, the central β residue of putative β-layer strands was required to form backbone hydrogen bonds (N-O distance ≤ 3.5 Å) to the equivalent residue of another β-layer strand within a biological assembly. All matches were verified by visual inspection. These searches were complemented by extensive interactive analyses of fibrous proteins in PDB.
Acknowledgements
We thank Reinhard Albrecht, Kerstin Baer and Silvia Deiss for technical assistance and are very grateful to the staff of beamline X10SA/Swiss Light Source for excellent technical support. This work was supported by the German Science Foundation (SFB 766, TP B4) and by institutional funds of the Max Planck Society.
Funding Statement
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Funding Information
This paper was supported by the following grants:
Max-Planck-Gesellschaft to Marcus D Hartmann, Claudia T Mendler, Jens Bassler, Ioanna Karamichali, Oswin Ridderbusch, Andrei N Lupas, Birte Hernandez Alvarez.
Deutsche Forschungsgemeinschaft SFB766 (TP B4) to Andrei N Lupas, Birte Hernandez Alvarez.
Additional information
Competing interests
The authors declare that no competing interests exist.
Author contributions
MDH, Conception and design, Acquisition of data, Analysis and interpretation of data, Drafting or revising the article.
CTM, Acquisition of data, Analysis and interpretation of data.
JB, Acquisition of data, Analysis and interpretation of data.
IK, Acquisition of data, Analysis and interpretation of data.
OR, Acquisition of data, Analysis and interpretation of data.
ANL, Conception and design, Analysis and interpretation of data, Drafting or revising the article.
BHA, Conception and design, Acquisition of data, Analysis and interpretation of data, Drafting or revising the article.
References
- Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Research. 1997;25:3389–3402. doi: 10.1093/nar/25.17.3389. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Alvarez BH, Gruber M, Ursinus A, Dunin-Horkawicz S, Lupas AN, Zeth K. A transition from strong right-handed to canonical left-handed supercoiling in a conserved coiled-coil segment of trimeric autotransporter adhesins. Journal of Structural Biology. 2010;170:236–245. doi: 10.1016/j.jsb.2010.02.009. [DOI] [PubMed] [Google Scholar]
- Bassler J, Hernandez Alvarez B, Hartmann MD, Lupas AN. A domain dictionary of trimeric autotransporter adhesins. International Journal of Medical Microbiology. 2015;305:265–275. doi: 10.1016/j.ijmm.2014.12.010. [DOI] [PubMed] [Google Scholar]
- Biegert A, Mayer C, Remmert M, Soding J, Lupas AN. The MPI bioinformatics toolkit for protein sequence analysis. Nucleic Acids Research. 2006;34:W335–W339. doi: 10.1093/nar/gkl217. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Biegert A, Soding J. Sequence context-specific profiles for homology searching. Proceedings of the National Academy of Sciences of the United States of America. 2009;106:3770–3775. doi: 10.1073/pnas.0810767106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bogomolovas J, Simon B, Sattler M, Stier G. Screening of fusion partners for high yield expression and purification of bioactive viscotoxins. Protein Expression and Purification. 2009;64:16–23. doi: 10.1016/j.pep.2008.10.003. [DOI] [PubMed] [Google Scholar]
- Brown JH, Cohen C, Parry DA. Heptad breaks in alpha-helical coiled coils: stutters and stammers. Proteins. 1996;26:134–145. doi: 10.1002/(SICI)1097-0134(199610)26:2<134::AID-PROT3>3.0.CO;2-G. [DOI] [PubMed] [Google Scholar]
- Chen J, Skehel JJ, Wiley DC. N- and c-terminal residues combine in the fusion-pH influenza hemagglutinin HA2 subunit to form an n cap that terminates the triple-stranded coiled coil. Proceedings of the National Academy of Sciences. 1999;96:8967–8972. doi: 10.1073/pnas.96.16.8967. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Crick F. The fourier transform of a coiled-coil. Acta Crystallographica. 1953a;6:685–689. [Google Scholar]
- Crick F. The packing of alpha-helices: simple coiled-coils. Acta Crystallographica. 1953b;6:689–697. [Google Scholar]
- Crooks G. WebLogo: a sequence logo generator. Genome Research. 2004;14:1188–1190. doi: 10.1101/gr.849004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Deiss S, Hernandez Alvarez B, Bär K, Ewers CP, Coles M, Albrecht R, Hartmann MD. Your personalized protein structure: andrei n. lupas fused to GCN4 adaptors. Journal of Structural Biology. 2014;186:380–385. doi: 10.1016/j.jsb.2014.01.013. [DOI] [PubMed] [Google Scholar]
- Delorenzi M, Speed T. An Hmm model for coiled-coil domains and a comparison with Pssm-based predictions. Bioinformatics. 2002;18:617–625. doi: 10.1093/bioinformatics/18.4.617. [DOI] [PubMed] [Google Scholar]
- Eddy SR, Pearson W. Accelerated profile HMM searches. PLoS Computational Biology. 2011;7:e11861. doi: 10.1371/journal.pcbi.1002195. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Emsley P, Cowtan K. Coot : model-building tools for molecular graphics. Acta Crystallographica Section D Biological Crystallography. 2004;60:2126–2132. doi: 10.1107/S0907444904019158. [DOI] [PubMed] [Google Scholar]
- Frickey T, Lupas A. Clans: a java application for visualizing protein families based on pairwise similarity. Bioinformatics. 2004;20:3702–3704. doi: 10.1093/bioinformatics/bth444. [DOI] [PubMed] [Google Scholar]
- Grin I, Hartmann MD, Sauer G, Hernandez Alvarez B, Schütz M, Wagner S, Madlung J, Macek B, Felipe-Lopez A, Hensel M, Lupas A, Linke D. A trimeric lipoprotein assists in trimeric autotransporter biogenesis in enterobacteria. Journal of Biological Chemistry. 2014;289:7388–7398. doi: 10.1074/jbc.M113.513275. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gruber M. Historical review: another 50th anniversary – new periodicities in coiled coils. Trends in Biochemical Sciences. 2003;28:679–685. doi: 10.1016/j.tibs.2003.10.008. [DOI] [PubMed] [Google Scholar]
- Gruber M, Söding J, Lupas AN. Comparative analysis of coiled-coil prediction methods. Journal of Structural Biology. 2006;155:140–145. doi: 10.1016/j.jsb.2006.03.009. [DOI] [PubMed] [Google Scholar]
- Hartmann MD, Ridderbusch O, Zeth K, Albrecht R, Testa O, Woolfson DN, Sauer G, Dunin-Horkawicz S, Lupas AN, Alvarez BH. A coiled-coil motif that sequesters ions to the hydrophobic core. Proceedings of the National Academy of Sciences of the United States of America. 2009;106:16950–16955. doi: 10.1073/pnas.0907256106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hartmann MD, Grin I, Dunin-Horkawicz S, Deiss S, Linke D, Lupas AN, Hernandez Alvarez B. Complete fiber structures of complex trimeric autotransporter adhesins conserved in enterobacteria. Proceedings of the National Academy of Sciences of the United States of America. 2012;109:20907–20912. doi: 10.1073/pnas.1211872110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hartmann MD, Dunin-Horkawicz S, Hulko M, Martin J, Coles M, Lupas AN. A soluble mutant of the transmembrane receptor Af1503 features strong changes in coiled-coil periodicity. Journal of Structural Biology. 2014;186:357–366. doi: 10.1016/j.jsb.2014.02.008. [DOI] [PubMed] [Google Scholar]
- Hernandez Alvarez B, Hartmann MD, Albrecht R, Lupas AN, Zeth K, Linke D. A new expression system for protein crystallization using trimeric coiled-coil adaptors. Protein Engineering Design and Selection. 2008;21:11–18. doi: 10.1093/protein/gzm071. [DOI] [PubMed] [Google Scholar]
- Hicks MR, Walshaw J, Woolfson DN. Investigating the tolerance of coiled-coil peptides to nonheptad sequence inserts. Journal of Structural Biology. 2002;137:73–81. doi: 10.1006/jsbi.2002.4462. [DOI] [PubMed] [Google Scholar]
- Hoiczyk E, Roggenkamp A, Reichenbecher M, Lupas A, Heesemann J. Structure and sequence analysis of yersinia YadA and moraxella UspAs reveal a novel class of adhesins. The EMBO Journal. 2000;19:5989–5999. doi: 10.1093/emboj/19.22.5989. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Huang P-S, Oberdorfer G, Xu C, Pei XY, Nannenga BL, Rogers JM, DiMaio F, Gonen T, Luisi B, Baker D. High thermodynamic stability of parametrically designed helical bundles. Science. 2014;346:481–485. doi: 10.1126/science.1257481. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Joh NH, Wang T, Bhate MP, Acharya R, Wu Y, Grabe M, Hong M, Grigoryan G, DeGrado WF. De novo design of a transmembrane Zn2+-transporting four-helix bundle. Science. 2014;346:1520–1524. doi: 10.1126/science.1261172. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kabsch W. Automatic processing of rotation diffraction data from crystals of initially unknown symmetry and cell constants. Journal of Applied Crystallography. 1993;26:795–800. doi: 10.1107/S0021889893005588. [DOI] [Google Scholar]
- Kraulis P. Molscript: a program to produce both detailed and schematic plots of protein structures. Journal of Applied Crystallography. 1991;24:946–950. [Google Scholar]
- Laskowski R, MacArthur M, Moss D, Thornton J. Procheck: a program to check the stereochemical quality of protein structures. Journal of Applied Crystallography. 1993;26:283–291. [Google Scholar]
- Leo JC, Lyskowski A, Hattula K, Hartmann MD, Schwarz H, Butcher SJ, Linke D, Lupas AN, Goldman A. The structure of e. coli IgG-binding protein d suggests a general model for bending and binding in trimeric autotransporter adhesins. Structure . 2011;19:1021–1030. doi: 10.1016/j.str.2011.03.021. [DOI] [PubMed] [Google Scholar]
- Lupas A, Van Dyke M, Stock J. Predicting coiled coils from protein sequences. Science. 1991;252:1162–1164. doi: 10.1126/science.252.5009.1162. [DOI] [PubMed] [Google Scholar]
- Lupas A. Coiled coils: new structures and new functions. Trends in Biochemical Sciences. 1996;21:375–382. doi: 10.1016/S0968-0004(96)10052-9. [DOI] [PubMed] [Google Scholar]
- Lupas AN, Gruber M. The structure of alpha-helical coiled coils. Advances in Protein Chemistry. 2005;70:37–38. doi: 10.1016/S0065-3233(05)70003-6. [DOI] [PubMed] [Google Scholar]
- Merritt EA, Bacon DJ. Raster3D: photorealistic molecular graphics. Methods in Enzymology. 1997;277:505–524. doi: 10.1016/s0076-6879(97)77028-9. [DOI] [PubMed] [Google Scholar]
- Murshudov G, Vagin A, Lebedev A, Wilson K, Dodson E. Efficient anisotropic refinement of macromolecular structures using fft. Acta Crystallographica Section D Biological Crystallography. 1999;55:247–255. doi: 10.1107/S090744499801405X. [DOI] [PubMed] [Google Scholar]
- Parry DA. Fifty years of fibrous protein research: a personal retrospective. Journal of Structural Biology. 2014;186:320–334. doi: 10.1016/j.jsb.2013.10.010. [DOI] [PubMed] [Google Scholar]
- Perrakis A, Morris R, Lamzin VS. Automated protein model building combined with iterative structure refinement. Nature Structural Biology. 1999;6:458–463. doi: 10.1038/8263. [DOI] [PubMed] [Google Scholar]
- Remmert M, Biegert A, Hauser A, Söding J. HHblits: lightning-fast iterative protein sequence searching by Hmm-Hmm alignment. Nature Methods. 2011;9:173–175. doi: 10.1038/nmeth.1818. [DOI] [PubMed] [Google Scholar]
- Strelkov SV, Burkhard P. Analysis of alpha-helical coiled coils with the program TWISTER reveals a structural mechanism for stutter compensation. Journal of Structural Biology. 2002;137:54–64. doi: 10.1006/jsbi.2002.4454. [DOI] [PubMed] [Google Scholar]
- Szczesny P, Lupas A. Domain annotation of trimeric autotransporter adhesins--daTAA. Bioinformatics. 2008;24:1251–1256. doi: 10.1093/bioinformatics/btn118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Soding J, Biegert A, Lupas AN. The HHpred interactive server for protein homology detection and structure prediction. Nucleic Acids Research. 2005;33:W244–W248. doi: 10.1093/nar/gki408. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Thomson AR, Wood CW, Burton AJ, Bartlett GJ, Sessions RB, Brady RL, Woolfson DN. Computational design of water-soluble α-helical barrels. Science. 2014;346:485–488. doi: 10.1126/science.1257452. [DOI] [PubMed] [Google Scholar]
- Vagin A, Teplyakov A. An approach to multi-copy search in molecular replacement. Acta Crystallographica Section D Biological Crystallography. 2000;56:1622–1624. doi: 10.1107/S0907444900013780. [DOI] [PubMed] [Google Scholar]
- Woolfson Dn. The design of coiled-coil structures and assemblies. Advances in Protein Chemistry. 2005;70:79–112. doi: 10.1016/S0065-3233(05)70004-8. [DOI] [PubMed] [Google Scholar]