Abstract
Bacteria exploit surface proteins to adhere to other bacteria, surfaces and host cells. Such proteins need to project away from the bacterial surface and resist significant mechanical forces. SasG is a protein that forms extended fibrils on the surface of Staphylococcus aureus and promotes host adherence and biofilm formation. Here we show that although monomeric and lacking covalent cross-links, SasG maintains a highly extended conformation in solution. This extension is mediated through obligate folding cooperativity of the intrinsically disordered E domains that couple non-adjacent G5 domains thermodynamically, forming interfaces that are more stable than the domains themselves. Thus, counterintuitively, the elongation of the protein appears to be dependent on the inherent instability of its domains. The remarkable mechanical strength of SasG arises from tandemly arrayed ‘clamp' motifs within the folded domains. Our findings reveal an elegant minimal solution for the assembly of monomeric mechano-resistant tethers of variable length.
Staphylococcal biofilm formation is promoted by the surface protein SasG. Here, the authors characterize the structure and remarkable mechanical strength of the repeat region of SasG, and show how elongation is achieved by obligate folding of the disordered regions within the repeating units.
Bacteria adhere to surfaces, host cells and one another via specialized surface proteins (for example, pili, fimbriae or cellulosomes), which need to project away from the bacterial surface and be resistant to mechanical stress1,2. The mechanisms commonly used by bacterial adhesins to achieve both extension and mechanical strength are multimeric assembly and/or covalent stabilization1,3,4,5,6. SasG from Staphylococcus aureus and accumulation-associated protein from S. epidermidis are homologous proteins that promote host colonization and biofilm formation7,8,9. Staphylococcal biofilms are clinically important functional micro-communities of bacteria10 that cause hospital-acquired infections11 and promote exchange of antibiotic resistance genes12, presenting a significant global challenge.
Both SasG and accumulation-associated protein contain a central region with a variable (strain dependent) number of 128 amino acid repeats. The proteins are covalently attached to the cell wall at their carboxy terminus and are visible, using electron microscopy, as thin, highly extended fibrils on the cell surface (53–160 nm; strain dependent)7,13. Many bacterial surface fibrils, such as pili, are multi-chain assemblies1,4,14,15, which can have inter- and/or intramolecular covalent cross-links to maintain strong, highly extended structures5,16,17. Monomeric proteins that are formed from α-helical bundles can provide length18 but are intrinsically mechanically weak19. Alternatively, monomeric proteins with tandemly arrayed β-sandwich domains (for example, titin and fibronectin) have been shown to be mechanically strong20,21, but these do not form highly elongated structures22,23. The tandem repeats of SasG fold into two structurally related domains: E (50 residues) and G5 (78 residues), formed from single-layer triple-stranded β-sheets, framing a central collagen-like triple-helical region24,25 (Fig. 1). Interestingly, the E domain is disordered both in isolation and when preceded by a folded G5 domain (G5-E), but folded in E-G5 and G5-E-G5 to form elongated structures24. Whether and how such tandemly repeated, relatively unstable, β-sheet domains could underpin an elongated and mechanically strong structure is unclear.
Figure 1. Overview of the SasG system.
(a) Crystal structure of G51-E-G52 (PDB accession: 3TIQ) illustrating the head-to-tail arrangement of G5 (red) and E (blue) domains in SasG. The lengths of G51-E-G52 and individual domains, determined from the crystal structure, are shown. The 128 amino acid sequence repeat, covering a G5 and an E domain, is also indicated. (b) Schematic representation of the domain topology for G5 and E. The domains share the same fold (β-triple helix-β), defined by two single-layer, triple-stranded β-sheets connected via a central collagen-like triple-helical region (also indicated in a).
Here we use a multidisciplinary approach, combining structural, computational and biophysical methods to demonstrate that the repeat region of SasG maintains a monomeric and highly extended conformation in solution, which is resistant to significant mechanical stress. The length is achieved by obligate folding of intrinsically disordered E domains to form stable interfaces that couple non-adjacent G5 domains and promote long-range cooperativity. The mechanical strength of SasG arises from tandemly arrayed ‘clamp' motifs within the folded G5 and E domains, formed from directly hydrogen-bonded β-strands. Our study reveals the molecular basis for the efficient formation of elongated and mechanically resistant bacterial adhesins from a single polypeptide chain.
Results
SasG is an extended monomeric molecule
Only SasG with five or more E-G5 repeats promotes biofilm formation, likely to be due to the need to project above the layer of other cell surface components7. To generate a model for the tandemly arrayed repeats, we first determined the structure of G52-E-G53 (Supplementary Fig. 1a–h, Supplementary Table 1 and Supplementary Discussion), which has the highest sequence identity to other repeats. A model of contiguous repeats comprising all domains from G51 to G57 (generated by iterative superposition) had a length of 71 nm (Supplementary Fig. 1i). To assess the validity of this model experimentally, we produced constructs starting from G51-E-G52 with E-G5 increments up to G51–G57 (Fig. 2a); all were monomeric and monodisperse (Supplementary Fig. 2a and Supplementary Table 2). We determined their shape in solution using small-angle X-ray scattering (SAXS) (Fig. 2b–f, Supplementary Fig. 2b–g and Supplementary Discussion). The maximum particle dimension, Dmax of G51-E-G52 matched the crystal structure (19 nm; Fig. 2c) and the calculated scattering matched the experimental SAXS data (χ2=1.12; Fig. 2e and Supplementary Fig. 2f). The Dmax increased incrementally with stepwise addition of E-G5 units (Fig. 2b,c). The Dmax of 63 nm for G51–G57 is only 8 nm (∼11%) shorter than our (maximally extended) crystal structure-based model (Supplementary Fig. 1i). At the same time, the maximum particle cross-section (Fig. 2d and Supplementary Table 2) remains nearly unchanged and displays only a minor increase from 2.0 nm for G51–G52 to 2.3 nm for G51–G57. SAXS ab initio models (Fig. 2b) suggest that although highly extended on average, the particles may slightly coil or bend, which explains the observed increase in the effective cross-section.
Figure 2. SAXS analysis of SasG particle shape and size.
(a) Schematic of SasG in S. aureus strain NCTC 8325-4; signal sequence (S; cleaved), adhesion domain (A), E-G5 repeats, C-terminal wall (W) region and LPKTG signal for cell wall attachment; expression constructs are illustrated below. (b–f) SAXS studies of G51–G52, G51–G53, G51–G54, G51–G55, G51–G56 and G51–G57 (colour legend defined in c): (b) five ab initio bead models and their filtered average (grey); (c) distance distribution functions P(r) and (d) cross-sectional distance distribution functions Pc(r) (for presentation purposes, P(r) and Pc(r) have been scaled relative to construct length); (e) SAXS profiles (offset on log scale) and calculated scattering fits (black dashed lines) for: the G51–G52 X-ray crystal structure and a representative Gasbor ab initio model of all other constructs (see Supplementary Table 2 for χ2-values). (f) Porod exponents (Log I(s) versus Log s offset on log scale; linear regression analysis (black dashed lines)) fitted to the mid-s data range (white squares with black outline).
To further corroborate the SAXS results, we adapted a high-resolution single-molecule imaging technique (SHRImP26; Fig. 3a and Supplementary Discussion). G51–G57, labelled with fluorophores at both ends, was visualized using total internal reflection fluorescence microscopy (TIRFM) and exhibited end-to-end distances consistent with the SAXS analysis (mean±s.e.=59±5 nm, Fig. 3b and Supplementary Fig. 3a,b). As this is the first reported application of SHRImP to the measurement of an end-to-end distance in an elongated protein, a control experiment using DNA of similar predicted length was performed (Fig. 3b inset and Supplementary Fig. 3a,b inset). All-atom molecular dynamics (MD) simulations of G51-E-G52 show that the individual G5 and E domains are relatively rigid and do not bend (Fig. 3c and Supplementary Fig. 4a). Analysis of the trajectories suggests that the collagen-like triple helix region provides rigidity within the domains (Supplementary Fig. 4b). Flexing of the molecule occurs almost exclusively at the interfaces between domains, which nonetheless maintain an overall linear profile (Fig. 3c and Supplementary Fig. 4), explaining the small difference between the rigid modelling from the crystal structure (Supplementary Fig. 1i) and the solution SAXS-based model (Fig. 2b). Thus, SasG is monomeric and extended in solution on length scales consistent with the extended fibrils observed on the bacterial cell surface.
Figure 3. Conformational flexibility of SasG.
(a) SHRImP-TIRFM experimental method: Fluorophores were attached to the N- and C-termini of G51–G57 via engineered cysteine residues. Fluorescently labelled protein was immobilized on a quartz slide surface using poly-D-lysine and visualized by prism-coupled TIRFM. By stepwise fluorophore photobleaching, individual point spread functions (PSFs) were analysed and interfluorophore distances calculated. (b) Interfluorophore distances for Alexa Fluor 488-labelled G51–G57 and 198 bp DNA (inset) on a poly-D-lysine (100 μg ml−1)-treated quartz surface. Solid lines indicate Gaussian fits to the histograms (mean=54 and 51 nm, respectively). Dashed lines indicate end-to-end distances based on crystallographic data for G51–G57 and B-form DNA (71 nm (Supplementary Fig. 1i) and 67 nm, respectively); the dotted line indicates the SAXS determined Dmax. (c) All-atom MD simulations of G51-E-G52. The N-to-C distance as a function of time is plotted for G51-E-G52 (black) and individual domains within the protein (G51—red, E—green and G52—blue). G51-E-G52 most frequently adopts an extended conformation (with the length of ∼170 Å, as observed in the crystal structure), but rare bent conformations (Supplementary Fig. 4b) are also sampled at room temperature (N-to-C distance fluctuates between 75 and 180 Å). As the individual domains maintain their fully extended crystallographic conformations throughout the simulation, the observed flexibility of G51-E-G52 appears due to bending at the G5-E and E-G5 interfaces (Supplementary Fig. 4).
SasG behaves as series of overlapping cooperative units
To determine the molecular mechanism whereby SasG maintains an extended conformation, we investigated the stability and folding kinetics of G52 in isolation and in tandem with E (E-G52). The unfolding was monitored by both intrinsic tyrosine fluorescence and specific FRET (Förster resonance energy transfer) measurements (Fig. 4). Irrespective of the method used, equilibrium unfolding for both G52 and E-G52 was reversible and characterized by a single transition, indicating two-state, all-or-none unfolding for both constructs (Fig. 4). We did not observe any separate folding or unfolding events when specific FRET probes were used to monitor the folding of only E or G52 in the context of E-G52 (Fig. 4 and Supplementary Fig. 5). E-G52 is more stable than G52 alone, by ∼3.5 kcal mol−1 (Table 1). Although E remains unfolded24 in G51-E, once folded cooperatively with G52, E interacts with and stabilizes the G51 domain such that G51-E-G52 unfolds at even higher concentrations of denaturant and again in an apparently cooperative manner (Fig. 4b and Table 1). Hence, each G5 repeat is stabilized by the folding of the following G5, even though they are not in direct contact. The instability of E is required for the long-range cooperativity between G5 domains; this cooperativity would not exist if the E domains folded independently. The entire repeat region can be considered as a series of overlapping G5-E-G5 cooperative units (Fig. 2a) and thus will form a single cooperative unit.
Figure 4. Equilibrium denaturation curves for SasG domains.
(a) Structure of the two-domain fragment of SasG—E-G52. E and G52 domains are shown in blue and red, respectively. The location of tyrosines and residues used to engineer the FRET pairs are indicated and colour-coded. E and G5 domains each contain a single tyrosine residue (grey) located in the last strand of the C-terminal β-sheet. FRET pair T501CA488-E613CA594 (magenta) results in FRET when both E and G52 domains are folded and was used to monitor (un)folding of E-G52. FRET pair E500W-E532CIAEDANS (cyan) results in FRET when the E domain is folded and was used to monitor (un)folding of E in the context of E-G52. (b) Equilibrium unfolding (closed circles) and refolding (open circles) data for wild-type G51 (green), G51-E (orange), G52 (red), E-G52 (blue) and G51-E-G52 (purple), as well as E-G52-T501CA488 -E613CA594 (magenta) and E-G52-E500W-E532CIAEDANS (cyan). Data for the wild-type proteins were collected by monitoring the change in intrinsic tyrosine fluorescence, whereas the FRET signal was measured as the change in acceptor fluorescence (Alexa Fluor 594 and 1,5-IAEDANS). All data were fit to a two-state model of unfolding (see Table 1 for thermodynamic parameters).
Table 1. Apparent equilibrium parameters for SasG domains.
G51-WT | G51-E-WT | G52-WT | E-G52-WT | E-G52-T501CA488-E613CA594 | E-G52-E500W-E532CIAEDANS | G51-E-G52-WT | |
---|---|---|---|---|---|---|---|
mD−N (kcal mol−1 M−1) | 1.0±0.1 | 1.0±0.1 | 1.0±0.1 | 1.4±0.1 | 1.4±0.1 | 1.4±0.1 | 1.5±0.1 |
[urea]50% (M) | 3.2±0.1 | 3.2±0.1 | 2.8±0.1 | 4.4±0.1 | 4.3±0.1 | 4.3±0.1 | 4.9±0.1 |
![]() |
3.2±0.1 | 3.2±0.1 | 2.8±0.1 | 6.3±0.2 | 6.1±0.3 | 6.1±0.4 | 7.5±0.2 |
The parameters were obtained by fitting the data to a two-state model. The errors were calculated based on the errors of the fits.
Long-range cooperativity is mediated by intrinsic disorder
In all-β tandem repeat proteins such as titin, with immunoglobulin-like domains, the individual domains are significantly more stable than the domains in SasG, but the domains do not fold cooperatively27 and the proteins are not fully elongated in solution22,28. The cooperativity in SasG is mediated by the intrinsic instability of the E repeats and by the relative stability of the interfaces. As explained in more detail in the Supplementary Discussion, we can estimate minimal stabilities conferred by the interfaces. The G51-E and E-G52 interfaces must confer at least 1.5 and 6 kcal mol−1, respectively, compared with the stabilities of the individual domains G51, E and G52 (−3.2, ≥+2.5 and −2.8 kcal mol−1, respectively). Herein lies the explanation for the remarkable elongation of the SasG molecule. In most tandem domain protein systems, the ‘weak link' is the inter-domain interface. In SasG, the interfaces provide more stability than the domains themselves and are maintained even in the most flexed structures in the MD simulations (Supplementary Fig. 4b). It is these interfaces, formed on the folding of E, that mediate the long-range cooperativity.
The role of intrinsic disorder in promoting folding cooperativity29 and allosteric coupling30,31 has been reported for other systems. In multi-domain proteins, where all domains are folded, cooperativity is short range and limited strictly to neighbouring domains; as has been clearly demonstrated for spectrin domains, domain i will not be affected by domain i+2 if domain i+1 is stable and folded32. At the other extreme, repeat proteins, such as proteins with helical ankyrin repeats, behave cooperatively, because the individual units are highly unstable and the linking interfaces are strong29. The SasG system is an intermediate case: the regions of disorder (E domains) are interspersed with stably folded G5 domains, but the high stability of the G5-E and E-G5 interfaces (relative to inherent instability of E) induces cooperativity between non-adjacent G5 domains. Thus, when G52 folds, this induces the folding of E and facilitates formation of the G51-E interface, which imparts stability to G51.
SasG has remarkable mechanical strength
SasG is anchored to the cell wall at the C-terminus and to other cells via its N-terminus9; thus, SasG molecules are likely to experience longitudinal mechanical stress in vivo. We used atomic force microscopy (AFM) to measure the mechanical strength of SasG G51–G57 (Fig. 5a). Approach-retract cycles were performed at various retraction velocities (200, 800, 1,500, 3,000 and 5,000 nm s−1), recording the force as a function of extension (Fig. 5a,b, Supplementary Fig. 6 and Table 2). Contrary to chemical denaturation, under force, SasG domains unfold independently of each other. Irrespective of the pulling speed, two types of unfolding peaks were observed: lower- and higher-force peaks (Supplementary Fig. 6). The lower-force unfolding peaks are associated with a contour length gain (ΔLc) of ∼150 Å (ranges from 145 to 154 Å for different retraction rates; Supplementary Fig. 6 and Table 2) and, as the difference in length between folded and unfolded E is ∼143 Å, these events correspond to unfolding of E domains. The difference in length between folded and fully extended G5 is ∼213 Å; hence, the higher-force peaks associated with ΔLc of ∼220 Å (ranges from 216 to 227 Å for different retraction rates; Supplementary Fig. 6 and Table 2) represent unfolding of G5 domains. SasG domains show remarkable mechanical strength (Fig. 5b and Table 2). The weaker E domain, unstable in isolation, unfolds at forces of 250 pN (at 800 nm s−1 retraction rate), which is higher than needed to unfold the ‘strength-paradigm' 27th immunoglobulin domain from titin (I27; ∼180 pN)33, as well as other ‘strong' proteins34. The mechanical resistance of the G5 domain (420 pN at 800 nm s−1 pulling speed) falls into the upper limit of mechano-stability for any protein domain stabilized solely by non-covalent interactions34,35. Mechanical unfolding of SasG domains was also predicted by MD simulations revealing that the strength originates from tandemly arrayed ‘mechanical clamps'34 (Fig. 5c and Supplementary Fig. 7). Mechanical clamps are structural elements that determine the protein's primary resistance to tensile force; for example, the mechanical clamps of globular β-sandwich domains, such as I27, are formed by directly hydrogen-bonded N- and C-terminal β-strands. In the case of SasG, E and G5 domains contain an N-terminal β-sheet clamp, formed by two anti-parallel β-strands, and a C-terminal β-sheet clamp, composed of two parallel β-strands. Both N- and C-terminal clamps involve long stretches of hydrogen bonds and associated side-chain packing interactions along the β-strands (Fig. 5c and Supplementary Fig. 7). G5 domains have significantly longer N-terminal clamps than E domains, explaining the higher mechanical resistance of G5 relative to E.
Figure 5. Mechanical resistance of SasG domains probed by AFM.
(a) A sample AFM force-extension profile for a SasG construct containing seven G5 and six E domains (G51–G57; grey trace). Schematic of the experimental setup and the correlating scatter plot superposed with a smooth density histogram are also shown. The yellow trace represents the AFM cantilever tip approaching the surface. The six lower-force unfolding peaks (around 250 pN at a retraction rate of 800 nm s−1; labelled and indicated in blue) are associated with a contour length change (ΔLc) of 145 Å. The difference in length between a folded and unfolded E is ∼143 Å and these events thus correspond to unfolding of E domains. The seven higher-force peaks (around 420 pN at a retraction rate of 800 nm s−1; labeled and indicated in red) relate to a ΔLc of 216 Å, and hence represent unfolding of G5 domains (the difference in length between a folded and unfolded G5 is ∼213 Å). The last force peak (indicated in green) represents the detachment of the protein from the tip or the surface. (b) Plot of mode unfolding forces (three independent experiments were performed for each pulling speed; see Supplementary Fig. 6 and Table 2 for details) against retraction velocity for SasG domains (E shown in blue and G5 in red) and I27 (shown in green; data taken from Best et al.70). Error bars represent s.d. E, G5 and I27 show a similar dependence of unfolding force on pulling speed, but SasG domains are mechanically more stable than I27. (c) Topology diagrams of SasG domains (E, G51 and G52) with ‘mechanical clamps' indicated as shaded boxes. Hydrogen bonds are shown as dotted lines. As revealed by MD simulations (Supplementary Fig. 7), the remarkable mechanical strength of SasG domains originates from tandemly arrayed mechanical clamps involving long stretches of hydrogen bonds and associated side-chain packing interactions along the β-strands.
Table 2. AFM parameters for SasG domains.
Retraction rate (nm s−1) |
n |
Force (pN)* |
ΔLc
(Å)* |
|||
---|---|---|---|---|---|---|
E | G5 | E | G5 | E | G5 | |
200 | 126 | 52 | 212±25 | 351±34 | 149±15 | 219±9 |
800 | 409 | 146 | 250±35 | 421±36 | 145±12 | 216±6 |
1,500 | 597 | 229 | 229±27 | 379±28 | 154±10 | 224±8 |
3,000 | 258 | 149 | 254±35 | 397±53 | 152±12 | 224±13 |
5,000 | 547 | 263 | 265±37 | 408±35 | 153±13 | 227±9 |
AFM, atomic force microscopy.
Note that three independent experiments were performed at each pulling speed.
*Values (mode ± s.d.) were obtained from Gaussian fits to the histograms (see Supplementary Fig. 6).
Under mechanical force, the protein is no longer behaving as a cooperative unit. E and G5 domains unfold independently; the E domains act as force ‘buffers' relieving mechanical stress without complete unfolding of the SasG molecule. As the E domains fold rapidly when the G5 domains are folded, this allows for rapid recovery once stress is released. We speculate that SasG can oscillate between a ‘flexible' state under force, with uncoupled G5 domains and E unfolded, and a ‘stiff' state in the absence of force with all domains folded.
Discussion
Nature can form elongated single-chain proteins or strong single-chain proteins. Here we have discovered how nature can form, from apparently insubstantial building blocks, a monomeric structure that is both long and strong. The length is maintained by using intrinsic disorder to form highly cooperative and stable interfaces that mediate communication between non-adjacent, stiff domains. Strength results from the optimal use of small domains that nonetheless contain long, aligned β-strands. Our findings provide a paradigm for efficient formation of a strong, highly elongated protein structure from a single polypeptide chain. Such protein rods of tunable length and requiring no additional covalent stabilization have significant potential for incorporation into novel biomaterials.
Methods
Chemicals
All chemicals were purchased from Sigma-Aldrich, unless otherwise stated.
Protein production and characterization
Purification of short SasG repeats (G51–G52; residue range 420–629) was described previously24. A construct of G52–G53 (residue range 545–757) was expressed and purified similarly. The coding sequences of G51–G53 (residue range 419–757), G51–G54 (419–885), G51–G55 (419–1,013), G51–G56 (419–1,141) and G51–G57 (419–1,269) were synthesized by Genewiz and fused to an N-terminal hexahistidine-3C protease-cleavable tag and a non-cleavable C-terminal Strep-tag II (WSHPQFEK) in a modified pET28 vector, pET-YSBLIC (His6-3C-G51-G5x–Strep). A second G51–G57 construct incorporated two cysteine residues at the C-terminus of the Strep affinity tag for AFM force unfolding studies generating His6-3C-G51-G57–Strep-CysCys using primer pair:
5′-TCCAGGGACCAGCAATGGCACCTAAGACCATCACC-3′
5′-TGAGGAGAAGGCGCGTTAGCAGCATTTTTCAAATTGAGGAT GAGACCAGGTCTCCGGACCATACTC-3′
A third G51–G57 construct was engineered into a XhoI/NdeI-digested pET26b(+) vector (Novagen), to generate a non-cleavable C-terminal His tag; cysteine residues replaced E418 and T1269 for SHRImP-TIRFM studies (Cys-G51-G57–Cys-His6) with primers:
5′-AAGAAGGAGATATACATATGGGTGGCTGCGCACCTAAGACCATCAC-3′
5′-GGTGGTGGTGCTCGAGGCCGCTGCTGCCGCACTCCGGACCATACTCGG-3′
Liquid cultures of LB supplemented with 50 μg ml−1 kanamycin were inoculated with transformed Escherichia coli BL21(DE3) cells (Novagen) and grown at 37 °C to OD600 0.5–0.6; expression was induced with addition of 1 mM isopropyl-β-D-thiogalactopyranoside, followed by incubation at 20 °C for 20 h. Cells were resuspended in 500 mM NaCl, 20 mM imidazole, 20 mM Tris HCl, pH 7.5, supplemented with EDTA-free protease inhibitor cocktail (Roche) and lysed by sonication. Soluble extract was purified by nickel affinity chromatography. Constructs with 3C protease sites and Strep tags were digested with HRV-3C protease and purified by Strep-Trap affinity chromatography (GE Healthcare). Eluted protein was separated by size-exclusion chromatography (SEC) on a Superdex 200 26/60 column (GE Healthcare) in 200 mM NaCl, 1 mM EDTA and 20 mM Tris HCl, pH 7.5. His6-3C-G51-G57–Strep-CysCys and Cys-G51-G57–Cys-His6 were also purified by SEC with the addition of 2.5 mM dithiothreitol (DTT). Molecular masses were characterized by electrospray ionization mass spectrometry. SAXS samples were dialysed against SEC buffer to generate matched solvent for buffer scattering subtraction. Protein concentrations were estimated by A280. SEC-multi-angle laser light scattering–quasi-elastic light scattering analysis of samples in the range 1.5–2 mg ml−1 was performed using a Superdex 200 10/300 GL column (GE healthcare) in line with a Dawn HELEOS-II 18-angle light-scattering detector, Optilab rEX refractive index monitor and quasi-elastic light scattering detector (Wyatt).
Crystallography
G52–G53 was concentrated to 26.2 mg ml−1 and screened for crystallization at 18 °C. Crystals grew in 3 days in JCSG-plus (Molecular Dimensions) condition D2 (0.2 M MgCl2, 0.1 M Na HEPES pH 7.5, 30% v/v PEG 400). Crystals were vitrified in liquid N2 and two data sets collected from a single crystal by offsetting the crystal by κ=40° using a mini-Kappa goniometer at Diamond light source beamline I04, wavelength 0.9795 Å at 100 K. Data were integrated using XDS36, merged and scaled using Aimless37, and molecular replacement performed using PhaserMR38. E1G52 (PDB accession: 3TIP) was separated into two search models: E1 (residue range 502–548) and G52 (residue range 549–629). The model was partially built using Buccaneer39, completed with Coot40 and refined with Refmac5 (ver. 5.8.0073)41. TLSMD server was used to define translation/libration/screw groups (residue ranges 543–588, 589–677 and 678–757)42. Aimless, PhaserMR, Buccaneer and Refmac5 were implemented through the CCP4 interface43. The final model was evaluated with MolProbity44 and the Ramachandran plot showed that all residues have favourable ϕ/ψ geometry. G51–G52 (PDB accession: 3TIQ) superposition was performed using Gesamt45. The G51–G57 particle was modelled by superposition of G51–G52 and G52–G53, and iterative superposition of G52–G53 by secondary structure matching46.
Small-angle X-ray scattering
SAXS intensity data, I(s) versus s, (s=4πsinθ/λ, where 2θ is the scattering angle, 0.04–4.4 nm−1) were collected from protein samples and matched solvent blanks at the EMBL-P12 beamline at PETRAIII (DESY, Hamburg) employing automated data acquisition and radial averaging protocols47. Guinier (ln I(s) versus s2) and modified Guinier analysis for extracting cross-sectional terms (ln (I(s)s) versus s2) were performed using Primus48. Indirect inverse Fourier transformations of the data were calculated using GNOM49, to generate probable real-space atom-pair distance distributions (P(r) versus r) and distance distributions of cross-sections (Pc(r)). Porod exponents were calculated from the slope of linear fits to the decay in scattering intensity of Log I(s) versus Log s through the mid-s regions of the profiles50,51. The molecular masses (MMs) were evaluated from the forward-scattering intensity I(0) placed on an absolute scale (cm−1) using the scattering from water52, combined with calculated values of concentration, partial specific volume and scattering-length density contrasts53 derived from amino acid sequence and atomic compositions. The MM was additionally calculated relative to the scattering calibrated using a BSA standard54 and SAXS.MoW was used to estimate the concentration-independent MM based on apparent volumes55. Ab initio shape restorations using a dummy residue algorithm Gasbor56 were performed five times against each SAXS data set, and aligned and averaged using Damaver57 to generate a consensus three-dimensional shape. The model fit of the X-ray crystal structure against the SAXS data was calculated using Crysol58. Rigid body modelling was performed using SASREF59.
Production of fluorescently labelled protein
Cysteine residues were engineered into the N- and C termini of G51–G57 where quenching by nearby residues is least probable. The protein (50 μM) was incubated with 20 × molar excess of Alexa Fluor 488 C5 maleimide (Life Technologies) in 20 mM MOPS, 200 mM NaCl, 0.1 mM tris(2-carboxyethyl)phosphine, pH 7.0, at 25 °C for 2 h. The reaction was quenched by the addition of DTT to a final concentration of 5 mM. The protein was then dialysed into storage buffer (20 mM KH2PO4/K2HPO4, 200 mM NaCl, 1 mM DTT, pH 7.5) and purified by SEC on an S200 10/300 column (Amersham), to remove all remaining free dye before storage at −80 °C.
Production of fluorescently labelled DNA
A 215-bp DNA fragment was PCR amplified from bacteriophage T7 DNA using fluorescently labelled primers
5′-CCCCAAGCTTCATCTZGTCAGATGAGACTACCCCTCTGAA-3′
and
5′-CCZCAAAGTCTGTACTTTTAGTAGGTCTTATAGTCC-3′
where ‘Z' indicates an internal Alexa Fluor 488-dT modification, such that the fluorophores are separated by 198 bp. An inter-fluorophore distance of 62–67 nm was predicted using the DNA curvature analysis tool ( http://www.lfd.uci.edu/~gohlke/dnacurve/) and model.it60, which takes into account sequence-dependent static bending.
Sample preparation for TIRF microscopy
Poly-D-lysine-coated quartz slides were prepared by incubating the slides at 20–22 °C for 30 min in 20–500 μg ml−1 poly-D-lysine hydrobromide (30–70 kDa, Sigma), 10 mM MOPS, pH 7.0. The slides were then rinsed by dipping in deionized water and dried with filtered air. Alexa Fluor 488-labelled protein and DNA samples were prepared by diluting concentrated stocks into imaging buffer (10 mM HEPES, 10 mM NaCl, 5 μM β-mercaptoethanol, 1 mM Trolox, pH 7.0) with streptavidin-derivatized quantum dots emitting at 655 nm (100 pM, Quantum Dot Corporation). Five-micrometre silica beads (∼0.1 mg ml−1) were included in all samples, to specify the height of flow cells and to prevent coupling of excitation light into the coverslip. Flow cells were constructed by applying sample (25 μl) to the slide, covering with coverslip (No. 1, 22 mm × 64 mm, Menzel-Gläser) and sealing two opposite short sides with nail varnish. Ten minutes were allowed for the protein to immobilize. Any unbound material was subsequently washed out with two volumes of imaging buffer and the flow cell sealed with nail varnish.
TIRF microscopy
Prism-coupled TIRFM was performed using a custom-modified inverted IM35 microscope (Carl Zeiss AG) at 20–22 °C. Fluorophores were excited with 488 and 561 nm lasers (Coherent) operated at 10 mW and 30–50 mW output, respectively. A 488-nm zero-order quarter-wave plate (Edmund Optics) was installed to circularly polarize the incident laser light, thus removing any dye orientation-dependent fluorescence excitation. The quantum dots were used as an image-focusing aid (with 561 nm laser illumination only), thus minimizing Alexa Fluor 488 dye photobleaching before video acquisition. Fluorescence emission was captured through a Plan-Apochromat × 100/numerical aperture 1.4 oil-immersion objective (Carl Zeiss AG). A dual-view image splitter (OptoSplit II, Cairn Research) with appropriate emission dichroic (580 nm long pass, Zeiss) and bandpass filters for the Alexa Fluor 488 dye (ET525/50M, Chroma) and quantum dots (ET605/70M, Chroma) was used to split the image into two fluorescence emission channels. Video data were collected using an Evolve 512 electron-multiplying CCD (charge-coupled device) camera (Photometrics), cooled to −70 °C and operated through μManager61 with 500 ms exposure (2 fps). Pixel size was equivalent to 157 nm in the magnified image as determined using a USAF calibration target (Edmund Optics).
Detection and localization of single fluorophores
Fluorescent spots were detected using the multi-step test (using standardized full width at half maximum and intensity threshold values) in GMimPro62. Fluorophores that photobleached in two steps were manually selected and their x,y coordinates exported. Spot intensities for individual particles were then extracted in ImageJ63 as an image stack for a 10 × 10 pixel2 area. Subsequently in MATLAB (MathWorks, Cambridge, UK), the intensity profile of each particle was processed using a Chung-Kennedy filter64 and a derivative-based step detection method was implemented to identify the three intensity states of the particle: I1, two fluorophores fluorescing; I2, one fluorescing; I3, none fluorescing. Mean images of the seven frames before and after the first step were used to calculate I1 and I2, respectively. Fluorophore locations were determined by fitting a two-dimensional Gaussian function (with integration over each pixel) to I2 and the difference between I1 and I2:
![]() |
where A is amplitude with centre (x0, y0), σx and σy describe the widths, θ specifies rotation of the function about the centre and B accounts for background intensity. End-to-end distances were calculated for pairs of fluorophores with eccentricity <0.04 and presented as histograms. A bin size of 25 nm was chosen to satisfy the Freedman–Diaconis rule65. A Gaussian distribution of the form
![]() |
was fitted to each histogram.
MD simulations
Simulations were performed using a united-atom force field (CHARMM19) and implicit solvent model (FACTS). All simulations were performed at 300 K, with Langevin dynamics using the leapfrog integrator, a timestep of 2 fs and a friction coefficient of 3 ps−1, and run using CHARMM. The CHARMM19 FACTS parameters used were: dielectric constant=2.0, nonpolar surface tension coefficient=0.015 kcal mol−1 Å−2. Within the FACTS implicit solvent, the influence of salt has been taken into account on the Debye-Hückel level66. The friction coefficient of 3 ps−1 corresponds to a viscosity about 20 times smaller than that of water, thus enhancing sampling efficiency by a corresponding factor, but it is sufficiently large to ensure that dynamics is diffusive and mechanisms observed independent on the friction coefficient67. Trajectory frames (and associated analysis parameters) were recorded every 500 steps. Simulation for G51-E-G52 was started from the crystal structure (PDB accession: 3TIQ) and continued for 930 ns. Forced unfolding was simulated by attaching an ideal spring to the N and C atoms of the two termini and retracting them at constant speed. Simulations were performed in conditions corresponding to monovalent salt concentration of both 0 and 200 mM (experiments were performed in solution containing between 140 and 200 mM NaCl). No significant differences were observed between low (0 mM) and higher salt (200 mM) conditions.
FRET labelling
Tryptophan (E500W) and cysteine (T501C, E532C and E613C) residues were introduced by site-directed mutagenesis using the following primers:
E500W
5′–GTTCTGTTCCAGGGGCCCTGGACGATCGCGCCGGGTC-3′
5′-GACCCGGCGCGATCGTCCAGGGCCCCTGGAACAGAAC-3′
T501C
5′-GTTCCAGGGGCCCGAATGTATCGCGCCGGGTCACC-3′
5′-GGTGACCCGGCGCGATACATTCGGGCCCCTGGAAC-3′
E532C
5′-CCGGGTATCAAAAACCCGTGTACCGGTGACGTTGTTCGTCC-3′
5′–GGACGAACAACGTCACCGGTACACGGGTTTTTGATACCCGG-3′
E613C
5′-CATCTCTAAAGGTGAATCTAAAGAATGTATCACCAAAGACCCGATCAACGAAC-3′
5′-GTTCGTTGATCGGGTCTTTGGTGATACATTCTTTAGATTCACCTTTAGAGATG-3′
Labelling was carried out using thiol-reactive probes, according to manufacturer's procedures. Labelling of E-G52-E500W-E532C was performed by mixing reduced protein with a 20-fold molar excess of 5-((((2-iodoacetyl)amino)ethyl)amino)naphthalene-1-sulfonic acid (1,5-IAEDANS; Life Technologies; acceptor). The labelling reaction proceeded for ∼14 h at 4 °C. Unreacted dye was removed by gel filtration (HiTrap Desalting column; GE Healthcare) and the degree of labelling was estimated as ∼100%. Labelling of E-G52-T501C-E613C was carried out using Alexa Fluor 488 (donor) and Alexa Fluor 594 (acceptor) maleimides (Life Technologies). The dyes were added to reduced protein simultaneously in equimolar ratios and incubated at 4 °C for ∼14 h. Unreacted dyes were removed by gel filtration (HiTrap Desalting column; GE Healthcare) and the hetero-doubly labelled protein (E-G52-T501CA488/594-E613CA488/594) was isolated on a HiTrapQ HP column (GE Healthcare).
Equilibrium studies
The free energy of unfolding of the proteins was determined by chemical denaturation using urea. Fluorescence measurements were carried out on a Perkin Elmer LS55 fluorescence spectrometer under standard conditions (PBS, 25 °C). The samples were equilibrated for at least 2 h before data collection. In the intrinsic tyrosine fluorescence studies, the protein concentration was 5 μM, the excitation wavelength used was 276 nm and the emission was followed at 305 nm. In the case of E-G52-E500W-E532CIAEDANS, the protein concentration was 500 nM, the excitation wavelength was 280 nm (tryptophan excitation) and the emission was followed at 493 nm (IAEDANS fluorescence). The data for E-G52-T501CA488-E613CA594 were collected at a protein concentration of 50 nM, the excitation wavelength was 495 nm (Alexa Fluor 495 excitation) and the emission was followed at 612 nm (Alexa Fluor 594 fluorescence). The data were fit to a standard two-state equation.
Kinetic studies
The kinetic experiments monitoring fluorescence change were carried out using an Applied Photophysics SX.20 stopped-flow fluorimeter maintained at a temperature of 25 °C. The final protein concentration and the excitation wavelength used were the same as described for equilibrium studies. No cut-off filter was used in the experiments for wild-type proteins monitoring the change in tyrosine fluorescence. Cut-off filters (435 and 590 nm) were used to collect the data for E-G52-E500W-E532CIAEDANS and E-G52-T501CA488/594-E613CA488/594, respectively. At least 20 traces were averaged for a typical measurement at a given urea concentration. Kinetic traces were analyzed using Kaleidagraph 4.1.3 (Synergy Software). All rate constants were independent of protein concentration under the experimental conditions. Chevron plots (the dependence of the logarithm of the observed rate constant on the concentration of urea) were fit either to a standard two-state model or a sequential transition state model.
Atomic force microscopy
AFM measurements were performed using a G51–G57 construct with two cysteine residues incorporated at the C terminus, to allow attachment to the gold-covered surfaces via gold-thiol covalent attachment. All AFM measurements were carried out in 20 mM Tris (pH 7.5), 150 mM NaCl, at 25 °C, using an Asylum Research MFP-3D microscope. Silicon nitride cantilevers with nominal spring constant of 30 pN nm−1 (Bruker MLCT) were used and calibrated using the thermal method68. One hundred-microlitre protein solution (250 μg ml−1 in AFM buffer) was adsorbed onto a gold surface and the AFM cantilever tip was used to pick it up by nonspecific adhesion, and then retracted at a constant speed (200, 800, 1,500, 3,000 and 5,000 nm s−1), measuring the force exerted by the protein in the process. Three independent experiments (different cantilevers and surfaces) were performed for each pulling speed. The unfolding force for all events from acceptable traces were measured and their force-extension profiles fitted to the worm-like chain model69 (with the persistence length fixed to 400 pm) using the IGOR Pro 6 software (WaveMetrics) to obtain ΔLc values. The data from triplicates were pooled and the force and ΔLc probability histograms were generated for each retraction rate. The modal force and ΔLc values were calculated from Gaussian fits to the histograms using Mathematica 10 (Wolfram Research).
Additional information
Accession codes: The atomic coordinates have been deposited in the Protein Data Bank under accession code number 4WVE. SAXS data and models have been deposited in the Small Angle Scattering Biological Data Bank www.sasbdb.org with codes SASDA37, SASDA47, SASDA57, SASDA67, SASDA77 and SASDA87.
How to cite this article: Gruszka, D. T. et al. Cooperative folding of intrinsically disordered domains drives assembly of a strong elongated protein. Nat. Commun. 6:7271 doi: 10.1038/ncomms8271 (2015).
Supplementary Material
Supplementary Figures 1-7, Supplementary Tables 1-2, Supplementary Discussion and Supplementary References
Acknowledgments
This research was supported by Biotechnology and Biological Research Council Grants BB/J006459/1 (D.T.G. and J.C.), BB/J005029/1 (F.W. and J.R.P), BB/G019452/1 (O.E.F. and D.J.B.) and BB/G020671/1 (C.G.B. and J.R.P.). H.K.H.F. is supported by a studentship from a Wellcome Trust 4-year PhD programme (WT095024MA). C.M.J. is supported by the German Federal Ministry of Education and Research (BMBF), grant BIOSCAT (contract number 05K12YE1). J.C. is a Wellcome Trust Senior Research Fellow (WT/095195). J.R.P. holds a British Heart Foundation Senior Basic Science Fellowship (FS/12/36/29588). We acknowledge the use of EMBL SAXS beamline P12 at Petra-3 (DESY, Hamburg, Germany) and Maxim Petoukhov (EMBL) for providing a modified version of SASREF. The research leading to these results has received funding from the European Community's Seventh Framework Programme (FP7/2007–2013) under BioStruct-X (grant agreement number 283570). We thank Diamond Light Source for beamtime (proposal mx-7864) and Johan Turkenburg and Sam Hart for assistance with crystal testing and data collection.
Footnotes
Author contributions D.T.G., F.W., J.R.P. and J.C. designed the investigation. F.W. produced proteins for SAXS, single-molecule fluorescence imaging and AFM, and characterized these samples. F.W., C.B. and C.M.J. collected SAXS data. F.W. and C.M.J. processed and analysed SAXS data. C.M.J. performed SAXS ab initio and rigid body modelling. H.K.H.F. and C.G.B. performed the single-molecule fluorescence imaging experiments. D.T.G. produced samples and performed folding studies. D.T.G. and O.E.F. collected AFM data, which were analysed by D.T.G., O.E.F. and D.J.B. E.P. performed MD simulations. E.P. and D.T.G. analysed MD simulations. D.T.G., F.W., J.R.P. and J.C. wrote the manuscript. All other authors discussed and commented on the manuscript.
References
- Lillington J., Geibel S. & Waksman G. Biogenesis and adhesion of type 1 and P pili. Biochim. Biophys. Acta 1840, 2783–2793 (2014). [DOI] [PubMed] [Google Scholar]
- Schoeler C. et al. Ultrastable cellulosome-adhesion complex tightens under load. Nat. Commun. 5, 5635 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kwon H., Squire C. J., Young P. G. & Baker E. N. Autocatalytically generated Thr-Gln ester bond cross-links stabilize the repetitive Ig-domain shaft of a bacterial cell surface adhesin. Proc. Natl Acad. Sci. USA 111, 1367–1372 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hilleringmann M. et al. Molecular architecture of Streptococcus pneumoniae TIGR4 pili. EMBO J. 28, 3921–3930 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kang H. J., Coulibaly F., Clow F., Proft T. & Baker E. N. Stabilizing isopeptide bonds revealed in Gram-positive bacterial pilus structure. Science 318, 1625–1628 (2007). [DOI] [PubMed] [Google Scholar]
- Kang H. J. & Baker E. N. Intramolecular isopeptide bonds: protein crosslinks built for stress? Trends Biochem. Sci. 36, 229–237 (2011). [DOI] [PubMed] [Google Scholar]
- Corrigan R. M., Rigby D., Handley P. & Foster T. J. The role of Staphylococcus aureus surface protein SasG in adherence and biofilm formation. Microbiology 153, 2435–2446 (2007). [DOI] [PubMed] [Google Scholar]
- Rohde H. et al. Induction of Staphylococcus epidermidis biofilm formation via proteolytic processing of the accumulation-associated protein by staphylococcal and host proteases. Mol. Microbiol. 55, 1883–1895 (2005). [DOI] [PubMed] [Google Scholar]
- Roche F. M., Meehan M. & Foster T. J. The Staphylococcus aureus surface protein SasG and its homologues promote bacterial adherence to human desquamated nasal epithelial cells. Microbiology 149, 2759–2767 (2003). [DOI] [PubMed] [Google Scholar]
- Costerton J. W., Stewart P. S. & Greenberg E. P. Bacterial biofilms: a common cause of persistent infections. Science 284, 1318–1322 (1999). [DOI] [PubMed] [Google Scholar]
- Hall-Stoodley L., Costerton J. W. & Stoodley P. Bacterial biofilms: from the natural environment to infectious diseases. Nat. Rev. Microbiol. 2, 95–108 (2004). [DOI] [PubMed] [Google Scholar]
- Ghigo J. M. Natural conjugative plasmids induce bacterial biofilm development. Nature 412, 442–445 (2001). [DOI] [PubMed] [Google Scholar]
- Banner M. A. et al. Localized tufts of fibrils on Staphylococcus epidermidis NCTC 11047 are comprised of the accumulation-associated protein. J. Bacteriol. 189, 2793–2804 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ton-That H., Marraffini L. A. & Schneewind O. Sortases and pilin elements involved in pilus assembly of Corynebacterium diphtheriae. Mol. Microbiol. 53, 251–261 (2004). [DOI] [PubMed] [Google Scholar]
- Craig L., Pique M. E. & Tainer J. A. Type IV pilus structure and bacterial pathogenicity. Nat. Rev. Microbiol. 2, 363–378 (2004). [DOI] [PubMed] [Google Scholar]
- Kang H. J., Paterson N. G., Gaspar A. H., Ton-That H. & Baker E. N. The Corynebacterium diphtheriae shaft pilin SpaA is built of tandem Ig-like modules with stabilizing isopeptide and disulfide bonds. Proc. Natl Acad. Sci. USA 106, 16967–16971 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kang H. J. et al. A slow-forming isopeptide bond in the structure of the major pilin SpaD from Corynebacterium diphtheriae has implications for pilus assembly. Acta Crystallogr. D 70, 1190–1201 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sakamoto S. et al. Electron microscopy and computational studies of Ebh, a giant cell-wall-associated protein from Staphylococcus aureus. Biochem. Biophys. Res. Commun. 376, 261–266 (2008). [DOI] [PubMed] [Google Scholar]
- Rief M., Pascual J., Saraste M. & Gaub H. E. Single molecule force spectroscopy of spectrin repeats: low unfolding forces in helix bundles. J. Mol. Biol. 286, 553–561 (1999). [DOI] [PubMed] [Google Scholar]
- Rief M., Gautel M., Oesterhelt F., Fernandez J. M. & Gaub H. E. Reversible unfolding of individual titin immunoglobulin domains by AFM. Science 276, 1109–1112 (1997). [DOI] [PubMed] [Google Scholar]
- Oberhauser A. F., Badilla-Fernandez C., Carrion-Vazquez M. & Fernandez J. M. The mechanical hierarchies of fibronectin observed with single-molecule AFM. J. Mol. Biol. 319, 433–447 (2002). [DOI] [PubMed] [Google Scholar]
- Wang K., Ramirez-Mitchell R. & Palter D. Titin is an extraordinarily long, flexible, and slender myofibrillar protein. Proc. Natl Acad. Sci. USA 81, 3685–3689 (1984). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Erickson H. P., Carrell N. & McDonagh J. Fibronectin molecule visualized in electron microscopy: a long, thin, flexible strand. J. Cell Biol. 91, 673–678 (1981). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gruszka D. T. et al. Staphylococcal biofilm-forming protein has a contiguous rod-like structure. Proc. Natl Acad. Sci. USA 109, E1011–E1018 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Conrady D. G., Wilson J. J. & Herr A. B. Structural basis for Zn2+-dependent intercellular adhesion in staphylococcal biofilms. Proc. Natl Acad. Sci. USA 110, E202–E211 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gordon M. P., Ha T. & Selvin P. R. Single-molecule high-resolution imaging with photobleaching. Proc. Natl Acad. Sci. USA 101, 6462–6465 (2004). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Scott K. A., Steward A., Fowler S. B. & Clarke J. Titin: a multidomain protein that behaves as the sum of its parts. J. Mol. Biol. 315, 819–829 (2002). [DOI] [PubMed] [Google Scholar]
- von Castelmur E. et al. A regular pattern of Ig super-motifs defines segmental flexibility as the elastic mechanism of the titin chain. Proc. Natl Acad. Sci. USA 105, 1186–1191 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Aksel T., Majumdar A. & Barrick D. The contribution of entropy, enthalpy, and hydrophobic desolvation to cooperativity in repeat-protein folding. Structure 19, 349–360 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hilser V. J. & Thompson E. B. Intrinsic disorder as a mechanism to optimize allosteric coupling in proteins. Proc. Natl Acad. Sci. USA 104, 8311–8315 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Simon A. J., Vallee-Belisle A., Ricci F. & Plaxco K. W. Intrinsic disorder as a generalizable strategy for the rational design of highly responsive, allosterically cooperative receptors. Proc. Natl Acad. Sci. USA 111, 15048–15053 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Batey S., Nickson A. A. & Clarke J. Studying the folding of multidomain proteins. HFSP J. 2, 365–377 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Carrion-Vazquez M. et al. Mechanical and chemical unfolding of a single protein: a comparison. Proc. Natl Acad. Sci. USA 96, 3694–3699 (1999). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hoffmann T., Tych K. M., Hughes M. L., Brockwell D. J. & Dougan L. Towards design principles for determining the mechanical stability of proteins. Phys. Chem. Chem. Phys. 15, 15767–15780 (2013). [DOI] [PubMed] [Google Scholar]
- Valbuena A. et al. On the remarkable mechanostability of scaffoldins and the mechanical clamp motif. Proc. Natl Acad. Sci. USA 106, 13791–13796 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kabsch W. XDS. Acta Crystallogr. D 66, 125–132 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Evans P. R. An introduction to data reduction: space-group determination, scaling and intensity statistics. Acta Crystallogr. D 67, 282–292 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- McCoy A. J. et al. Phaser crystallographic software. J. Appl. Crystallogr. 40, 658–674 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cowtan K. The Buccaneer software for automated model building. 1. Tracing protein chains. Acta Crystallogr. D 62, 1002–1011 (2006). [DOI] [PubMed] [Google Scholar]
- Emsley P., Lohkamp B., Scott W. G. & Cowtan K. Features and development of Coot. Acta Crystallogr. D 66, 486–501 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Murshudov G. N. et al. REFMAC5 for the refinement of macromolecular crystal structures. Acta Crystallogr. D 67, 355–367 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Painter J. & Merritt E. A. TLSMD web server for the generation of multi-group TLS models. J. Appl. Crystallogr. 39, 109–111 (2006). [Google Scholar]
- Bailey S. The CCP4 suite: programs for protein crystallography. Acta Crystallogr. D 50, 760–763 (1994). [DOI] [PubMed] [Google Scholar]
- Lovell S. C. et al. Structure validation by Calpha geometry: phi,psi and Cbeta deviation. Proteins 50, 437–450 (2003). [DOI] [PubMed] [Google Scholar]
- Krissinel E. Enhanced fold recognition using efficient short fragment clustering. J. Mol. Biochem. 1, 76–85 (2012). [PMC free article] [PubMed] [Google Scholar]
- Krissinel E. & Henrick K. Secondary-structure matching (SSM), a new tool for fast protein structure alignment in three dimensions. Acta Crystallogr. D 60, 2256–2268 (2004). [DOI] [PubMed] [Google Scholar]
- Franke D., Kikhney A. G. & Svergun D. I. Automated acquisition and analysis of small angle X-ray scattering data. Nucl. Instr. Meth. Phys. Res. A 689, 52–59 (2012). [Google Scholar]
- Konarev P. V., Volkov V. V., Sokolova A. V., Koch M. H. J. & Svergun D. I. PRIMUS: a Windows PC-based system for small-angle scattering data analysis. J. Appl. Crystallogr. 36, 1277–1282 (2003). [Google Scholar]
- Svergun D. I. Determination of the regularization parameter in indirect-transform methods using perceptual criteria. J. Appl. Crystallogr. 25, 495–503 (1992). [Google Scholar]
- Schmidt P. W. Small-angle scattering studies of disordered, porous and fractal systems. J. Appl. Crystallogr. 24, 414–435 (1991). [Google Scholar]
- Beaucage G. Small-angle scattering from polymeric mass fractals of arbitrary mass-fractal dimension. J. Appl. Crystallogr. 29, 134–146 (1996). [Google Scholar]
- Orthaber D., Bergmann A. & Glatter O. SAXS experiments on absolute scale with Kratky systems using water as a secondary standard. J. Appl. Crystallogr. 33, 218–225 (2000). [Google Scholar]
- Whitten A. E., Cai S. Z. & Trewhella J. MULCh: modules for the analysis of small-angle neutron contrast variation data from biomolecular assemblies. J. Appl. Crystallogr. 41, 222–226 (2008). [Google Scholar]
- Mylonas E. & Svergun D. I. Accuracy of molecular mass determination of proteins in solution by small-angle X-ray scattering. J. Appl. Crystallogr. 40, S245–S249 (2007). [Google Scholar]
- Fischer H., Neto M. D., Napolitano H. B., Polikarpov I. & Craievich A. F. Determination of the molecular weight of proteins in solution from a single small-angle X-ray scattering measurement on a relative scale. J. Appl. Crystallogr. 43, 101–109 (2010). [Google Scholar]
- Svergun D. I., Petoukhov M. V. & Koch M. H. J. Determination of domain structure of proteins from X-ray solution scattering. Biophys. J. 80, 2946–2953 (2001). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Volkov V. V. & Svergun D. I. Uniqueness of ab initio shape determination in small-angle scattering. J. Appl. Crystallogr. 36, 860–864 (2003). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Svergun D., Barberato C. & Koch M. H. J. CRYSOL - a program to evaluate X-ray solution scattering of biological macromolecules from atomic coordinates. J. Appl. Crystallogr. 28, 768–773 (1995). [Google Scholar]
- Petoukhov M. V. & Svergun D. I. Global rigid body modeling of macromolecular complexes against small-angle scattering data. Biophys. J. 89, 1237–1250 (2005). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Munteanu M. G., Vlahovicek K., Parthasarathy S., Simon I. & Pongor S. Rod models of DNA: sequence-dependent anisotropic elastic modelling of local bending phenomena. Trends Biochem. Sci. 23, 341–347 (1998). [DOI] [PubMed] [Google Scholar]
- Edelstein A., Amodaj N., Hoover K., Vale R. & Stuurman N. Computer control of microscopes using microManager. Curr. Protoc. Mol. Biol. 92, Chapter 14, Unit 14.20 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mashanov G. I. & Molloy J. E. Automatic detection of single fluorophores in live cells. Biophys. J. 92, 2199–2211 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schneider C. A., Rasband W. S. & Eliceiri K. W. NIH Image to ImageJ: 25 years of image analysis. Nat. Methods 9, 671–675 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chung S. H. & Kennedy R. A. Forward-backward non-linear filtering technique for extracting small biological signals from noise. J. Neurosci. Methods 40, 71–86 (1991). [DOI] [PubMed] [Google Scholar]
- Freedman D. & Diaconis P. On the histogram as a density estimator: L2 theory. Z. Wahrscheinlichkeit. 57, 453–476 (1981). [Google Scholar]
- Srinivasan J., Trevathan M. W., Beroza P. & Case D. A. Application of a pairwise generalized born model to proteins and nucleic acids: inclusion of salt effects. Theor. Chem. Acc. 101, 426–434 (1999). [Google Scholar]
- Best R. B. & Hummer G. Diffusive model of protein folding dynamics with Kramers turnover in rate. Phys. Rev. Lett. 96, 228104 (2006). [DOI] [PubMed] [Google Scholar]
- Hutter J. L. & Bechhoefer J. Calibration of atomic-force microscope tips. Rev. Sci. Instrum. 64, 1868–1873 (1993). [Google Scholar]
- Marko J. F. & Siggia E. D. Stretching DNA. Macromolecules 28, 8759–8770 (1995). [Google Scholar]
- Best R. B. & Clarke J. What can atomic force microscopy tell us about protein folding? Chem. Commun. 3, 183–192 (2002). [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Supplementary Figures 1-7, Supplementary Tables 1-2, Supplementary Discussion and Supplementary References