Skip to main content
The Journal of Biological Chemistry logoLink to The Journal of Biological Chemistry
. 2009 Jul 1;284(37):25065–25076. doi: 10.1074/jbc.M109.007054

The Functional Curli Amyloid Is Not Based on In-register Parallel β-Sheet Structure*

Frank Shewmaker , Ryan P McGlinchey , Kent R Thurber §, Peter McPhie , Fred Dyda , Robert Tycko §, Reed B Wickner ‡,1
PMCID: PMC2757210  PMID: 19574225

Abstract

The extracellular curli proteins of Enterobacteriaceae form fibrous structures that are involved in biofilm formation and adhesion to host cells. These curli fibrils are considered a functional amyloid because they are not a consequence of misfolding, but they have many of the properties of protein amyloid. We confirm that fibrils formed by CsgA and CsgB, the primary curli proteins of Escherichia coli, possess many of the hallmarks typical of amyloid. Moreover we demonstrate that curli fibrils possess the cross-β structure that distinguishes protein amyloid. However, solid state NMR experiments indicate that curli structure is not based on an in-register parallel β-sheet architecture, which is common to many human disease-associated amyloids and the yeast prion amyloids. Solid state NMR and electron microscopy data are consistent with a β-helix-like structure but are not sufficient to establish such a structure definitively.


Interest in amyloid is largely because of its association with many late onset human diseases, including Alzheimer disease (Aβ),2 Parkinson disease (α-synuclein), type II diabetes (amylin), and the transmissible spongiform encephalopathies (PrP). In each case a particular endogenous protein becomes incorporated into large aggregates known as amyloid, which was originally defined by pathologists as a tissue deposit staining like starch (1). However, the term amyloid has come to mean a filamentous protein aggregate with cross-β secondary structure (cross-β means that the β-strands that form β-sheets in the amyloid fibrils run approximately perpendicular to the long axis of the fibril with interstrand hydrogen bonds that run approximately parallel to the long axis) and protease resistance. Morphologically amyloid fibrils may vary in length from tens of nanometers to micrometers and have diameters in the range of 3–10 nm, although lateral association can produce much larger apparent diameters.

Proteins from a variety of organisms can form amyloid both in vitro and in vivo, and the propensity to form amyloid may be a common property of many proteins (2). In addition to disease-associated amyloids, there are several confirmed cases of functional amyloid (for a review, see Ref. 3). For example, hydrophobins are amyloid-like proteins that coat the surface of fungal cells, and amyloid fibrils coating fish eggs protect them from dehydration (4, 5). The [Het-s] prion of Podospora anserina is involved in heterokaryon incompatibility, a recognition of non-self reaction believed to be important as a defense against fungal virus infection (6).

Curli are extracellular filamentous structures of Enterobacteriaceae (7) that are integral to biofilm formation and are the major protein component of the extracellular matrix of these organisms (8). Curli of Escherichia coli are composed of the secreted proteins CsgA and CsgB. The latter is believed to prime the polymerization of the former and anchor the fibrils to the outer membrane (9). Both CsgA and CsgB fibrils are β-sheet-rich and, like amyloids, stain with the dye Congo red (10, 11).

Because amyloid fibrils are non-crystalline and insoluble, solution NMR and x-ray crystallography are not directly applicable in structural studies. Solid state NMR and electron spin resonance have both been useful in obtaining constraints on amyloid structures and, in some cases, determining detailed structural information. The disease-associated amyloids formed by Aβ, amylin, α-synuclein, and tau along with the infectious amyloids of several yeast prions each have in-register parallel β-sheet structure (1219).

Here we confirm that the fibrils formed in vitro by CsgA and CsgB proteins are amyloids and explore their structure using solid state NMR and electron microscopy. Our results indicate that, unlike the pathogenic amyloids of humans and yeast, CsgA and CsgB amyloids are not in-register parallel β-sheet structures. Solid state NMR and electron microscopy data are consistent with a β-helix-like structure but do not establish such a structure definitively.

EXPERIMENTAL PROCEDURES

CsgA and CsgB Purification

The csgA and csgB genes, with carboxyl-terminally encoded His tags, were cloned into pET101/D (Invitrogen) bacterial expression plasmids (supplemental Table 1). Protein overexpression was performed in E. coli strain BL21(DE3) grown in Luria broth at 37 °C. For single isotope labeling, cells were grown in defined amino acid medium as described previously (17, 18). For uniform 13C,15N labeling, cells were grown in a defined medium containing the following: 4 g/liter glucose, 0.2 g/liter adenine sulfate, 0.2 g/liter uracil, 0.03 g/liter thiamine, 1.2 g/liter MgSO4, 13 g/liter KH2PO4, 10 g/liter K2HPO4, 9 g/liter Na2HPO4, 2 g/liter NH4Cl, 0.05 g/liter EDTA, 0.06 g/liter FeSO4, 0.06 g/liter CaCl2, 12 mg/liter MnCl2, 8 mg/liter CoCl2, 7 mg/liter ZnSO4, 3 mg/liter CuCl2, 2 mg/liter H3BO3, and 2.5 mg/liter (NH4)6Mo7O24. When the cells reached an A600 of ∼0.8, they were harvested and resuspended to twice their original volume in the same medium but with [13C]glucose and 15NH4Cl. Protein expression was induced with 1 mm isopropyl β-d-thiogalactosidase when cell densities reached an A600 of ∼0.5–0.7. Cells were harvested after 3–4 h of protein induction.

CsgA and CsgB were purified similarly. Following protein expression, cell pellets were resuspended in a phosphate buffer (10 mm K2HPO4, pH 7.4, 150 mm NaCl, and 1 mm EDTA) supplemented with Roche Applied Science Complete protease inhibitor mixture. The cells were lysed by sonication and then spun at 20,000 rpm for 20 min. The supernatant was discarded, and the pellet was resuspended in Denaturing Buffer (8 m guanidine, 100 mm NaCl, and 100 mm K2HPO4, pH 8). After 2–3 days of incubation at room temperature with occasional agitation, the lysate was spun at 30,000 rpm for 20 min. The pellet was discarded, and the supernatant was mixed with nickel-nitrilotriacetic acid-agarose (Qiagen; 1 ml/liter of culture) and incubated at 4 °C with mild agitation for ∼1 h. The lysate and nickel-nitrilotriacetic acid-agarose were poured into a glass column and washed with 10 column volumes of 8.5 m urea, 20 mm Tris, pH 7.5, 160 mm NaCl, and 20 mm imidazole. The protein was eluted with 8 m urea, 50 mm Na2HPO4, pH 8, 100 mm NaCl, and 200 mm imidazole. Truncated CsgB(Δ1–8) was purified the same way except there was no sonication step, and cells were directly lysed in Denaturing Buffer.

CsgA and CsgB fibrils were prepared by dialyzing purified protein into 25 mm Tris, pH 7.5, 100 mm NaCl, and 0.5 mm EDTA and incubating at room temperature with or without mild agitation for several days. The fibrils are resistant to 8.5 m urea and were sometimes further cleaned of residual contaminants by briefly washing with 5–6 m urea. For solid state NMR the fibrils were harvested by centrifugation, washed several times with water, and finally lyophilized. Uniformly 13C,15N-labeled CsgA fibrils were rehydrated prior to two-dimensional NMR spectroscopy.

Solid State Circular Dichroism

Suspensions of fibrils of CsgA or CsgB in water were dispensed (100 μl; ∼1 mg/ml) onto a quartz slide and dried under vacuum. The resulting films were analyzed using a Jasco J-715 spectropolarimeter as described previously (20). Soluble bovine serum albumin (BSA) was similarly dried and analyzed as a control.

Thioflavin-T Fluorescence Assay

A thioflavin-T fluorescence assay was performed using a Photon Technology International QuantaMaster fluorometer as described previously (21). Soluble BSA and fibrils of CsgB(Δ1–8) and Ure2p were dispensed into a quartz cuvette to a final concentration 10 μg/ml (800-μl final volume). The assay was performed in 200 mm Tris, pH 7.5, 200 mm NaCl, and 0.01 mg/ml thioflavin-T (filtered) with constant stirring at room temperature with a fluorescence excitation of 420 nm.

Proteinase K Digestion

Comparative proteolysis was performed by incubating 20 ng of Proteinase K with 10 μg of soluble or fibrous CsgA for 0, 5, 10, and 20 min. Reactions were performed in 125 mm Tris, pH 7.5, and 0.8 m urea at room temperature in a total volume of 20 μl. The reactions were terminated by the addition of 100 μl of 88% formic acid, which also serves to depolymerize CsgA fibrils for subsequent gel electrophoresis (22). The samples were lyophilized, then resuspended in 8 m urea and 2% SDS, and heated for 3 min at 72 °C. The samples were evaluated by SDS-PAGE using a 10–20% Tricine gel.

X-ray Fiber Diffraction

CsgA and CsgB fibrils were washed extensively with water, lyophilized, and packed into a 0.7-mm-diameter, thin walled, boron-rich glass capillary by centrifugation. The samples were placed in an x-ray beam (0.15418 nm) generated by a rotating anode source operated at 50 kV and 100 mA coupled with a multilayer focusing optic. Diffraction data were collected in 20-min exposures during which the samples were continuously rotated around an axis perpendicular to the beam. Diffraction data were recorded on an RaxisIV two-dimensional imaging plate detector placed 15 cm from the capillary. Background data were collected similarly but exposing a part of the same capillary that was free of fibers. The data were converted to intensity versus 2θ plots by circular summation around the center of the two-dimensional images using the R-axis Display Program (Rigaku Americas). The backgrounds were subtracted, and the resulting plots were displayed by Origin 7 (OriginLab).

Electron Microscopy

Aqueous suspensions of curli fibrils were applied to carbon-coated copper grids, incubated for 2–3 min, briefly washed with water, and stained with 1–2% uranyl acetate for 2–3 min. Grids were then blotted and air-dried. Electron micrographs were produced with an FEI Co. Morgagni transmission electron microscope operating at 80 kV.

For electron diffraction, aqueous suspensions of CsgB fibrils were applied to carbon-coated copper grids and washed several times with water. The diffraction from the protein fibrils is sensitive to damage caused by the electron beam, so regions on the grid with CsgB fibrils were found under low magnification with low electron beam intensity prior to diffraction imaging. Immediately after diffraction imaging, regular images were taken to confirm the region used for diffraction. All electron diffraction images were taken with an 80-kV electron beam and 350-mm camera length. The diffraction distances were calibrated with the diffraction rings of thallous chloride (Electron Microscopy Sciences) under the same microscope conditions. Single crystals of molybdenum oxide (MoO3; Electron Microscopy Sciences) were used to measure the rotation between regular images and diffraction images. Also MoO3 crystals were used to confirm that the electron beam does not shift between regular and diffraction imaging by comparison of regular and defocused diffraction images under the measurement conditions.

Solid State NMR Spectroscopy

Solid state NMR experiments with carbonyl-labeled samples were performed at room temperature at 9.39 teslas (100.4-MHz 13C NMR frequency) with an InfinityPlus spectrometer (Varian, Palo Alto, Ca). Lyophilized CsgA fibrils were packed into 3.2-mm-diameter magic angle spinning (MAS) NMR rotors. Measurements were performed with Varian MAS NMR probes. For certain measurements, the fibrils were rehydrated by addition of water in the rotor. Previous solid state NMR measurements on amyloid fibrils have shown that lyophilization does not disrupt amyloid structures and that rehydration of lyophilized samples leads to solid state NMR spectra with sharp lines that are indistinguishable from spectra of fully hydrated samples that had not been lyophilized (17, 23, 24).

The PITHIRDS-CT technique (25) was used to measure intermolecular 13C-13C magnetic dipole-dipole couplings in carbonyl-labeled samples. These couplings depend on intermolecular distances as 1/R3. Experimental conditions were the same as in earlier PITHIRDS-CT measurements on various amyloid fibrils (19, 26, 27). Raw PITHIRDS-CT data were corrected for contributions from the 1.1% natural abundance 13C at all carbonyl positions, assuming the 17% carbonyl signal decay observed in PITHIRDS-CT measurements on an unlabeled CsgA sample. Differences in T2 decays between naturally occurring 13C and 13C labels were small (supplemental Fig. 3B) but were included in the correction for relative contribution by using the relative signals remaining following a period equal to the constant time period of the PITHIRDS-CT experiment.

Two-dimensional NMR spectra of uniformly 15N,13C-labeled CsgA fibrils were obtained at 14.1 teslas (150.7-MHz 13C NMR frequency) with MAS at 13.50 kHz. Approximately 2 mg of lyophilized fibrils were loaded into a 3.2-mm MAS rotor and rehydrated with Teflon spacers to keep the sample centered in the radiofrequency (rf) coil of the NMR probe. Two-dimensional 13C-13C spectra were obtained with either finite pulse radiofrequency-driven recoupling (fpRFDR) (28, 29) or rf-assisted diffusion (RAD) (30, 31) in the mixing period between t1 and t2 periods. In fpRFDR experiments, the mixing period was 2.37 ms, 13C π pulses were 20.0 μs, and proton decoupling fields were 105 kHz. In RAD experiments, the mixing period was 200 ms with a 6.75-kHz proton rf field. Total experiment times were 44 h for the two-dimensional fpRFDR spectrum and 112 h for the two-dimensional RAD spectrum. Two-dimensional 15N-13C spectra were obtained with mixing periods comprising a 5-ms 15N-13C cross-polarization period (∼31-kHz 15N rf field and 18-kHz 13C rf field) followed by a 13C-13C polarization transfer period under fpRFDR. For the two-dimensional 15N-13Cα/13CX spectrum, the 13C carrier frequency was centered in the 13Cα region during 15N-13C cross-polarization. The fpRFDR period was 2.37 ms with 25.0-μs 13C π pulses and with the carrier frequency centered in the aliphatic region. For the two-dimensional 15N-13CO/13CX spectrum, the 13C carrier frequency was centered in the 13CO region during 15N-13C cross-polarization. The fpRFDR period was 2.37 ms with 10.0-μs 13C π pulses and with the carrier frequency centered between 13CO and 13Cα signals. Total experiment times were 12 h for the two-dimensional 15N-13Cα/13CX spectrum and 15 h for the two-dimensional 15N-13CO/13CX spectrum. Both spectra were acquired with 32 t1 points and a 96.0-μs t1 increment. Spectra with longer t1 periods showed only a minor improvement in resolution in the 15N dimension at the expense of signal to noise.

RESULTS

CsgA Forms Amyloid Fibrils

Milligram quantities of CsgA were obtained using pET101/D in E. coli BL21(DE3) (see “Experimental Procedures”). Amino-terminal sequencing and mass spectrometry showed that purified CsgA lacked its signal peptide and otherwise had the expected mass (Fig. 1). After CsgA samples were removed from denaturing conditions and incubated in non-denaturing buffer, small fibrils formed within 1 day, and then larger filamentous networks formed over subsequent days as observed by electron microscopy (Fig. 1C and supplemental Fig. 1). This time course of fibril formation corroborates previous observations of purified CsgA (32). Also the in vitro prepared CsgA fibrils required formic acid for depolymerization prior to SDS-PAGE (Fig. 1B) as observed previously for curli derived directly from E. coli (10).

FIGURE 1.

FIGURE 1.

Recombinant CsgA is correctly processed and forms SDS/urea-resistant fibrils. A, the primary sequence of CsgA is shown as coded by the pET expression plasmid pFPS149. Underlined amino acids indicate locations of specific carbonyl labeling with 13C for the PITHIRDS experiments. Amino-terminal sequencing of the purified product indicated that the protein was cleaved between Ala-20 and Gly-21 as indicated above (*). The theoretical mass of the processed protein with the histidine tag is 13916 Da. Mass spectrometry of purified CsgA indicated a mass of 13,918 Da for the unlabeled protein and 13,921 Da for the sample isotopically labeled at the three phenylalanine carbonyl positions. B, the electrophoretic migration of CsgA is slightly slower than would be predicted for its mass. Equal amounts of CsgA fibrils were treated with 88% formic acid (lanes 1 and 2) or with 8 m urea (lanes 3 and 4) prior to SDS-PAGE. C, electron micrographs of purified recombinant CsgA after incubation in Tris-saline buffer.

CsgA fibrils prepared in vitro possessed the physical characteristics typical of amyloid. Fibrils were examined by x-ray diffraction, which indicated a primary atomic spacing of ∼4.7 Å and an additional spacing of ∼9 Å (Fig. 2A). These two spacings are common for amyloid and are interpreted, respectively, as the distance between β-strands within β-sheets and the distance between layers of β-sheets. Solid state circular dichroism spectroscopy was performed on CsgA fibrils (Fig. 2B). This method provides details of structural composition while bypassing the problems associated with fibrillar samples that form macroscopic aggregates (20). The circular dichroism spectrum of CsgA fibrils, especially the negative ellipticity peak at 218 nm, is characteristic of protein with a high degree of β-sheet structure. Also relative to soluble CsgA, fibrillar CsgA exhibited markedly higher resistance to limited Proteinase K exposure as is typical of amyloid fibrils (Fig. 2C).

FIGURE 2.

FIGURE 2.

CsgA fibrils are rich in β-sheet and have increased protease resistance. A, x-ray diffraction of dried CsgA fibrils indicates atomic spacing of ∼0.48 (upper arrow) and ∼0.9 nm (lower arrow). B, dried CsgA fibrils were compared against dried BSA by circular dichroism spectroscopy. Based on curve fitting (20), the relative secondary structure contributions of CsgA fibrils are as follows: 16 ± 2% α-helix, 40 ± 2% β-sheet, 13 ± 2% β-turn, and 31 ± 2% remainder. C, equal concentrations of soluble and fibrous CsgA were exposed to Proteinase K for 0–20 min (′). mdeg, millidegrees.

CsgB Forms Amyloid Fibrils

Previously a recombinant and truncated CsgB protein was shown to form amyloid-like fibrils in vitro (9), but attempts to purify the full-length protein were reported to be unsuccessful. We attempted the same strategies reported above for CsgA to produce full-length CsgB. Two csgB constructs were made (Fig. 3A): one coding the entire open reading frame and another beginning at the second methionine residue (CsgB(Δ1–8)) with the presumption that the export signal would be disrupted and the protein could be more easily purified from a cell lysate. Each construct yielded proteins with the correct approximate mass as determined by electrophoresis (Fig. 3B), and the truncated signal sequence did disrupt secretion as judged by plating cells on Congo red indicator plates (supplemental Fig. 2). The full-length csgB construct yielded two species of slight mass difference, which we assume to be the result of imperfect post-translational processing using this expression system. Like CsgA, both CsgB proteins formed fibrils very quickly after removal from denaturing conditions (Fig. 3, C and D). Moreover the fibrils displayed detergent and urea insensitivity but could be depolymerized with formic acid (Fig. 3B).

FIGURE 3.

FIGURE 3.

CsgB and CsgB(Δ1–8) form SDS/urea-resistant fibrils. A, the primary sequence of CsgB as coded by plasmid pFPS186. A similar plasmid was constructed but with a disruption to the secretory sequence (pFPS184[CsgB(Δ1–8)]). Tyrosine residues (underlined) of CsgB were labeled with 13C at the carbonyl position for the PITHIRDS-CT experiments. B, fibrils were examined by SDS-PAGE: CsgB fibrils suspended in 8 m urea and 2% SDS without (lane 1) and with (lane 2) 88% formic acid pretreatment are shown; similarly CsgB(Δ1–8) is shown without (lane 3) and with (lane 4) pretreatment. C, electron micrograph (28,000× direct magnification) of CsgB fibers after 1 day in Tris-saline buffer. D, CsgB(Δ1–8) fibers after 1 day.

Fibrils formed by CsgB also have the properties of amyloid. The fluorescent dye thioflavin-T is commonly used to report amyloid formation. When CsgB fibrils were mixed with thioflavin-T, the characteristic increase in fluorescence intensity was observed (Fig. 4A). Likewise solid state circular dichroism spectroscopy yielded a spectrum consistent with high β-sheet content (Fig. 4B), and x-ray diffraction of CsgB fibrils also yielded the distinctive spacings (∼4.7 and ∼9 Å) of amyloid (Fig. 4C). However, high β-sheet content alone does not mean a protein aggregate is amyloid. Rather the cross-β structure, in which β-sheets are oriented such that the β-strands run perpendicular to the long axis of the fibril, is the structural hallmark of amyloid. Strong x-ray diffraction of non-aligned samples may indicate cross-β structure, but diffraction from aligned fibrils is more definitive. To this end, electron diffraction can be used to obtain such directional information about amyloid secondary structure (33, 34). The electron diffraction pattern of CsgB fibrillar aggregates on electron microscope grids shows the distinctive 4.7-Å atomic spacing of β-sheet (Fig. 5A). Moreover regions in which the CsgB fibrils are preferentially aligned in the same direction yield asymmetrical electron diffraction patterns indicative of repeated spacing of about 4.7 Å along the long axis of the fibrils (Fig. 5B) that is the signature of cross-β structure. A single slice extending from the center of the electron diffractogram indicates an increase in intensity at about 4.7 Å (Fig. 5C). The angular dependence of the diffraction intensity around the diffraction ring shows local maxima and minima corresponding, respectively, to the diffraction parallel and perpendicular to the direction of fibril alignment (Fig. 5D), indicating that the 4.7-Å spacing is along the axis of the fibrils.

FIGURE 4.

FIGURE 4.

CsgB fibrils have features characteristic of amyloid. A, thioflavin-T fluorescence assay comparing CsgB(Δ1–8) fibrils with fibrils of the prion protein Ure2p and soluble BSA. λex = 420 nm. B, dried CsgB fibrils were compared against dried BSA using solid state circular dichroism spectroscopy. The relative secondary structure contributions of CsgB fibrils are predicted as follows: 7 ± 1% α-helix, 46 ± 1% β-sheet, 12 ± 1% β-turn, and 35 ± 2% remainder. C, x-ray diffraction of dried fibrils indicates that both CsgB and CsgB(Δ1–8) show characteristic atomic spacing of ∼0.47 (upper arrow) and ∼0.9 nm (lower arrow). mdeg, millidegrees.

FIGURE 5.

FIGURE 5.

Electron diffraction with fibrils of CsgB. A, non-aligned CsgB fibrils yield an ∼4.7-Å electron diffraction pattern (left image) from the region inside the dashed circle on the regular electron microscope image (right image). B, CsgB fibrils in the dashed circle show a similar orientation along the vertical direction (indicated with 0° arrow) and produce the diffraction pattern shown in the inset. The arrow in the diffraction inset indicates the corresponding orientation on the diffraction pattern. C, intensity of a single slice through the diffraction pattern of B indicating the increase of intensity at ∼4.7Å. D, angular dependence of the diffraction ring of B (integrated from 4.6 to 4.8 Å with 20° angular bins). The peak at 180° and minima at 90° and 270° indicate that the ∼4.7-Å distance is along the axis of the fibrils (cross-β structure). The solid line is drawn to guide the eye.

Solid State NMR Chemical Shifts Indicate β-Sheet Structure for Curli Fibrils

Solid state NMR spectroscopy is perhaps the most direct and powerful method for acquiring detailed structural information about amyloids (35). For solid state NMR experiments, CsgA fibrils were prepared with 13C isotope labels at all of the carbonyl positions of leucine (6 residues), phenylalanine (3 residues), or valine (8 residues). These residues are relatively evenly distributed throughout the CsgA amino acid sequence as underlined in Fig. 1A. In addition, roughly 1.9 natural abundance 13C nuclei per CsgA molecule are distributed randomly at backbone or side chain carbonyl and carboxyl sites. The one-dimensional 13C MAS NMR spectra of lyophilized samples are presented in Fig. 6 along with dashed lines that indicate the frequencies of the respective residues when in random coil secondary structure (36) (two-component Gaussian fits of the spectra with chemical shifts and relative contributions are presented in supplemental Fig. 3A and supplemental Table 2). In the spectrum of leucine-labeled CsgA fibrils, the carbonyl 13C signal is asymmetric with the major component centered at 172.8 ppm (Fig. 6A). The random coil value for a leucine carbonyl is 175.9 ppm, and negative deviations from this value are indicative of β-strand secondary structure (37). For the valine-labeled sample, the carbonyl 13C signal is more symmetric with a peak at 172.5 ppm, which is in the range typical of β-strand secondary structure (Fig. 6C). For the phenylalanine-labeled sample, the carbonyl 13C signal is split with maxima at 174.2 and 173.1 ppm. In the spectrum of unlabeled CsgA fibrils, the carbonyl peak is centered at 172.2 ppm (Fig. 6E), which is lower than the predicted random coil frequencies for the constituent CsgA amino acids, confirming the general β-sheet character of the fibrils. Spectra in Fig. 6, A and B, suggest that certain leucine and phenylalanine residues may be in non-β-strand segments in CsgA fibrils. Data for uniformly 15N,13C-labeled CsgA fibrils described below also indicated non-β-strand conformations at certain residues.

FIGURE 6.

FIGURE 6.

Solid state 13C one-dimensional NMR spectra of carbonyl-labeled CsgA and CsgB samples. A–D, the carbonyl signal from each labeled CsgA and CsgB fibril sample is shown. The chemical shift values are relative to tetramethylsilane and are presented with a dashed line indicating the random coil value for the carbonyl of the respective amino acid (36). E and F, the full carbon chemical shift spectra of the [1-13C]valine and unlabeled CsgA samples.

CsgB was similarly labeled at the carbonyl position of its tyrosine residues. The 13C carbonyl signal is centered at 172.8 ppm (Fig. 6D), which is shifted from the random coil value in the direction of β-sheet structure. However, the signal is broad and suggests that the 6 tyrosine residues may be distributed into different structural environments.

Intermolecular Distance Measurements by Solid State NMR

PITHIRDS-CT measurements (25) were performed on the 13C carbonyl-labeled CsgA and CsgB samples to probe intermolecular carbonyl-carbonyl distances. The PITHIRDS-CT uses radiofrequency pulse sequences, comprising π pulses occupying one-third of the MAS rotor period, to restore (i.e. recouple) the nuclear magnetic dipole-dipole interactions that are otherwise averaged out by MAS in solid state NMR. The recoupling periods are of variable length during a time prior to signal acquisition; the constant time (CT) approach minimizes variation in signal decay due to T2 spin relaxation. The magnitudes of dipole-dipole interactions are proportional to 1/R3, so they can be used to report distances between nuclei. The time scale for decay of the 13C signal in a PITHIRDS-CT measurement as a result of the dipole-dipole interactions is related to the distance between 13C nuclei as shown by numerical simulations in Fig. 7.

FIGURE 7.

FIGURE 7.

Dipolar recoupling NMR experiments indicate curli amyloid is not based on parallel in-register β-sheet structure. The graphs show 13C signal decays of specifically labeled carbonyl positions using the PITHIRDS-CT method. All graphs include simulated PITHIRDS-CT curves for ideal chains of 13C with spacing of 4.7 Å. A, [1-13C]tyrosine-labeled Sup35NM fibrils adjacent to a representative schematic of their in-register parallel β-sheet amyloid structure. B, [1-13C]tyrosine-labeled HET-s(218–289) fibrils adjacent to a representative schematic of their β-helix amyloid structure. C, CsgA fibrils separately labeled at the carbonyl positions of valine, leucine, or phenylalanine. The contributions of the adjacent valines at the amino terminus of the CsgA sequence were subtracted from the valine signal values with the assumption that they have signal decays similar to the simulated 3-Å dipole-dipole distance. D, [1-13C]tyrosine-labeled CsgB fibrils and unlabeled CsgA fibrils.

Amyloid fibrils formed by the yeast prion proteins Sup35, Ure2p, and Rnq1 are based on in-register parallel β-sheets (1719, 27). In such structures, each residue is adjacent to the same residue in the neighboring polypeptide chains within the β-sheets, leading to intermolecular distances of roughly 5 Å. Fig. 7A shows a schematic of the Sup35 structure along with the signal decay of a [1-13C]tyrosine-labeled Sup35 sample in a PITHIRDS-CT experiment. Fig. 7B shows the signal decay for amyloid of [1-13C]-tyrosine-labeled HET-s(218–289), the fibril-forming domain of a fungal prion that forms a β-helix-like structure in which intermolecular carbonyl-carbonyl distances among like residues are generally greater than 9 Å (38). Accordingly there is a much slower decay of the 13C signal for HET-s(218–289). The PITHIRDS-CT data for CsgA and CsgB fibril samples (corrected for signal contributions from natural abundance 13C as described above) are shown in Fig. 7, C and D, respectively. None of the samples shows the rapid 13C carbonyl signal decay indicative of an in-register parallel β-sheet. Instead the PITHIRDS-CT data indicate average carbonyl-carbonyl distances of ∼7 Å or greater. Note that valine-labeled CsgA contains one sequential carbonyl 13C pair (expected 3.4-Å intramolecular distance in a β-strand; see Fig. 7C legend) and one pair separated by a single unlabeled residue (expected 7-Å intramolecular distance in a β-strand). Leucine-labeled CsgA also contains one carbonyl 13C pair separated by a single unlabeled residue. The hypothetical alignment of homologous segments in Fig. 1A would produce one pair of leucines and one pair of valines with approximate 5-Å intersegmental, intramolecular distances. These intramolecular proximities may explain the apparent average 13C-13C distances of ∼7 Å for valine-labeled and leucine-labeled samples. The hypothetical alignment in Fig. 1A also leads to 8–9-Å intramolecular 13C-13C distances for phenylalanine-labeled CsgA. We do not expect the PITHIRDS-CT technique to yield accurate interatomic distances when these distances exceed 7 Å because pulse sequence imperfections and spin relaxation processes may become dominant sources of signal decay in this regime.

Two-dimensional Solid State NMR Spectroscopy of 13C,15N-Labeled CsgA

Fibrils were prepared from uniformly 15N,13C-labeled CsgA and examined by two-dimensional solid state NMR. The two-dimensional 13C-13C NMR spectrum acquired with fpRFDR mixing shows strong cross-peaks that connect 13C chemical shifts of directly bonded carbon sites, allowing partial assignments to residue types based on their known chemical shift ranges in peptides and proteins (Fig. 8, A–C). Signals from threonine, serine, and glutamine/glutamate (unresolved) residues have chemical shifts characteristic of β-strands. The major signals from alanine and valine residues also have β-strand chemical shifts, but minor signal components for these residues have non-β-strand chemical shifts. Signals from asparagine/aspartate (unresolved) residues have chemical shifts that are not clearly indicative of β-strands, suggesting that the majority of these residues are in non-β-strand segments (Table 1).

FIGURE 8.

FIGURE 8.

Two-dimensional 13C-13C spectra of uniformly 15N,13C-labeled CsgA fibrils. A, full two-dimensional spectrum obtained with a 2.37-ms fpRFDR mixing period at 13.50-kHz MAS frequency and 150.7-MHz 13C NMR frequency. CO/Cα cross-peaks of glycine residues and cross-peaks to side chain CO sites of asparagines/aspartate and glutamine/glutamate sites are indicated by arrows. B, one-dimensional slices of the two-dimensional spectrum taken at positions indicated by the color coding. C, expansion of the aliphatic region with color-coded paths indicating residue type assignments of cross-peaks to threonine, serine, valine, alanine, and isoleucine residues. Dashed red and purple lines indicate valine and alanine signals with non-β-strand chemical shifts. Chemical shifts in all other assignment paths are consistent with β-strand secondary structure. Cα/Cβ cross-peaks for asparagines/aspartate and glutamine/glutamate residues and Cα diagonal peaks for glycine residues are indicated by arrows. Asparagines/aspartate chemical shifts suggest non-β-strand secondary structure. D, aliphatic region of two-dimensional spectrum obtained with a 200-ms RAD mixing period. Cross-peaks attributable to polarization transfers between residues that occur as sequential pairs in the CsgA sequence are indicated for threonine/glutamine, threonine/alanine, alanine/serine, asparagine/serine, and glycine/asparagine pairs. The order of the two residues in each pair cannot be determined from this spectrum.

TABLE 1.

13C NMR chemical shifts in CsgA fibrils determined from the two-dimensional spectrum in Fig. 8C

Random coil values (36) were converted to the tetramethylsilane chemical shift reference by subtraction of 1.7 ppm.

Residue type Cα chemical shift Cα random coil value (36) Cβ chemical shift Cβ random coil value (36) Predicted secondary structure
ppm ppm
Alanine (major) 48.9 50.8 20.8 17.4 β
Alanine (minor) 53.2 50.8 16.9 17.4 Non-β
Valine (major) 58.4 60.5 33.3 31.2 β
Valine (minor) 59.7 60.5 30.7 31.2 Non-β
Asparagine/aspartate 52.1 51.4/52.5 37.7 37.2/39.4 Non-β
Glutamine/glutamate 52.1 54.0 31.2 27.7 β
Serine 55.4 56.6 64.2l 62.1 β
Threoninea 59.9 60.1 68.8 68.1 β

a The threonine CO chemical shift is 171.2 ppm, whereas the random coil value is 173.0 ppm.

In solid state NMR experiments on HET-s(218–289) fibrils, Meier and co-workers (38, 39) observed that roughly half of the amino acid sequence does not contribute to solid state NMR spectra. The invisible segments correspond to highly flexible loops (40). Similar behavior has been reported by Helmus et al. (41) in solid state NMR studies of Y145Stop PrP fibrils. To search for such highly flexible segments in CsgA fibrils, we performed two-dimensional 13C-13C NMR measurements with RAD mixing (Fig. 8D) and two-dimensional 15N-13C NMR measurements (Fig. 9) from which cross-peak signals arising from pairs of residues that occur consecutively in the CsgA sequence can be identified. The two-dimensional 13C-13C RAD spectrum shows cross-peaks attributable to glycine/asparagine, asparagine/serine, threonine/alanine, alanine/serine, and threonine/glutamine pairs. The order of residues within these pairs cannot be determined from the two-dimensional 13C-13C RAD spectrum. The two-dimensional 15N-13C spectra allow the identification of backbone 15N chemical shifts for certain residue types (Fig. 9A) and cross-peaks attributable to glycine/glycine, asparagine/glycine, asparagine/serine, glycine/asparagine, threonine/glutamine, and threonine/alanine residue pairs in the specified order (Fig. 9B). These observations argue against the possibility that the glycine-rich segments of CsgA might be highly flexible. We do not have evidence that large segments of CsgA are invisible in solid state NMR measurements.

FIGURE 9.

FIGURE 9.

Two-dimensional 15N-13C spectra of uniformly 15N,13C-labeled CsgA fibrils. A, two-dimensional 15N-13Cα/13CX spectrum showing chemical shift correlations within individual residues. Residue type assignments to glycine, serine, alanine, valine, asparagine, and glutamine residues are shown based on their distinctive 13C NMR chemical shifts in Fig. 8. B, two-dimensional 15N-13CO/13CX spectrum showing correlations between backbone amide 15N chemical shifts of residue k and 13C chemical shifts of residue k − 1. Assignments to certain sequential residue pairs are based on 15N chemical shifts in A and 13C chemical shifts in Fig. 8. Intraresidue cross-peaks involving asparagine and glutamine side chain amides are also observed.

Electron Microscopy Reveals Narrow CsgA Fibrils

Electron microscopic comparisons of CsgA fibrils with tobacco mosaic virus, HET-s prion domain fibrils, and Ure2p fibrils indicate that CsgA forms a very narrow fibril similar in size to that of HET-s(218–289) (Fig. 10). The individual CsgA fibrils are ∼3 nm in diameter, are of various lengths, and frequently associate laterally to form larger fibril bundles (Fig. 10A). It is primarily these fibril bundles that are seen in Fig. 1. The single fibrils are similar in diameter to the fibrils formed by the HET-s prion domain that have diameters of 4 or 5 nm according to the dimensions of the NMR structure (38) or previous electron microscopy measurements (42). By comparison, a fiber composed of multiple CsgA fibrils is dwarfed by a single fibril of the full-length yeast prion protein Ure2p (Fig. 10C), which has a single polypeptide per 4.7 Å along the fibril axis (43) and thus has much more mass per unit length than fibers formed by HET-s(218–289).

FIGURE 10.

FIGURE 10.

CsgA fibers are composed of narrow fibrils. A, at high magnification in negatively stained transmission electron microscopy images, CsgA fibers are seen to form from the lateral association of very small fibrils, which are 3–4 nm in diameter. B, comparison of the fibrils (white arrows) of both CsgA and the prion protein HET-s(218–289) with tobacco mosaic virus (TMV; black arrows). C, the bundled fibrils of CsgA are dwarfed by a single fibril of the prion protein Ure2p that is ∼25 nm in diameter (43). The respective molecular masses of CsgA, HET-s prion domain, and Ure2p are ∼14, 9, and 40 kDa.

Mass per Length Measurements

An in-register parallel β-sheet structure has one molecular mass per 4.7 Å, whereas a single β-helix should have half (or less) this value. We used dark field transmission electron microscopy to measure the mass per unit length of unstained CsgA filaments by a method developed by Chen et al.3 Measurements on the thinnest visible filaments gave an average of 15.3 kDa/nm (Fig. 11). Given that the molecular mass of CsgA is 13.916 kDa, a mass per unit length value of 29.6 kDa/nm is expected for an in-register parallel β-sheet structure. The value of 15.3 kDa/nm is a maximum value because it is uncertain that single unbundled fibers are detected under the dark field conditions.

FIGURE 11.

FIGURE 11.

Mass per length measurements of CsgA fibrils. A and B, dark field electron microscopy images of CsgA fibers and tobacco mosaic virus (black arrows); CsgA fibers are of a variety of widths because of lateral association of individual fibrils. Examples of the thinnest discernible CsgA fibrils are indicated with the white arrows. The scale bars represent 100 nm, respectively. C, distribution of the mass per length values of the thinnest CsgA fibrils.

DISCUSSION

Proteins from a variety of organisms can form amyloid both in vitro and in vivo, and the propensity to form amyloid may be a common property of many proteins (2); there is no single amino acid sequence or defining sequence motif that determines amyloid-forming proteins except a relatively low amount of charged residues (35, 44). Because of its association with many diseases, amyloid is typically viewed as an aberrant conformation that is a consequence of protein misfolding. However, there is a precedent for some proteins to form functional amyloid in which the large filamentous aggregates represent the native and functional state; this is a departure from the traditional model that amyloid is solely a result of misfolding (for a review, see Ref. 3). Such proteins form biologically functional fibrils that have the biophysical hallmarks of other amyloids but are structurally distinct from other biological filaments formed by proteins such as actin or tubulin. Implicitly native state amyloids are assumed to be based upon structural elements similarly to disease-associated amyloids.

Curli fibrils represent a functional state and are not products of misfolding, and because they have many of the characteristics of amyloid, they have been proposed to be a functional amyloid (10). Given the high sequence identity between the curli proteins CsgA and CsgB and their ability to cross-polymerize (9, 11, 45), it is likely that they adopt similar protein folds in curli fibrils. Using homology modeling, individual CsgA and CsgB subunits have previously been predicted to form β-helices (7, 46, 47), which are common folds for proteins secreted by bacteria (48, 49). A β-helix (also known as β-solenoid or β-roll) is a rodlike structural motif formed by repetitive alternation of short β-strands and bends with β-strands running perpendicular to the rod and intramolecular hydrogen bonds between β-strands running parallel to the rod as in a cross-β structure. If curli fibrils are based on linearly arranged (stacked) β-helices, they are structurally distinct from most of the amyloids studied to date with the exception of fibrils formed by residues 218–289 of the HET-s fungal prion protein (38).

In this work, we attempt to further characterize curli structure using several biophysical methods, including solid state NMR. Full-length curli proteins CsgA and CsgB were expressed and purified using an E. coli expression system with no specific csg mutations. Both recombinant proteins form fibrils in vitro that, like in vivo curli, have many of the characteristics of amyloid. Moreover the x-ray and electron diffraction data presented here indicate that curli fibrils have a cross-β structure, which is often considered the definitive characteristic of amyloid. To date, most amyloid fibrils formed by peptides or proteins longer than 15 residues have been found to be based on in-register parallel β-sheet structures (for a review, see Ref. 50). Some important examples of in-register parallel β-sheet structures are the disease-associated amyloids formed by the Aβ peptide (Alzheimer disease) (26, 51), α-synuclein (Parkinson disease) (15), amylin (type II diabetes) (14, 23), and tau (Alzheimer disease) (16). Likewise three different yeast prions share a similar in-register parallel β-sheet structure (1719).

The PITHIRDS-CT data in Fig. 7 imply that CsgA and CsgB fibrils do not contain in-register parallel β-sheet structures as we found no evidence for the ∼5-Å intermolecular distances between 13C-labeled backbone carbonyl sites that would exist in such structures. Whether CsgA and CsgB fibrils contain β-helical structures or some other variant of a cross-β structure cannot be determined definitively from our current data. Nonetheless several features of the data merit discussion. First, 13C chemical shifts at certain residues do not follow the trends expected in β-strand segments, i.e. reduced chemical shift values for carbonyl and α-carbon sites and increased values for β-carbon sites (relative to random coil chemical shifts). The carbonyl line shapes for leucine and phenylalanine sites in Fig. 6 indicate that a fraction of these residues may be in non-β-strand segments. In Fig. 8C, a minor fraction of the valine and alanine residues exhibits non-β-strand chemical shifts for α- and β-carbons. Also in Fig. 8C, the strong Cα/Cβ cross-peak assigned to asparagine (16 copies in the CsgA sequence) and aspartate (eight copies) residues occurs at a non-β-strand position (Table 1). In contrast, we did not detect serine, threonine, or glutamine/glutamate NMR signals indicative of non-β-strand conformations. Thus, we have preliminary evidence for alternating β-strand and bend segments in CsgA fibrils. Second, the two-dimensional 13C-13C spectrum in Fig. 8D indicates that glycine/asparagine, asparagine/serine, alanine/serine, threonine/alanine, and threonine/glutamine pairs that occur consecutively in the CsgA sequence (in either order) are sufficiently rigid that they contribute to solid state NMR signals. The two-dimensional 15N-13C spectra in Fig. 9 indicate that glycine/glycine, asparagine/glycine, glycine/asparagine, asparagine/serine, threonine/glutamine, and threonine/alanine pairs (in this order) are rigid. The significance of these observations is that we did not find evidence for the existence of flexible loops that separate structurally ordered segments as has been reported for HET-s(218–289) fibrils (38, 39). Third, CsgA fibrils have an apparent diameter of 3–4 nm in negatively stained transmission electron microscopy images (Fig. 10). If ∼100 or more residues participate in the fibril core structure, this diameter seems inconsistent with any structure in which each 4.7-Å β-sheet spacing along the fibril axis contains an entire CsgA molecule. For example, 2-fold symmetric and 3-fold symmetric structures for 40-residue Aβ peptide fibrils (containing ∼60 and 90 residues per β-sheet spacing in the fibril core) have significantly larger diameters in negatively stained transmission electron microscopy images (26). The small diameter of CsgA fibrils would be consistent with a structure in which each CsgA molecule extends over several β-sheet spacings so that the structural repeat distance along the fibril axis is a multiple of 4.7 Å. Consistent with these estimates, our mass per unit length measurements place an upper limit of about one-half a molecular mass per 4.7 Å. A β-helix-like structure is one possibility but not the only one.

One impetus for studying curli fibrils was the possibility that uniformly 15N,13C-labeled CsgA would exhibit exceptionally sharp, well resolved solid state NMR lines, which would facilitate full structure determination as demonstrated previously for HET-s(218–289) fibrils (38, 39). The functional nature of curli fibrils suggests that the CsgA and CsgB sequences are evolved to adopt specific molecular structures without the structural variations or polymorphism observed in fibrils formed by Aβ, amylin, and yeast prion proteins (for reviews, see Refs. 35, 50, and 52). A single molecular structure without internal disorder might then be expected to result in sharp solid state NMR lines. The two-dimensional solid state NMR spectra in Figs. 8 and 9 do not support this expectation. Thus, the observation of exceptionally sharp solid state NMR lines in HET-s(218–289) fibrils (and in fibrils formed by a truncated version of mammalian prion protein (41)) may not be solely a consequence of an evolved biological function. Instead the solid state NMR line widths may also be affected by lateral association of fibrils (or the absence thereof), by the amplitudes and time scales of internal molecular motions, or by other factors not yet identified.

The definition of amyloid includes several architectures with cross-β-strands as a common feature. Functional amyloid may represent the cell's adoption of a pathology to do useful work, but there may prove to be a general structural distinction between pathologic and functional amyloids. The in-register parallel structure of yeast prions and amyloids is compatible with the existence of prion variants (for a review, see Ref. 53) and amyloid variants (26), but a functional amyloid probably should have only one structure.

Supplementary Material

Supplemental Data

Acknowledgments

We thank Ulrich Baxa for helpful suggestions, Eric Anderson for mass spectrometry, and Bo Chen for help with mass per length measurements.

*

This work was supported by the Intramural Program of the NIDDK, National Institutes of Health.

Inline graphic

The on-line version of this article (available at http://www.jbc.org) contains supplemental Figs. 1–3 and Tables 1 and 2.

3

B. Chen, K. R. Thurber, F. Shewmaker, R. B. Wickner, and R. Tycko, manuscript in preparation.

2
The abbreviations used are:
amyloid β
BSA
bovine serum albumin
Tricine
N-[2-hydroxy-1,1-bis(hydroxymethyl)ethyl]glycine
MAS
magic angle spinning
CT
constant time
rf
radiofrequency
fpRFDR
finite pulse radiofrequency-driven recoupling
RAD
rf-assisted diffusion.

REFERENCES

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplemental Data

Articles from The Journal of Biological Chemistry are provided here courtesy of American Society for Biochemistry and Molecular Biology

RESOURCES