Abstract

Carboxysomes are protein microcompartments that function in the bacterial CO2 concentrating mechanism (CCM) to facilitate CO2 assimilation. To do so, carboxysomes assemble from thousands of constituent proteins into an icosahedral shell, which encapsulates the enzymes Rubisco and carbonic anhydrase to form structures typically > 100 nm and > 300 megadaltons. Although many of the protein interactions driving the assembly process have been determined, it remains unknown how size and composition are precisely controlled. Here, we show that the size of α-carboxysomes is controlled by the disordered scaffolding protein CsoS2. CsoS2 contains two classes of related peptide repeats that bind to the shell in a distinct fashion, and our data indicate that size is controlled by the relative number of these interactions. We propose an energetic and structural model wherein the two repeat classes bind at the junction of shell hexamers but differ in their preferences for the shell contact angles, and thus the local curvature. In total, this model suggests that a set of specific and repeated interactions between CsoS2 and shell proteins collectively achieve the large size and monodispersity of α-carboxysomes.
Introduction
Spatial organization is a fundamental feature of biology delineating the boundaries between cells and the environment and segregating the internal metabolism.1 Cyanobacteria and diverse chemoautotrophic bacteria use specialized protein microcompartments called carboxysomes to provide a CO2-enriched microenvironment for the carboxylase Rubisco, which fixes CO2 into sugars.2−4 Carboxysomes are a striking example of emergent order, with thousands of individual copies of 10 different constituent proteins spontaneously assembling into quasi-icosahedral particles with well-defined size and composition.5 The mechanism of size control is particularly interesting, as the assembled particles extend far beyond the dimensions of any individual component. The assembly process also requires high fidelity with the potential for severe phenotypic penalty. Morphological defects to the carboxysome, such as by deletion of shell genes, break the CO2 concentrating mechanism and result in cell death.6−8
Carboxysomes occur in two evolutionarily distinct lineages: α, found in most marine cyanobacteria and several clades of bacterial chemoautotrophs; and β, found in certain freshwater cyanobacteria. β-Carboxysomes tend to be larger (100–400 nm in diameter) and are thought to assemble in an inside-out manner with the formation of a dense Rubisco kernel preceding encapsulation by the shell proteins.9−11 α-Carboxysomes are somewhat smaller (80–120 nm in diameter) and more regular in size. Their assembly is hypothesized to involve the concurrent formation of cargo and shells.12−14 In this work, we focus on α-carboxysomes to understand what factors drive the size and regularity of the particles.
α-Carboxysome assembly culminates in particles that are densely packed with cargo, free of shell defects to serve as a CO2 permeability barrier, and of uniform size with diameters of about 115 nm in the model species Halothiobacillus neapolitanus.4,15 The process by which carboxysome size is determined is not understood, though several models have been put forward.16−18 Structural studies on the shell proteins have revealed spontaneous assembly of small icosahedral complexes with triangulation numbers of T = 3, 4, or 9.19−21 By comparison, the H. neapolitanus carboxysome is significantly larger with T ∼ 75.22 This suggests that intrinsic curvature preferences of the shell proteins alone1 are not determinative of the carboxysome size—as is the case for many small icosahedral virus capsids—but that an interplay with internal components is ultimately responsible for particle size.
α-Carboxysomes are composed of a thin icosahedral shell made up of thousands of hexameric protein capsomers (CsoS1) and pentameric capsomers (CsoS4) forming the 12 vertices.22 Inside the shell is a dense cargo of enzymes: Rubisco and carbonic anhydrase; and the scaffolding protein CsoS2 (Figure 1).23−26 CsoS2 is unique among the carboxysome proteins as it is predicted to be largely structurally disordered along its entire length.4,13,27 It does, however, contain several distinct repeated sequence motifs that collectively define a three-part domain structure: the N-terminal domain (NTD) with N-peptide motifs, the middle region (MR) with M-peptide motifs, and the C-terminal domain (CTD) with C-peptide motifs and a conserved C-terminal peptide (CTP). Previous work has shown that CsoS2 is essential to growth and acts as a central node with interactions to both Rubisco and CsoS1.13,14,28 Specifically, the NTD binds to and encapsulates Rubisco via interactions with the N-peptides,14,25 and the CTD has been shown to associate with the shell.20,21 We recently showed that MR also interacts with the shell, though what distinguishes its functional role from the CTD is not yet clear.29
Figure 1.
(a) H. neapolitanus regulon with the 10 proteins making up the α-carboxysome. The callout shows the domain structure with repeated peptide motifs in the disordered scaffold protein CsoS2. Note that M7 has been reclassified as an M-peptide (see Figure S1). (b) Carboxysome protein interactome. Known protein–protein interactions are depicted with solid arrows, and cases where the contact structures are solved are shown in the popouts along with their corresponding PDB IDs. CsoS2 is an important hub in the network, bridging the cargo to shell connection. Cartoon schematics are shown roughly to scale.
Herein we describe how the sequence of the scaffold CsoS2 and, in particular, the relative numbers of M- and C-peptide motifs specify the size of the carboxysome. Furthermore, this size effect is largely independent of the primary cargo Rubisco and points toward a model for size control that is driven by the shell’s mechanical properties and, specifically, CsoS2’s modulation of these mechanical properties. Finally, in light of a recent cryoEM structure of portions of the CTD-shell interface,21 we propose a structural model of the M-peptide’s role.
Materials and Methods
Cloning and Expression
All recombinant carboxysome expression was based on the pHnCB10 plasmid which contains the full carboxysome regulon from H. neapolitanus composed of 10 proteins under the control of a lac operator, has a ColE1 origin, and carries a chloramphenicol resistance marker.30 UniProt protein IDs are as follows: CbbL (Rubisco large subunit) O85040; CbbS (Rubisco small subunit) P45686; CsoS2 O85041; CsoSCA (carbonic anhydrase) O85042; CsoS4A (pentameric capsomer) O85043; CsoS4B (pentameric capsomer) O85044; CsoS1C (hexameric capsomer): p45688; CsoS1A (hexameric capsomer): p45689; CsoS1B (hexameric capsomer) P45690; CsoS1D (pseudohexameric capsomer) D0KZ73.
For testing of different CsoS2 variants and truncations, a Golden Gate destination vector was created from pHnCB10, replacing the original csoS2 gene with a lacZ fragment bracketed by BsaI type II restriction sites. The different CsoS2 constructs were built by PCR amplification of the desired fragments with primers containing flanking BsaI sites and compatible recombination sites. Each construct encodes the full set of carboxysome genes but with a modified CsoS2. The polyproline II helix sequence was purchased as a gBlock from IDT. All plasmids were completed by Golden Gate Assembly.31 The coexpression plasmids for CsoS1ABC, CsoS2A, CsoS2B, and fragments of the MR fused to GFP were built on a pFA backbone with a p15a origin, kanamycin resistance, and tetracycline operator.32 All constructs are summarized in Table S1.
Plasmids were transformed into electrocompetent BW25113 Escherichia coli cells. For carboxysome expression, cells were grown at 37 °C to midlog (OD600 0.4–0.6) whereupon the temperature was reduced to 18 °C and inducer was added: 1 mM IPTG for all samples and a variable amount of anhydrotetracycline (aTc) for the variable coexpression. Cultures were grown overnight and then pelleted and frozen or used directly for carboxysome purification.
H. neapolitanus cells were grown at 30 °C in DSMZ 68 media. The construction of the strain expressing carboxysomes with only CsoS2B from a genomic neutral site under control of a lac operator is described in ref (33). To express carboxysomes and sustain atmospheric growth, we grew this strain in the presence of 1 mM IPTG.
Carboxysome Purification
Carboxysomes were purified as described previously.28 The cells from 1 L expression cultures were lysed with 25 mL of B-PER bacterial lysis reagent (Thermo Fisher) supplemented with 0.01 mg/mL DnaseI, 0.1 mg/mL lysozyme, 1 mM phenylmethylsulfonyl fluoride (PMSF), 10 mM MgCl2, and 20 mM NaHCO3. Lysate was clarified by centrifugation at 12 000 rcf for 30 min. Carboxysomes were pelleted from the supernatant by centrifugation at 40 000 rcf for 30 min. Crude pelleted carboxysomes were resuspended with 20 mL of TEMB buffer (10 mM Tris, 10 mM MgCl2, 1 mM EDTA, and 20 mM NaHCO3, pH 8.4) on ice and pelleted again at 40 000 rcf for 30 min. The pellet was again gently resuspended in 2 mL of TEMB and loaded on top of a sucrose gradient with 5 mL layers of 10, 20, 30, 40, and 50% sucrose (w/v) in TEMB. The gradient samples were spun in an ultracentrifuge for 35 min at 105 000 rcf. Then, the gradients were fractionated into 1 mL fractions and analyzed by sodium dodecyl-sulfate polyacrylamide gel electrophoresis (SDS-PAGE) to determine the carboxysome-containing fractions which also display bluish light scattering from the particles. Fractions with carboxysomes were pooled, centrifuged for 90 min at 105 000 rcf, resuspended with TEMB, and stored at 4 °C prior to negative stain transmission electron microscopy (TEM).
Transmission Electron Microscopy and Size Characterization
Carboxysome samples were diluted to a 280 nm absorbance of 0.05–0.1. Prior to sample application, Formvar/carbon-coated, copper EM grids were prepared by glow discharge. Five μL of sample was loaded on to each grid, allowed to sit for at least 2 min, washed, stained with continuous application of 10 μL of 1% (w/v) uranyl acetate solution, and blotted with filter paper. Exposure of the carboxysomes on the grid to air–water interfaces was minimized to prevent carboxysome breakage and to increase the accuracy of size analysis. All samples were imaged with either a FEI Tecnai 12 120 kV or JEOL 1200 EX 80 kV transmission electron microscope.
Particle size characterization was performed in ImageJ. Carboxysomes were manually located to avoid particle misidentification and to exclude significantly damaged carboxysomes. The images were first given a pixel offset of +4 counts to eliminate any zero count pixels. Then, a polyhedron was drawn around each carboxysome, and the pixel counts were zeroed within this region. Finally, the particles were selected with a zero count threshold and measured with automatic particle characterization utility. The minimum Feret diameter, i.e., the closest approach of two parallel lines in contact with the particle projection, was used for all size statistics.
MR Localization to Carboxysomes
A series of plasmids expressing MR truncations fused to Superfolder GFP34 were each cotransformed with the full carboxysome plasmid (pHnCB10). Carboxysome expression was induced with 1 mM IPTG and GFP fusion expression was induced with 100 nM aTc when the cells reached midlog phase (OD600 0.4–0.6). The temperature was reduced to 18 °C and the cells were grown overnight. Carboxysomes were purified as described above. Then, for each sample, the absorbance was measured at 340 nm and GFP fluorescence was measured with 485 nm excitation and 515 nm emission using a Tecan M1000 plate reader. Due to their size, carboxysomes strongly scatter at 340 nm, while soluble protein has low absorption at this wavelength. Thus, the GFP fluorescence intensity divided by the 340 nm absorbance was used as a proxy for the relative GFP loading per carboxysome.
Modified CsoS2 in H. neapolitanus
H. neapolitanus is naturally competent and will take up foreign DNA and specifically integrate it into its genome by homologous recombination if provided suitable flanking homology arms. We used this strategy to knock out CsoS2 at the native locus, replacing it with a spectinomycin resistance cassette and validating it by PCR. This intermediate strain is incapable of atmospheric growth due to its inability to form carboxysomes. We then knocked in a series of CsoS2 variants having variable M-peptide repeat numbers into the genomic neutral site NS1 along with a kanamycin resistance gene.33 All variants could grow under CO2-replete conditions with 5% CO2. Colony forming unit (CFU) counts were performed by serial dilution of liquid cultures grown at 5% CO2 and plated on DSMZ 68 agar plates with spectinomycin (20 μg/mL), kanamycin (10 μg/mL), and IPTG (1 mM) and placed in the corresponding CO2 environments: atmosphere (∼0.04), 0.5, and 5%.
Structural Modeling of M-Peptide Shell Interaction
The AlphaFold-predicted structures of four divergent CsoS2 proteins were downloaded from UniProt (RefIDs):35H. neapolitanus (O85041), Prochlorococcus marinus (Q7V2C8), Nitrosomonas eutropha (Q0AHV9), and Imhoffiella purpurea (W9VGM3). The coordinates matching the M-peptides were extracted and compared pairwise for structural similarity using TMAlign.36 They were commonly aligned to M4 from N. eutropha as the nearest structure to the centroid.
H. neapolitanus M1-peptide was docked to the luminal side of the junction of three CsoS1A hexamers. The coordinates of this shell trimer were obtained from symmetry expansion of the coordinates of PDB ID 2EWH.37 Rosetta was used to independently relax the structures of the M1-peptide and the shell trimer, and the best scoring structures were used for protein–protein docking. The M1-peptide was positioned above the 3-fold shell axis and roughly aligned with the M1-peptide pseudo-3-fold axis. Next, the Rosetta local docking routine was used with an initial translation randomization of 5 Å and orientational randomization of 12°. 500 docked structures were generated.38 The best scoring among these clearly fell into a common class of configurations, with the VTGs wedged near the symmetry-related CsoS1A-His79 residues. The protein–protein interface was further relaxed in Rosetta subject to an applied harmonic distance constraint bringing the VTG threonines into contact with the 3-fold His79 residues.
Bioinformatic Characterization of M- and C-Peptide Repeats
All sequences from the Integrated Microbial Genomes database (IMG) with the CsoS2 M pfam (PF12288) were downloaded in May 2020 for a total of 770 sequences. These were filtered to eliminate those with partially sequenced and unspecified residues. Next, the sequences were dereplicated to 95% identity using Usearch for a total of 272 unique sequences.39 MEME was used to identify peptide motifs and find, as previously reported, the main known motifs of the N-peptide, M-peptide, C-peptide, and CTP.40 MAST from the MEME suite was used to extract the positions of all matches to the M- and C-peptide motifs, and the amino acid sequences were extracted for each of these with a 15aa buffer zone on either side. These peptide sequences were aligned all together (i.e., both M- and C-peptides) using mafft.41 FastTree was used to generate a phylogenetic tree on the basis of this alignment.42 From the tree there are clearly distinguishable clades for the M- and C-peptide classes but some of the initial MAST hits were mislabeled (including H. neapolitanus M7). All sequences belonging to each clade were aligned against each other and Weblogo3 was used to generate sequence logos.43
Results
Size in self-assembling biological systems can be controlled by a number of mechanisms including intrinsic curvature built into the geometry of the constituent proteins,44 molecular rulers,45 excluded volume effects,18 and nonequilibrium race conditions between growth and termination.16 We hypothesized whether Rubisco, which is approximately 65% of the particle by mass and most of the cargo,5 might play a pivotal role in guiding the carboxysome size and overall morphology. However, several studies have shown that expression of carboxysome particles without Rubisco yields the same basic icosahedral architecture and size, including even in the native Halothiobacillus host.20,46−48 We repeated this experiment—by recombinantly expressing all carboxysome proteins except Rubisco in our E. coli heterologous expression system—and obtained similar results (Figure 2a). In particular, the particle size does not collapse to that of the much smaller complexes formed from shell proteins alone (Figure 2a, blue lines).
Figure 2.
(a) Comparison of recombinant carboxysomes with and without the primary cargo Rubisco. TEM negative stain micrographs highlight the same overall pseudo-icosahedral structure. All swarm plots show the minimum Feret diameter. The dark central line is mean diameter with whiskers to ±1 standard deviation. Particle counts and mean diameters are shown at the top. The blue lines mark the sizes of solved icosahedral complexes of shell capsomers, and the corresponding T-numbers are indicated.20,21 (b) Hexamer titration in E. coli carrying pHnCB10, a plasmid containing all 10 carboxysome proteins under control of lac operator, and a plasmid with the major hexameric shell proteins CsoS1ABC under tetracycline induction control. Cells were all induced with 1 mM ITPG and variable amounts of anhydrotetracycline (aTc) to change the shell expression levels. Swarm plots of particle sizes as in (a) with inset zoomed TEM micrographs showing normal morphology. (c) Polyproline helix. Carboxysomes were expressed with CsoS2B, i.e., CsoS2 with no frameshift, and CsoS2B with a 12 nm polyproline II helix inserted. Swarm plots of particle sizes as in (a) with inset TEM micrographs of representative particles.
We next turned to the major shell components: the hexameric proteins. An alternative hypothesis is that increasing shell protein concentration would alter the balance between cargo and shell growth in favor of the shell to enclose smaller particles, ultimately short-circuiting the assembly of full-sized particles.16 To test this idea, the three major hexameric proteins CsoS1ABC were coexpressed with variable induction alongside the full set of carboxysome proteins. The size and morphologies of the resulting particles were not appreciably different; however, the presence of extra shell proteins was deleterious to the yield of purified carboxysomes (Figure 2b, lysate gel, Figure S2), likely by diverting other carboxysome components into off-pathway assemblies and/or aggregates.
A third hypothesis is that CsoS2, also a major component at ∼15% by mass,5 is a possible governor of carboxysome size. To this point, it has previously been found that disordered scaffold proteins can play a role in the size specification of certain icosahedral virus capsids.49,50 A particularly striking case is PBCV-1 in which a disordered protein acts as a molecular ruler, effectively measuring the distance between vertices.45 We thus tested whether CsoS2 acts as a molecular ruler by inserting a sequence encoding a polyproline II helix—a rigid structural element—of approximately 12 nm between the M3 and M4 repeats.51 This construct also produced carboxysomes, but this structural modification of CsoS2 showed no discernible size effect (Figure 2c).
Given these negative results, we posited that other known biochemistries within the carboxysome could play a role in size control. In H. neapolitanus carboxysomes, CsoS2 appears as two distinct isoforms: a short form of ∼61 kDa called CsoS2A and a long form of ∼92 kDa called CsoS2B.27 We previously showed that the short form is produced as a result of a −1 programmed ribosomal frameshift (PRF) that leads to the premature termination of CsoS2 (Figure 3a).28 In native H. neapolitanus carboxysomes and ones produced recombinantly in E. coli, CsoS2A and CsoS2B are present in approximately equimolar quantities. Furthermore, CsoS2B alone can still sustain atmospheric growth in H. neapolitanus, while CsoS2A alone cannot due to its inability to assemble robust carboxysomes.28,33 We thus asked whether the ratio between CsoS2 isoforms has consequences for particle size. Carboxysomes from H. neapolitanus were purified from the wild-type strain (having both CsoS2A and B) and a mutant strain in which CsoS2 was knocked out at the native locus and complemented with a CsoS2B-only sequence lacking the frameshifting slippery sequence. Similarly, carboxysomes were expressed and purified from E. coli with either the wild-type (wtCsoS2) sequence or the CsoS2B-only sequence in pHnCB10. There was good agreement between the average diameters of carboxysomes obtained from H. neapolitanus and E. coli. However, a reduction of size was observed in both going from wtCsoS2 to CsoS2B carboxysomes with mean diameters changing from 114 to 93 nm for H. neapolitanus and 113 to 93 nm for E. coli (Figure 3b), similar to the effect of eliminating Rubisco (Figure 2a). A difference between H. neapolitanus and E. coli carboxysomes is the notable sharpness of the H. neapolinatus CsoS2B-only size distribution. We attribute this to two factors: there is only one CsoS2 form so that there is no particle-by-particle variation caused by the stoichiometry of isoforms, and native carboxysomes are seen to adopt more regular icosahedral geometries relative to their heterologous counterparts.30 In sum, the loss of the frameshift alone toggles a size shift in both H. neapolitanus and E. coli carboxysome that, while appearing relatively modest, represents a nearly 40% decrease of available cargo volume in carboxysomes lacking the CsoS2 frameshifting element.
Figure 3.
(a) Diagram of the programmed ribosomal frameshift (PRF) leading to ribosome slippage to −1 frame and premature termination of CsoS2 to generate the CsoS2A isoform. (b) Kernel density estimate distributions of carboxysome sizes from H. neapolitanus and E. coli expressing carboxysomes with wtCsoS2 (CsoS2A + CsoS2B) or only CsoS2B in which the PRF has been eliminated. White boxes span the inner quartiles, with the median as the central black line. Whiskers are from the 5th to the 95th percentile. Shown below the distributions are p-values for unpaired two-tailed t tests. Representative negative stain TEM images are shown for carboxysomes purified from H. neapolitanus and E. coli. (c) Scheme and distributions for titration of CsoS2A on a background of CsoS2B-only carboxysomes (red) and titration of CsoS2B on a background of wtCsoS2 carboxysomes. t test p values are shown below. (d) Mean particle diameters for titration conditions plotted versus the measured fraction of CsoS2 as CsoS2A from gel densitometry. aTc concentrations in nanomolar quantities are shown in italics next to each data point. Popouts are representative TEM micrographs of the extremes of the CsoS2B titration, showing substantially smaller carboxysomes with overexpression of CsoS2B.
To further explore the connection between CsoS2 isoforms and size, we employed a titratable coexpression system in E. coli. In the first, CsoS2A was titrated against a background of carboxysomes with CsoS2B-only. In the second, CsoS2B was titrated against a background of wild-type carboxysomes containing both CsoS2A and CsoS2B (Figure 3c,d, lysate gel, Figure S3). Broadly, the results indicate that the balance of CsoS2A and CsoS2B systematically alters the size of the resulting carboxysomes. That is, more CsoS2A correlates with larger particle sizes. Interestingly, overexpression of CsoS2B enhanced the recovered yield of carboxysomes while also reducing the mean particle sizes even lower than those of carboxysomes having only CsoS2B (Figure 3b,d). This suggests that whatever interaction is driving the size effect may not be fully saturated under normal assembly conditions.
CsoS2A and CsoS2B are distinguished by the absence or presence of the CTD, respectively, which we know to be an indispensable feature for carboxysome assembly.28 Wild-type and CsoS2B-only carboxysomes then differ in the relative amount of MR with respect to the CTD. To investigate this effect, we therefore built a series of CsoS2 variants in which we systematically truncated or augmented the MR to include different numbers of conserved M-peptide repeats. In all of these, we eliminated the PRF sequence such that the resulting proteins would be precisely defined with respect to the M-peptide-to-CTD ratio without the complicating effects of frameshifting.
The results showed that each CsoS2 variant produced purifiable carboxysomes that encapsulated Rubisco, and, across the series of M-peptide numbers, we observed a dramatic size trend (Figure 4, pooled carboxysome gel, Figure S4). The smallest carboxysomes were those with no M-peptide repeats and with an average diameter of just over 40 nm, a size which is highly similar to the single particle structure solved by Ni and Jiang et al.21 at T = 9 and 36.9 nm diameter. In contrast, augmenting CsoS2B with copies of M-peptides M1–M6 resulted in particles having substantially larger diameters, with an average of 130 versus 94 nm for unmodified CsoS2B.
Figure 4.
(a) Series of carboxysomes were produced containing all components—including Rubisco—and CsoS2 with variable numbers of M-peptide repeats going from 0 to 13 (CsoS2B has 7). Negative stained TEM images are shown for representative particles in the population. All images are at the same scale with 100 nm scale bars. Below is the distribution of the minimum particle Feret diameters. The middle black line indicates the median, the white box represents inner quartiles, and whiskers extend to the 5th–95th percentile. (b) Plot of mean particle diameters as a function of the number of M-peptides. Whiskers are ±1 standard deviation of the distributions. (c) Plate growth assay of CsoS2-deficient H. neapolitanus complemented with a subset of CsoS2 M-peptide variants. Colony forming unit (CFU) counts were quantified in biological triplicate for atmospheric (0.04), 0.5, and 5% CO2 concentrations. Whiskers represent ±1 geometric standard deviation.
In addition, we tested a subset of these variants for complementation in the wtCsoS2 knockout H. neapolitanus strain. While all strains could grow under 5% CO2, we found that the MR truncations negatively impacted survival at atmospheric CO2 concentrations (∼0.04%) with 1 and 4 M-peptide repeats unable to sustain growth, while 7 and 13 M-peptide repeats could. At 0.5% CO2, all but the 1 M-peptide strain could survive (Figure 4c).
The two C-peptides of the CTD are related to the M-peptides and share a common motif of three VTG triplets each separated by 8 residues (Figure S1). This similarity leads us to ask: do these C-peptides play a similar role as the M-peptides with respect to carboxysome size? To test this hypothesis, we created two CsoS2 variants: one with duplication of both C-peptide repeats and another with duplication of the entire CTD, including the CTP. In contrast to the MR augmentation which produced substantially larger carboxysomes, the C-peptide duplications did not result in larger particles and, instead, produced similarly sized or even slightly smaller carboxysomes compared to CsoS2B (Figure S5).
It is now well established that the CTD forms interactions with the carboxysome shell via both the C-peptides and CTP.20,21 Our recent study found that the MR also specifically interacts with the shell hexamers and that the VTG triplets are an essential component of that interaction.29 To further test this interaction in the context of carboxysome assembly, we designed a series of MR-GFP fusion proteins having different numbers of M-peptides. These were coexpressed alongside complete carboxysomes, and we quantified the relative GFP loading by measuring the ratio of GFP fluorescence to absorbance at 340 nm due to carboxysome light scattering (Figure S6). We found a strong dependence on the M-peptide repeat number with more repeats resulting in more loading. The quantity of any of these fusions in the carboxysomes as visualized by SDS-PAGE was far below that of CsoS2 such that they did not appreciably alter the balance of the M- and C-peptides. Together, these results imply that the MR interacts with the shell during assembly and does so in a multivalent fashion.
In summary, the M- and C-peptides both share similar sequence features and interact with the shell. However, they have divergent effects on the size: the number of M-peptides strongly dictates the particle size, while the C-peptides are absolutely required for carboxysome-like particle formation, but their relative numbers seemingly do not alter particle size.
Discussion
In this study, we have examined the factors determining the size of α-carboxysomes. We discovered that the particle size is strongly specified by the M-peptide repeat number and that this size preference is transmitted via its interaction with the shell. Below, we discuss the implications of these observations and put forward a size control model with a potential structural mechanism.
The dispensability of Rubisco to the formation of carboxysome-like particles implies that it is not an active participant in the assembly itself. Instead, it acts more as a client protein, partitioning into a phase-separated condensate with CsoS2 via the NTD on the surface of the growing shell.14 Thus, we are left with the interplay between CsoS2 and the shell from which to build a model. This model must take into account several observations: (1) the MR and CTD both interact with shell proteins;20,21,29 (2) their interactions are both multivalent in nature and due to the repeated M- and C-peptides, respectively21,29 (Figure S6); (3) the M- and C-peptides share a sequence motif—the [V/I]TG triplet—so that they likely have related modes of contacting the shell (Figure S1); and (4) the M- and C-peptides differ in their effects on particle size, i.e., M-peptides encourage larger particles while C-peptides are size neutral (Figures 4 and S5). Taken together, we believe that these factors point toward a model in which CsoS2 modifies the mechanical properties of the shell.
It has been previously shown that expressing shell proteins alone results in small icosahedral structures, revealing that, in the absence of other factors, the capsomers form stable high-curvature contacts.19,52,53 The introduction of CsoS2 into this mix frustrates the formation of these small particles and instead produces significantly larger particles with lower curvature at the shell–shell interfaces. We specifically propose that the essence of the size control lies in energetic differences in shell binding between the M- and C-peptides with respect to the angles at the junctions of shell hexamers. This energetic effect can be expressed as two related but distinct models we call “preferred curvature” and “flexible curvature” (Figure 5a). In the preferred curvature model, the C-peptide has an energy minimum at a greater shell contact angle than the M-peptide. In the flexible-curvature model, the C-peptide and M-peptide have the same energy minimum angle. However, the C-peptide has a broader energy function and can tolerate contact angles greater than those of the M-peptide. Note that the angle, while represented as a two-dimensional plane angle in the cartoon schematic (Figure 5a), is likely a three-dimensional solid angle. Finally, it should be emphasized that the model is not for C-peptide binding at shell junctions directly bordering the pentameric vertex capsomer—where Ni and Jiang et al. find the CTP21—but rather for stabilizing hexamer-only junctions in the vicinity of the vertex that must sustain higher curvature than those further away for which the M-peptide is favored (Figure 5b).54 It is possible that the CTP interaction plays a supporting role in size control. However, it is likely minor since carboxysomes made without pentameric capsomers form with typical diameters.6
Figure 5.
(a) Energetic models of M- and C-peptide shell interactions. Plots show the potential energy of shell-bound M- (green) and C-peptides (blue) as a function of shell angle. In the “preferred curvature” model, the C-peptide has an energy minimum for a high-curvature shell configuration. In the “flexible curvature” model, the C-peptide has the same preferred angle as the M-peptide but a shallower energy well. (b) Idealized map of carboxysome shell curvature. Higher-curvature regions are centered on vertices, while lower curvature prevails in between and on facets. The model predicts green regions will be more favorable for M-peptide binding and blue regions will be more favorable for C-peptide binding. (c) More M-peptide stabilizes particles with a greater share of low-curvature surface and thus greater diameter.
This model can be used to interpret several experimental observations. First, the M-peptide series in Figure 4 entails increasing M-peptide numbers on a constant background of two C-peptides. To a first approximation, the number of high-curvature sites will be roughly the same as particles increase in size, while the number of low-curvature sites will scale with the surface area. Thus, smaller particles are energetically better suited to a low M- to C-peptide ratio, while larger particles will benefit from a high M- to C-peptide ratio (Figure 5c).54 Second, the insufficiency of CsoS2A alone to form carboxysomes can be understood through the lens of the model. Having only M-peptides, it could perhaps knit together flat sheets of hexamers. Without the CTD, however, CsoS2A lacks the C-peptides necessary to stabilize the regions of highest curvature near the vertices and is thus incapable of producing closed particles. Finally, the larger size of H. neapolitanus carboxysomes with wtCsoS2 is explained by the extra abundance of the M-peptides, which extend the low curvature between vertices relative to CsoS2B-only particles.
At first glance, it appears improbable that one could make structural inferences on the basis of our phenomenological model given CsoS2’s high disorder score throughout the sequence.13,14,28 However, analysis of diverse CsoS2 sequences and their structural predictions from the AlphaFold UniProt database revealed that, while the overall structures are amorphous and have low confidence, the aligned M-peptide sequences give a remarkably consistent microdomain (Figure 6a). This structure has the VTG triplet motifs arranged into a triangular structure with pseudo-3-fold symmetry. Furthermore, the conserved pair of cysteine residues are predicted to be in close proximity to one another. This is notable because the carboxysome interior is thought to possess an oxidized environment, and thus one might expect a fold-stabilizing disulfide bridge to form.9,55,56
Figure 6.
(a) Sequence logo for M-peptide created from 1662 M-peptide sequences pulled from 272 CsoS2 sequences after dereplication to 95% identity. Aligned AlphaFold predictions for all M-peptides in CsoS2s from H. neapolitanus, P. marinus, N. eutropha, and I. purpurea. Key conserved residues are colored, matching the bars on the sequence logo. (b) Schematic of hexameric shell lattice indicating the site classes with 3-fold symmetry. (c) Surface representation of H. neapolitanus M1-peptide docked onto the 3-fold hexamer junction with Rosetta. Zoom in shows the VTG motifs in contact with the triangle defined by CsoS1A-His79.
The above structural prediction combined with the curvature model suggests that this pseudo-3-fold symmetric M-peptide ought to bind at a shell junction with 3-fold symmetry (Figure 6b, orange). We took the AlphaFold prediction for the H. neapolitanus CsoS2 M1-peptide and used Rosetta to dock the structure and relax the interface onto the 3-fold hexamer junction using the CsoS1A coordinates (PDB ID: 2EWH).37,38 The fit has good size and shape complementarity to the depression at the junction. A cryoEM structure with portions of the CTD resolved was very recently described by Ni and Jiang et al.21 Most interestingly, they identified CsoS1A-His79 as an important contact for CTD binding through the VTG motifs. His79 lies in close proximity to the triad of VTGs of the docked M1-peptide. Using harmonic distance constraints in Rosetta, the M1-peptide was relaxed into an idealized geometry with the VTG threonine hydroxyls each in hydrogen bonding contact with the His79 delta nitrogen while maintaining the same basic tertiary fold (Figure 6c).
What might structurally distinguish the C-peptide from the M-peptide and explain their energetic preferences for different shell curvatures? The M-peptide consensus sequence is essentially the C-peptide consensus plus several additional conserved positions, namely, the aforementioned cysteine pair, a tyrosine residue appearing three residues downstream of the last VTG motif, and a lysine-valine pair near the beginning (Figures 6a and S1). In a recent study of the MR, we observed significant effects of the conserved tyrosine on the biochemical interactions with the shell and sufficiency of resulting carboxysomes for atmospheric growth in H. neapolitanus, highlighting a distinct yet crucial role of the M-peptides.29 The AlphaFold predictions for the C-peptides are of low confidence and lack a common structure. The CTD-shell structure from Ni and Jiang et al. shows that even as the C-peptide and CTP residues are well resolved, particularly at the VTG-binding contacts, the overall structure threads its way along the hexamers without a clear tertiary structure of its own.21 We hypothesize, in contrast, that the M-peptide acts more as a microdomain with a characteristic fold (Figure 6a). Taken together, this suggests the M-peptide motif is more rigid and prefers to bind low-angle hexamer junctions, while the C-peptide motif is more flexible and can accommodate higher-angle junctions.
Given this model of size, we can make some inferences about the natural diversity of CsoS2 sequences and the attending size consequences. Broadly speaking, we predict that organisms with high M- to C-peptide ratios will produce on average larger carboxysomes, while those with low M- to C-peptide ratios will produce smaller carboxysomes. Furthermore, we also predict that all else being equal, CsoS2 sequences containing a PRF will afford larger carboxysomes than an otherwise identical sequence. It will be interesting to see whether these predictions map to environmental conditions. For example, we observed that CsoS2 truncated to 4 M-peptides could not sustain growth in H. neapolitanus under atmospheric CO2 concentrations but could at 0.5% CO2 (Figure 4c).
More broadly, how does this mechanism of size control compare to other bacterial microcompartments (BMCs)? β-Carboxysomes do not form large icosahedral shell structures in the absence of Rubisco52 nor do they possess an obvious analogue to CsoS2. Combined with the evidence for inside-out assembly, this suggests that the size is determined as a result of a close interplay between a growing shell stabilized around a dense cargo.9,16,17 Among other BMCs (e.g., Pdu and Eut), there is a wide variety of sizes and morphologies. It is not presently clear whether they fall under common assembly principles and size controls. Interestingly, some BMC shell proteins can assemble into diverse macrostructures such as nanotubes, extended sheets, and rosettes.57 A leading hypothesis for the BMC size control is that it may be driven by the relative stoichiometries of constituent shell capsomers that differ in their toleration of curvature.19,54,58,59 With additional structural and biochemical data, it will thus be interesting to see whether there is a broad unifying principle for the size control of microcompartments or if, as with viruses, there are multiple mechanistic routes to achieve similar outcomes.
Conclusions
In this study, we sought to uncover the determinants of particle size in α-carboxysomes. We discovered that the size is set primarily by the scaffold protein CsoS2 and its interactions with the shell. Through systematic modification of CsoS2, we observed that the size effect is mediated by the relative numbers of M-peptide repeats found in the MR and the C-peptide repeats found in the CTD. Additionally, we determined that a functional consequence of the PRF in H. neapolitanus CsoS2 is to increase particle size at no cost of additional sequence.
We propose an energetic model based on differential curvature preferences between the M- and C-peptides and connect this model to a mechanistic structural hypothesis on the basis of AlphaFold structural predictions and new experimental breakthroughs in carboxysome structure.
An increasingly comprehensive view of the α-carboxysome assembly is coming into focus as the structures and interactions linking the components together are elucidated. Only a few key gaps, most notably the CsoS2-MR shell interaction, remain before a complete account of the structure, spanning atomic detail to functional particles. With this depth of information, we think that the carboxysome is a useful paradigm for complex self-assembly and potentially a template, physically and conceptually, for future engineering efforts.
Acknowledgments
The authors thank Dr. Hanlun Jiang for assistance with Rosetta modeling. They acknowledge the staff at the UC Berkeley Electron Microscope Laboratory for training and assistance with TEM. This work was supported by the U.S. Department of Energy, Office of Science, Office of Basic Energy Sciences, Chemical Sciences, Geosciences, & Biosciences Division, Physical Biosciences Program, under Award Number DE-SC0016240 to D.F.S. D.F.S. is an Investigator of the Howard Hughes Medical Institute.
Supporting Information Available
The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/acs.biochem.3c00403.
Sequence comparison and classification of M- and C-peptides; SDS-PAGE analysis of carboxysomes from CsoS1ABC titration, CsoS2A/B titration, and CsoS2 size variant series; carboxysome size distributions with CTD augmentation; targeting of GFP-MR fragment fusions to carboxysomes; and a table of all plasmids used for protein expression including the amino acid sequences for any CsoS2 variants or fragments (PDF)
Accession Codes
UniProt protein IDs for the H. neapolitanus regulon (Figure 1a) are as follows: CbbL (Rubisco large subunit) O85040; CbbS (Rubisco small subunit) P45686; CsoS2 O85041; CsoSCA (carbonic anhydrase) O85042; CsoS4A (pentameric capsomer) O85043; CsoS4B (pentameric capsomer) O85044; CsoS1C (hexameric capsomer) P45688; CsoS1A (hexameric capsomer) P45689; CsoS1B (hexameric capsomer) P45690; CsoS1D (pseudohexameric capsomer) D0KZ73.
Author Present Address
# Department of Biotechnology, Faculty of Science, Mahidol University, Rama VI Rd., Bangkok 10400, Thailand
Author Contributions
⊥ L.M.O. and A.W.C. contributed equally to this work.
The authors declare the following competing financial interest(s): D.F.S. is a co-founder of Scribe Therapeutics and a scientific advisory board member of Scribe Therapeutics and Mammoth Biosciences. All other authors declare no competing interests.
Footnotes
These studies with shell-only complexes use a subset of capsomer paralogs. It is unknown whether other combinations might change size characteristics.
Supplementary Material
References
- Lee H.; DeLoache W. C.; Dueber J. E. Spatial organization of enzymes for metabolic engineering. Metab. Eng. 2012, 14, 242–251. 10.1016/j.ymben.2011.09.003. [DOI] [PubMed] [Google Scholar]
- Mangan N. M.; Flamholz A.; Hood R. D.; Milo R.; Savage D. F. pH determines the energetic efficiency of the cyanobacterial CO2 concentrating mechanism. Proc. Natl. Acad. Sci. U.S.A. 2016, 113 (36), E5354–E5362. 10.1073/pnas.1525145113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Espie G. S.; Kimber M. S. Carboxysomes: cyanobacterial RubisCO comes in small packages. Photosynth. Res. 2011, 109, 7–20. 10.1007/s11120-011-9656-y. [DOI] [PubMed] [Google Scholar]
- Heinhorst S.; Cannon G. C.; Shively J. M.. Carboxysomes and Carboxysome-like Inclusions. In Complex Intracellular Structures in Prokaryotes; Shively J. M., Ed.; Springer Berlin Heidelberg: Berlin, Heidelberg, 2006; pp 141–165. [Google Scholar]
- Sun Y.; Harman V. M.; Johnson J. R.; Brownridge P. J.; Chen T.; Dykes G. F.; Lin Y.; Beynon R. J.; Liu L.-N. Decoding the Absolute Stoichiometric Composition and Structural Plasticity of α-Carboxysomes. MBio 2022, 13, e03629-21 10.1128/mbio.03629-21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cai F.; Menon B. B.; Cannon G. C.; Curry K. J.; Shively J. M.; Heinhorst S. The pentameric vertex proteins are necessary for the icosahedral carboxysome shell to function as a CO2 leakage barrier. PLoS One 2009, 4, e7521. 10.1371/journal.pone.0007521. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dou Z.; Heinhorst S.; Williams E. B.; Murin C. D.; Shively J. M.; Cannon G. C. CO2 fixation kinetics of Halothiobacillus neapolitanus mutant carboxysomes lacking carbonic anhydrase suggest the shell acts as a diffusional barrier for CO2. J. Biol. Chem. 2008, 283, 10377–10384. 10.1074/jbc.M709285200. [DOI] [PubMed] [Google Scholar]
- Desmarais J. J.; Flamholz A. I.; Blikstad C.; Dugan E. J.; Laughlin T. G.; Oltrogge L. M.; Chen A. W.; Wetmore K.; Diamond S.; Wang J. Y.; Savage D. F. DABs are inorganic carbon pumps found throughout prokaryotic phyla. Nat. Microbiol. 2019, 4, 2204–2215. 10.1038/s41564-019-0520-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen A. H.; Robinson-Mosher A.; Savage D. F.; Silver P. A.; Polka J. K. The bacterial carbon-fixing organelle is formed by shell envelopment of preassembled cargo. PLoS One 2013, 8, e76127. 10.1371/journal.pone.0076127. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cameron J. C.; Wilson S. C.; Bernstein S. L.; Kerfeld C. A. Biogenesis of a bacterial organelle: the carboxysome assembly pathway. Cell 2013, 155, 1131–1140. 10.1016/j.cell.2013.10.044. [DOI] [PubMed] [Google Scholar]
- Wang H.; Yan X.; Aigner H.; Bracher A.; Nguyen N. D.; Hee W. Y.; Long B. M.; Price G. D.; Hartl F. U.; Hayer-Hartl M. Rubisco condensate formation by CcmM in β-carboxysome biogenesis. Nature 2019, 566, 131–135. 10.1038/s41586-019-0880-5. [DOI] [PubMed] [Google Scholar]
- Rae B. D.; Long B. M.; Badger M. R.; Price G. D. Functions, compositions, and evolution of the two types of carboxysomes: polyhedral microcompartments that facilitate CO2 fixation in cyanobacteria and some proteobacteria. Microbiol. Mol. Biol. Rev. 2013, 77, 357–379. 10.1128/MMBR.00061-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cai F.; Dou Z.; Bernstein S. L.; Leverenz R.; Williams E. B.; Heinhorst S.; Shively J.; Cannon G. C.; Kerfeld C. A. Advances in understanding carboxysome assembly in prochlorococcus and synechococcus implicate csos2 as a critical component. Life 2015, 5, 1141–1171. 10.3390/life5021141. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Oltrogge L. M.; Chaijarasphong T.; Chen A. W.; Bolin E. R.; Marqusee S.; Savage D. F. Multivalent interactions between CsoS2 and Rubisco mediate α-carboxysome formation. Nat. Struct. Mol. Biol. 2020, 27, 281–287. 10.1038/s41594-020-0387-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shively J. M.; Ball F.; Brown D. H.; Saunders R. E. Functional organelles in prokaryotes: polyhedral inclusions (carboxysomes) of Thiobacillus neapolitanus. Science 1973, 182, 584–586. 10.1126/science.182.4112.584. [DOI] [PubMed] [Google Scholar]
- Rotskoff G. M.; Geissler P. L. Robust nonequilibrium pathways to microcompartment assembly. Proc. Natl. Acad. Sci. U.S.A. 2018, 115, 6341–6346. 10.1073/pnas.1802499115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mohajerani F.; Hagan M. F. The role of the encapsulated cargo in microcompartment assembly. PLoS Comput. Biol. 2018, 14, e1006351. 10.1371/journal.pcbi.1006351. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mohajerani F.; Sayer E.; Neil C.; Inlow K.; Hagan M. F. Mechanisms of Scaffold-Mediated Microcompartment Assembly and Size Control. ACS Nano 2021, 15, 4197–4212. 10.1021/acsnano.0c05715. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sutter M.; Greber B.; Aussignargues C.; Kerfeld C. A. Assembly principles and structure of a 6.5-MDa bacterial microcompartment shell. Science 2017, 356, 1293–1297. 10.1126/science.aan3289. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tan Y. Q.; Ali S.; Xue B.; Teo W. Z.; Ling L. H.; Go M. K.; Lv H.; Robinson R. C.; Narita A.; Yew W. S. Structure of a Minimal α-Carboxysome-Derived Shell and Its Utility in Enzyme Stabilization. Biomacromolecules 2021, 22, 4095–4109. 10.1021/acs.biomac.1c00533. [DOI] [PubMed] [Google Scholar]
- Ni T.; Jiang Q.; Ng P. C.; Shen J.; Dou H.; Zhu Y.; Radecke J.; Dykes G. F.; Huang F.; Liu L.-N.; Zhang P. Intrinsically disordered CsoS2 acts as a general molecular thread for α-carboxysome shell assembly. Nat. Commun. 2023, 14, 5512 10.1038/s41467-023-41211-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tanaka S.; Kerfeld C. A.; Sawaya M. R.; Cai F.; Heinhorst S.; Cannon G. C.; Yeates T. O. Atomic-level models of the bacterial carboxysome shell. Science 2008, 319, 1083–1086. 10.1126/science.1151458. [DOI] [PubMed] [Google Scholar]
- Blikstad C.; Dugan E. J.; Laughlin T. G.; Turnsek J. B.; Liu M. D.; Shoemaker S. R.; Vogiatzi N.; Remis J. P.; Savage D. F. Identification of a carbonic anhydrase−Rubisco complex within the alpha-carboxysome. Proc. Natl. Acad. Sci. U.S.A. 2023, 120 (43), e2308600120 10.1073/pnas.2308600120. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Metskas L. A.; Ortega D.; Oltrogge L. M.; Blikstad C.; Lovejoy D. R.; Laughlin T. G.; Savage D. F.; Jensen G. J. Rubisco forms a lattice inside alpha-carboxysomes. Nat. Commun. 2022, 13, 4863 10.1038/s41467-022-32584-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ni T.; Sun Y.; Burn W.; Al-Hazeem M. M. J.; Zhu Y.; Yu X.; Liu L.-N.; Zhang P. Structure and assembly of cargo Rubisco in two native α-carboxysomes. Nat. Commun. 2022, 13, 4299 10.1038/s41467-022-32004-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kerfeld C. A.; Melnicki M. R. Assembly, function and evolution of cyanobacterial carboxysomes. Curr. Opin. Plant Biol. 2016, 31, 66–75. 10.1016/j.pbi.2016.03.009. [DOI] [PubMed] [Google Scholar]
- Baker S. H.; Lorbach S. C.; Rodriguez-Buey M.; Williams D. S.; Aldrich H. C.; Shively J. M. The correlation of the gene csoS2 of the carboxysome operon with two polypeptides of the carboxysome in thiobacillus neapolitanus. Arch. Microbiol. 1999, 172, 233–239. 10.1007/s002030050765. [DOI] [PubMed] [Google Scholar]
- Chaijarasphong T.; Nichols R. J.; Kortright K. E.; Nixon C. F.; Teng P. K.; Oltrogge L. M.; Savage D. F. Programmed Ribosomal Frameshifting Mediates Expression of the α-Carboxysome. J. Mol. Biol. 2016, 428, 153–164. 10.1016/j.jmb.2015.11.017. [DOI] [PubMed] [Google Scholar]
- Turnšek J. B.; Oltrogge L. M.; Savage D. F. Conserved and repetitive motifs in an intrinsically disordered protein drive α-carboxysome assembly. BioRxiv 2023, 2023-07. 10.1101/2023.07.08.548221. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bonacci W.; Teng P. K.; Afonso B.; Niederholtmeyer H.; Grob P.; Silver P. A.; Savage D. F. Modularity of a carbon-fixing protein organelle. Proc. Natl. Acad. Sci. U.S.A. 2012, 109, 478–483. 10.1073/pnas.1108557109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Engler C.; Kandzia R.; Marillonnet S. A one pot, one step, precision cloning method with high throughput capability. PLoS One 2008, 3, e3647. 10.1371/journal.pone.0003647. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Flamholz A. I.; Dugan E.; Blikstad C.; Gleizer S.; Ben-Nissan R.; Amram S.; Antonovsky N.; Ravishankar S.; Noor E.; Bar-Even A.; Milo R.; Savage D. F. Functional reconstitution of a bacterial CO2 concentrating mechanism in Escherichia coli. Elife 2020, 9, e59882 10.7554/eLife.59882. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chaijarasphong T.Towards an in Vitro Reconstitution of the α-Carboxysome, Dissertation or Thesis; University of California: Berkeley, 2016. [Google Scholar]
- Pédelacq J.-D.; Cabantous S.; Tran T.; Terwilliger T. C.; Waldo G. S. Engineering and characterization of a superfolder green fluorescent protein. Nat. Biotechnol. 2006, 24, 79–88. 10.1038/nbt1172. [DOI] [PubMed] [Google Scholar]
- Jumper J.; Evans R.; Pritzel A.; Green T.; Figurnov M.; Ronneberger O.; Tunyasuvunakool K.; Bates R.; Žídek A.; Potapenko A.; Bridgland A.; Meyer C.; Kohl S. A. A.; Ballard A. J.; Cowie A.; Romera-Paredes B.; Nikolov S.; Jain R.; Adler J.; Back T.; Petersen S.; Reiman D.; Clancy E.; Zielinski M.; Steinegger M.; Pacholska M.; Berghammer T.; Bodenstein S.; Silver D.; Vinyals O.; Senior A. W.; Kavukcuoglu K.; Kohli P.; Hassabis D. Highly accurate protein structure prediction with AlphaFold. Nature 2021, 596, 583–589. 10.1038/s41586-021-03819-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang Y.; Skolnick J. TM-align: a protein structure alignment algorithm based on the TM-score. Nucleic Acids Res. 2005, 33, 2302–2309. 10.1093/nar/gki524. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tsai Y.; Sawaya M. R.; Cannon G. C.; Cai F.; Williams E. B.; Heinhorst S.; Kerfeld C. A.; Yeates T. O. Structural analysis of CsoS1A and the protein shell of the Halothiobacillus neapolitanus carboxysome. PLoS Biol. 2007, 5, e144. 10.1371/journal.pbio.0050144. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Marze N. A.; Roy Burman S. S.; Sheffler W.; Gray J. J. Efficient flexible backbone protein-protein docking for challenging targets. Bioinformatics 2018, 34, 3461–3469. 10.1093/bioinformatics/bty355. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Edgar R. C. Search and clustering orders of magnitude faster than BLAST. Bioinformatics 2010, 26, 2460–2461. 10.1093/bioinformatics/btq461. [DOI] [PubMed] [Google Scholar]
- Bailey T. L.; Johnson J.; Grant C. E.; Noble W. S. The MEME Suite. Nucleic Acids Res. 2015, 43, W39–W49. 10.1093/nar/gkv416. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Katoh K.; Standley D. M. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol. Biol. Evol. 2013, 30, 772–780. 10.1093/molbev/mst010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Price M. N.; Dehal P. S.; Arkin A. P. FastTree 2—approximately maximum-likelihood trees for large alignments. PLoS One 2010, 5, e9490. 10.1371/journal.pone.0009490. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Crooks G. E.; Hon G.; Chandonia J. M.; Brenner S. E. WebLogo: a sequence logo generator. Genome Res. 2004, 14, 1188–1190. 10.1101/gr.849004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mannige R. V.; Brooks C. L. Geometric considerations in virus capsid size specificity, auxiliary requirements, and buckling. Proc. Natl. Acad. Sci. U.S.A. 2009, 106, 8531–8536. 10.1073/pnas.0811517106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fang Q.; Zhu D.; Agarkova I.; Adhikari J.; Klose T.; Liu Y.; Chen Z.; Sun Y.; Gross M. L.; Van Etten J. L.; Zhang X.; Rossmann M. G. Near-atomic structure of a giant virus. Nat. Commun. 2019, 10, 388 10.1038/s41467-019-08319-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Menon B. B.; Dou Z.; Heinhorst S.; Shively J. M.; Cannon G. C. Halothiobacillus neapolitanus carboxysomes sequester heterologous and chimeric RubisCO species. PLoS One 2008, 3, e3570. 10.1371/journal.pone.0003570. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li T.; Jiang Q.; Huang J.; Aitchison C. M.; Huang F.; Yang M.; Dykes G. F.; He H.-L.; Wang Q.; Sprick R. S.; Cooper A. I.; Liu L.-N. Reprogramming bacterial protein organelles as a nanoreactor for hydrogen production. Nat. Commun. 2020, 11, 5448 10.1038/s41467-020-19280-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jiang Q.; Li T.; Yang J.; Aitchison C. M.; Huang J.; Chen Y.; Huang F.; Wang Q.; Cooper A. I.; Liu L.-N. Synthetic engineering of a new biocatalyst encapsulating [NiFe]-hydrogenases for enhanced hydrogen production. J. Mater. Chem. B 2023, 11, 2684–2692. 10.1039/D2TB02781J. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen D.-H.; Baker M. L.; Hryc C. F.; DiMaio F.; Jakana J.; Wu W.; Dougherty M.; Haase-Pettingell C.; Schmid M. F.; Jiang W.; Baker D.; King J. A.; Chiu W. Structural basis for scaffolding-mediated assembly and maturation of a dsDNA virus. Proc. Natl. Acad. Sci. U.S.A. 2011, 108, 1355–1360. 10.1073/pnas.1015739108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Medina E. M.; Andrews B. T.; Nakatani E.; Catalano C. E. The bacteriophage lambda gpNu3 scaffolding protein is an intrinsically disordered and biologically functional procapsid assembly catalyst. J. Mol. Biol. 2011, 412, 723–736. 10.1016/j.jmb.2011.07.045. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schuler B.; Lipman E. A.; Steinbach P. J.; Kumke M.; Eaton W. A. Polyproline and the “spectroscopic ruler” revisited with single-molecule fluorescence. Proc. Natl. Acad. Sci. U.S.A. 2005, 102, 2754–2759. 10.1073/pnas.0408164102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cai F.; Bernstein S. L.; Wilson S. C.; Kerfeld C. A. Production and Characterization of Synthetic Carboxysome Shells with Incorporated Luminal Proteins. Plant Physiol. 2016, 170, 1868–1877. 10.1104/pp.15.01822. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sutter M.; Laughlin T. G.; Sloan N. B.; Serwas D.; Davies K. M.; Kerfeld C. A. Structure of a Synthetic β-Carboxysome Shell. Plant Physiol. 2019, 181, 1050–1058. 10.1104/pp.19.00885. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li S.; Matoz-Fernandez D. A.; Olvera de la Cruz M. Effect of mechanical properties on multicomponent shell patterning. ACS Nano 2021, 15, 14804–14812. 10.1021/acsnano.1c04795. [DOI] [PubMed] [Google Scholar]
- Peña K. L.; Castel S. E.; de Araujo C.; Espie G. S.; Kimber M. S. Structural basis of the oxidative activation of the carboxysomal gamma-carbonic anhydrase, CcmM. Proc. Natl. Acad. Sci. U.S.A. 2010, 107, 2455–2460. 10.1073/pnas.0910866107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Heinhorst S.; Williams E. B.; Cai F.; Murin C. D.; Shively J. M.; Cannon G. C. Characterization of the carboxysomal carbonic anhydrase CsoSCA from Halothiobacillus neapolitanus. J. Bacteriol. 2006, 188, 8087–8094. 10.1128/JB.00990-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Young E. J.; Burton R.; Mahalik J. P.; Sumpter B. G.; Fuentes-Cabrera M.; Kerfeld C. A.; Ducat D. C. Engineering the bacterial microcompartment domain for molecular scaffolding applications. Front. Microbiol. 2017, 8, 1441. 10.3389/fmicb.2017.01441. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mallette E.; Kimber M. S. A complete structural inventory of the mycobacterial microcompartment shell proteins constrains models of global architecture and transport. J. Biol. Chem. 2017, 292, 1197–1210. 10.1074/jbc.M116.754093. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ochoa J. M.; Yeates T. O. Recent structural insights into bacterial microcompartment shells. Curr. Opin. Microbiol. 2021, 62, 51–60. 10.1016/j.mib.2021.04.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.






