Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 2025 Nov 3;122(45):e2409566122. doi: 10.1073/pnas.2409566122

From sequence to scaffold: Computational design of protein nanoparticle vaccines from AlphaFold2-predicted building blocks

Cyrus M Haas a,b,c,1, Naveen Jasti b,d,1, Annie Dosey b,e, Joel D Allen f, Rebecca Gillespie g, Jackson McGowan b,e, Elizabeth M Leaf b,e, Max Crispin f, Cole A DeForest a,b,c,h,i,j, Masaru Kanekiyo g, Neil P King b,e,2
PMCID: PMC12626006  PMID: 41183183

Significance

Self-assembling protein nanoparticle vaccines have steadily gained traction in both academic and industry vaccine development over the last decade. Recent work has shown that computationally designing self-assembling proteins allows the structural and functional features of nanoparticle vaccines to be precisely tailored and that this can significantly affect the vaccine-elicited immune response. To date, designing such nanoparticle vaccines has required the use of known crystal structures as starting points. Here, we show how recently developed machine learning tools can be leveraged to design self-assembling protein nanoparticle immunogens in the absence of experimentally determined structures of the building blocks that elicit strong immune responses in mice.

Keywords: protein design, nanoparticles, vaccines, influenza, machine learning

Abstract

Self-assembling protein nanoparticles are being increasingly utilized in the design of next-generation vaccines due to their ability to induce antibody responses of superior magnitude, breadth, and durability. Computational protein design offers a route to nanoparticle scaffolds with structural and biochemical features tailored to specific vaccine applications. Although strategies for designing self-assembling proteins have been established, the recent development of powerful machine learning (ML)–based tools for protein structure prediction and design provides an opportunity to overcome several of their limitations. Here, we leveraged these tools to develop a generalizable method for designing self-assembling proteins starting from AlphaFold2 predictions of oligomeric protein building blocks. We used the method to generate six 60-subunit protein nanoparticles with icosahedral symmetry, and single-particle cryoelectron microscopy reconstructions of three of them revealed that they were designed with atomic-level accuracy. To transform one of these nanoparticles into a functional immunogen, we reoriented its termini through circular permutation, added a genetically encoded oligomannose-type glycan, and displayed a stabilized trimeric variant of the influenza hemagglutinin receptor-binding domain through a rigid de novo linker. The resultant immunogen elicited potent receptor-blocking and neutralizing antibody responses in mice. Our results demonstrate the practical utility of ML–based protein modeling tools in the design of nanoparticle vaccines. More broadly, by eliminating the requirement for experimentally determined structures of protein building blocks, our method dramatically expands the number of starting points available for designing self-assembling proteins.


The design of nanoparticle immunogens based on self-assembling protein scaffolds is a burgeoning area of technological innovation in vaccines (14). It has long been known that repetition of epitopes in three-dimensional space is a danger signal sensed by B cells through pathogen- or vaccine-driven clustering of their antigen receptors (5, 6). More recently, engineered protein nanoparticle immunogens and DNA nanotechnology objects have been used to explore in detail the effects of repetitive antigen display on B cell responses, highlighting the role of the detailed geometry of antigen presentation (713). In parallel, the advantageous trafficking of particulate immunogens in vivo and how they are sensed by the innate immune system is also becoming increasingly clear (14, 15). Together, these observations have motivated many efforts over the last decade in which antigens of interest are displayed on self-assembling protein scaffolds, beginning with two landmark studies in 2013 (16, 17). These studies generally show that protein nanoparticle immunogens elicit significantly more potent humoral immune responses than soluble antigen and in some cases have also demonstrated improved quality, breadth, and durability (9, 1820). Many of these efforts have used naturally occurring self-assembling proteins such as ferritin and lumazine synthase as scaffolds (16, 17), although computationally designed protein nanoparticles are being increasingly adopted by the field (2123). Importantly, a COVID-19 vaccine comprising the receptor-binding domain (RBD) of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) spike protein displayed on a computationally designed protein nanoparticle was recently licensed in multiple countries, clinically derisking this technology (2426).

All of the computationally designed protein nanoparticle vaccines studied to date have used protein nanoparticles generated by docking oligomeric building blocks in a target symmetric architecture and designing noncovalent protein–protein interfaces that drive assembly (2729). Since its introduction more than a decade ago, this method has been generalized to design a large number of self-assembling proteins with a wide variety of structural properties (3032). Nevertheless, except for a small number of custom oligomeric building blocks (21), this approach has otherwise been limited by a requirement for experimentally solved structures used as inputs to docking. Despite the large and growing Protein Data Bank [PDB; (33)], this remains a significant limitation, as available structures represent only a tiny fraction of the possible space of protein structures (3436). For example, if ten unique trimers from the PDB each yield five promising nanoparticle designs, the result would be 50 possible nanoparticles. However, if the number of building block candidates could be expanded to 200 by leveraging homologous sequences, the result would be 1,000 possible nanoparticles, a 20-fold increase. Even within protein families that are represented in the PDB, biotechnologically relevant properties such as stability, secretability, and posttranslational modifications can vary widely among homologs from different species (31, 37). Given that the structural and biophysical properties of an assembled nanoparticle are largely determined by the properties of its building blocks, methods that expand the number of protein building blocks available for design would facilitate the design of nanoparticles and nanoparticle vaccines with desired properties.

Machine learning (ML)-based methods are revolutionizing protein structure prediction and design (3843). Design methods like RFdiffusion for backbone generation and ProteinMPNN for amino acid sequence design have dramatically increased the success rate of many de novo design challenges, aided by the use of structure prediction methods such as AlphaFold2 (AF2) and RoseTTAFold as filters for high-quality designed proteins (4446). For example, in protein nanoparticle design specifically, two recent papers highlighted that using ProteinMPNN for interface design resulted in improved interfaces and nanoparticles compared to traditional Rosetta-based sequence design (47, 48). Although the ability of AF2 and RoseTTAFold to provide accurate models of protein structures from sequence alone provides an opportunity to dramatically expand the number of building blocks available for nanoparticle design, no designed self-assembling nanoparticles based on predicted structures from these tools have been reported yet.

Here, we develop a general method for designing self-assembling proteins that uses predicted structures of oligomeric proteins from thermophilic organisms as building blocks. We found that homomeric nanoparticles can be designed with atomic-level accuracy from sequence alone and can be engineered to elicit robust antibody responses against influenza by repetitively displaying antigens based on the RBD of the viral hemagglutinin (HA).

Results

Computational Nanoparticle Design.

We set out to leverage ML-based structure prediction tools to expand the number of building blocks available for nanoparticle design. Our general strategy was to i) predict the structures of homologs of oligomeric proteins of known structure, and ii) use the predictions as inputs to established pipelines for nanoparticle design based on symmetric docking and protein–protein interface design (49). We focused on the design of homomeric (“one-component”) nanoparticles, as the docking of such assemblies is severely constrained and would benefit most from the availability of a wider variety of building blocks. Based on our prior experiences designing one-component nanoparticles (28, 31, 49), we attempted to identify building blocks with attributes that may increase the likelihood they would assemble into nanoparticles. Specifically, we searched the PDB for homomeric proteins from bacteria or archaea that had threefold rotational symmetry (C3), high-resolution structures (≤ 2.4 Å), and significant (>50%) helical secondary structure (Fig. 1A). After clustering similar sequences, 95 structures (PDB seeds) passed these filters. We then screened the UniProt database for proteins from thermophilic organisms with >50% sequence identity to these 95 PDB seeds. Previous work has shown thermophilic homologs can be up to 10-fold more tolerant of mutations at individual residues than their mesophilic counterparts, allowing for the evolution of thermostable proteins into binders, receptor analogs, and P450 enzymes with novel activities (5056). By focusing on proteins from thermophiles, we hoped to identify starting structures that were robust, soluble, and tolerant to mutation. Of the 359 identified thermophilic trimer homologs, 191 yielded high-confidence AF2 predictions [predicted aligned error (pAE) < 5, predicted local distance difference test (pLDDT) > 90] (Fig. 1B). We docked these predicted structures in the so-called “I3” architecture (28) by aligning their threefold symmetry axes to the corresponding axes of icosahedral point group symmetry, using the Rosetta SymDofMover to sample the translational and rotational degrees of freedom along this shared axis [Fig. 1C, (27)]. We then designed the interfaces between the docked C3 building blocks using ProteinMPNN (Fig. 1D and SI Appendix, Table S1), which was recently shown to generate high-quality nanoparticle interfaces without requiring manual intervention [(47, 48); Methods). This pipeline produced ~430,000 designs, which were filtered using Rosetta scoring metrics [ddG < −10, buried Solvent Accessible Surface Area (SASA) > 300, Shape Complementarity > 0.5, and dGins, pred > 2.6; SI Appendix, Fig. S1] and by visual inspection to select 88 designs for experimental characterization.

Fig. 1.

Fig. 1.

Computational design of novel self-assembling proteins from AF2-predicted models. (A) The PDB was searched for structures that possess optimal attributes to form nanoparticles. (B) From these PDB seeds, homologous sequences were identified from thermophilic organisms and their structures predicted with AF2. (C) The best-scoring predictions were docked using RPXDock (40) and their translational (r) and rotational (ω) degrees of freedom were subsequently sampled using the Rosetta SymDofMover (27) to generate multiple structures for interface design. (D) The interfaces between docked nanoparticle components were designed using ProteinMPNN (42), and a subset of these designs was selected using Rosetta scoring metrics for experimental characterization.

Screening and Characterization of Designed Nanoparticles.

We ordered genes for expression in Escherichia coli encoding the 88 designs with hexahistidine tags appended to enable purification via immobilized metal affinity chromatography (IMAC). IMAC-purified samples were subsequently analyzed by size-exclusion chromatography (SEC), resulting in ten putative hits with elution peaks at the expected retention time (Fig. 2A). This initial success rate compares favorably to historical success rates for established design protocols starting from building blocks of known structure. Following large-scale expression and purification of these hits, SEC confirmed that six designs yielded predominant peaks at the expected elution volume, with three of the six also featuring substantial peaks at the void volume of the column (Fig. 2B). These six designs originated from three unique PDB seeds (6HS8: I3-A6, I3-B2, I3-B3, and I3-B5; 1QWG: I3-D12; 6HF7: I3-A7) and therefore have three distinct backbones (SI Appendix, Table S2). SEC fractions at the expected elution volume were pooled and analyzed by dynamic light scattering (DLS) and negative stain electron microscopy (nsEM), which revealed the presence of homogeneous nanoparticles for all six samples. Low-resolution three-dimensional (3D) reconstructions from nsEM were consistent with the computational design models. We measured the aggregation temperature (Tagg) for each nanoparticle by monitoring DLS while heating from 25 to 84 °C, which showed that all nanoparticles retained their intended assembly state until at least 60 °C, and I3-D12 did not aggregate in the measured range (Fig. 2C and SI Appendix, Fig. S2).

Fig. 2.

Fig. 2.

Screening and characterization of designed nanoparticles. (A) An overlay of HPLC traces for all 88 designs tested at small scale. The magnified region shows a large number of putative hits at the expected retention time for icosahedral nanoparticles. (B) From Left to Right, normalized SEC traces of IMAC eluent from large-scale expression, normalized DLS, and negative stain micrographs with 2D averages and 3D reconstructions for each of the six confirmed nanoparticles. Bars labeled “NP” indicate the SEC fractions containing nanoparticles that were pooled for further characterization. Scale bars represent 40 nm, 10 nm, and 5 nm for the micrographs, 2D averages, and 3D reconstructions, respectively. (C) Aggregation temperatures for the six confirmed nanoparticles. Data related to nanoparticles from PDB seeds 6HS8, 1QWG, and 6HF7 are colored in blue, yellow, and green, respectively.

High-Resolution Structural Characterization of Three Nanoparticles.

To evaluate the accuracy of our design method at high resolution, we used single-particle cryoelectron microscopy (cryo-EM) to determine the structures of one nanoparticle from each of the successful PDB seeds. Homogeneous and monodisperse particles were observed in the raw micrographs for all three samples, and 2D class averages exhibited multiple well-defined views that provided clear evidence of icosahedral symmetry (Fig. 3A). Icosahedral 3D reconstructions at 2.5 Å, 2.3 Å, and 3.5 Å resolution were obtained for I3-A6, I3-D12, and I3-A7, respectively, allowing us to build atomic models of each nanoparticle (Fig. 3B and SI Appendix, Fig. S3 and Table S3). Comparing the experimental cryo-EM structures to the computational design models demonstrated that AF2 structure prediction was quite accurate: The all-atom rmsd for a single monomer of I3-A6, I3-D12, and I3-A7 were 1.3 Å, 1.7 Å, and 1.4 Å, respectively. Aligning two subunits across the designed interfaces revealed that these were designed with high accuracy, yielding alpha-carbon rmsds of 0.7 Å, 1.1 Å, and 1.5 Å, respectively (Fig. 3 C and D). In each case, the rotameric states of the interdigitated hydrophobic side chains at the designed interfaces corresponded well with the design models, as did the polar residues lining the periphery of the I3-A6 and I3-D12 interfaces (Fig. 3D). Several of the polar residues at the I3-A7 interface adopted unintended rotamers, likely due to small shifts in the backbones of the loops that make up the interface. Nevertheless, essentially all of the polar and nonpolar contacts at each interface recapitulated those in the design models. Slight rigid body deviations between the building blocks added up to alpha-carbon rmsds of 1.6 Å, 2.2 Å, and 2.2 Å across the entire 60-subunit nanoparticle for I3-A6, I3-D12, and I3-A7, respectively. These values are comparable to those obtained from previous computational nanoparticle design campaigns that started from high-resolution crystal structures of the building blocks (27, 29, 47). These results establish that ML-based tools for structure prediction (AF2) and design (ProteinMPNN) enable the accurate design of self-assembling protein nanomaterials from predicted structures.

Fig. 3.

Fig. 3.

High-resolution structural characterization of three nanoparticles. (A) Processed micrographs and 2D class averages. Scale bars represent the diameter of each nanoparticle as labeled in panel B. (B) Cryo-EM reconstructions. (C) Two overlaid trimers across the twofold axis. Dashed lines represent residues that were not modeled due to insufficient density. (D) Close-up of the designed interface for each nanoparticle. Gray structures represent AF2 predictions while colored ones are the true cryo-EM structure.

Engineering I3-A7 to Function as a Nanoparticle Immunogen.

The application of computationally designed proteins as scaffolds for nanoparticle vaccines has advanced in recent years, with one such vaccine licensed [for SARS-CoV-2 (4, 24)] and several others in clinical trials for additional indications (8, 23). To test whether our designed assemblies could be functionalized as multivalent nanoparticle immunogens through genetic fusion of an antigen of interest, we set out to re-engineer several of their structural and biochemical properties. First, we evaluated whether they could be secreted as nanoparticles from mammalian cells, a prerequisite for display of many glycoprotein antigens that require posttranslational modifications found only in the secretory pathway (57). We found that only one of our six designed nanoparticles (I3-A7) successfully secreted from transfected Expi293F cells when an N-terminal signal peptide was appended, so our subsequent engineering efforts focused on this nanoparticle (SI Appendix, Fig. S4). Second, as described below, we circularly permuted the subunit of I3-A7 so that its N terminus was present on the exterior of the nanoparticle and available for genetic fusion. Third, we engineered an oligomannose-type N-linked glycan into the I3-A7 subunit, as these have been shown to improve the trafficking and immunogenicity of particulate immunogens (3, 14). Finally, we designed a de novo rigid genetic linker between the re-engineered I3-A7 subunit and a variant of the influenza HA RBD, stabilized in a native-like, closed trimeric conformation [“trihead”; (18)].

Display of the HA trihead via genetic fusion required the N terminus of I3-A7 to be accessible on the nanoparticle exterior. As the termini of I3-A7 were originally buried and therefore inaccessible, we circularly permuted the subunit, a genetic manipulation in which the original termini are fused together and new termini are created elsewhere (58). By visual inspection, we chose to “cut” the subunit between Asp133 and Ser134 (Genbank PKK86397 numbering), which would place the new N terminus on the exterior surface of each subunit (Fig. 4A). We then used RFdiffusion to generate de novo backbones for linkers connecting the two original termini (Gly2 and Asn183) up to four amino acids in length. The two adjacent amino acids on each side were allowed to move during diffusion in order to optimize backbone geometry throughout the linker. We then designed sequences for the partially diffused backbones using ProteinMPNN and predicted the structures of trimers of the circularly permuted subunits by AF2. We used visual inspection and AF2 prediction metrics for the diffused regions (pLDDT > 92, pAE < 5, IpTM > 0.9) to select four different linkers to test experimentally (Fig. 4B).

Fig. 4.

Fig. 4.

Engineering I3-A7 to function as a nanoparticle immunogen. (A) A schematic showing the process of circular permutation on a subunit of the trimer that forms the nanoparticle. (B) The four diffused linkers between the I3-A7 termini that were tested experimentally. The arrow indicates the directionality of the I3-A7cp backbone from the N to C terminus. (C) Four MPNN outputs at a central position in the intertrimer interface. (D) A representative electron micrograph of the circularly permuted nanoparticle. (Scale bar, 21 nm.) (E) A depiction of the two N-linked glycans added to each subunit trimer. Glycans were modeled as Man5GlcNAc2 using GlycoShape (59). An engineered sequon at position N111 was found to be unoccupied. (F) Complete characterization of each occupied glycosylation site via LC–MS. (G) A representative electron micrograph of the nanoparticle with glycans at positions N46 and N62. (Scale bar, 21 nm.)

Before validating the permutations, we noticed that our postpermutation termini were near the intratrimer interface of I3-A7, introduced by a break in the polypeptide chain that could potentially destabilize the trimer. We therefore used ProteinMPNN to evaluate stabilizing mutations at ten positions at the trimer interface selected by inspection of the parent PDB seed (PDB ID 6HF7). After filtering on AF2 prediction metrics (pLDDT > 90, pAE < 8, IpTM > 0.9), we selected five sets of mutations in the central helix near the threefold symmetry axis. For example, mutations at position Ala14 increased hydrophobic packing (A14Y and A14F) or introduced new hydrogen bonds (A14Y, A14S, and A14N) between the trimer subunits (Fig. 4C). All 20 combinations of the five redesigned trimeric interfaces with the four diffused linkers had strong AF2 prediction metrics, including low rmsd to the original trimer prediction of the thermophilic homolog. We identified the highest-expressing variant from Expi239F cells (called I3-A7cp) by sodium dodecyl sulfate–polyacrylamide gel electrophoresis (SDS-PAGE) of cell culture supernatants and confirmed that it still formed the expected icosahedral nanoparticle using SEC and nsEM (Fig. 4D and SI Appendix, Fig. S4).

We then engineered N-linked glycans into I3-A7cp to further optimize its performance as a nanoparticle immunogen. Glycans have been shown to improve antigen secretion (60, 61), shield antibody responses against off-target epitopes (62), and—specifically in the case of oligomannose-type glycans—enhance vaccine trafficking and immunogenicity by engaging the innate immune system (3, 14). We introduced sequons encoding N-linked glycans at three solvent-exposed positions in I3-A7cp: N111, which is located on the edge of the subunit and points upward from within the pores of the assembled nanoparticle; N46, found at the apex of the subunit pointing out toward the exterior of the assembled nanoparticle; and N62, which is also located on the edge of the subunit but points into the pores of the assembled nanoparticle (Fig. 4E). Although N-linked glycans were not detectable at position 111 in nanoparticles purified from transfected cells, we observed 74% and 83% occupancy at N46 and N62, respectively (Fig. 4F). Glycosylation did not noticeably impact nanoparticle morphology: SEC, DLS, and nsEM of I3-A7cp-N46-N62 were essentially indistinguishable from I3-A7 and I3-A7cp (Fig. 4G and SI Appendix, Fig. S4). Furthermore, all three nanoparticles were highly thermostable, with Tagg ≥ 72 °C (SI Appendix, Fig. S4C). LC–MS analysis revealed that while N46 was occupied by fully processed complex-type glycans, N62 comprised 71% underprocessed oligomannose-type glycans. Oligomannose-type glycans are typically processed in the endoplasmic reticulum (ER) and Golgi to become complex-type glycans, but this can be prevented by chemical or steric inhibition of the enzymes involved (63). Because no chemical inhibitors of glycan processing were added during transfection, this result suggested that glycan processing at N62 was sterically constrained. Based on the location of N62 in the nanoparticle, we suspected that this steric constraint may have resulted from nanoparticle assembly early in the secretory pathway, prior to initial processing of oligomannose-type glycan precursors in the ER. To probe this, we expressed and purified a trimeric variant of I3-A7cp containing the glycan sequons but lacking the computationally designed interface that drives nanoparticle assembly. Analysis of this protein confirmed that N62 was occupied almost exclusively by complex-type glycans, comprising only 4% oligomannose-type glycans (SI Appendix, Fig. S5). These data demonstrate that we engineered a genetically encoded oligomannose-type glycan into I3-A7cp that depends on nanoparticle assembly early in the secretory pathway. We kept only the N62 glycan in the nanoparticle immunogens for influenza described below, reverting the sequons at positions 46 and 111.

Design and Immunogenicity of a Trihead-I3-A7 Nanoparticle Immunogen.

To evaluate glycosylated I3-A7cp (I3-A7cp-N62) as a nanoparticle vaccine scaffold, we chose to display a domain-based antigen derived from influenza HA. We recently reported the design of RBDs from several HAs stabilized in closed trimeric conformations that we refer to as triheads (10, 18). Previously, trihead stabilization required rigid integration into the trimeric component of a two-component protein nanoparticle via a coiled-coil linker comprising a disulfide bond. Displaying the trihead on a different nanoparticle therefore represents a challenging design task.

In parallel to the I3-A7 optimization steps described above, we used RFdiffusion and ProteinMPNN to generate de novo linkers between the N terminus of I3-A7cp-N62 and a trihead antigen based on the HA from influenza virus A/Michigan/45/2015 [MI15; (18)]. After using RPXdock to obtain an initial dock of the trihead above the I3-A7cp trimer, we provided inputs to RFdiffusion in which the antigen was translated between −6 and +15 Å along the shared threefold axis of symmetry (Fig. 5A). Between five and 80 de novo residues were used for linker generation, resulting in primarily alpha-helical structures that were often linked to the N terminus of I3-A7cp by short loops. After designing sequences for the de novo linkers using ProteinMPNN, we filtered approximately 2,200 designs based primarily on the accuracy of AF2 prediction of the entire TH-I3-A7 trimer and selected 13 for experimental characterization. Of these, 11 secreted from transfected cells, yielded SEC peaks near the expected elution volume, and bound monoclonal antibodies (mAbs) specific to HA (SI Appendix, Fig. S6). One design, named TH-I3-A7, had 60 residues diffused between the trihead antigen and the underlying nanoparticle, expressed particularly well, and had a nearly symmetric SEC elution peak (Fig. 5B). TH-I3-A7 strongly bound the receptor-binding site-specific mAb 5J8 (59), and minimally bound the trimer interface-specific mAb FluA-20 (64), indicating display of a closed trihead (Fig. 5C). Negative stain EM of TH-I3-A7 revealed fields of monodisperse nanoparticles with visible antigen, and 2D class averages indicated that the de novo linker was relatively rigid compared to previously described nanoparticle immunogens that used flexible glycine-serine linkers [Fig. 5D; (8)].

Fig. 5.

Fig. 5.

Design and immunogenicity of a trihead-I3-A7 nanoparticle immunogen. (A, Left) A schematic highlighting docking of the trihead antigen along the threefold symmetry axis of the I3-A7cp trimer. Docks with various distances r were provided as inputs to RFdiffusion for linker design. (Right) Examples of de novo linkers connecting the trihead antigen to the nanoparticle. (B) A model of the final nanoparticle vaccine candidate, TH-I3-A7. The I3-A7cp component is shown in blue, the trihead antigen in purple, the oligomannose-type glycan at N62 in green, and the diffused linker in dark gray. (C) BLI of the nanoparticle showing strong binding of 5J8 to the receptor-binding site and minimal binding of the trimer interface antibody FluA-20. (D) A representative electron micrograph and 2D class averages of TH-I3-A7. (Scale bar, 40 nm for the micrograph and 20 nm for the 2D class averages.) (E) TH-I3-A7 immunogenicity study design and groups. (F) ELISA-binding titers (n = 10), (G) HAI titers (n = 9) and (H) microneutralization (n = 10) titers in immune sera at week 6. All statistical significance between groups is shown. Significance was determined using the Kruskal–Wallis test corrected with Dunn’s multiple comparisons test (F) or the Mann–Whitney test (G and H); **P < 0.01; ***P < 0.001.

We then evaluated the immunogenicity of TH-I3-A7 in mice, benchmarking against our previously described two-component nanoparticle immunogen displaying the same MI15 trihead [TH-I53_dn5; (18)]. We included I3-A7cp-N62 lacking fused antigen as a negative control immunogen. All three immunogens formed monodisperse nanoparticles, and both of the trihead-bearing nanoparticles bound 5J8 but not FluA-20 as shown previously (SI Appendix, Fig. S7). Groups of 10 BALB/c mice were immunized with AddaVax-adjuvanted immunogens comprising 1 μg of trihead or a corresponding number of bare I3-A7cp-N62 nanoparticles at weeks 0 and 4, and sera were collected at week 6 (Fig. 5F). The geometric mean titers of HA-specific IgG measured by enzyme-linked immunosorbent assay (ELISA) were similar between the TH-I3-A7 and TH-I53_dn5 groups, although more variability was observed among the mice that received TH-I53_dn5 (Fig. 5F). I3-A7cp-N62 elicited essentially no HA-specific antibodies as expected. We also measured vaccine-matched hemagglutination inhibition (HAI) and neutralization to evaluate the functionality of antibodies elicited by each immunogen. Both assays again showed comparable activity induced by TH-I3-A7 and TH-I53_dn5, with more variability in the TH-I53_dn5 group (Fig. 5 G and H). Finally, we measured neutralization of the vaccine-mismatched H1N1 strain A/California/07/2009 and similarly found minimal differences between the two groups (Fig. 5H). Together, these data establish that the methodologies and design approaches we used were successful in generating an immunogenic nanoparticle vaccine starting from the predicted structure of a thermostable trimeric protein building block.

Discussion

Here, we developed a method that allowed us to accurately design self-assembling protein nanoparticles using only the amino acid sequences of homologs of oligomeric proteins. The present approach goes beyond previous methods for designing self-assembling proteins by eliminating their requirement for experimentally determined structures of candidate protein building blocks (2729). This advance was made possible by ML-based tools for structure prediction that can rival the accuracy of experimental structure determination (39, 42, 43). When used as part of the approach we describe here, these tools broaden the number of protein building blocks that can be used to design self-assembling proteins, improving the likelihood of obtaining protein assemblies with the desired structural or functional features. Remarkably, the success rate and accuracy of our method [6/88 designs (7%) and 60-subunit alpha-carbon rmsds of 1.6 to 2.2 Å, respectively] are comparable to previous reports that started from crystal structures of oligomeric building blocks (27, 29, 47), highlighting the accuracy of both AF2-based structure prediction and the “dock-and-design” methodology. One limitation of the present study is that we only targeted a single symmetric architecture: icosahedral assemblies constructed from trimeric building blocks. However, given that the dock-and-design strategy has successfully generalized to a wide variety of both finite and unbounded architectures (3032), we expect the present approach to be at least as widely applicable, if not more so due to the increased variety of candidate building blocks.

Several properties are required of a protein assembly in order for it to be a successful one-component nanoparticle vaccine scaffold. This fact was highlighted by our observation that I3-A7 was the only one out of six designed nanoparticles that secreted from mammalian cells, as well as the extensive re-engineering required to transform it into a functional vaccine candidate. We recently showed that the presence of cryptic transmembrane domains in designed protein assemblies can prevent secretion, which prompted the development of a computational protocol to eliminate them through sequence redesign (31). However, none of the six nanoparticles designed in the present study comprised cryptic transmembrane domains, indicating that other unidentified features must account for the failure of five of the six to secrete. As additional determinants of secretion are identified, they can be incorporated into computational design approaches to ensure optimization of this critical property. We then had to substantially alter the structure of I3-A7 to enable display of the trihead antigen by relocating its N terminus and designing a de novo linker between the antigen and nanoparticle scaffold. These alterations were necessary because we did not preselect building blocks with particular geometric features. Future efforts can now do so, a strategy that will be facilitated by the wider variety of building blocks now accessible for nanoparticle design. Nevertheless, we were able to successfully re-engineer I3-A7 to display the trihead, a challenging design task that was made possible because of recently developed tools for protein design including RFdiffusion, ProteinMPNN, and AF2. As these tools continue to improve, designing new nanoparticles and manipulating existing ones should become more straightforward. Finally, we engineered a genetically encoded oligomannose-type glycan into the I3-A7 subunit. A limitation of our approach is that we did not explicitly model the glycan during design, as available tools for glycan modeling were few and unwieldy. Versions of both AlphaFold and RoseTTAFold are now available that explicitly model nonprotein atoms including glycans (65, 66), and we anticipate that these tools will enable more sophisticated glycan modeling and design. Even without these tools, we succeeded in encoding an oligomannose-type glycan in the amino acid sequence of I3-A7, and showed that its composition depended on nanoparticle assembly. This result provides evidence suggesting that assembly of this secreted protein nanoparticle immunogen occurs during the early stages of the secretory pathway.

Immunogenicity studies in mice showed that TH-I3-A7 is immunogenic and induces robust levels of receptor-blocking and neutralizing antibodies. Notably, TH-I3-A7 is as immunogenic as the same antigen displayed on I53_dn5, the two-component protein nanoparticle on which the trihead was originally designed (10). Two-component nanoparticles like I53_dn5 and homomeric nanoparticles like I3-A7 each have advantages and disadvantages as vaccine platforms. The ability to assemble two-component nanoparticles in vitro yields superior control over the quality and homogeneity of the displayed antigen (11) and simplifies the generation of mosaic nanoparticle immunogens that codisplay multiple different antigens on the same nanoparticle (23, 6769). Furthermore, in vitro assembly has been derisked as a scalable manufacturing approach by several two-component nanoparticle immunogens that have advanced to clinical trials or licensure, including multivalent influenza vaccine candidates based on I53_dn5 [NCT06481579, NCT05968989, and NCT04896086; (4, 8, 23, 24)]. However, two-component nanoparticles have not yet been adapted for use in genetic immunization approaches based on DNA, messenger RNA (mRNA), or viral vectors. These can be faster to manufacture than protein-based vaccines, which is particularly relevant in indications like influenza where frequent strain updates are required due to antigenic drift and shift (70). Several homomeric protein nanoparticle immunogens have been delivered as either DNA or mRNA vaccines, mostly based on a small number of naturally occurring self-assembling proteins such as ferritin and lumazine synthase (7175). Computationally designing custom homomeric nanoparticles promises to enhance the performance of genetically delivered nanoparticle vaccines by enabling precise control over their size, shape, symmetry, valency, and other features. By broadening the set of oligomeric protein building blocks available for design, our method will facilitate efforts directed at designing self-assembling protein nanomaterials with structures and activities tailored for this and other applications in medicine.

Methods

Identifying and Filtering Sequences from Thermophiles.

To identify initial structures on which to base homology, the PDB was searched for homotrimers with C3 symmetry at a sufficient resolution (<2.4 Å). Additional filters ensured that selected proteins: 1) maintain >50% helical structure for easier implementation with current nanoparticle docking algorithms; 2) originate in Archaea or Bacteria so that they might have thermophilic homologs; and 3) do not include membrane proteins to reduce the risk of insoluble protein expression in vitro. The resulting list of PDB seeds was used to search the UniProt database for all sequences with >50% identity to each seed. For those sequences, the parent organisms were considered thermophiles if “therm” was in the name or if they had reported optimal growth temperatures above 55 °C on the BacDive database. Remaining hits were filtered to <70% identity so sequences were not too similar. Sequences with published high-resolution structures were also removed.

Computational Design of Protein Nanoparticles.

Thermophilic C3 sequences were predicted using AF2-ptm and predictions were filtered to keep only those with a pLDDT > 90 and a pAE < 5. Asymmetric units (asu) for each homotrimer were extracted and used to establish an initial nanoparticle docking conformation using RPXDock. Further sampling was introduced with Rosetta SymDofMover by translating trimers along the radial axis and rotating them about their symmetry axis. The interface residues were defined as those within 8 Å of each other at contact sites between components. Designs were eliminated if there were less than 12 interface residues. For all other designs, ProteinMPNN was implemented with a sampling temperature of 0.2 to engineer the interface residues for optimal self-assembly. After ten iterations of ProteinMPNN for each interface, the sequences were threaded onto the AF2 predictions with RosettaScripts, and Rosetta metrics were calculated for filtering purposes. The following filters eliminated most designs: ddG < −10, SASA > 300, Shape Complementarity > 0.5, and dGins, pred > 2.6. Prior to ordering constructs, the predicted sequences (which were derived from the originating PDB seeds and comprised only the regions modeled in the structures) were used as inputs in BLAST (76) to retrieve full-length genomic sequences, and these were used as the basis for construct generation.

Bacterial Protein Expression and Purification.

To screen initial designs, eBlocks gene fragments (Integrated DNA Technologies) were cloned into a pET29b+-based vector backbone via Golden Gate Assembly and transformed into chemically competent BL21 E. coli (New England Biolabs). Small-scale production occurred in 96 deep-well plates in autoinduction media containing TBII media (Mpbio) supplemented with 50 × 5052, 50 × M salts, 20 mM MgSO4, and a trace metal mix. Cultures grew under antibiotic selection at 37 °C for 6 h at 1,350 rpm in a Heidolph shaker. Cells were harvested by centrifugation at 4,000 × g and chemically lysed with BugBuster protein extraction reagent (Sigma-Aldrich) after resuspension in lysis buffer (50 mM Tris pH 8.0, 250 mM NaCl, 20 mM imidazole), followed by addition of bovine pancreatic DNaseI (Sigma-Aldrich) and protease inhibitors (Thermo Scientific). Clarified lysate supernatants were applied to equilibrated Ni-NTA resin (QIAGEN) columns. Washes were performed with 10 column volumes of lysis buffer, then eluted with 4 column volumes of the same buffer containing 500 mM imidazole. Eluant was analyzed for protein expression via SDS-PAGE and HPLC.

Large-scale nanoparticle expression (50 to 100 mL) was performed as described above, except using TBII media with IPTG induction at OD600 = 0.6 to 0.8 and 4 h of expression at 37 °C and 250 rpm. Concentrated or unconcentrated elution fractions were further purified using a Superose 6 Increase 10/300 GL (Cytiva) on an ÄKTA Pure (Cytiva) into 25 mM Tris pH 8.0, 150 mM NaCl. Instrument control and elution profiles analysis were performed with UNICORN 7 software (Cytiva).

Mammalian Protein Expression and Purification.

For purification of plasmid DNA for Expi293F transfection, bacteria were cultured and plasmids were harvested according to the QIAGEN Plasmid Plus Maxi Kit protocol (QIAGEN). For mammalian expression and purification of secreted nanoparticles, Expi293F cells at 3 million cells/mL were transfected with the following per mL of cell culture: 70 μL Opti-MEM (Thermo Scientific), 1 µg plasmid DNA and 3 µg PEI-MAX. Cells were harvested 3 to 5 d posttransfection by centrifugation for 5 min at 4,000 × g, incubation with polyDADMAC (Sigma-Aldrich) for 10 min, and a final centrifugation for 5 min at 4,000 × g. Sterile-filtered supernatant was adjusted to 50 mM Tris pH 8.0 and 500 mM NaCl, then batch bound to Ni Sepharose Excel (Cytiva) with agitation at 25 °C for 1 h or 4 °C for 15 h. Pelleted resin was washed with 20 column volumes of wash buffer (50 mM Tris pH 8.0, 500 mM NaCl, 30 mM imidazole), then eluted with 2 column volumes of the same buffer containing 300 mM imidazole. Concentrated elution fractions were purified by SEC as described above, with the addition of 100-200 mM L-Arginine as an excipient for I3-A7cp and constructs fused to antigen. For TH-I3-A7, the mutation T38E (I3-A7cp numbering) was made to improve secretion yields after using DeepTMHMM prediction (77) to identify potential cryptic transmembrane regions.

DLS and Thermal Melts.

DLS was performed using an UNcle (Unchained Labs) to determine the hydrodynamic radius of each nanoparticle and the polydispersity of the sample. SEC-purified samples at 0.1 mg/mL were loaded into an 8.8 μL quartz capillary cassette (UNi, UNchained Laboratories) and DLS measurements were obtained at 25 °C. Buffer components and protein compositions were accounted for in the UNcle software. To determine the aggregation temperature (Tagg) of the nanoparticles, DLS measurements were collected over a temperature range from 25 to 84 °C or 25 to 95 °C, with a ramp of approximately 2 °C/min. Tagg was defined as the temperature at which the absolute value of the first derivative of the Z-averaged diameter was greater than unity for all subsequent consecutive measurements while allowing for single nonconsecutive values below one. First derivatives were calculated after smoothing with four neighboring points using 2nd order polynomial settings in GraphPad Prism. Two replicates were performed for each nanoparticle.

Biolayer Interferometry (BLI).

BLI was performed using an Octet Red 96 system, at 25 °C and 1,000 rpm. Antibodies were diluted to 0.02 mg/mL (5J8) and 0.07 mg/mL (FluA-20) in kinetics buffer [1 × phosphate-buffered saline (PBS), 0.5% serum bovine albumin and 0.01% Tween]. ProA Octet Biosensors (Sartorius) were used to load antibodies for the assay which was run as follows: 60 s buffer, 100 s loading, 30 s buffer, 100 s association, and 100 s dissociation. Biosensors were regenerated using 10 mM Glycine pH 2.5 for 30 s, followed by 60 s in buffer.

nsEM.

300-mesh carbon coated grids (Electron Microscopy Sciences) were glow discharged for 30 s prior to application of protein samples (3 µL, 0.1 mg/mL) and blotting three times with 2% (w/v) uranyl formate. Data were collected using a 120 kV Talos L120C transmission electron microscope (Thermo Scientific) with a BM-Ceta camera. CryoSPARC was used for CTF correction, particle picking and extraction, and 2D classification.

Cryo-EM Sample Preparation, Data Collection, and Processing.

Cryo-EM samples were prepared on glow-discharged CFLAT 1.2/1.3 holey-carbon grids (Electron Microscopy Sciences). Protein sample at 1 to 4 mg/mL was applied to each grid, incubated for 30 s, and then blotted using Whatman filter paper; this process was performed twice, followed by a third application of sample and final blotting and plunge freezing in liquid-ethane using a Vitrobot (FEI). Data were acquired on a Titan Krios operating at 300 kV equipped with a K3 Summit Direct Detector and a Quantum GIF energy filter (Gatan) operating in zero-loss mode with a 20-eV slit width. Movies were collected in counting mode, collecting 50 frames with a total dose of 52 e−2, with a defocus range of −0.5 to −2 µm. Automated data collection was carried out using Leginon (78) at a nominal magnification of 105,000× with a pixel size of 0.843 Å. All data processing was carried out using cryoSPARC (79). Movie frame alignment and dose-weighting summation were performed using Patch Motion, and CTF estimation was done using patch CTF. Particles were first picked using Blob picker and these were used to generate initial 2D averages. A few of the best 2D averages representative of different particle views were then selected as templates for template picking. From these template picks, two rounds of 2D classification and selection were done to isolate well-aligned, single particles. Initial models were generated by ab initio reconstruction with C1 symmetry. Homogeneous 3D refinements were then carried out with icosahedral symmetry, with per-particle defocus refinement, and beamtilt and anisotropic magnification by image shift group enabled.

Model Building and Refinement.

AF2 models were docked into cryo-EM maps as rigid bodies using Chimera (80). An initial refinement of a nanoparticle asu into the cryo-EM density was performed with ISOLDE (81). A full icosahedron was then generated from the ISOLDE-refined asu, and then this entire icosahedral nanoparticle was relaxed into the cryo-EM density again in ISOLDE. A final real-space refinement with grid searching, Ramachandran and rotamer restraints all turned off, and “use starting model as reference” turned on, was then performed in Phenix (82). There were flexible loops composed of residues 17 to 23 and 106 to 113 in I3-A6, residues 123 to 129 in I3-A7, and residues 137 to 143 and 170 to 176 in I3-D12 that were not resolved in the cryo-EM density so these loops were not modeled.

Sample Preparation and Analysis by LC–MS.

Glycan analysis was performed as described previously (8385). Briefly, aliquots of each sample were denatured for 1 h in 50 mM Tris/HCl, pH 8.0 containing 6 M of urea. The polypeptides were buffer-exchanged into 50 mM Tris/HCl, pH 8.0 using Vivaspin columns (3 kDa) and digested separately overnight using trypsin/Lys-C (1 µg), chymotrypsin (1 µg), or alpha lytic protease (1 µg) (Mass Spectrometry Grade, Promega). The next day, the peptides were dried and extracted using the Oasis HLB 96-well plate (Waters). The peptides were analyzed by nanoLC-ESI MS with an Ultimate 3000 HPLC (Thermo Fisher Scientific) system coupled to an Orbitrap Eclipse mass spectrometer (Thermo Fisher Scientific) using stepped higher energy collision-induced dissociation (HCD) fragmentation (15%, 25%, 45%). Peptides were separated using an EasySpray PepMap RSLC C18 column (75 µm × 15 cm). A trapping column (PepMap 100 C18 3 μm particle size, 75 μm × 2 cm) was used in line with the LC prior to separation with the analytical column. The LC conditions were as follows: 60-min linear gradient consisting of 5-40% acetonitrile in 0.1% formic acid over 50 min followed by 10 min of 95% acetonitrile in 0.1% formic acid. The flow rate was set to 300 nL/min. The spray voltage was set to 2.5 kV and the temperature of the heated capillary was set to 55 °C. The ion transfer tube temperature was set to 275 °C. Precursor and fragment detection were performed using an Orbitrap at a resolution MS1 = 120,000, MS2 = 30,000. Glycopeptide fragmentation data were extracted from the raw file using Byos (version 5.5, Protein Metrics). The glycopeptide fragmentation data were evaluated manually for each glycopeptide; the peptide was scored as true-positive when the correct b and y fragment ions were observed along with oxonium ions corresponding to the glycan identified. The MS data were searched using the Protein Metrics N-glycan library. The relative amounts of each glycan at each site as well as the unoccupied proportion were determined by comparing the extracted chromatographic areas for different glycotypes with an identical peptide sequence. All charge states for a single glycopeptide were summed. The precursor mass tolerance was set at 4 ppm and 10 ppm for fragments. A 1% false discovery rate was applied. Glycans were categorized according to the composition detected. HexNAc(2)Hex(9−4) was classified as M9 to M4. Any of these compositions containing fucose were classified as fucosylated mannose. HexNAc(3)Hex(5−6)X was classified as Hybrid with HexNAc(3)Fuc(1)X classified as Fhybrid. Complex-type glycans were classified according to the number of processed antenna and fucosylation. Complex-type glycans were categorized according to the number of N-acetylhexosamine monosaccharides detected that do not fit in the previously defined categories.

Immunization.

For immunogenicity studies, female BALB/cAnNHsd were purchased from Envigo (order code 047) at 7 wk of age. Mice were housed in a specific-pathogen-free facility within the Department of Comparative Medicine at the University of Washington, Seattle, accredited by the Association for Assessment and Accreditation of Laboratory Animal Care. Animal studies were conducted in accordance with the University of Washington’s Institutional Animal Care and Use Committee under protocol 4470-01. For each immunization, low-endotoxin proteins were diluted to 20 μg/mL in buffer and mixed with 1:1 v/v AddaVax adjuvant (InvivoGen vac-adx-10) to obtain a final dose of 1 μg of immunogen per animal, per injection. At 8 wk of age, 10 mice per group were injected subcutaneously in the inguinal region with 100 μL of immunogen at weeks 0 and 4. Animals were bled using the submental route at week 6. Whole blood was collected in serum separator tubes (BD #365967) and rested for 30 min at room temperature for coagulation. Tubes were then centrifuged for 10 min at 2,000 × g and serum was collected and stored at −80 °C until use.

ELISA.

HA-foldon trimers were added to 96-well Nunc MaxiSorp plates (Thermo Scientific) at 2.0 μg/mL with 100 mL per well and incubated for 18 h. Blocking buffer [TBST: 25 mM Tris pH 8.0, 150 mM NaCl, and 0.05% (v/v) Tween 20, and 5% Nonfat milk] was then added at 200 mL per well and incubated for 1 h. Plates were then washed three times with TBST using a robotic plate washer (Biotek). Threefold serial dilutions of serum starting at 1:100 were made in blocking buffer and added to plates at 100 μL per well and incubated for 1 h. Plates were washed thrice again before addition of 100 μL per well of anti-mouse IgG horseradish peroxidase-linked secondary antibody (CellSignaling Technology) diluted 1:2,000 in blocking buffer and incubated for 30 min. All incubations were carried out with shaking at room temperature. Plates were washed a final time, and then 100 μL per well of TMB (3,30,50,5-tetramethylbenzidine, SeraCare) was added for 2 min, followed by quenching with 100 μL per well of 1 N HCl. Reading at 450 nm absorbance was done on an Epoch plate reader (BioTek).

HAI.

Serum was inactivated using receptor-destroying enzyme (RDE) II (Seiken) in PBS at a 3:1 ratio of RDE to serum for 18 h at 37 °C, followed by 40 min at 56 °C. Inactivated serum was serially diluted 2.5-fold in PBS in V-bottom plates at 25 μL per well. 25 μL HA-ferritin nanoparticles at 4 hemagglutinating units were then added to all wells and incubated at room temperature for 30 min. Finally, 50 μL of 10-fold diluted turkey red blood cells (Lampire) in PBS was added to each well. Hemagglutination was left to proceed for at least 1 h before recording HAI titer.

Reporter-Based Microneutralization assay.

Microneutralization assays were performed as described previously (86). Generation of the replication-restricted reporter (R3ΔPB1) virus H1N1 A/Michigan/45/2015 and A/California/07/2009 is described elsewhere (87). Briefly, to produce the R3 viruses, the viral genomic RNA encoding functional PB1 was replaced with a gene encoding the fluorescent protein (TdKatushka2), and the R3 viruses were rescued by reverse genetics and propagated in the complementary cell line which expresses PB1 constitutively. Each R3 virus stock was titrated by determining the fluorescent units per mL (FU/mL) prior to use in the experiments. For virus titration, serial dilutions of virus stock in OptiMEM were mixed with prewashed MDCK-SIAT1-PB1 cells (8 × 105 cells/mL) and incubated in a 384-well plate in quadruplicate (25 µL/well). Plates were incubated for 18 to 26 h at 37 °C with 5% CO2 humidified atmosphere. After incubation, fluorescent cells were imaged and counted by using a Celigo Image Cytometer (Nexcelom) with a customized red filter for detecting TdKatushka2 fluorescence.

For the microneutralization assay, serial dilutions of RDE treated serum were prepared in OptiMEM and mixed with an equal volume of R3 virus (~8 × 104 FU/mL) in OptiMEM. After incubation at 37 °C and 5% CO2 humidified atmosphere for 1 h, prewashed MDCK-SIAT1-PB1 cells (8 × 105 cells/well) were added to the serum-virus mixtures and transferred to 384-well plates in quadruplicate (25 µL/well). Plates were incubated and counted as described above. Target virus control range for this assay is 500 to 2,000 FU per well, and cell-only control is acceptable up to 30 FU per well. The percent neutralization was calculated for each well by constraining the virus control (virus plus cells) as 0% neutralization and the cell-only control (no virus) as 100% neutralization. A 7-point neutralization curve was plotted against serum dilution for each sample, and a four-parameter nonlinear fit was generated using Prism (GraphPad) to calculate the 50% (IC50) inhibitory concentrations.

Supplementary Material

Appendix 01 (PDF)

Acknowledgments

We thank Andrew Borst, Kenneth Carr, and Connor Weidle for assistance with electron microscopy; Suna Cheng, Craig Dobbins, Alexis Hand, and Adri Tran-Pearson for assistance with tissue culture; Lauren Carter and Kandise VanWormer for laboratory and administrative support; and Luki Goldschmidt and Patrick Vecchiato for building and maintaining the computing infrastructure at the Institute for Protein Design. This work was supported by a generous gift from Open Philanthropy; the Audacious Project at the Institute for Protein Design; the Bill & Melinda Gates Foundation (INV-010680 and INV-043758); the International AIDS Vaccine Initiative Neutralizing Antibody Center through the Collaboration for AIDS Vaccine Discovery grant OPP1196345/INV-008813 funded by the Bill & Melinda Gates Foundation; the National Institute of Allergy and Infectious Diseases (P01AI167966); the intramural research program of the Vaccine Research Center, National Institute of Allergy and Infectious Diseases, NIH; and the NSF Graduate Research Fellowships Program Award (DGE-2140004 to C.M.H.).

Author contributions

C.M.H., N.J., and N.P.K. designed research; C.M.H., N.J., A.D., J.D.A., R.G., J.M., and E.M.L. performed research; C.M.H., N.J., A.D., J.D.A., R.G., M.C., and M.K. contributed new reagents/analytic tools; C.M.H., N.J., A.D., J.D.A., R.G., M.C., C.A.D., M.K., and N.P.K. analyzed data; and C.M.H., N.J., J.D.A., and N.P.K. wrote the paper.

Competing interests

C.M.H., N.J., A.D., J.D.A., and N.P.K. have filed patent applications through the University of Washington related to the proteins described in this work. All other authors declare no competing interests.

Footnotes

This article is a PNAS Direct Submission.

Data, Materials, and Software Availability

Data deposition, atomic coordinates, and structure factors have been deposited in the Protein Data Bank, http://www.rcsb.org (PDB ID 9CLZ (88), 9CM1 (89), and 9CM0 (90)). Cryo-EM maps were deposited in the Electron Microscopy Data Bank (EMD-45734 (91), EMD-45736 (92), and EMD-45735 (93)).

Supporting Information

References

  • 1.López-Sagaseta J., Malito E., Rappuoli R., Bottomley M. J., Self-assembling protein nanoparticles in the design of vaccines. Comput. Struct. Biotechnol. J. 14, 58–68 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Kanekiyo M., et al. , Rational design of an Epstein-Barr virus vaccine targeting the receptor-binding site. Cell 162, 1090–1100 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Tokatlian T., et al. , Innate immune recognition of glycans targets HIV nanoparticle immunogens to germinal centers. Science 363, 649–654 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Walls A. C., et al. , Elicitation of potent neutralizing antibody responses by designed protein nanoparticle vaccines for SARS-CoV-2. Cell 183, 1367–1382.e17 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Bachmann M. F., et al. , The influence of antigen organization on B cell responsiveness. Science 262, 1448–1451 (1993). [DOI] [PubMed] [Google Scholar]
  • 6.Bachmann M. F., Zinkernagel R. M., Neutralizing antiviral b cell responses. Annu. Rev. Immunol. 15, 235–270 (1997). [DOI] [PubMed] [Google Scholar]
  • 7.Kato Y., et al. , Multifaceted effects of antigen valency on B cell response composition and differentiation in vivo. Immunity 53, 548–563.e8 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Marcandalli J., et al. , Induction of potent neutralizing antibody responses by a designed protein nanoparticle vaccine for respiratory syncytial virus. Cell 176, 1420–1431.e17 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Ols S., et al. , Multivalent antigen display on nanoparticle immunogens increases B cell clonotype diversity and neutralization breadth to pneumoviruses. Immunity 56, 2425–2441.e14 (2023). [DOI] [PubMed] [Google Scholar]
  • 10.Ellis D., et al. , Antigen spacing on protein nanoparticles influences antibody responses to vaccination. Cell Rep. 42, 113552 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Brouwer P. J. M., et al. , Enhancing and shaping the immunogenicity of native-like HIV-1 envelope trimers with a two-component protein nanoparticle. Nat. Commun. 10, 4272 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Abbott R. K., et al. , Precursor frequency and affinity determine B cell competitive fitness in germinal centers, tested with germline-targeting HIV vaccine immunogens. Immunity 48, 133–146.e6 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Wamhoff E.-C., et al. , Enhancing antibody responses by multivalent antigen display on thymus-independent DNA origami scaffolds. Nat. Commun. 15, 1–13 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Read B. J., et al. , Mannose-binding lectin and complement mediate follicular localization and enhanced immunogenicity of diverse protein nanoparticle immunogens. Cell Rep. 38, 110217 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Gause K. T., et al. , Immunological principles guiding the rational design of particles for vaccine delivery. ACS Nano 11, 54–68 (2017). [DOI] [PubMed] [Google Scholar]
  • 16.Kanekiyo M., et al. , Self-assembling influenza nanoparticle vaccines elicit broadly neutralizing H1N1 antibodies. Nature 499, 102–106 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Jardine J., et al. , Rational HIV immunogen design to target specific germline B cell receptors. Science 340, 711–716 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Dosey A., et al. , Combinatorial immune refocusing within the influenza hemagglutinin RBD improves cross-neutralizing antibody responses. Cell Rep. 42, 113553 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Ou B. S., et al. , Broad and durable humoral responses following single hydrogel immunization of SARS-CoV-2 subunit vaccine. Adv. Healthc. Mater. 12, e2301495 (2023). [DOI] [PubMed] [Google Scholar]
  • 20.Arunachalam P. S., et al. , Durable protection against the SARS-CoV-2 Omicron variant is induced by an adjuvanted subunit vaccine. Sci. Transl. Med. 14, eabq4130 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Ueda G., et al. , Tailored design of protein nanoparticle scaffolds for multivalent presentation of viral glycoprotein antigens. Elife 9, e57659 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Bruun T. U. J., Andersson A.-M.C., Draper S. J., Howarth M., Engineering a rugged nanoscaffold to enhance plug-and-display vaccination. ACS Nano 12, 8855–8866 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Boyoglu-Barnum S., et al. , Quadrivalent influenza nanoparticle vaccines induce broad protection. Nature 592, 623–628 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Song J. Y., et al. , Immunogenicity and safety of SARS-CoV-2 recombinant protein nanoparticle vaccine GBP510 adjuvanted with AS03: Interim results of a randomised, active-controlled, observer-blinded, phase 3 trial. EClinicalMedicine 64, 102140 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Grigoryan L., et al. , Adjuvanting a subunit SARS-CoV-2 vaccine with clinically relevant adjuvants induces durable protection in mice. NPJ Vaccines 7, 55 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Arunachalam P. S., et al. , Adjuvanting a subunit COVID-19 vaccine to induce protective immunity. Nature 594, 253–258 (2021). [DOI] [PubMed] [Google Scholar]
  • 27.King N. P., et al. , Accurate design of co-assembling multi-component protein nanomaterials. Nature 510, 103–108 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Hsia Y., et al. , Design of a hyperstable 60-subunit protein icosahedron. Nature 540, 150 (2016). [DOI] [PubMed] [Google Scholar]
  • 29.Bale J. B., et al. , Accurate design of megadalton-scale two-component icosahedral protein complexes. Science 353, 389–394 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Divine R., et al. , Designed proteins assemble antibodies into modular nanocages. Science 372, eabd9994 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Wang J. Y. J., et al. , Improving the secretion of designed protein assemblies through negative design of cryptic transmembrane domains. Proc. Natl. Acad. Sci. U.S.A. 120, e2214556120 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Yang E. C., et al. , Computational design of non-porous pH-responsive antibody nanoparticles. Nat. Struct. Mol. Biol. 31, 1404–1412 (2024), 10.1038/s41594-024-01288-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Berman H. M., et al. , The protein data bank. Nucleic Acids Res. 28, 235–242 (2000). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Steinegger M., Mirdita M., Söding J., Protein-level assembly increases protein sequence recovery from metagenomic samples manyfold. Nat. Methods 16, 603–606 (2019). [DOI] [PubMed] [Google Scholar]
  • 35.Mitchell A. L., et al. , MGnify: The microbiome analysis resource in 2020. Nucleic Acids Res. 48, D570 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Huang P.-S., Boyken S. E., Baker D., The coming of age of de novo protein design. Nature 537, 320–327 (2016). [DOI] [PubMed] [Google Scholar]
  • 37.Wargacki A. J., et al. , Complete and cooperative in vitro assembly of computationally designed self-assembling protein nanomaterials. Nat. Commun. 12, 883 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Wicky B. I. M., et al. , Hallucinating symmetric protein assemblies. Science 378, 56–61 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Jumper J., et al. , Highly accurate protein structure prediction with AlphaFold. Nature 596, 583–589 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Sheffler W., et al. , Fast and versatile sequence-independent protein docking for nanomaterials design using RPXDock. PLoS Comput. Biol. 19, e1010680 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Baek M., et al. , Accurate prediction of protein structures and interactions using a three-track neural network. Science 373, 871–876 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Dauparas J., et al. , Robust deep learning-based protein sequence design using ProteinMPNN. Science 378, eadd2187 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Watson J. L., et al. , De novo design of protein structure and function with RFdiffusion. Nature 620, 1089–1100 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Vázquez Torres S., et al. , De novo design of high-affinity binders of bioactive helical peptides. Nature 626, 435–442 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Yeh A.H.-W., et al. , De novo design of luciferases using deep learning. Nature 614, 774–780 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Sumida K. H., et al. , Improving protein expression, stability, and function with ProteinMPNN. J. Am. Chem. Soc. 146, 2054–2061 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.de Haas R. J., et al. , Rapid and automated design of two-component protein nanomaterials using ProteinMPNN. Proc. Natl. Acad. Sci. U.S.A. 121, e2314646121 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Meador K., et al. , A suite of designed protein cages using machine learning and protein fragment-based protocols. Structure 32, 751–765.e11 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.King N. P., et al. , Computational design of self-assembling protein nanomaterials with atomic level accuracy. Science 336, 1171–1174 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Finch A., Kim J., Thermophilic proteins as versatile scaffolds for protein engineering. Microorganisms 6, 97 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Bloom J. D., Labthavikul S. T., Otey C. R., Arnold F. H., Protein stability promotes evolvability. Proc. Natl. Acad. Sci. U.S.A. 103, 5869–5874 (2006). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Traxlmayr M. W., et al. , Strong enrichment of aromatic residues in binding sites from a charge-neutralized hyperthermostable Sso7d scaffold library. J. Biol. Chem. 291, 22496–22508 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Shusta E. V., Holler P. D., Kieke M. C., Kranz D. M., Wittrup K. D., Directed evolution of a stable scaffold for T-cell receptor engineering. Nat. Biotechnol. 18, 754–759 (2000). [DOI] [PubMed] [Google Scholar]
  • 54.Besenmatter W., Kast P., Hilvert D., Relative tolerance of mesostable and thermostable protein homologs to extensive mutation. Proteins 66, 500–506 (2007). [DOI] [PubMed] [Google Scholar]
  • 55.McConnell A., Batten S. L., Hackel B. J., Determinants of developability and evolvability of synthetic miniproteins as ligand scaffolds. J. Mol. Biol. 435, 168339 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Teufl M., Zajc C. U., Traxlmayr M. W., Engineering strategies to overcome the stability-function trade-off in proteins. ACS Synth. Biol. 11, 1030–1039 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Watanabe Y., Bowden T. A., Wilson I. A., Crispin M., Exploitation of glycosylation in enveloped virus pathobiology. Biochim. Biophys. Acta Gen. Subj. 1863, 1480–1497 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Bliven S., Prlić A., Circular permutation in proteins. PLoS Comput. Biol. 8, e1002445 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Krause J. C., et al. , A broadly neutralizing human monoclonal antibody that recognizes a conserved, novel epitope on the globular head of the influenza H1N1 virus hemagglutinin. J. Virol. 85, 10905–10908 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Caplan S., Green R., Rocco J., Kurjan J., Glycosylation and structure of the yeast MF alpha 1 alpha-factor precursor is important for efficient transport through the secretory pathway. J. Bacteriol. 173, 627–635 (1991). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Sagt C. M. J., et al. , Introduction of an N-glycosylation site increases secretion of heterologous proteins in yeasts. Appl. Environ. Microbiol. 66, 4940–4944 (2000). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Duan H., et al. , Glycan masking focuses immune responses to the HIV-1 CD4-binding site and enhances elicitation of VRC01-class precursor antibodies. Immunity 49, 301–311.e5 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Behrens A.-J., et al. , Molecular architecture of the cleavage-dependent mannose patch on a soluble HIV-1 envelope glycoprotein trimer. J. Virol. 91, e01894-16 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Bangaru S., et al. , A site of vulnerability on the influenza virus hemagglutinin head domain trimer interface. Cell 177, 1136–1152.e18 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Abramson J., et al. , Accurate structure prediction of biomolecular interactions with AlphaFold 3. Nature 630, 493–500 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Krishna R., et al. , Generalized biomolecular modeling and design with RoseTTAFold All-Atom. Science 384, eadl2528 (2024). [DOI] [PubMed] [Google Scholar]
  • 67.Walls A. C., et al. , Elicitation of broadly protective sarbecovirus immunity by receptor-binding domain nanoparticle vaccines. Cell 184, 5432–5447.e16 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Brinkkemper M., et al. , Co-display of diverse spike proteins on nanoparticles broadens sarbecovirus neutralizing antibody responses. iScience 25, 105649 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Sliepen K., et al. , Induction of cross-neutralizing antibodies by a permuted hepatitis C virus glycoprotein nanoparticle vaccine candidate. Nat. Commun. 13, 7271 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Carrat F., Flahault A., Influenza vaccine: The challenge of antigenic drift. Vaccine 25, 6852–6862 (2007). [DOI] [PubMed] [Google Scholar]
  • 71.Xu Z., et al. , In vivo assembly of nanoparticles achieved through synergy of structure-based protein engineering and synthetic DNA generates enhanced adaptive immunity. Adv. Sci. (Weinh.) 7, 1902802 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Mu Z., et al. , mRNA-encoded HIV-1 Env trimer ferritin nanoparticles induce monoclonal antibodies that neutralize heterologous HIV-1 isolates in mice. Cell Rep. 38, 110514 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Roier S., et al. , mRNA-based VP8* nanoparticle vaccines against rotavirus are highly immunogenic in rodents. NPJ Vaccines 8, 1–14 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Konrath K. M., et al. , Nucleic acid delivery of immune-focused SARS-CoV-2 nanoparticles drives rapid and potent immunogenicity capable of single-dose protection. Cell Rep. 38, 110318 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Edwards K. R., et al. , A gH/gL-encoding replicon vaccine elicits neutralizing antibodies that protect humanized mice against EBV challenge. NPJ Vaccines 9, 120 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Camacho C., et al. , BLAST+: Architecture and applications. BMC Bioinformatics 10, 421 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Hallgren J., et al. , DeepTMHMM predicts alpha and beta transmembrane proteins using deep neural networks. bioRxiv [Preprint] (2022). https://www.biorxiv.org/content/10.1101/2022.04.08.487609v1. Accessed 11 December 2023.
  • 78.Suloway C., et al. , Automated molecular microscopy: The new Leginon system. J. Struct. Biol. 151, 41–60 (2005). [DOI] [PubMed] [Google Scholar]
  • 79.Punjani A., Rubinstein J. L., Fleet D. J., Brubaker M. A., cryoSPARC: Algorithms for rapid unsupervised cryo-EM structure determination. Nat. Methods 14, 290–296 (2017). [DOI] [PubMed] [Google Scholar]
  • 80.Pettersen E. F., et al. , UCSF Chimera—A visualization system for exploratory research and analysis. J. Comput. Chem. 25, 1605–1612 (2004). [DOI] [PubMed] [Google Scholar]
  • 81.Croll T. I., ISOLDE: A physically realistic environment for model building into low-resolution electron-density maps. Acta Crystallogr. D Struct. Biol. 74, 519–530 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Liebschner D., et al. , Macromolecular structure determination using X-rays, neutrons and electrons: Recent developments in Phenix. Acta Crystallogr. D Struct. Biol. 75, 861–877 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Del Moral-Sánchez I., et al. , Triple tandem trimer immunogens for HIV-1 and influenza nucleic acid-based vaccines. NPJ Vaccines 9, 74 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.Kumar S., et al. , Neutralizing antibodies induced by first-generation gp41-stabilized HIV-1 envelope trimers and nanoparticles. mBio 12, e0042921 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Zhang Y.-N., et al. , Single-component multilayered self-assembling protein nanoparticles presenting glycan-trimmed uncleaved prefusion optimized envelope trimers as HIV-1 vaccine candidates. Nat. Commun. 14, 1–29 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Yang Y. R., et al. , Immune memory shapes human polyclonal antibody responses to H2N2 vaccination. Cell Rep. 43, 114171 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Creanga A., et al. , A comprehensive influenza reporter virus panel for high-throughput deep profiling of neutralizing antibodies. Nat. Commun. 12, 1–12 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Haas C. M., et al. , Novel designed icosahedral nanoparticle I3-A6. Protein Data Bank. https://www.rcsb.org/structure/9CLZ. Deposited 12 July 2024.
  • 89.Haas C. M., et al. , Novel designed icosahedral nanoparticle I3-D12. Protein Data Bank. https://www.rcsb.org/structure/9CM1. Deposited 12 July 2024.
  • 90.Haas C. M., et al. , Novel designed icosahedral nanoparticle I3-A7. Protein Data Bank. https://www.rcsb.org/structure/9CM0. Deposited 12 July 2024.
  • 91.Haas C. M., et al. , Novel designed icosahedral nanoparticle I3-A6. Electron Microscopy Data Bank. https://www.ebi.ac.uk/emdb/EMD-45734. Deposited 23 July 2024.
  • 92.Haas C. M., et al. , Novel designed icosahedral nanoparticle I3-D12. Electron Microscopy Data Bank. https://www.ebi.ac.uk/emdb/EMD-45736. Deposited 23 July 2024.
  • 93.Haas C. M., et al. , Novel designed icosahedral nanoparticle I3-A7. Electron Microscopy Data Bank. https://www.ebi.ac.uk/emdb/EMD-45735. Deposited 12 July 2024.

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Appendix 01 (PDF)

Data Availability Statement

Data deposition, atomic coordinates, and structure factors have been deposited in the Protein Data Bank, http://www.rcsb.org (PDB ID 9CLZ (88), 9CM1 (89), and 9CM0 (90)). Cryo-EM maps were deposited in the Electron Microscopy Data Bank (EMD-45734 (91), EMD-45736 (92), and EMD-45735 (93)).


Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES