Abstract
Encapsulins are a recently discovered class of prokaryotic self-assembling icosahedral protein nanocompartments measuring between 24 and 42 nm in diameter, capable of selectively encapsulating dedicated cargo proteins in vivo. They have been classified into four families based on sequence identity and operon structure, and thousands of encapsulin systems have recently been computationally identified across a wide range of bacterial and archaeal phyla. Cargo encapsulation is mediated by the presence of specific targeting motifs found in all native cargo proteins that interact with the interior surface of the encapsulin shell during self-assembly. Short C-terminal targeting peptides (TPs) are well documented in Family 1 encapsulins, while more recently, larger N-terminal targeting domains (TDs) have been discovered in Family 2. The modular nature of TPs and their facile genetic fusion to non-native cargo proteins of interest has made cargo encapsulation, both in vivo and in vitro, readily exploitable and has therefore resulted in a range of rationally engineered nano-compartmentalization systems. This review summarizes current knowledge on cargo protein encapsulation within encapsulins and highlights select studies that utilize TP fusions to non-native cargo in creative and useful ways.
Graphical Abstract
Encapsulins are protein nanocompartments that selectively encapsulate cargo proteins via specific peptide targeting motifs. Genetic fusion of these motifs to non-native cargo proteins results in the facile engineering of rationally designed nano-compartmentalization systems.
Introduction
All cells employ different methods of regulating their metabolism in time and space.1 Doing so allows compartmentalization and dynamic spatial control of otherwise incompatible chemical reactions or metabolic pathways.1–3 As opposed to the lipid-based organelles used by eukaryotes, prokaryotes mainly employ various protein-based modalities to establish and maintain discrete subcellular compartmental-ization.1–5 These strategies range from small homo-multimeric protein cages natively lacking sequestered protein cargo, to more complex, multi-component shell systems capable of housing multiple functional enzymes. A unifying theme for all prokaryotic protein compartments and cages is their ability to self-assemble and create sequestered spaces within cells. Through various biological and chemical engineering strategies, diverse nano-sized protein cages have been repurposed as programmable molecular containers for use in biocatalysis, biomedicine, and bionanotechnology.6, 7
Examples of prokaryotic protein cages include the small 8–12 nm ferritins, hollow shells comprised of identical protein subunits with the shell serving as a diffusion barrier and ferroxidase at the same time, allowing effective iron storage within, without the need to sequester proteinaceous cargo.8, 9 In comparison, the much larger 40–200 nm bacterial microcompartments (BMCs) consist of multi-component cages sequestering multiple enzymes that act together to yield a complex metabolic organelle-like compartment.4, 10 The more recently discovered encapsulin nanocompartments occupy the space between these two examples with respect to size, complexity, and ability to encapsulate cargo proteins, and will be the focus of this review.11–13
Encapsulins are icosahedral protein nanocages that range from 24 to 42 nm in size with varying triangulation numbers (T1, T3, or T4) formed via self-assembly of 60 to 240 subunits of the same shell protein exhibiting the HK97 (Hong Kong 97) phage-like fold (Fig. 1A–C).13–15 Notably, the eponymous feature of encapsulins is their ability to encapsulate specific cargo proteins during shell self-assembly using selective cargo loading mechanisms based on targeting domains (TDs) or targeting peptides (TPs) present at the N- or C-terminus of each cargo protein. This native, efficient, and modular cargo loading modality makes encapsulins excellent protein cargo carriers with potential broad applications as targeted drug delivery vehicles, vaccine platforms, and bionanoreactors, among others.14, 16–22 Recent genome datamining studies have led to the grouping of encapsulins into four separate families that vary in sequence, operon configuration, overall structure, and encapsulation mechanism.23–25 Family 1 encapsulins are the most extensively studied, with experimental information available for multiple systems based on their shell structures, associated cargo function, and respective cargo loading process. Similar studies pertaining to Family 2 encapsulins have recently begun to emerge, though these studies remain nascent in comparison to the data available for Family 1 encapsulins. Family 3 and Family 4 encapsulins remain putative and currently lack experimental validation.
As experimental details only exist for Family 1 and Family 2 encapsulins, this review will focus on the current understanding and use of encapsulin cargo loading of these two families. The native in vivo mechanisms involved in cargo loading as well as efforts undertaken to date to manipulate those mechanisms will be discussed. This review will also detail the practical application of encapsulin cargo loading as it pertains to recent bioengineering efforts as well as recent studies dissecting the mechanism of cargo encapsulation. Lastly, this review will discuss potential future challenges and directions, including prospective studies that may help further elucidate or manipulate encapsulin cargo loading, as well as the future potential that such rational manipulation of encapsulin cargo loading may hold for biocatalysis, biomedicine, and biomaterials research.
Family 1 Encapsulins - Targeting Peptides and Cargo Loading
As Family 1 encapsulins (Pfam ID: PF04454) were the first discovered family, most of the published encapsulin-based research has been focused on them. Originally identified in the 1990s as high-molecular-weight aggregates found in the supernatant of Brevibacterium linens M18, the first encapsulin was initially misidentified as a possible antibacterial bacteriocin dubbed linocin M18.26 Additional homologs were soon discovered in Mycobacterium species and Thermotoga maritima, some of which were thought to display proteolytic activity.27, 28 However, additional studies in the 2000s resulted in researchers being unable to replicate the previously perceived proteolytic effects while concurrent structural studies proved encapsulins to be homomultimeric, self-assembling capsid-like nanocompartments sequestering enzymes in their interior.13, 29–31
Family 1 encapsulins are mainly found in the bacterial phyla Actinobacteria, Proteobacteria, and Firmicutes, and are sorted into several operon types according to their native enzyme cargo.23 The operon arrangement for Family 1 encapsulins generally follows a layout comprised of an upstream gene encoding for the respective cargo protein followed by the gene encoding for the encapsulin shell protein, with or without flanking co-regulated accessory proteins (Fig. 1D).23, 32 In order for cargo encapsulation to occur, a targeting peptide (TP), sometimes also referred to as a cargo-loading peptide (CLP), is strictly necessary and is found at the C-terminus of all cargo proteins (Fig. 1D). TPs are usually separated from the catalytically active folded domain of the cargo by a flexible linker with high glycine and proline content of ca. 10–50 residues in length (Fig. 2A). This arrangement likely minimizes steric clashes between adjacent cargo proteins within the shell, thus maximizing cargo loading capacity. A corollary of this feature is that cargo proteins are generally poorly resolved in encapsulin structures due to their high mobility caused by being flexibly tethered to the shell interior. Cargo protein loading is not necessary for shell assembly. Encapsulin shells generally self-assemble very efficiently even in the absence of any cargo. This implies a cargo loading mechanism where co-expression of cargo and shell – as insured by a tight operon structure – allows efficient TP-shell interactions during shell self-assembly.
Several examples now exist in the literature providing reliable structural data illustrating TP-shell interactions (Fig. 2). Of these examples, the TPs of two ferritin-like protein (Flp) cargos bound to the interior surface of their respective T1 shells have been resolved – GGDLGIRK in the T. maritima system (Fig. 2B), and GSLGIGSLR in the Haliangium ochraceum system determined in both the “closed” and “open” pentameric conformations based on a shift in the encapsulin A-domain (Fig. 2C).13, 33 Further, two Flp TP-shell interactions were resolved for the T3 Myxococcus xanthus system showing the TPs to be SHPLTVGSLRR for the EncB cargo (Fig. 2D) and PEKRLTVGSLRR for the EncC cargo, both found to bind to all available binding sites in pentameric and hexameric shell facets (Fig. 2E).34 Lastly, the TP of an iron-mineralizing encapsulin-associated firmicute (IMEF) cargo protein bound to the hexamers of its native T4 shell from Quasibacillus thermotolerans was determined to be TVGSLIQ (Fig. 2F).14 Based on the cumulative structural data, the TP binding site has been determined to reside on the luminal surface of each Family 1 encapsulin shell protein subunit in a conserved cleft between the N-terminal helix and the P-domain (Fig. 1A and Fig. 2). TP lengths range from 7 to 12 residues that rigidly interact with the binding site. In many cargo proteins, additional C-terminal residues, usually less than 10, can be found after the rigidly interacting binding motif. However, they do not seem to be important for TP-shell interaction based on their absence in the structural data available at this time, though further research is warranted. Because the TP binding pocket completely resides within a single shell protein subunit and does not cross subunit boundaries, the maximal cargo loading is set by the total number of shell protein subunits – 60 for T1, 180 for T3, and 240 for T4 shells. However, cargo loading capacity is likely further determined by cargo protein size and oligomerization state. Bioinformatic analyses have provided further evidence that Family 1 TPs are often comprised of 10 to 20 C-terminal residues containing GSL or double GSL motifs –with exceptions as exemplified by the T. maritima system – often with an immediately subsequent positively charged residue.23, 35 Based on structural information and the consensus TP sequences of the main cargo types (Fig. 2G), a general TP binding mode can be derived where two or three hydrophobic residues (isoleucine, leucine, or valine) spaced one or two residues apart – the spacers often containing glycines for flexibility – specifically interact with hydrophobic patches within the binding pocket. In many instances, positively charged residues (lysines or arginines) follow this motif and seem to interact less specifically with negatively charged surface patches of the shell protein (Fig. 2H).
In sum, cargo loading in Family 1 encapsulin systems results from a combination of mass action based on the relative expression levels of cargo and shell proteins, and specific TP-mediated protein-protein interactions with the final number of encapsulated cargo proteins being additionally determined by the relative size of the shell and cargo as well as the cargo oligomerization state.
Family 2 Encapsulins - Targeting Domains and Cargo Loading
Family 2 encapsulins (Pfam ID: PF19307) can be distinguished from other encapsulin families by sequence similarity, distinct insertion domains within the shell protein, as well as their operon structure.23, 36 In Family 2 encapsulin systems, most putative cargo proteins are encoded by genes found immediately downstream of the encapsulin gene. This family is further subdivided into Family 2A and Family 2B, differentiated by the absence (2A) or presence (2B) of an insertion domain, annotated as a cyclic nucleoside monophosphate (cNMP)-binding domain, within the E-loop of the encapsulin shell protein. In contrast to the short C-terminal TPs found in Family 1 encapsulin systems, Family 2 cargo proteins contain long (20–260 residues), unannotated, intrinsically disordered targeting domains (TDs) generally located at their N-terminus (Fig. 3A).23, 36, 37
So far, little experimental evidence for Family 2 encapsulin systems has been published. However, the N-terminal targeting domain hypothesis has recently been confirmed for one cysteine desulfurase-encapsulating Family 2A system found in Synechococcus elongatus.36 Using in vitro assays, it was shown that the N-terminal 255 residue long domain found in the desulfurase cargo is necessary and sufficient for cargo encapsulation (Fig. 3B and Fig 3C). Furthermore, structural analysis of the cargo-loaded shell highlighted a resolvable, low-resolution density close to the 3-fold symmetry axis of the shell, suggesting a potential binding region for TDs on the shell interior (Fig. 3D). One caveat of this analysis is the fact that cargo-loading was carried out in vitro using a protein refolding procedure which could have resulted in a non-native mode of cargo encapsulation. Computational analysis of the N-terminal TD within the desulfurase cargo identified 20–30 residue long conserved motifs of high sequence identity (Fig. 3E), separated by long stretches of divergent, mostly hydrophobic residues.23, 36 To explore the contributions of each of the conserved motifs to cargo loading, different parts of the TD, containing different combinations of motifs, were N-terminally fused to a fluorescent reporter (GFP) followed by co-expression and purification. The results did not clearly identify a single motif or sub-region sufficient for maximal cargo loading. Instead, it seems that the full-length TD is needed to mediate optimal cargo encapsulation (Fig. 3C). This may have important mechanistic implications for the Family 2 cargo loading process which seems to be quite different compared to Family 1, relying on potentially multiple specific discontiguous interactions based on conserved sequence motifs separated by long flexible and hydrophobic linker regions which themselves might possess affinity for the interior of the encapsulin shell.
For other Family 2 cargo types besides desulfurases, recent bioinformatic analyses have shown similar motif-containing N-terminal domains annotated as mostly disordered.23 So far, one other putative cargo type, a 2-methylisoborneol (2-MIB) synthase with a similarly long, disordered N-terminal domain has been confirmed as a Family 2 cargo protein.38 However, no detailed structural or mechanistic analysis of this system is currently available in the literature.36, 38
Of additional note regarding Family 2 encapsulins is the putative existence of two-component shells, so far only bioinformatically predicted, where the Family 2 operon encodes two distinct encapsulin shell genes.23 As experimental data for these systems is currently lacking, it is not yet known how these encapsulins might assemble. However, if the gene products do assemble into functional encapsulin shells with two different types of subunits, there is a possibility that these systems can natively encapsulate defined stoichiometries of two distinct cargo proteins into the same two-component nanocompartment based on specific interactions of two distinct TDs with two distinct shell protein binding sites. If confirmed, such systems may hold significant potential for novel more complex bioengineering applications, beyond what is currently possible with engineered Family 1 systems.
Encapsulin Engineering and Applications
Protein-based nanocages have gained significant popularity for various engineering applications including for the delivery of therapeutics, as diagnostics, as small molecule and materials nanoreactors, and more.19, 20, 39–43 As opposed to lipid-based compartments, protein-based cages can be genetically engineered, generally self-assemble into defined 3D architectures, and can be easily chemically functionalized. Therefore, they represent excellent platforms for rational bioengineering. This has inspired a wide array of protein cage engineering using viral capsids, BMCs, ferritins, and de novo designed protein cages.4, 21, 42–50 Similar efforts at engineering Family 1 encapsulins have recently been undertaken making use of the key advantage of encapsulins, namely, their modular native in vivo cargo loading mechanism. With respect to non-native cargo encapsulation, a protein of interest can be easily genetically functionalized with the respective TP leading to efficient in vivo cargo encapsulation upon co-expression with the cognate encapsulin shell protein.48, 51 In addition, in vitro assembly of enzyme-loaded encapsulin nanoreactors is also possible based on in vitro disassembly of the encapsulin shell via exposure to extreme pH or denaturants followed by co-assembly of both shell and separately purified cargo after exchange into physiological buffer, yielding cargo-loaded protein cages.52
Potential benefits of encapsulating non-native cargo proteins are abundant and include improving the stability of cargo proteins under harsh conditions like elevated temperature, extreme pH, or exposure to proteases; controlling or improving catalysis; delivering a therapeutic or diagnostic payload; or a combination of the above. Below, we will first highlight efforts towards engineering TPs and modulating their targeting strength, followed by a discussion of select recent studies showcasing the progress made in employing encapsulins as bioengineering tools. Particular emphasis is placed on examples that improve cargo stability, add control over chemical reactions, or show therapeutic or diagnostic application potential (Table 1).
Table 1.
Targeting peptide engineering
So far, the prevailing strategy for encapsulating non-native cargo proteins inside encapsulins has been to genetically fuse the known native TP of a given system to the C-terminus of the cargo of interest. Relatively few reports have tried to change or optimize TP sequences for modulating cargo loading efficiency. To rationally and reliably mutate TPs for controlling cargo loading extent, or relative stoichiometry when encapsulating multiple cargos at once, TP-shell interactions need to be understood in detail. So far, only six TP-shell interactions have been structurally resolved, always of native TPs and their cognate encapsulin shell (Fig. 2). However, rational changes to TP sequences for altering TP-shell binding affinity will likely require further structural analyses of systematically mutated TPs. So far, only a few studies have systematically probed TP-shell interactions using experimental and computational approaches (Fig. 4).
One study utilized the T1 encapsulin from T. maritima and its native TP fused to a fluorescent reporter (sfGFP) in order to assess the minimal TP needed to attain maximal cargo loading (Fig. 4A and B).35 Different truncations of the C-terminal 30 residues found in the native T. maritima cargo protein were appended to the reporter. Bulk fluorescence after purification was used as a readout of cargo loading extent. Results showed that the 15 C-terminal residues of the native cargo were sufficient for optimal cargo loading. These include the 8 rigidly bound residues that could be structurally resolved (Fig. 2). It is likely that only these residues are needed for binding. In a similar study focused on the T1 encapsulin from Mycobacterium smegmatis, different truncations of the 19 C-terminal residues found in the native cargo were appended to an eGFP reporter.53 The results indicated that the 12 C-terminal residues were required to attain maximal cargo loading levels. Even though no structural information for the M. smegmatis TP-shell interaction is available, the 12 C-terminal residues contain a double GSL motif confirming its importance for cargo encapsulation.
In another recent study, a combined computational and experimental approach was taken to probe the influence of single residue substitutions within the TPs of the T1 T. maritima and T3 M. xanthus encapsulin systems.54 Rosetta-based force-field modelling was employed to predict the influence of mutations within the two TPs (Fig. 4C). Select TP mutants were then experimentally characterized. Computational prediction and experiment were found to generally agree. This approach led to further interesting insights, including the fact that most mutations were computationally predicted to have a negative effect on binding strength, in particular, the mostly conserved GSL-like motifs (Fig. 4D and E). The cumulative results of these studies highlight the fact that TPs vary with respect to native specificity and binding strength, and that TP-shell binding is significantly influenced by specific hydrophobic and ionic interactions, as well as TP flexibility.
Improved cargo stability under harsh conditions
Improved stability of cargo proteins upon encapsulation is an often-observed phenomenon in encapsulin systems and encompasses thermal stability, increased catalytic lifetime of sequestered enzymes, and protease resistance (Fig. 5A). Increased thermal stability is generally found in encapsulins originating from thermotolerant or thermophilic organisms. For example, encapsulation of IMEF cargo inside its native Q. thermotolerans T4 shell increased its melting temperature by 10°C.14 This improved thermal stability is not only seen for native cargo proteins, but also when non-native cargo is encapsulated. Multiple non-native enzymes showed prolonged catalytic activity at elevated temperatures when encapsulated inside the T1 protein shell of the thermophile M. hassiacum. This included an encapsulated dye-decolorizing peroxidase (DyP) cargo which was active at 40°C for 25 hours while the free DyP lost activity at that temperature after 30 minutes.57 Additionally, the carbohydrate oxidase mChitO lost activity after 90 minutes at 50°C whereas when encapsulated, mChitO still retained 50% activity after four hours. The same enzymes in their encapsulated forms also exhibited prolonged resistance against digestion with proteinase K. Similarly, another recent study showed that protein cargo encapsulated within a T1 encapsulin from the acidophile Acidipropionibacterium acidipropionici was protected from pepsin digest at pH 3 as well as trypsin and chymotrypsin degradation at pH 7.5.65 In general, the encapsulin shell is able to protect cargo from proteolytic degradation due to the physical sequestration of the cargo within a stable protein shell which is itself often highly protease resistant. The often observed increased thermal stability of encapsulated cargo might be a result of high local cargo molarity, tethering, and confinement within the tight encapsulin shell that leads to a favorable molecular crowding effect, where protein-protein interactions are optimized to prevent complete cargo unfolding, multimer dissociation, or aggregation and encourage rapid protein re-folding or re-association when partial denaturation or dissociation occurs.57, 69, 70 A similar mechanism might be responsible for the increased catalytic lifetime reported for many encapsulated enzymes.30, 36 Increased stability of proteins of interest upon encapsulation could be exploited for numerous applications, such as producing encapsulated biocatalysts that exhibit long-term stability under harsh conditions or acting as robust delivery vehicles for biological therapeutic or diagnostic proteins.
Encapsulation of metal catalysts and control of chemical reactions
Encapsulins have recently been employed to create metal catalyst-loaded nanocages for biocatalysis applications. An encapsulin-based nanoreactor containing the organometallic ruthenium catalyst [CpRu(HQ)(allyl)]PF6 was assembled by encapsulating a proteinaceous TP-fused HaloTag in vivo which was then purified and covalently modified with the ruthenium catalyst in vitro via a specific chloroalkane spacer.53 The resulting encapsulated catalyst was capable of de-N-allylation of a coumarin pro-fluorophore with a yield and kinetics similar to that of free PEGylated catalyst (Fig. 5B). The nanoreactor was also shown to be active in cultured mammalian cells. This successfully demonstrates the use of engineered encapsulins to conduct bio-orthogonal transition-metal catalysis in live cells. Furthermore, it demonstrates the possibility of using encapsulins for the delivery of catalysts that convert pro-drugs to their bioactive forms in situ. This is especially noteworthy as it opens up the possibility of delivering catalysts with low biocompatibility in an encapsulated, biocompatible, and active form. Other secondary tags besides HaloTag have recently been employed for similar purposes, including simple avidin tags. In general, this approach is promising for creating modular nanoreactors containing potentially multiple co-localized transition metal organocatalysts and enzymes in the same compartment leading to novel chemoenzymatic nanoreactor capabilities.71
Engineered encapsulins as therapeutics and diagnostics
Nanoscale encapsulation systems – including protein nanocages – have long been used as engineering platforms for nano-medicine applications.43, 72–76 Different platform types, e.g., lipid- vs protein-based compartments, offer different advantages and disadvantages, including accessible size range, stability, and ease of functionalization.77 As any therapeutics or diagnostics delivery system is required to be biocompatible, large enough to carry a cargo of interest, and targetable to a specific biological site, encapsulins in particular have recently shown significant promise.20, 39, 78, 79
A recent innovative study involved the use of non-UV light to control the production of toxic reactive oxygen species (ROS) inside cancer cells.58 A TP was genetically fused to a minimal singlet oxygen generator (mSOG), a flavoprotein that produces mainly singlet oxygen ROS upon exposure to blue light. The TP-fused mSOG was then encapsulated inside the T. maritima T1 encapsulin shell via co-expression yielding a nanocage-based platform for the delivery of photodynamic therapeutics (Fig. 5C). Encapsulated mSOG was shown to generate higher ROS levels than either the free mSOG or the encapsulin control due to the encapsulin’s additive O2 generative effect when combined with the free mSOG, which in turn is likely due to the observed non-specific adsorption of endogenous flavin molecules such as flavin mononucleotide (FMN), flavin adenine dinucleotide (FAD), and riboflavin to the encapsulin from T. maritima.52, 56 Furthermore, the use of encapsulated mSOG led to increased cell death in a lung cancer cell culture model, being attributed to the fact that encapsulated mSOG was shown to be taken up by cells while free mSOG was not internalized, which was consistent with previous reports that show free mSOG to be incapable of penetrating tumor cells.80 This system provides a novel, highly controllable method for the light-triggered generation of toxic ROS without the use of potentially harmful UV light or the need for additional small-molecule substrates. As ROS generation can be used as both a therapeutic modality and effective bioimaging signal, the platform represents a theranostic encapsulin-based delivery system.
This mSOG encapsulin platform was further engineered to display the Designed Ankyrin repeat protein (DARPin), an antibody mimic, on the exterior of the shell through genetic fusion of DARPin to the surface-exposed C-terminus of the encapsulin shell protein.19 This enabled the targeted delivery of mSOG to Human Epidermal growth factor Receptor 2 (HER2)-positive breast cancer cells and the subsequent light-triggered induction of toxic ROS leading to apoptosis. This platform self-assembles in a single step when expressed in Escherichia coli and offers specific targeting, photodynamic therapeutic ROS generation, and potential modular functionalization with different DARPin molecules selected to specifically bind to other targets.
Another innovative example of using encapsulins as reporters or diagnostic platforms was based on the in vivo encapsulation of the enzyme tyrosinase, able to polymerize melanin inside the encapsulin protein shell.60 The sequestered and concentrated melanin could then be used for imaging and tracking purposes due to its strong near-infrared absorption. Further, cells expressing this system did not show any growth defect based on melanin toxicity which is usually observed for non-encapsulated melanin.
Production of inorganic materials using encapsulins
Encapsulins have recently been used to create inorganic-organic biomaterials platforms. To introduce non-organic cargo into encapsulin shells, a recent study showed that TP-functionalized inorganic nanoparticles could be encapsulated by the T. maritima T1 encapsulin by following an in vitro co-assembly protocol.52 Gold nanoparticles (AuNPs) were surface-modified with TP-containing synthetic peptides consisting of an N-terminal cysteine residue for direct binding to the gold surface, followed by four residues ending in a glutamate able to electrostatically interact with the positive charge of an (11‐ mercaptoundecyl)‐N,N,N‐trimethylammonium bromide (MUTAB) shell around the AuNP, a short four-residue hinge motif ending in a flexible double glycine, and culminating in the seven residue native T. maritima TP. This allowed AuNPs to be encapsulated following denaturation of the encapsulin shell protein at pH 2, mixing of the functionalized TP-containing AuNPs, and refolding and reassembly of the encapsulin shell at pH 7 (Fig. 5D). The process was proven to be extremely efficient with 99% AuNP encapsulation as confirmed by TEM. This approach may be useful in the creation of protein-coated inorganic nanoparticles for use in antimicrobials and anti-cancer applications.
Engineered encapsulins as catalytic enzyme nanoreactors
Engineered encapsulin-based nanoreactors aim to emulate the advantages observed for naturally-occurring protein organelles and compartments.81–83 These include the ability to co-localize multiple enzymes which can improve intermediate channeling and pathway flux; increasing the local molarities of enzymes, substrates, and intermediates; preventing unwanted side reactions and toxic or reactive intermediate leakage; and generally improving the stability and performance of a sequestered enzymatic process .24, 84 Recent progress in utilizing encapsulins as enzyme nanoreactors include the encapsulation of the pyruvate dehydrogenase Aro10p in yeast to produce and protect an intermediate of the high-value opioid precursor norcoclaurine.59 When TP-fused Aro10p was encapsulated inside the Myxococcus xanthus T3 encapsulin shell, the Aro10p-generated reactive intermediate 4-hydroxyphenylacetaldehyde (4-HPPA) was protected from endogenous detoxification enzymes and could undergo spontaneous reaction with dopamine yielding norcoclaurine, whereas overexpression of free Aro10p did not lead to detectable levels of the opioid precursor.
Another recent example of employing encapsulins as biocatalytic enzyme nanoreactors is the use of the DyP-peroxidase-loaded M. hassiacum T1 encapsulin together with free eugenol oxidase in an enzyme cascade, yielding lignin-like crosslinked reaction products.57 The main challenge for all protein-based nanoreactors encapsulating non-native enzymes is to overcome the often observed decrease in catalytic activity upon enzyme encapsulation.57 This is generally due to the protein shell not being optimized for the influx of the particular substrates and cofactors needed by a specific non-native sequestered enzyme. Future efforts aimed at pore engineering to improve molecular flux across encapsulin shells will likely be able to address this problem and result in fully catalytically active nanoreactors.22, 64, 85, 86
Conclusions and Future Challenges
Encapsulins offer advantages relative to other nanocages used in bioengineering, including their exclusively proteinaceous nature, biophysical robustness, genetic engineerability, and facile in vivo cargo loading that negates the need for additional methods like cargo-scaffold or cargo-capsid genetic fusions, covalent conjugation, or harsh refolding procedures.7, 40, 87, 88 Encapsulin research has made substantial progress over the past decade, generating novel insights into shell structure and dynamics, cargo encapsulation mechanisms, biological function, and engineering applications.24 This progress has been almost exclusively confined to Family 1 systems. However, recent bioinformatic analyses have unveiled thousands of novel encapsulin systems across numerous bacterial and archaeal phyla.23 Many of these systems likely contain novel useful features for future bioengineering and synthetic biology applications including dynamic and controllable pores, shell insertion domains, and two-component protein shells.38, 65 Recent projects aimed at engineering encapsulins have highlighted a range of promising application areas, many of which have been highlighted in this review. However, some of these studies have also revealed a number of outstanding challenges to be overcome, including limitations associated with the various techniques used for determining cargo loading, such as the potential imprecision of gel densitometry, the difficulty of deconvoluting shell and cargo UV/Vis signals, and the cost and nascence of mass photometry.89, 90 Notably, recent studies have also highlighted the current insufficient understanding of the cargo encapsulation mechanism which at the moment prevents the rational design of stoichiometrically defined cargo-loaded encapsulins. Therefore, systematic studies probing the effects of TP mutations on cargo loading and TP binding mode should be prioritized in future studies, along with elucidating the mechanistic details of pore dynamics and molecular flux across the protein shell, and investigating potential two-component shell assembly, all potentially highly useful features for various engineering applications. We envision encapsulins as a modular and robust platform technology that, once their basic biophysical and biochemical characteristics are thoroughly understood, will find many applications in biomedicine, biocatalysis, biomaterials research, and bionanotechnology.
Acknowledgements
This study was supported by the NIH (R35GM133325). Molecular graphics and analyses performed with UCSF ChimeraX, developed by the Resource for Biocomputing, Visualization, and Informatics at the University of California, San Francisco, with support from the National Institutes of Health R01‐GM129325 and the Office of Cyber Infrastructure and Computational Biology, National Institute of Allergy and Infectious Diseases.
Footnotes
Conflicts of interest
There are no conflicts to declare.
References
- 1.Cornejo E, Abreu N and Komeili A, Curr Opin Cell Biol, 2014, 26, 132–138. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Diekmann Y and Pereira-Leal JB, Biochem J, 2013, 449, 319–331. [DOI] [PubMed] [Google Scholar]
- 3.Gabaldón T and Pittis AA, Biochimie, 2015, 119, 262–268. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Aussignargues C, Pandelia ME, Sutter M, Plegaria JS, Zarzycki J, Turmo A, Huang J, Ducat DC, Hegg EL, Gibney BR and Kerfeld CA, J Am Chem Soc, 2016, 138, 5262–5270. [DOI] [PubMed] [Google Scholar]
- 5.Kerfeld CA, Heinhorst S and Cannon GC, Annu Rev Microbiol, 2010, 64, 391–408. [DOI] [PubMed] [Google Scholar]
- 6.Edwardson TGW, Levasseur MD, Tetter S, Steinauer A, Hori M and Hilvert D, Chem Rev, 2022, 122, 9145–9197. [DOI] [PubMed] [Google Scholar]
- 7.Jones JA and Giessen TW, Biotechnol Bioeng, 2020. [DOI] [PMC free article] [PubMed]
- 8.Rivera M, Acc Chem Res, 2017, 50, 331–340. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Honarmand Ebrahimi K, Hagedoorn PL and Hagen WR, Chem Rev, 2015, 115, 295–326. [DOI] [PubMed] [Google Scholar]
- 10.Kerfeld CA, Sawaya MR, Tanaka S, Nguyen CV, Phillips M, Beeby M and Yeates TO, Science, 2005, 309, 936–938. [DOI] [PubMed] [Google Scholar]
- 11.Giessen TW, Curr Opin Chem Biol, 2016, 34, 1–10. [DOI] [PubMed] [Google Scholar]
- 12.McHugh CA, Fontana J, Nemecek D, Cheng N, Aksyuk AA, Heymann JB, Winkler DC, Lam AS, Wall JS, Steven AC and Hoiczyk E, EMBO J, 2014, 33, 1896–1911. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Sutter M, Boehringer D, Gutmann S, Günther S, Prangishvili D, Loessner MJ, Stetter KO, Weber-Ban E and Ban N, Nat Struct Mol Biol, 2008, 15, 939–947. [DOI] [PubMed] [Google Scholar]
- 14.Giessen TW, Orlando BJ, Verdegaal AA, Chambers MG, Gardener J, Bell DC, Birrane G, Liao M and Silver PA, Elife, 2019, 8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Tso DJ, Hendrix RW and Duda RL, J Mol Biol, 2014, 426, 2112–2129. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Giessen TW and Silver PA, Nat Microbiol, 2017, 2, 17029. [DOI] [PubMed] [Google Scholar]
- 17.Chen AH and Silver PA, Trends Cell Biol, 2012, 22, 662–670. [DOI] [PubMed] [Google Scholar]
- 18.Gabashvili AN, Chmelyuk NS, Sarkisova VA, Melnikov PA, Semkina AS, Nikitin AA and Abakumov MA, Int J Mol Sci, 2022, 23. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Van de Steen A, Khalife R, Colant N, Mustafa Khan H, Deveikis M, Charalambous S, Robinson CM, Dabas R, Esteban Serna S, Catana DA, Pildish K, Kalinovskiy V, Gustafsson K and Frank S, Synth Syst Biotechnol, 2021, 6, 231–241. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Lagoutte P, Mignon C, Stadthagen G, Potisopon S, Donnat S, Mast J, Lugari A and Werle B, Vaccine, 2018, 36, 3622–3628. [DOI] [PubMed] [Google Scholar]
- 21.Lua LH, Connors NK, Sainsbury F, Chuan YP, Wibowo N and Middelberg AP, Biotechnol Bioeng, 2014, 111, 425–440. [DOI] [PubMed] [Google Scholar]
- 22.Jenkins MC and Lutz S, ACS Synth Biol, 2021, 10, 857–869. [DOI] [PubMed] [Google Scholar]
- 23.Andreas MP and Giessen TW, Nat Commun, 2021, 12, 4748. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Giessen TW, Annu Rev Biochem, 2022. [DOI] [PMC free article] [PubMed]
- 25.Tracey JC, Coronado M, Giessen TW, Lau MCY, Silver PA and Ward BB, Sci Rep, 2019, 9, 20122. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Valdés-Stauber N and Scherer S, Appl Environ Microbiol, 1994, 60, 3809–3814. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Rosenkrands I, Rasmussen PB, Carnio M, Jacobsen S, Theisen M and Andersen P, Infect Immun, 1998, 66, 2728–2735. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Hicks PM, Rinker KD, Baker JR and Kelly RM, FEBS Lett, 1998, 440, 393–398. [DOI] [PubMed] [Google Scholar]
- 29.Akita F, Chong KT, Tanaka H, Yamashita E, Miyazaki N, Nakaishi Y, Suzuki M, Namba K, Ono Y, Tsukihara T and Nakagawa A, J Mol Biol, 2007, 368, 1469–1483. [DOI] [PubMed] [Google Scholar]
- 30.Rahmanpour R and Bugg TD, FEBS J, 2013, 280, 2097–2104. [DOI] [PubMed] [Google Scholar]
- 31.Contreras H, Joens MS, McMath LM, Le VP, Tullius MV, Kimmey JM, Bionghi N, Horwitz MA, Fitzpatrick JA and Goulding CW, J Biol Chem, 2014, 289, 18279–18289. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Goddard TD, Huang CC, Meng EC, Pettersen EF, Couch GS, Morris JH and Ferrin TE, Protein Sci, 2018, 27, 14–25. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Ross J, McIver Z, Lambert T, Piergentili C, Bird JE, Gallagher KJ, Cruickshank FL, James P, Zarazúa-Arvizu E, Horsfall LE, Waldron KJ, Wilson MD, Mackay CL, Baslé A, Clarke DJ and Marles-Wright J, Sci Adv, 2022, 8, eabj4461. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Eren E, Wang B, Winkler DC, Watts NR, Steven AC and Wingfield PT, Structure, 2022, 30, 551–563.e554. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Cassidy-Amstutz C, Oltrogge L, Going CC, Lee A, Teng P, Quintanilla D, East-Seletsky A, Williams ER and Savage DF, Biochemistry, 2016, 55, 3461–3468. [DOI] [PubMed] [Google Scholar]
- 36.Nichols RJ, LaFrance B, Phillips NR, Radford DR, Oltrogge LM, Valentin-Alvarado LE, Bischoff AJ, Nogales E and Savage DF, Elife, 2021, 10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Jones DT and Cozzetto D, Bioinformatics, 2015, 31, 857–863. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Andreas MP and Giessen TW, MethodsX, 2022, 9, 101787. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Moon H, Lee J, Min J and Kang S, Biomacromolecules, 2014, 15, 3794–3801. [DOI] [PubMed] [Google Scholar]
- 40.Giessen TW and Silver PA, Chembiochem, 2016, 17, 1931–1935. [DOI] [PubMed] [Google Scholar]
- 41.Putri RM, Allende-Ballestero C, Luque D, Klem R, Rousou KA, Liu A, Traulsen CH, Rurup WF, Koay MST, Castón JR and Cornelissen JJLM, ACS Nano, 2017, 11, 12796–12804. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Putri RM, Fredy JW, Cornelissen JJ, Koay MS and Katsonis N, Chemphyschem, 2016, 17, 1815–1818. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Ma Y, Nolte RJ and Cornelissen JJ, Adv Drug Deliv Rev, 2012, 64, 811–825. [DOI] [PubMed] [Google Scholar]
- 44.Schoonen L, Nolte RJ and van Hest JC, Nanoscale, 2016, 8, 14467–14472. [DOI] [PubMed] [Google Scholar]
- 45.Schoonen L and van Hest JC, Adv Mater, 2016, 28, 1109–1128. [DOI] [PubMed] [Google Scholar]
- 46.Michel-Souzy S, Hamelmann NM, Zarzuela-Pura S, Paulusse JMJ and Cornelissen JJLM, Biomacromolecules, 2021, 22, 5234–5242. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Snijder J, Kononova O, Barbu IM, Uetrecht C, Rurup WF, Burnley RJ, Koay MS, Cornelissen JJ, Roos WH, Barsegov V, Wuite GJ and Heck AJ, Biomacromolecules, 2016, 17, 2522–2529. [DOI] [PubMed] [Google Scholar]
- 48.Rurup WF, Snijder J, Koay MS, Heck AJ and Cornelissen JJ, J Am Chem Soc, 2014, 136, 3828–3832. [DOI] [PubMed] [Google Scholar]
- 49.Sainsbury F, Saxena P, Aljabali AA, Saunders K, Evans DJ and Lomonossoff GP, Methods Mol Biol, 2014, 1108, 139–153. [DOI] [PubMed] [Google Scholar]
- 50.McConnell SA, Cannon KA, Morgan C, McAllister R, Amer BR, Clubb RT and Yeates TO, ACS Synth Biol, 2020, 9, 381–391. [DOI] [PubMed] [Google Scholar]
- 51.Tamura A, Fukutani Y, Takami T, Fujii M, Nakaguchi Y, Murakami Y, Noguchi K, Yohda M and Odaka M, Biotechnol Bioeng, 2015, 112, 13–20. [DOI] [PubMed] [Google Scholar]
- 52.Künzle M, Mangler J, Lach M and Beck T, Nanoscale, 2018, 10, 22917–22926. [DOI] [PubMed] [Google Scholar]
- 53.Lohner P, Zmyslia M, Thurn J, Pape JK, Gerasimaitė R, Keller-Findeisen J, Groeer S, Deuringer B, Süss R, Walther A, Hell SW, Lukinavičius G, Hugel T and Jessen-Trefzer C, Angew Chem Int Ed Engl, 2021, 60, 23835–23841. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Altenburg WJ, Rollins N, Silver PA and Giessen TW, Sci Rep, 2021, 11, 4951. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Xiong X, Sun C, Vago FS, Klose T, Zhu J and Jiang W, Biomolecules, 2020, 10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Künzle M, Mangler J, Lach M and Beck T, Nanoscale, 2019, 11, 6497. [DOI] [PubMed] [Google Scholar]
- 57.Lončar N, Rozeboom HJ, Franken LE, Stuart MCA and Fraaije MW, Biochem Biophys Res Commun, 2020, 529, 548–553. [DOI] [PubMed] [Google Scholar]
- 58.Diaz D, Vidal X, Sunna A and Care A, ACS Appl Mater Interfaces, 2021, 13, 7977–7986. [DOI] [PubMed] [Google Scholar]
- 59.Lau YH, Giessen TW, Altenburg WJ and Silver PA, Nat Commun, 2018, 9, 1311. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Sigmund F, Massner C, Erdmann P, Stelzl A, Rolbieski H, Desai M, Bricault S, Wörner TP, Snijder J, Geerlof A, Fuchs H, Hrabĕ de Angelis M, Heck AJR, Jasanoff A, Ntziachristos V, Plitzko J and Westmeyer GG, Nat Commun, 2018, 9, 1990. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Snijder J, van de Waterbeemd M, Damoc E, Denisov E, Grinfeld D, Bennett A, Agbandje-McKenna M, Makarov A and Heck AJ, J Am Chem Soc, 2014, 136, 7295–7299. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Jones JA, Cristie-David AS, Andreas MP and Giessen TW, Angew Chem Int Ed Engl, 2021, 60, 25034–25041. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Gabashvili AN, Chmelyuk NS, Sarkisova VA, Melnikov PA, Semkina AS, Nikitin AA and Abakumov MA, Int J Mol Sci, 2022, 23. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Williams EM, Jung SM, Coffman JL and Lutz S, ACS Synth Biol, 2018, 7, 2514–2517. [DOI] [PubMed] [Google Scholar]
- 65.Jones JA, Andreas MP and Giessen TW, BioRxiv, 2022.
- 66.Ebensperger P, Zmyslia M, Lohner P, Braunreuther J, Deuringer B, Becherer A, Süss R, Fischer A, Jessen HJ and Jessen-Trefzer C, ChemRxiv, 2022. [DOI] [PubMed]
- 67.Xiong X, Sun C, Vago FS, Klose T, Zhu J and Jiang W, Biomolecules, 2020, 10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Klem R, de Ruiter MV and Cornelissen JJLM, Mol Pharm, 2018, 15, 2991–2996. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Rodríguez JM, Allende-Ballestero C, Cornelissen JJLM and Castón JR, Nanomaterials (Basel), 2021, 11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Zhou HX, Rivas G and Minton AP, Annu Rev Biophys, 2008, 37, 375–397. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Ebensperger P, Zmyslia M , Lohner P, Braunreuther J , Deuringer BB , Anita R. Süss , Fischer A and Trefzer J−, ChemRxiv, 2022.
- 72.Allen TM and Cullis PR, Adv Drug Deliv Rev, 2013, 65, 36–48. [DOI] [PubMed] [Google Scholar]
- 73.Duncan R, Nat Rev Drug Discov, 2003, 2, 347–360. [DOI] [PubMed] [Google Scholar]
- 74.Kedar U, Phutane P, Shidhaye S and Kadam V, Nanomedicine, 2010, 6, 714–729. [DOI] [PubMed] [Google Scholar]
- 75.Maham A, Tang Z, Wu H, Wang J and Lin Y, Small, 2009, 5, 1706–1721. [DOI] [PubMed] [Google Scholar]
- 76.Khoshnejad M, Parhiz H, Shuvaev VV, Dmochowski IJ and Muzykantov VR, J Control Release, 2018, 282, 13–24. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Brigger I, Dubernet C and Couvreur P, Adv Drug Deliv Rev, 2002, 54, 631–651. [DOI] [PubMed] [Google Scholar]
- 78.Lee TH, Carpenter TS, D’haeseleer P, Savage DF and Yung MC, Biotechnol Bioeng, 2020, 117, 603–613. [DOI] [PubMed] [Google Scholar]
- 79.Choi B, Moon H, Hong SJ, Shin C, Do Y, Ryu S and Kang S, ACS Nano, 2016, 10, 7339–7350. [DOI] [PubMed] [Google Scholar]
- 80.Mironova KE, Proshkina GM, Ryabova AV, Stremovskiy OA, Lukyanov SA, Petrov RV and Deyev SM, Theranostics, 2013, 3, 831–840. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Kennedy NW, Mills CE, Nichols TM, Abrahamson CH and Tullman-Ercek D, Curr Opin Microbiol, 2021, 63, 36–42. [DOI] [PubMed] [Google Scholar]
- 82.Giessen TW and Silver PA, J Mol Biol, 2016, 428, 916–927. [DOI] [PubMed] [Google Scholar]
- 83.Kirst H and Kerfeld CA, BMC Biol, 2019, 17, 79. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Stewart AM, Stewart KL, Yeates TO and Bobik TA, Trends Biochem Sci, 2021, 46, 406–416. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Adamson LSR, Tasneem N, Andreas MP, Close W, Jenner EN, Szyszka TN, Young R, Cheah LC, Norman A, MacDermott-Opeskin HI, O’Mara ML, Sainsbury F, Giessen TW and Lau YH, Sci Adv, 2022, 8, eabl7346. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Tasneem N, Szyszka TN, Jenner EN and Lau YH, ACS Nano, 2022, 16, 8540–8556. [DOI] [PubMed] [Google Scholar]
- 87.Uchida M, Manzo E, Echeveria D, Jiménez S and Lovell L, Curr Opin Virol, 2022, 52, 250–257. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88.Patterson DP, LaFrance B and Douglas T, Chem Commun (Camb), 2013, 49, 10412–10414. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Wu D, Hwang P, Li T and Piszczek G, Gene Ther, 2022, 29, 691–697. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Gassmann M, Grenacher B, Rohde B and Vogel J, Electrophoresis, 2009, 30, 1845–1855. [DOI] [PubMed] [Google Scholar]