Skip to main content
iScience logoLink to iScience
. 2021 Aug 25;24(9):103032. doi: 10.1016/j.isci.2021.103032

Herding cats: Label-based approaches in protein translocation through nanopore sensors for single-molecule protein sequence analysis

Keisuke Motone 1, Nicolas Cardozo 1, Jeff Nivala 1,
PMCID: PMC8433247  PMID: 34527891

Summary

Proteins carry out life's essential functions. Comprehensive proteome analysis technologies are thus required for a full understanding of the operating principles of biological systems. While current proteomics techniques suffer from limitations in sensitivity and/or throughput, nanopore technology has the potential to enable de novo protein identification through single-molecule sequencing. However, a significant barrier to achieving this goal is controlling protein/peptide translocation through the nanopore sensor for processive strand analysis. Here, we review recent approaches that use a range of techniques, from oligonucleotide conjugation to molecular motors, aimed at driving protein strands and peptides through protein nanopores. We further discuss site-specific protein conjugation chemistry that could be combined with these translocation approaches as future directions to achieve single-molecule protein detection and sequencing of native proteins.

Subject areas: Bioorganic chemistry, Biochemistry, Proteomics

Graphical abstract

graphic file with name fx1.jpg


Bioorganic chemistry; Biochemistry; Proteomics

Introduction

Proteins are the major functional molecules involved in essentially every biological process, such as building the cellular structure, regulating gene expression, and powering the immune system. Although genomics and transcriptomics provide fundamental information about cellular history and basal activity, proteomics plays a crucial role in filling the gap between genotype and phenotype as protein activities are more directly related to phenotype (Figure 1). Thus, protein analysis provides valuable information for understanding biological phenomena and disease. Unfortunately, unlike the remarkable technological improvements in DNA and RNA sequencing in recent years, the development of highly sensitive, high-throughput protein sequencing techniques have not yet been realized. There are two principal methods currently available for protein sequencing/identification that do not use affinity reagents such as antibodies: Edman degradation and mass spectrometry (Edman et al., 1950; Steen and Mann, 2004). Edman degradation is a useful technique for de novo sequencing, but it is limited to the analysis of homogenous protein samples and read lengths typically <50 amino acids, which are far shorter than the median protein length of eukaryotic (361 amino acid long), bacterial (267 amino acid long), and archaeal organisms (247 amino acid long) (Brocchieri, 2005). Mass spectrometry allows the analysis of protein mixtures and currently dominates proteomics research. Mass spectrometry has undergone significant improvements in instrumentation and sample preparation over the decades (Budnik et al., 2018; Specht et al., 2021; Zhu et al., 2018a, 2018b), although it still faces limitations in terms of detection sensitivity, dynamic range, analytical throughput, and instrumentation cost (Zubarev, 2013).

Figure 1.

Figure 1

Current analytical techniques for omics studies

(Left) The proteome is orders of magnitude more complex than the genome and transcriptome (Aebersold et al., 2018). (Right) Second and third-generation sequencing techniques can be used for high throughput genomic and transcriptomic analysis, while protein sequencing/identification is performed by relatively-low throughput methods such as mass spectrometry or Edman degradation.

To address these challenges, complementary or potentially disruptive platforms for next-generation protein analysis and sequencing have been envisioned recently (Alfaro et al., 2021; Asandei et al., 2020; Cressiot et al., 2020; Hu et al., 2021; Restrepo-Pérez et al., 2018; Timp and Timp, 2020). These emerging techniques include tunneling currents, single-molecule fluorescence, and nanopores. In 2014, Ohshiro et al. identified 12 different amino acids and phosphotyrosine using tunneling currents measured as the individual molecules threaded through a nanoscale electrode gap (Ohshiro et al., 2014). Tunneling currents have shown the remarkable sensitivity to discriminate even enantiomers and isobaric amino acids (Zhao et al., 2014). In more recent years, several fluorescence-based studies have demonstrated experimental proof-of-principle toward single-molecule protein fingerprinting (de Lannoy et al., 2021; Swaminathan et al., 2018; van Ginkel et al., 2018). For example, Swaminathan et al. established a fluorosequencing technology that couples Edman degradation with fluorescent labeling of specific residues (e.g., cysteines and lysines). Total internal reflection fluorescence (TIRF) microscopy allows for millions of fluorescently labeled peptide molecules to be identified in parallel based on the sparse fluorescent sequence (i.e., fingerprint) of each molecule determined by recurrent rounds of Edman degradation (Swaminathan et al, 2015, 2018). Thus, those innovative approaches based on tunneling currents or fluorescence have opened new avenues to revolutionize conventional proteomic technologies. However, they may face limitations compared to nanopore technology in terms of the potential to achieve de novo full-length protein sequencing. Approaches using tunneling currents, for instance, lack experimental demonstrations of controlling translocation of polypeptides through the gap and are currently limited to the analysis of amino acids or short peptides. Fluorescence-based methods are so far incompatible with de novo sequencing owing to constraint in the chemical repertoire to orthogonally label the 20 amino acids and the number of uniquely distinguishable fluorescent labels. In the following section, we present an overview of protein analysis and sequencing using nanopore technology.

Recent advances and challenges in nanopore sequencing

Nanopore technology relies on a nanometer-sized pore (i.e., nanopore) within an insulating membrane that separates two electrolyte-filled wells. A voltage applied across the membrane drives ionic current flow through the nanopore sensor between the cis and trans wells. As individual molecules translocate through or otherwise interact with the nanopore, they can rectify this ionic current flow and cause an observable signal change that provides structural insights into the molecules (Deamer and Akeson, 2000). Although nanopore sensing was initially proposed as a technique for the sequencing of nucleic acid strands (Kasianowicz et al., 1996) (Figure 2), it also has great potential for protein analysis. Single-molecule sensitivity, full-length readout, real-time measurement, and device portability is just as, if not more, crucial for proteomics than it is for genomics and transcriptomics (Shi et al., 2017). Towards this end, nanopore sensors have been used for discrimination of peptides and proteins (Asandei et al., 2017; Cardozo et al., 2021; Huang et al, 2017, 2019; Nivala et al., 2014; Piguet et al., 2018; Robertson and Reiner, 2018), real-time measurement of protein–protein (Thakur and Movileanu, 2019) and protein–ligand interactions (Harrington et al., 2013; Movileanu et al., 2000), antigen−antibody binding assays (Han et al., 2008; Madampage et al., 2010; Sexton et al., 2007), and aptamer-mediated protein detection (Rotem et al., 2012; Soskine et al., 2012; Sze et al., 2017). Moreover, protein nanopores have shown promise in identifying amino acids and post-translational modifications (PTMs), taking a major step toward single-molecule protein sequencing. For example, Ouldali et al. have recently shown that 13 of the 20 standard amino acids are distinguishable based on their current signals using an aerolysin nanopore (Ouldali et al., 2020). The study also has proposed a way toward the identification of the remaining 7 amino acids by instrumentation advances and nanopore engineering. The detection of PTMs such as phosphorylation and glycosylation, which serve as biomarkers of cell states and diseases (Aebersold et al., 2018; Pagel et al., 2015), has also been achieved with protein nanopore sensors (Restrepo-Pérez et al., 2019b; Rosen et al., 2014b; Wloka et al., 2017; Ying et al., 2019; Zhang et al., 2021b).

Figure 2.

Figure 2

Schematic representation of single-molecule sequencing with a nanopore

In nanopore sequencing, a nanometer-sized protein pore is embedded in an insulating membrane that separates two electrolyte-filled wells. Voltage is applied between the wells, causing ionic current flow through the pore. As single biopolymer molecules translocate through the channel, they generate sequence-specific ionic current signals that are diagnostic of the polymer sequence. Nanopore sequencing has been established as a commercial technology to read single DNA and RNA molecules (Garalde et al., 2018; Jain et al., 2016).

Despite these promising results, protein sequencing of intact, full-length protein strands using nanopores has been hindered, in part, because of the difficulty in controlling protein translocation through the sensor. This challenge exists because of two major reasons (Figure 3): First, the polypeptide backbone is neutrally charged and amino acid side chains can vary in charge state. Thus, electrophoresis-driven unidirectional translocation of peptides or proteins through nanopores cannot be as effectively employed, as it can for uniformly negatively-charged polymers like nucleic acids. Second, most proteins adopt a stable 3-dimensional fold. Thus, disruption of this tertiary structure is required for proteins to translocate through a narrow nanopore constriction for primary sequence analysis. In this review, we summarize the recent advances and remaining obstacles in controlling protein translocation through a nanopore and highlight label-based approaches that potentially address these challenges.

Figure 3.

Figure 3

Challenges in protein translocation through a nanopore

(Left) While DNA/RNA is uniformly negatively charged, (Middle) proteins can contain both negatively-charged (glutamate and aspartate) and positively-charged (arginine, lysine, and histidine) residues at physiological pH. Unidirectional translocation of proteins in the electric field must be achieved despite their nonuniform charge. (Right) The diameters of folded proteins are typically larger than the constriction of protein nanopores that would be suitable for protein sequencing application (∼1–2 nm) (Steinbock et al., 2014). For processive strand analysis, proteins must be unfolded to allow the denatured protein strand to thread through the nanopore with amino acid residues in single-file order.

Approaches for protein/peptide translocation through a nanopore

In 2004, peptide translocation through a nanopore was demonstrated for the first time (Sutherland et al., 2004). This study showed the translocation of short repeats of the collagen-like sequence (GPP) through an α-hemolysin nanopore. Another pioneering study investigated the interactions of an α-hemolysin nanopore with helical peptides containing the (AAKAA)n sequence (Movileanu et al., 2005). These works laid the foundation for peptide analysis using nanopores, but general approaches for translocation of native peptides/proteins through nanopores are required for the development of single-molecule protein sequencing.

First efforts for protein translocation involved physical and chemical denaturants. Several groups have shown that the use of high temperature (Payet et al., 2012), chaotropic agents (Cressiot et al, 2012, 2015; Merstorf et al., 2012; Oukhaled et al., 2007; Pastoriza-Gallego et al., 2011; Talaga and Li, 2009), or detergents (Kennedy et al., 2016; Restrepo-Pérez et al., 2017), facilitates protein translocation through nanopores. For example, Timp and colleagues have demonstrated protein translocation through solid-state pores using sodium dodecyl sulfate (SDS) as a denaturant (Kennedy et al., 2016). SDS further provides a near-uniform negative charge to denatured proteins and promotes the electrical control of the translocation kinetics, though it is unclear if the protein-bound SDS could interfere with the nanopore signal's sensitivity to amino acid sequence. Although such denaturation methods are compatible with solid-state nanopores and show great promise for protein translocation, they cannot be as readily applied to protein nanopore systems that include lipid or lipid-like membranes, which are susceptible to harsh conditions required to completely unfold stable proteins (e.g. high temperature or a high concentration of denaturants). To overcome this barrier, several label-based translocation strategies that are compatible with protein nanopores have been explored (Figure 4).

Figure 4.

Figure 4

Approaches for protein/peptide translocation through a nanopore

(A) Electrophoresis-driven translocation. Left) The negative charges of oligonucleotides enable electrophoresis-driven unidirectional movement of proteins. Right) Positively-charged (arginine, lysine, and histidine) or negatively-charged (glutamate and aspartate) residues promote unidirectional protein translocation under a negative or positive voltage bias, respectively.

(B) Motor-driven translocation. Left) An unfoldase present in the trans solution binds a recognition motif on the threaded protein (e.g., ssrA tag) and generates mechanical force sufficient to denature and processively pull the protein through the pore. Right) A DNA-processive motor bound to an oligonucleotide strand pulls the oligonucleotide-peptide conjugate from the trans to cis side against electrophoretic force. This allows for the peptide strand to pass through the nanopore constriction with a regular stepping size and yields sequence-dependent signals derived from the peptide strand in the pore's sensitive region.

Oligonucleotide-assisted translocation

In nanopore-based DNA and RNA sensing, nucleic acid strands can be electrophoretically-driven into and through a nanopore unidirectionally by an applied voltage as their phosphodiester backbone is intrinsically negatively charged. Hence, attaching an oligonucleotide strand to a protein is a straightforward way to facilitate electrophoresis-driven protein translocation (Figure 4A, Left). Using this approach, Bayley and colleagues have demonstrated controlled directional translocation of proteins through an α-hemolysin nanopore (Rodriguez-Larrea and Bayley, 2013, 2014; Rosen et al., 2020). The group used genetically engineered thioredoxin as a model protein and linked a C-terminal cysteine side chain to a 30-mer oligo(dC) nucleotide via a disulfide bond. The study demonstrated a co-translocational unfolding mechanism; first, the protein-tethered oligonucleotide is captured by the nanopore. Second, the oligonucleotide is pulled through the nanopore and the C-terminus of the protein locally unfolds because of the pulling force. Third, the remaining protein structure unfolds spontaneously after being critically destabilized as a result of the C-terminal local structure unfolding. Fourth, the unfolded protein completely translocates through the nanopore. A recent follow-up work to this has also shown that simple neural network-based classifiers can distinguish single point mutations in the protein that result in altered unfolding characteristics (Rodriguez-Larrea, 2021). Other groups have also demonstrated successful translocation of proteins and peptides using tethered oligonucleotides (Biswas et al., 2015; Pastoriza-Gallego et al., 2014), confirming the effectiveness of this translocation approach. This method, however, generates fast translocation events (<1 ms) that may cause poor signal-to-noise ratios and thus make this method less sensitive to protein sequence-level changes. This method may also be ineffective at translocating larger, multi-domain proteins, as the electrophoretic pulling force is largely absent after the oligonucleotide has completely translocated through the nanopore. As molecular motors and nanopore engineering have been employed to reduce the velocity of translocating DNA and RNA in nanopore nucleic acid sequencing (Cherf et al., 2012; Garalde et al., 2018; Manrao et al., 2012; Rincon-Restrepo et al., 2011), techniques to regulate the rate of protein translocation may be required for the acquisition of well-resolved and reproducible current signals.

Charged peptide-assisted translocation

Similar to the oligonucleotide-assisted approach, the fusion of charged peptides promotes unidirectional protein translocation (Figure 4A, Right). Ouldali et al. have demonstrated translocation of XRRRRRRR peptides through an aerolysin nanopore, where X represents the 20 standard amino acids (Ouldali et al., 2020). The translocation experiment was run under a −50 mV bias applied to the trans side to enable the polycationic peptide to move toward the trans compartment by electrophoretic force. Moreover, Restrepo-Pérez et al. have shown translocation of 30 amino acid long peptides, containing 10 glutamates at the N-terminus and 10 arginines at the C-terminus (Restrepo-Pérez et al, 2019a, 2019b). The “tug-of-war” state created by the terminal charged residues enabled slow translocation (>1 ms) of peptides through a FraC nanopore to read out PTMs and 6 distinct chemical tags on the peptides. Our group has recently shown that an Smt3 protein genetically tagged at its C-terminus with a polyGSD peptide can promote capture of the protein in a CsgG nanopore but does not readily drive unfolding and complete translocation (Cardozo et al., 2021). This observation supports the conclusion that a more highly charged peptide or the addition of other mechanisms would be necessary to generate enough force to completely unfold and translocate stably folded proteins through a narrow nanopore.

Molecular motor-assisted translocation

Another approach for protein translocation is based on an unfoldase that enables enzyme-mediated unfolding and translocation of tagged proteins (Figure 4B, Left). We previously employed the AAA+ unfoldase ClpX, which specifically unfolds proteins bearing a C-terminal ssrA peptide tag (AANDENYALAA), for processive unfolding of large proteins (Nivala et al, 2013, 2014). ClpX generates sufficient mechanical force (∼20 pN) to denature stable protein folds and translocates proteins at a rate suitable for nanopore sequencing (up to 80 amino acids per second) (Maillard et al., 2011). This approach has demonstrated ClpX-mediated translocation of proteins over 700 amino acids in length, including a variety of protein domains, that are genetically fused with the ssrA tag and a polyanion peptide linker designed to promote protein capture and retention in the nanopore electric field. Distinct protein domains as well as specific point mutations, proteolytic cleavage, and sequence rearrangements in those domains resulted in detectable ionic current pattern changes and single-molecule classification accuracies of 86–99% (Nivala et al., 2014).

While ClpX is capable of unfolding many different types of proteins even with very high stabilities (Kenniston et al., 2004), it likely does not generate sufficient force for some protein folds (Hoskins et al., 2002). In addition, disulfide bonds, which commonly occur in proteins (Bosnjak et al., 2014; Wiedemann et al., 2020), may interrupt protein translocation. The force exerted on protein strands by ClpX or by an electric field (tens to several hundred pN) in a typical experimental setup (Ouldali et al., 2020) is not able to break the covalent disulfide bond (Baldus and Gräter, 2012; Wiita et al., 2007). Although ClpX's ring-like structure is flexible enough to translocate a disulfide-linked beta hairpin into the proteolytic chamber of ClpP (Burton et al., 2001), it is unlikely that a more narrow, rigid nanopore protein would accommodate such a structure. Thus, the use of reducing agents would assist the linear translocation of proteins with disulfide bonds. Another consideration to the unfoldase approach is the large and variable translocation step size of ClpX. Although the fundamental step size of ClpX is ∼1 nm, this distance corresponds to an irregular number of amino acids that is dependent on the confirmation of the peptide backbone (typically 5–8 amino acids per 1 nm step). ClpX stepping can also occur in quick bursts of up to 4 nm (Cordova et al., 2014). Although this bursting activity is critical to its unfolding activity, it could complicate sequencing with single-amino-acid resolution. Efforts to explore alternative unfoldase motors that have a more well-defined step size, such as ClpA (Miller et al., 2013; Olivares et al., 2014), or proteasome systems (Olszewski et al., 2019; Zhang et al., 2020) may be necessary for building a more robust translocation system with optimal resolution.

As an alternative to the unfoldase approach, recent work has explored controlled peptide translocation through nanopores using DNA processive motors. This was accomplished by conjugating peptides to DNA oligonucleotides. A DNA processive motor can then ratchet the peptide-oligonucleotide conjugate through the pore (Figure 4B, Right). This technique has been demonstrated with two different types of DNA processive motors, a helicase (Brinkerhoff et al., 2021) or a polymerase (Yan et al., 2021). The process yields a series of discrete ionic current steps derived from enzyme-mediated peptide-oligonucleotide strand translocation. As the motors are not able to move along the peptide strand itself, the read length is limited to less than the ∼30 amino acids that typical nanopore vestibules, such as MspA, can accommodate. This length is still longer than the typical peptide analyzed in bottom-up mass spectrometry and could be further extended by nanopore engineering. While this novel approach overcomes the irregular stepping of ClpX and allows identification of peptide variants in some contexts with single-amino-acid resolution, applications are currently limited to the analysis of synthetic peptide strands with highly negative charges responsive to electrophoretic forces that stretch and trap the peptides within the nanopore (Brinkerhoff et al., 2021); otherwise peptide strands would escape from the nanopore (Yan et al., 2021). A sandwich structure with a negatively-charged strand linked to the other free terminus of a peptide may allow for the peptide to remain captured in the nanopore regardless of its charge distribution.

In summary, key advances have been made toward facilitated translocation of proteins and peptides through narrow protein nanopore sensors for the realization of single-molecule protein sequencing. The development of protein translocation systems has been an exciting and active research area and further improvements are anticipated in the near future.

Site-specific protein modification and conjugation chemistry

Nanopore sequencing is an interdisciplinary technology where a wide variety of fields intersect, from protein science and biophysics to chemistry and computer science (Alfaro et al., 2021; Robertson et al., 2021). As the translocation methods discussed above involve the conjugation of translocation-assisting molecules (e.g., oligonucleotides and peptides) to a protein terminus, protein modification and conjugation chemistry are a key step to allow analysis of native proteins, analogous to how nanopore DNA/RNA sequencing requires sample preparation steps such as adapter ligation (Garalde et al., 2018; Jain et al., 2016). To this end, robust bioconjugation molecules such as the azide and alkyne groups can be incorporated into proteins in a site-specific manner (Shadish and DeForest, 2020; Sletten and Bertozzi, 2009). This section reviews conventional amine-specific N-hydroxysuccinimide (NHS) ester chemistry, emerging terminal-specific protein modification chemistry, and protein bioconjugation with translocation-assisting molecules (Figure 5).

Figure 5.

Figure 5

Schematic representation of site-specific protein modification and conjugation

In the site-specific modification step, proteins are modified with functional groups for subsequent bioconjugation. While N-hydroxysuccinimide (NHS) esters and 2-pyridinecarboxaldehyde derivatives target the protein N-terminus, photoredox-catalyzed decarboxylation enables C-terminal specific reaction. In the bioconjugation step, the modified proteins are conjugated with nanopore-targeting molecules such as oligonucleotides or peptides via click chemistry.

NHS ester

NHS esters are reagents that target primary amines and form stable amide bonds with both the N-terminal amine and lysine side chains. As the basicity of N-terminal α-amines (pKa = 6–8) is lower than that of lysine ε-amines (pKa = ∼10.5), a lower pH offers a higher degree of selectivity toward the N-terminal amine at the cost of decreased reactivity (Rosen and Francis, 2017; Sereda et al., 1993). NHS esters are widely used to modify proteins with a variety of functional molecules, such as an azide group and HaloTag ligands (Los et al., 2008), that are then used in subsequent conjugation steps. By preparing translocation-assisting molecules linked to the NHS ester, it would be also possible to directly conjugate proteins with them. The main drawback of NHS esters for N-terminal specific labeling is minimizing the cross reactivity with lysines. For example, RNase A, which has 10 lysine residues, had 2 and 5 simultaneous modifications at pH 6.5 and 7.5, respectively, when reacted with 2-azidoacetic acid NHS ester (Inoue et al., 2019).

N-terminal specific modification

Because it is challenging to achieve complete selectivity toward the N-terminus using NHS esters, much effort has been expended to explore more selective reagents and a growing number of studies are now demonstrating efficient targeting (Hoyt et al., 2019; Rosen and Francis, 2017; Shadish and DeForest, 2020). Notably, the emergence of facile and versatile N-terminal specific modification methods have enabled protein modification at an excellent conversion rate, relying solely on commercially available reagents and thereby circumventing the requirement of complex chemical synthesis steps. In 2015, Francis and colleagues reported one-step modification using 2-pyridinecarboxaldehyde (2PCA) (MacDonald et al., 2015). They found that 2PCA specifically reacts in moderate to excellent conversion rates (43–96%) with the N-termini of peptides and proteins in physiological conditions (37°C, pH 7.5), and is compatible with most N-terminal sequences. The study further demonstrated incorporation of functional molecules such as biotin and fluorescein onto the N-terminus. The reaction depends on the characteristics of 2PCA, which specifically reacts with the N-terminus through the formation of cyclic imidazolidinone. The ε-amino groups of lysine residues are unreactive in this pathway owing to their higher basicity compared to the α-amino group of the N-terminus and the lack of a neighboring amide group suitable for cyclization, yielding no detectable side reaction on the ε-amino groups of lysines. One drawback of this method is that proteins with a proline in their second N-terminal position cannot be modified because of cyclization blocking.

More recently, other groups have also performed similar or revised approaches for N-terminal specific modification (Chen et al., 2017; Deng et al., 2020; Inoue et al., 2019; Li et al., 2018; Onoda et al., 2020). In 2019, Inoue et al. reported one-step N-terminal specific modification using 6-(azidomethyl)-2-pyridinecarbaldehyde (6AMPC) derivatives (Inoue et al., 2019). This study showed that 6AMPC provides excellent conversions toward various peptides and proteins (e.g., 95% for Angiotensin I and >90% for RNase A). The azide group added to the N-terminus of RNase A was subsequently used for conjugation with fluorescein via copper-catalyzed azide–alkyne cycloaddition (CuAAC; 92% conversion in 60 min) and strain-promoted azide–alkyne cycloaddition (SPAAC; 59% in 60 min), demonstrating the feasibility of linking alkyne-modified molecules to native proteins without the requirement of genetic engineering. In 2020, Onoda et al. developed a more general strategy for one-step modification of the protein N-terminus using triazolecarbaldehyde reagents (Onoda et al., 2020). In principle, this method is applicable for protein modification with any functional molecule containing a primary amine. Indeed, the N-terminus of RNase A was modified with a variety of functional molecules such as biotin, polyethylene glycol, azide, and alkyne groups with conversion rates ranging from 50% to 85%.

C-terminal specific modification

C-terminal specific modification methods have also been explored, though not as thoroughly as N-terminus targeting reactions. In 2018, MacMillan and co-workers developed a method based on light-mediated single-electron transfer as a mechanism enabling site- and chemoselective bioconjugation (Bloom et al., 2018). This approach exploited the innate difference in oxidation potentials between side chains (glutamate and aspartate) and C-terminal α-amino carboxylates (Galicia and González, 2002; Zuo and MacMillan, 2014). In their work, insulin was modified at its C-terminus with an alkyne group with a conversion rate of 41% in 8 h incubation at room temperature and pH 3.5. Although this approach did not show a comparable conversion rate to the N-terminal modification strategies, C-terminal specific modification is useful for targeting native proteins with N-terminals that have been post-translationally modified. While N-terminal acetylation rarely occurs in prokaryotes or archaea, 80–90% of cytosolic mammalian proteins and 50% of yeast proteins are N-terminally acetylated (Polevoda and Sherman, 2003). Thus, N-terminal specific modification needs an additional step for removal of the blocked N-terminus (Gheorghe et al., 1997) when targeting post-translationally N-terminally modified proteins, whereas the C-terminal modification approach can circumvent this requirement.

Protein conjugation chemistry

Protein conjugation is followed by site-specific modification in order to prepare protein conjugates linked to translocation-assisting biomolecules such as oligonucleotides and peptides (Figure 5). SPAAC is one of the most widely used click chemistry reactions for biomolecule conjugation (Khatwani et al., 2012). SPAAC utilizes a pair of reagents, azides and cyclooctynes, that exclusively and efficiently react with each other while remaining inert to naturally occurring functional groups such as amines. SPAAC enables protein labeling with a wide variety of biomolecules without any auxiliary reagents in an aqueous and low temperature (e.g. 4°C) environment that is suitable for protein stability. Using the SPAAC chemistry, Biswas et al. have demonstrated translocation of three different peptides ranging between 8 and 12 amino acids long through a solid-state nanopore (Biswas et al., 2015). In their work, protein N-termini were selectively modified with an azide group and subsequently conjugated with a dibenzocyclooctyne-modified 20-mer oligo(dT) nucleotide via SPAAC for oligo-mediated nanopore translocation. In a similar way, CuAAC is also useful for such tagging reactions. The CuAAC reaction requires the addition of a copper catalyst and a reductive reagent such as sodium ascorbate (Inoue et al., 2019). In addition, there are a variety of other toolkits available for bioconjugation (Hatlem et al., 2019; Rosen et al., 2014a; Rosen and Francis, 2017; Shadish and DeForest, 2020; Sletten and Bertozzi, 2009; Stephanopoulos and Francis, 2011). These bioconjugation approaches are customizable by simply changing chemical groups introduced into proteins in the site-specific modification step. HaloTag, for instance, is a mutant haloalkane dehalogenase that specifically binds synthetic ligands comprising a chloroalkane linker attached to various functional molecules (Los et al., 2008). Covalent bond formation between HaloTag and the chloroalkane linker is fast and essentially irreversible under physiological conditions, enabling versatile protein conjugation with a variety of molecules (England et al., 2015; Yazaki et al., 2020). Thus, these current, and potentially future, conjugation strategies will facilitate protein conjugation with translocation-assisting molecules, enabling controlled protein translocation through a nanopore without the need of genetic engineering.

Conclusion and perspective

This review highlighted the intersection of nanopore sensing, protein chemistry, and bioconjugation strategies. The site-specific bioconjugation chemistries presented here are useful for protein sequencing and identification using nanopore technology as well for other single-molecule approaches (de Lannoy et al., 2021; Howard et al., 2020; Zhang et al., 2021a). Single-molecule protein sequencing is a highly interdisciplinary endeavor and will require the integration of a wide array of techniques on top of bioconjugation chemistry for successful realization. While advances in bioconjugation chemistry hold great promise to resolve the critical issue of driving protein translocation in nanopore sequencing, the grand challenge to be addressed is accurate decoding of the ionic current signals into amino acid sequences. Indeed, unfoldase-mediated linear translocation of full-length proteins through an α-hemolysin nanopore did not demonstrate reading of amino acids with single-residue resolution (Nivala et al, 2013, 2014). However, recent progress in synergistic areas of research has increased the feasibility of nanopore-based protein sequencing. First, signal-to-noise ratios can be improved by exploring and engineering nanopores suitable for the detection of amino acids. As the average amino acid residue is around half the size of a monophosphate nucleotide, high sensitivity and resolution would likely be achieved by a nanopore with a sharp and narrow constriction, for example, recently developed CsgG or MspA rather than α-hemolysin (Van der Verren et al., 2020; Wang et al., 2018). Second, protein fingerprinting, in which a small subset of residues are orthogonally labeled with chemical tags that produce readily distinguishable nanopore signals, could improve the accuracy of amino acid read-out and protein identification (Ohayon et al., 2019; Yao et al., 2015). This strategy has been adopted in efforts toward nanopore-based protein/peptide identification (Restrepo-Pérez et al., 2019a). Third, the emergence of commercial nanopore sequencers has enabled highly parallel nanopore analysis and high-throughput data collection on a nanopore array (Cardozo et al., 2021; Zhang et al., 2021b), which otherwise would not be realized by single-channel experiments. A large set of training data, coupled with state-of-the-art machine learning models, would boost accurate translation of complex raw signals into amino acid sequences and PTMs, analogous to the example of highly improved basecalling accuracy in nanopore DNA sequencing using deep learning (Rang et al., 2018; Teng et al., 2018; Wick et al., 2019).

Nanopore protein sequencing is a challenging frontier that has yet to be realized, but there is great potential for this nascent technology to revolutionize current proteomics studies, which often suffer from limited sensitivity and throughput in contrast to genomics and transcriptomics technologies. As nanopore sensors provide single-molecule sensitivity, the development of nanopore-based de novo protein sequencing would enable comprehensive identification and quantification of proteins having low abundance or heterogenous PTMs, opening the door to single-molecule proteomics, novel protein biomarker and drug target discovery, and early diagnosis of disease. Further, the low cost and portability of nanopore sensing platforms, in contrast to mass spectrometry instrumentation, open new avenues for more ubiquitous proteomic analyses, for example in the field or in other low resource environments.

Acknowledgments

K.M. thanks the Japan Society for the Promotion of Science for the Overseas Research Fellowships and BioRender for helping create figures. This work was supported in part by NSF EAGER Award no. 1841188, NSF CCF Award no. 2006864, NIH/NCI Cancer Center Support Grant (no. P30 CA015704) Pilot Award and NSF Award 2021552 to J.N.

Author contributions

K.M. and N.C. wrote the first article draft. K.M., N.C., and J.N. edited the article and conceived of the review topic.

Declaration of interests

J.N. is a consultant to Oxford Nanopore Technologies. K.M. and N.C. declare no competing interests.

References

  1. Aebersold R., Agar J.N., Amster I.J., Baker M.S., Bertozzi C.R., Boja E.S., Costello C.E., Cravatt B.F., Fenselau C., Garcia B.A. How many human proteoforms are there? Nat. Chem. Biol. 2018;14:206–214. doi: 10.1038/nchembio.2576. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Alfaro J.A., Bohländer P., Dai M., Filius M., Howard C.J., van Kooten X.F., Ohayon S., Pomorski A., Schmid S., Aksimentiev A. The emerging landscape of single-molecule protein sequencing technologies. Nat. Methods. 2021;18:604–617. doi: 10.1038/s41592-021-01143-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Asandei A., Di Muccio G., Schiopu I., Mereuta L., Dragomir I.S., Chinappi M., Luchian T. Nanopore-based protein sequencing using biopores: current achievements and open challenges. Small Methods. 2020;4:1900595. doi: 10.1002/smtd.201900595. [DOI] [Google Scholar]
  4. Asandei A., Rossini A.E., Chinappi M., Park Y., Luchian T. Protein nanopore-based discrimination between selected neutral amino acids from polypeptides. Langmuir. 2017;33:14451–14459. doi: 10.1021/acs.langmuir.7b03163. [DOI] [PubMed] [Google Scholar]
  5. Baldus I.B., Gräter F. Mechanical force can fine-tune redox potentials of disulfide bonds. Biophys. J. 2012;102:622–629. doi: 10.1016/j.bpj.2011.12.039. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Biswas S., Song W., Borges C., Lindsay S., Zhang P. Click addition of a DNA thread to the N-termini of peptides for their translocation through solid-state nanopores. ACS Nano. 2015;9:9652–9664. doi: 10.1021/acsnano.5b04984. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Bloom S., Liu C., Kölmel D.K., Qiao J.X., Zhang Y., Poss M.A., Ewing W.R., MacMillan D.W.C. Decarboxylative alkylation for site-selective bioconjugation of native proteins via oxidation potentials. Nat. Chem. 2018;10:205–211. doi: 10.1038/nchem.2888. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Bosnjak I., Bojovic V., Segvic-Bubic T., Bielen A. Occurrence of protein disulfide bonds in different domains of life: a comparison of proteins from the Protein Data Bank. Protein Eng. Des. Sel. 2014;27:65–72. doi: 10.1093/protein/gzt063. [DOI] [PubMed] [Google Scholar]
  9. Brinkerhoff H., Kang A.S.W., Liu J., Aksimentiev A., Dekker C. Infinite re-reading of single proteins at single-amino-acid resolution using nanopore sequencing. bioRxiv. 2021 doi: 10.1101/2021.07.13.452225. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Brocchieri L. Protein length in eukaryotic and prokaryotic proteomes. Nucl. Acids Res. 2005;33:3390–3400. doi: 10.1093/nar/gki615. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Budnik B., Levy E., Harmange G., Slavov N. SCoPE-MS: mass spectrometry of single mammalian cells quantifies proteome heterogeneity during cell differentiation. Genome Biol. 2018;19:161. doi: 10.1186/s13059-018-1547-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Burton R.E., Siddiqui S.M., Kim Y.I., Baker T.A., Sauer R.T. Effects of protein stability and structure on substrate processing by the ClpXP unfolding and degradation machine. EMBO J. 2001;20:3092–3100. doi: 10.1093/emboj/20.12.3092. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Cardozo N., Zhang K., Doroschak K., Nguyen A., Siddiqui Z., Bogard N., Strauss K., Ceze L., Nivala J. Multiplexed direct detection of barcoded protein reporters on a nanopore array. Nat. Biotechnol. 2021 doi: 10.1038/s41587-021-01002-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Chen D., Disotuar M.M., Xiong X., Wang Y., Chou D.H.-C. Selective N-terminal functionalization of native peptides and proteins. Chem. Sci. 2017;8:2717–2722. doi: 10.1039/C6SC04744K. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Cherf G.M., Lieberman K.R., Rashid H., Lam C.E., Karplus K., Akeson M. Automated forward and reverse ratcheting of DNA in a nanopore at 5-Å precision. Nat. Biotechnol. 2012;30:344–348. doi: 10.1038/nbt.2147. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Cordova J.C., Olivares A.O., Shin Y., Stinson B.M., Calmat S., Schmitz K.R., Aubin-Tam M.-E., Baker T.A., Lang M.J., Sauer R.T. Stochastic but highly coordinated protein unfolding and translocation by the ClpXP proteolytic machine. Cell. 2014;158:647–658. doi: 10.1016/j.cell.2014.05.043. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Cressiot B., Bacri L., Pelta J. The promise of nanopore technology: advances in the discrimination of protein sequences and chemical modifications. Small Methods. 2020;4:2000090. doi: 10.1002/smtd.202000090. [DOI] [Google Scholar]
  18. Cressiot B., Braselmann E., Oukhaled A., Elcock A.H., Pelta J., Clark P.L. Dynamics and energy contributions for transport of unfolded pertactin through a protein nanopore. ACS Nano. 2015;9:9050–9061. doi: 10.1021/acsnano.5b03053. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Cressiot B., Oukhaled A., Patriarche G., Pastoriza-Gallego M., Betton J.-M., Auvray L., Muthukumar M., Bacri L., Pelta J. Protein transport through a narrow solid-state nanopore at high voltage: experiments and theory. ACS Nano. 2012;6:6236–6243. doi: 10.1021/nn301672g. [DOI] [PubMed] [Google Scholar]
  20. de Lannoy C., Filius M., van Wee R., Joo C., de Ridder D. Evaluation of FRET X for single-molecule protein fingerprinting. bioRxiv. 2021 doi: 10.1101/2021.06.30.450512. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Deamer D.W., Akeson M. Nanopores and nucleic acids: prospects for ultrarapid sequencing. Trends Biotechnol. 2000;18:147–151. doi: 10.1016/S0167-7799(00)01426-8. [DOI] [PubMed] [Google Scholar]
  22. Deng J.-R., Lai N.C.-H., Kung K.K.-Y., Yang B., Chung S.-F., Leung A.S.-L., Choi M.-C., Leung Y.-C., Wong M.-K. N-terminal selective modification of peptides and proteins using 2-ethynylbenzaldehydes. Commun. Chem. 2020;3:67. doi: 10.1038/s42004-020-0309-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Edman P., Högfeldt E., Sillén L.G., Kinell P.-O. Method for determination of the amino acid sequence in peptides. Acta Chem. Scand. 1950;4:283–293. doi: 10.3891/acta.chem.scand.04-0283. [DOI] [Google Scholar]
  24. England C.G., Luo H., Cai W. HaloTag technology: a versatile platform for biomedical applications. Bioconjug. Chem. 2015;26:975–986. doi: 10.1021/acs.bioconjchem.5b00191. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Galicia M., González F.J. Electrochemical oxidation of tetrabutylammonium salts of aliphatic carboxylic acids in acetonitrile. J. Electrochem. Soc. 2002;149:D46. doi: 10.1149/1.1450616. [DOI] [Google Scholar]
  26. Garalde D.R., Snell E.A., Jachimowicz D., Sipos B., Lloyd J.H., Bruce M., Pantic N., Admassu T., James P., Warland A. Highly parallel direct RNA sequencing on an array of nanopores. Nat. Methods. 2018;15:201–206. doi: 10.1038/nmeth.4577. [DOI] [PubMed] [Google Scholar]
  27. Gheorghe M.T., Jörnvall H., Bergman T. Optimized alcoholytic deacetylation of N-acetyl-blocked polypeptides for subsequent Edman degradation. Anal. Biochem. 1997;254:119–125. doi: 10.1006/abio.1997.2380. [DOI] [PubMed] [Google Scholar]
  28. Han A., Creus M., Schürmann G., Linder V., Ward T.R., de Rooij N.F., Staufer U. Label-free detection of single protein molecules and protein−protein interactions using synthetic nanopores. Anal. Chem. 2008;80:4651–4658. doi: 10.1021/ac7025207. [DOI] [PubMed] [Google Scholar]
  29. Harrington L., Cheley S., Alexander L.T., Knapp S., Bayley H. Stochastic detection of Pim protein kinases reveals electrostatically enhanced association of a peptide substrate. Proc. Natl. Acad. Sci.U S A. 2013;110:E4417–E4426. doi: 10.1073/pnas.1312739110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Hatlem D., Trunk T., Linke D., Leo J.C. Catching a SPY: using the SpyCatcher-SpyTag and related systems for labeling and localizing bacterial proteins. Int. J. Mol. Sci. 2019;20:2129. doi: 10.3390/ijms20092129. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Hoskins J.R., Yanagihara K., Mizuuchi K., Wickner S. ClpAP and ClpXP degrade proteins with tags located in the interior of the primary sequence. Proc. Natl. Acad. Sci. U S A. 2002;99:11037–11042. doi: 10.1073/pnas.172378899. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Howard C.J., Floyd B.M., Bardo A.M., Swaminathan J., Marcotte E.M., Anslyn E.V. Solid-phase peptide capture and release for bulk and single-molecule proteomics. ACS Chem. Biol. 2020;15:1401–1407. doi: 10.1021/acschembio.0c00040. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Hoyt E.A., Cal P.M.S.D., Oliveira B.L., Bernardes G.J.L. Contemporary approaches to site-selective protein modification. Nat. Rev. Chem. 2019;3:147–171. doi: 10.1038/s41570-019-0079-1. [DOI] [Google Scholar]
  34. Hu Z., Huo M., Ying Y., Long Y. Biological nanopore approach for single-molecule protein sequencing. Angew. Chem. 2021;133:14862–14873. doi: 10.1002/ange.202013462. [DOI] [PubMed] [Google Scholar]
  35. Huang G., Voet A., Maglia G. FraC nanopores with adjustable diameter identify the mass of opposite-charge peptides with 44 dalton resolution. Nat. Commun. 2019;10:835. doi: 10.1038/s41467-019-08761-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Huang G., Willems K., Soskine M., Wloka C., Maglia G. Electro-osmotic capture and ionic discrimination of peptide and protein biomarkers with FraC nanopores. Nat. Commun. 2017;8:935. doi: 10.1038/s41467-017-01006-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Inoue N., Onoda A., Hayashi T. Site-specific modification of proteins through N-terminal Azide labeling and a chelation-assisted CuAAC reaction. Bioconjug. Chem. 2019;30:2427–2434. doi: 10.1021/acs.bioconjchem.9b00515. [DOI] [PubMed] [Google Scholar]
  38. Jain M., Olsen H.E., Paten B., Akeson M. The Oxford Nanopore MinION: delivery of nanopore sequencing to the genomics community. Genome Biol. 2016;17:239. doi: 10.1186/s13059-016-1103-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Kasianowicz J.J., Brandin E., Branton D., Deamer D.W. Characterization of individual polynucleotide molecules using a membrane channel. Proc. Natl. Acad. Sci. U S A. 1996;93:13770–13773. doi: 10.1073/pnas.93.24.13770. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Kennedy E., Dong Z., Tennant C., Timp G. Reading the primary structure of a protein with 0.07 nm3 resolution using a subnanometre-diameter pore. Nat. Nanotechnol. 2016;11:968–976. doi: 10.1038/nnano.2016.120. [DOI] [PubMed] [Google Scholar]
  41. Kenniston J.A., Burton R.E., Siddiqui S.M., Baker T.A., Sauer R.T. Effects of local protein stability and the geometric position of the substrate degradation tag on the efficiency of ClpXP denaturation and degradation. J. Struct. Biol. 2004;146:130–140. doi: 10.1016/j.jsb.2003.10.023. [DOI] [PubMed] [Google Scholar]
  42. Khatwani S.L., Kang J.S., Mullen D.G., Hast M.A., Beese L.S., Distefano M.D., Taton T.A. Covalent protein–oligonucleotide conjugates by copper-free click reaction. Bioorg. Med. Chem. 2012;20:4532–4539. doi: 10.1016/j.bmc.2012.05.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Li D., Han B., Wei R., Yao G., Chen Z., Liu J., Poon T.C.W., Su W., Zhu Z., Dimitrov D.S. N-terminal α-amino group modification of antibodies using a site-selective click chemistry method. mAbs. 2018;10:712–719. doi: 10.1080/19420862.2018.1463122. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Los G.V., Encell L.P., McDougall M.G., Hartzell D.D., Karassina N., Zimprich C., Wood M.G., Learish R., Ohana R.F., Urh M. HaloTag: a novel protein labeling technology for cell imaging and protein analysis. ACS Chem. Biol. 2008;3:373–382. doi: 10.1021/cb800025k. [DOI] [PubMed] [Google Scholar]
  45. MacDonald J.I., Munch H.K., Moore T., Francis M.B. One-step site-specific modification of native proteins with 2-pyridinecarboxyaldehydes. Nat. Chem. Biol. 2015;11:326–331. doi: 10.1038/nchembio.1792. [DOI] [PubMed] [Google Scholar]
  46. Madampage C.A., Andrievskaia O., Lee J.S. Nanopore detection of antibody prion interactions. Anal. Biochem. 2010;396:36–41. doi: 10.1016/j.ab.2009.08.028. [DOI] [PubMed] [Google Scholar]
  47. Maillard R.A., Chistol G., Sen M., Righini M., Tan J., Kaiser C.M., Hodges C., Martin A., Bustamante C. ClpX(P) generates mechanical force to unfold and translocate its protein substrates. Cell. 2011;145:459–469. doi: 10.1016/j.cell.2011.04.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Manrao E.A., Derrington I.M., Laszlo A.H., Langford K.W., Hopper M.K., Gillgren N., Pavlenok M., Niederweis M., Gundlach J.H. Reading DNA at single-nucleotide resolution with a mutant MspA nanopore and phi29 DNA polymerase. Nat. Biotechnol. 2012;30:349–353. doi: 10.1038/nbt.2171. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Merstorf C., Cressiot B., Pastoriza-Gallego M., Oukhaled A., Betton J.-M., Auvray L., Pelta J. Wild type, mutant protein unfolding and phase transition detected by single-nanopore recording. ACS Chem. Biol. 2012;7:652–658. doi: 10.1021/cb2004737. [DOI] [PubMed] [Google Scholar]
  50. Miller J.M., Lin J., Li T., Lucius A.L. E. coli ClpA catalyzed polypeptide translocation is allosterically controlled by the protease ClpP. J. Mol. Biol. 2013;425:2795–2812. doi: 10.1016/j.jmb.2013.04.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Movileanu L., Howorka S., Braha O., Bayley H. Detecting protein analytes that modulate transmembrane movement of a polymer chain within a single protein pore. Nat. Biotechnol. 2000;18:1091–1095. doi: 10.1038/80295. [DOI] [PubMed] [Google Scholar]
  52. Movileanu L., Schmittschmitt J.P., Martin Scholtz J., Bayley H. Interactions of peptides with a protein pore. Biophys. J. 2005;89:1030–1045. doi: 10.1529/biophysj.104.057406. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Nivala J., Marks D.B., Akeson M. Unfoldase-mediated protein translocation through an α-hemolysin nanopore. Nat. Biotechnol. 2013;31:247–250. doi: 10.1038/nbt.2503. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Nivala J., Mulroney L., Li G., Schreiber J., Akeson M. Discrimination among protein variants using an unfoldase-coupled nanopore. ACS Nano. 2014;8:12365–12375. doi: 10.1021/nn5049987. [DOI] [PubMed] [Google Scholar]
  55. Ohayon S., Girsault A., Nasser M., Shen-Orr S., Meller A. Simulation of single-protein nanopore sensing shows feasibility for whole-proteome identification. PLOS Comput. Biol. 2019;15:e1007067. doi: 10.1371/journal.pcbi.1007067. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Ohshiro T., Tsutsui M., Yokota K., Furuhashi M., Taniguchi M., Kawai T. Detection of post-translational modifications in single peptides using electron tunnelling currents. Nat. Nanotechnol. 2014;9:835–840. doi: 10.1038/nnano.2014.193. [DOI] [PubMed] [Google Scholar]
  57. Olivares A.O., Nager A.R., Iosefson O., Sauer R.T., Baker T.A. Mechanochemical basis of protein degradation by a double-ring AAA+ machine. Nat. Struct. Mol. Biol. 2014;21:871–875. doi: 10.1038/nsmb.2885. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Olszewski M.M., Williams C., Dong K.C., Martin A. The Cdc48 unfoldase prepares well-folded protein substrates for degradation by the 26S proteasome. Commun. Biol. 2019;2:29. doi: 10.1038/s42003-019-0283-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Onoda A., Inoue N., Sumiyoshi E., Hayashi T. Triazolecarbaldehyde reagents for one-step N-terminal protein modification. ChemBioChem. 2020;21:1274–1278. doi: 10.1002/cbic.201900692. [DOI] [PubMed] [Google Scholar]
  60. Oukhaled G., Mathé J., Biance A.-L., Bacri L., Betton J.-M., Lairez D., Pelta J., Auvray L. Unfolding of proteins and long transient conformations detected by single nanopore recording. Phys. Rev. Lett. 2007;98:158101. doi: 10.1103/PhysRevLett.98.158101. [DOI] [PubMed] [Google Scholar]
  61. Ouldali H., Sarthak K., Ensslen T., Piguet F., Manivet P., Pelta J., Behrends J.C., Aksimentiev A., Oukhaled A. Electrical recognition of the twenty proteinogenic amino acids using an aerolysin nanopore. Nat. Biotechnol. 2020;38:176–181. doi: 10.1038/s41587-019-0345-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Pagel O., Loroch S., Sickmann A., Zahedi R.P. Current strategies and findings in clinically relevant post-translational modification-specific proteomics. Expert Rev. Proteomics. 2015;12:235–253. doi: 10.1586/14789450.2015.1042867. [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Pastoriza-Gallego M., Breton M.-F., Discala F., Auvray L., Betton J.-M., Pelta J. Evidence of unfolded protein translocation through a protein nanopore. ACS Nano. 2014;8:11350–11360. doi: 10.1021/nn5042398. [DOI] [PubMed] [Google Scholar]
  64. Pastoriza-Gallego M., Rabah L., Gibrat G., Thiebot B., van der Goot F.G., Auvray L., Betton J.-M., Pelta J. Dynamics of unfolded protein transport through an aerolysin pore. J. Am. Chem. Soc. 2011;133:2923–2931. doi: 10.1021/ja1073245. [DOI] [PubMed] [Google Scholar]
  65. Payet L., Martinho M., Pastoriza-Gallego M., Betton J.-M., Auvray L., Pelta J., Mathé J. Thermal unfolding of proteins probed at the single molecule level using nanopores. Anal. Chem. 2012;84:4071–4076. doi: 10.1021/ac300129e. [DOI] [PubMed] [Google Scholar]
  66. Piguet F., Ouldali H., Pastoriza-Gallego M., Manivet P., Pelta J., Oukhaled A. Identification of single amino acid differences in uniformly charged homopolymeric peptides with aerolysin nanopore. Nat. Commun. 2018;9:966. doi: 10.1038/s41467-018-03418-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Polevoda B., Sherman F. N-terminal acetyltransferases and sequence requirements for N-terminal acetylation of eukaryotic proteins. J. Mol. Biol. 2003;325:595–622. doi: 10.1016/S0022-2836(02)01269-X. [DOI] [PubMed] [Google Scholar]
  68. Rang F.J., Kloosterman W.P., de Ridder J. From squiggle to basepair: computational approaches for improving nanopore sequencing read accuracy. Genome Biol. 2018;19:90. doi: 10.1186/s13059-018-1462-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Restrepo-Pérez L., Huang G., Bohländer P.R., Worp N., Eelkema R., Maglia G., Joo C., Dekker C. Resolving chemical modifications to a single amino acid within a peptide using a biological nanopore. ACS Nano. 2019;13:13668–13676. doi: 10.1021/acsnano.9b05156. [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Restrepo-Pérez L., Wong C.H., Maglia G., Dekker C., Joo C. Label-free detection of post-translational modifications with a nanopore. Nano Lett. 2019;19:7957–7964. doi: 10.1021/acs.nanolett.9b03134. [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Restrepo-Pérez L., John S., Aksimentiev A., Joo C., Dekker C. SDS-assisted protein transport through solid-state nanopores. Nanoscale. 2017;9:11685–11693. doi: 10.1039/C7NR02450A. [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Restrepo-Pérez L., Joo C., Dekker C. Paving the way to single-molecule protein sequencing. Nat. Nanotechnol. 2018;13:786–796. doi: 10.1038/s41565-018-0236-6. [DOI] [PubMed] [Google Scholar]
  73. Rincon-Restrepo M., Mikhailova E., Bayley H., Maglia G. Controlled translocation of individual DNA molecules through protein nanopores with engineered molecular brakes. Nano Lett. 2011;11:746–750. doi: 10.1021/nl1038874. [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Robertson J.W.F., Ghimire M.L., Reiner J.E. Nanopore sensing: a physical-chemical approach. Biochim. Biophys. Acta BBA Biomembr. 2021:183644. doi: 10.1016/j.bbamem.2021.183644. [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Robertson J.W.F., Reiner J.E. The utility of nanopore technology for protein and peptide sensing. Proteomics. 2018;18:1800026. doi: 10.1002/pmic.201800026. [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Rodriguez-Larrea D. Single-aminoacid discrimination in proteins with homogeneous nanopore sensors and neural networks. Biosens. Bioelectron. 2021;180:113108. doi: 10.1016/j.bios.2021.113108. [DOI] [PubMed] [Google Scholar]
  77. Rodriguez-Larrea D., Bayley H. Protein co-translocational unfolding depends on the direction of pulling. Nat. Commun. 2014;5:4841. doi: 10.1038/ncomms5841. [DOI] [PMC free article] [PubMed] [Google Scholar]
  78. Rodriguez-Larrea D., Bayley H. Multistep protein unfolding during nanopore translocation. Nat. Nanotechnol. 2013;8:288–295. doi: 10.1038/nnano.2013.22. [DOI] [PMC free article] [PubMed] [Google Scholar]
  79. Rosen C.B., Bayley H., Rodriguez-Larrea D. Free-energy landscapes of membrane co-translocational protein unfolding. Commun. Biol. 2020;3:160. doi: 10.1038/s42003-020-0841-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Rosen C.B., Francis M.B. Targeting the N terminus for site-selective protein modification. Nat. Chem. Biol. 2017;13:697–705. doi: 10.1038/nchembio.2416. [DOI] [PubMed] [Google Scholar]
  81. Rosen C.B., Kodal A.L.B., Nielsen J.S., Schaffert D.H., Scavenius C., Okholm A.H., Voigt N.V., Enghild J.J., Kjems J., Tørring T. Template-directed covalent conjugation of DNA to native antibodies, transferrin and other metal-binding proteins. Nat. Chem. 2014;6:804–809. doi: 10.1038/nchem.2003. [DOI] [PubMed] [Google Scholar]
  82. Rosen C.B., Rodriguez-Larrea D., Bayley H. Single-molecule site-specific detection of protein phosphorylation with a nanopore. Nat. Biotechnol. 2014;32:179–181. doi: 10.1038/nbt.2799. [DOI] [PMC free article] [PubMed] [Google Scholar]
  83. Rotem D., Jayasinghe L., Salichou M., Bayley H. Protein detection by nanopores equipped with aptamers. J. Am. Chem. Soc. 2012;134:2781–2787. doi: 10.1021/ja2105653. [DOI] [PMC free article] [PubMed] [Google Scholar]
  84. Sereda T.J., Mant C.T., Quinn A.M., Hodges R.S. Effect of the α-amino group on peptide retention behaviour in reversed-phase chromatography determination of the pKa values of the α-amino group of 19 different N-terminal amino acid residues. J. Chromatogr. A. 1993;646:17–30. doi: 10.1016/S0021-9673(99)87003-4. [DOI] [PubMed] [Google Scholar]
  85. Sexton L.T., Horne L.P., Sherrill S.A., Bishop G.W., Baker L.A., Martin C.R. Resistive-pulse studies of proteins and protein/antibody complexes using a conical nanotube sensor. J. Am. Chem. Soc. 2007;129:13144–13152. doi: 10.1021/ja0739943. [DOI] [PubMed] [Google Scholar]
  86. Shadish J.A., DeForest C.A. Site-selective protein modification: from functionalized proteins to functional biomaterials. Matter. 2020;2:50–77. doi: 10.1016/j.matt.2019.11.011. [DOI] [Google Scholar]
  87. Shi W., Friedman A.K., Baker L.A. Nanopore sensing. Anal. Chem. 2017;89:157–188. doi: 10.1021/acs.analchem.6b04260. [DOI] [PMC free article] [PubMed] [Google Scholar]
  88. Sletten E.M., Bertozzi C.R. Bioorthogonal chemistry: fishing for selectivity in a sea of functionality. Angew. Chem. Int. Ed. 2009;48:6974–6998. doi: 10.1002/anie.200900942. [DOI] [PMC free article] [PubMed] [Google Scholar]
  89. Soskine M., Biesemans A., Moeyaert B., Cheley S., Bayley H., Maglia G. An engineered ClyA nanopore detects folded target proteins by selective external association and pore entry. Nano Lett. 2012;12:4895–4900. doi: 10.1021/nl3024438. [DOI] [PMC free article] [PubMed] [Google Scholar]
  90. Specht H., Emmott E., Petelski A.A., Huffman R.G., Perlman D.H., Serra M., Kharchenko P., Koller A., Slavov N. Single-cell proteomic and transcriptomic analysis of macrophage heterogeneity using SCoPE2. Genome Biol. 2021;22:50. doi: 10.1186/s13059-021-02267-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  91. Steen H., Mann M. The abc’s (and xyz’s) of peptide sequencing. Nat. Rev. Mol. Cell Biol. 2004;5:699–711. doi: 10.1038/nrm1468. [DOI] [PubMed] [Google Scholar]
  92. Steinbock L.J., Krishnan S., Bulushev R.D., Borgeaud S., Blokesch M., Feletti L., Radenovic A. Probing the size of proteins with glass nanopores. Nanoscale. 2014;6:14380–14387. doi: 10.1039/C4NR05001K. [DOI] [PubMed] [Google Scholar]
  93. Stephanopoulos N., Francis M.B. Choosing an effective protein bioconjugation strategy. Nat. Chem. Biol. 2011;7:876–884. doi: 10.1038/nchembio.720. [DOI] [PubMed] [Google Scholar]
  94. Sutherland T.C., Long Y.-T., Stefureac R.-I., Bediako-Amoa I., Kraatz H.-B., Lee J.S. Structure of peptides investigated by nanopore analysis. Nano Lett. 2004;4:1273–1277. doi: 10.1021/nl049413e. [DOI] [Google Scholar]
  95. Swaminathan J., Boulgakov A.A., Hernandez E.T., Bardo A.M., Bachman J.L., Marotta J., Johnson A.M., Anslyn E.V., Marcotte E.M. Highly parallel single-molecule identification of proteins in zeptomole-scale mixtures. Nat. Biotechnol. 2018;36:1076–1082. doi: 10.1038/nbt.4278. [DOI] [PMC free article] [PubMed] [Google Scholar]
  96. Swaminathan J., Boulgakov A.A., Marcotte E.M. A theoretical justification for single molecule peptide sequencing. PLOS Comput. Biol. 2015;11:e1004080. doi: 10.1371/journal.pcbi.1004080. [DOI] [PMC free article] [PubMed] [Google Scholar]
  97. Sze J.Y.Y., Ivanov A.P., Cass A.E.G., Edel J.B. Single molecule multiplexed nanopore protein screening in human serum using aptamer modified DNA carriers. Nat. Commun. 2017;8:1552. doi: 10.1038/s41467-017-01584-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  98. Talaga D.S., Li J. Single-molecule protein unfolding in solid state nanopores. J. Am. Chem. Soc. 2009;131:9287–9297. doi: 10.1021/ja901088b. [DOI] [PMC free article] [PubMed] [Google Scholar]
  99. Teng H., Cao M.D., Hall M.B., Duarte T., Wang S., Coin L.J.M. Chiron: translating nanopore raw signal directly into nucleotide sequence using deep learning. GigaScience. 2018;7:giy037. doi: 10.1093/gigascience/giy037. [DOI] [PMC free article] [PubMed] [Google Scholar]
  100. Thakur A.K., Movileanu L. Real-time measurement of protein–protein interactions at single-molecule resolution using a biological nanopore. Nat. Biotechnol. 2019;37:96–101. doi: 10.1038/nbt.4316. [DOI] [PMC free article] [PubMed] [Google Scholar]
  101. Timp W., Timp G. Beyond mass spectrometry, the next step in proteomics. Sci. Adv. 2020;6:eaax8978. doi: 10.1126/sciadv.aax8978. [DOI] [PMC free article] [PubMed] [Google Scholar]
  102. Van der Verren S.E., Van Gerven N., Jonckheere W., Hambley R., Singh P., Kilgour J., Jordan M., Wallace E.J., Jayasinghe L., Remaut H. A dual-constriction biological nanopore resolves homonucleotide sequences with high fidelity. Nat. Biotechnol. 2020;38:1415–1420. doi: 10.1038/s41587-020-0570-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  103. van Ginkel J., Filius M., Szczepaniak M., Tulinski P., Meyer A.S., Joo C. Single-molecule peptide fingerprinting. Proc. Natl. Acad. Sci. U S A. 2018;115:3338–3343. doi: 10.1073/pnas.1707207115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  104. Wang S., Zhao Z., Haque F., Guo P. Engineering of protein nanopores for sequencing, chemical or protein sensing and disease diagnosis. Curr. Opin. Biotechnol. 2018;51:80–89. doi: 10.1016/j.copbio.2017.11.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  105. Wick R.R., Judd L.M., Holt K.E. Performance of neural network basecalling tools for Oxford Nanopore sequencing. Genome Biol. 2019;20:129. doi: 10.1186/s13059-019-1727-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  106. Wiedemann C., Kumar A., Lang A., Ohlenschläger O. Cysteines and disulfide bonds as structure-forming units: insights from different domains of life and the potential for characterization by NMR. Front. Chem. 2020;8:280. doi: 10.3389/fchem.2020.00280. [DOI] [PMC free article] [PubMed] [Google Scholar]
  107. Wiita A.P., Perez-Jimenez R., Walther K.A., Gräter F., Berne B.J., Holmgren A., Sanchez-Ruiz J.M., Fernandez J.M. Probing the chemistry of thioredoxin catalysis with force. Nature. 2007;450:124–127. doi: 10.1038/nature06231. [DOI] [PMC free article] [PubMed] [Google Scholar]
  108. Wloka C., Van Meervelt V., van Gelder D., Danda N., Jager N., Williams C.P., Maglia G. Label-free and real-time detection of protein ubiquitination with a biological nanopore. ACS Nano. 2017;11:4387–4394. doi: 10.1021/acsnano.6b07760. [DOI] [PMC free article] [PubMed] [Google Scholar]
  109. Yan S., Zhang J., Wang Y., Guo W., Zhang S., Liu Y., Cao J., Wang Y., Wang L., Ma F. Single molecule ratcheting motion of peptides in a Mycobacterium smegmatis Porin A (MspA) nanopore. Nano Lett. 2021 doi: 10.1021/acs.nanolett.1c02371. [DOI] [PubMed] [Google Scholar]
  110. Yao Y., Docter M., van Ginkel J., de Ridder D., Joo C. Single-molecule protein sequencing through fingerprinting: computational assessment. Phys. Biol. 2015;12:055003. doi: 10.1088/1478-3975/12/5/055003. [DOI] [PubMed] [Google Scholar]
  111. Yazaki J., Kawashima Y., Ogawa T., Kobayashi A., Okoshi M., Watanabe T., Yoshida S., Kii I., Egami S., Amagai M. HaloTag-based conjugation of proteins to barcoding-oligonucleotides. Nucl. Acids Res. 2020;48:e8. doi: 10.1093/nar/gkz1086. [DOI] [PMC free article] [PubMed] [Google Scholar]
  112. Ying Y.-L., Yang J., Meng F.-N., Li S., Li M.-Y., Long Y.-T. A nanopore phosphorylation sensor for single oligonucleotides and peptides. Research. 2019;2019:1–8. doi: 10.34133/2019/1050735. [DOI] [PMC free article] [PubMed] [Google Scholar]
  113. Zhang L., Floyd B.M., Chilamari M., Mapes J., Swaminathan J., Bloom S., Marcotte E.M., Anslyn E.V. Photoredox-catalyzed decarboxylative C-terminal differentiation for bulk and single molecule proteomics. bioRxiv. 2021 doi: 10.1101/2021.07.08.451692. [DOI] [PMC free article] [PubMed] [Google Scholar]
  114. Zhang L., Gardner M.L., Jayasinghe L., Jordan M., Aldana J., Burns N., Freitas M.A., Guo P. Detection of single peptide with only one amino acid modification via electronic fingerprinting using reengineered durable channel of Phi29 DNA packaging motor. Biomaterials. 2021:121022. doi: 10.1016/j.biomaterials.2021.121022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  115. Zhang S., Huang G., Versloot R., Herwig B.M., de Souza P.C.T., Marrink S.-J., Maglia G. Bottom-up fabrication of a multi-component nanopore sensor that unfolds, processes and recognizes single proteins. bioRxiv. 2020 doi: 10.1101/2020.12.04.411884. [DOI] [Google Scholar]
  116. Zhao Y., Ashcroft B., Zhang P., Liu H., Sen S., Song W., Im J., Gyarfas B., Manna S., Biswas S. Single-molecule spectroscopy of amino acids and peptides by recognition tunnelling. Nat. Nanotechnol. 2014;9:466–473. doi: 10.1038/nnano.2014.54. [DOI] [PMC free article] [PubMed] [Google Scholar]
  117. Zhu Y., Clair G., Chrisler W.B., Shen Y., Zhao R., Shukla A.K., Moore R.J., Misra R.S., Pryhuber G.S., Smith R.D. Proteomic analysis of single mammalian cells enabled by microfluidic nanodroplet sample preparation and ultrasensitive NanoLC-MS. Angew. Chem. Int. Ed. 2018;57:12370–12374. doi: 10.1002/anie.201802843. [DOI] [PMC free article] [PubMed] [Google Scholar]
  118. Zhu Y., Piehowski P.D., Zhao R., Chen J., Shen Y., Moore R.J., Shukla A.K., Petyuk V.A., Campbell-Thompson M., Mathews C.E. Nanodroplet processing platform for deep and quantitative proteome profiling of 10–100 mammalian cells. Nat. Commun. 2018;9:882. doi: 10.1038/s41467-018-03367-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  119. Zubarev R.A. The challenge of the proteome dynamic range and its implications for in-depth proteomics. Proteomics. 2013;13:723–726. doi: 10.1002/pmic.201200451. [DOI] [PubMed] [Google Scholar]
  120. Zuo Z., MacMillan D.W.C. Decarboxylative arylation of α-amino acids via photoredox catalysis: a one-step conversion of biomass to drug pharmacophore. J. Am. Chem. Soc. 2014;136:5257–5260. doi: 10.1021/ja501621q. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from iScience are provided here courtesy of Elsevier

RESOURCES