Abstract
Disulfide bonds in proteins have a substantial impact on protein structure, stability, and biological activity. Localizing disulfide bonds is critical for understanding protein folding and higher-order structure. Conventional top-down mass spectrometry (TD-MS), where only terminal fragments are assigned for disulfide intact proteins, can access disulfide information, but suffers from low fragmentation efficiency, thereby limiting sequence coverage. Here, we show that assigning internal fragments generated from TD-MS enhances the sequence coverage of disulfide-intact proteins by 20-60% by returning information from the interior of the protein sequence, which cannot be obtained by terminal fragments alone. The inclusion of internal fragments can extend the sequence information of disulfide-intact proteins to near complete sequence coverage. Importantly, the enhanced sequence information that arise from the assignment of internal fragments can be used to determine the relative position of disulfide bonds and the exact disulfide connectivity between cysteines. The data presented here demonstrates the benefits of incorporating internal fragment analysis into the TD-MS workflow for analyzing disulfide-intact proteins, which would be valuable for characterizing biotherapeutic proteins such as monoclonal antibodies and antibody-drug conjugates.
Introduction
Disulfide bonds are among the most important posttranslational modifications (PTMs) in proteins, as they have a substantial impact on protein structure, stability, and biological activity.1–4 Determining disulfide bonding patterns is critical for understanding protein folding and higher-order structure as non-native disulfide bridges and aggregates can have detrimental effects on a protein’s three-dimensional structure and consequently their function.5, 6 The advancement of biotherapeutics such as monoclonal antibodies and antibody-drug conjugates have further driven the development of more efficient and accurate experimental strategies including mass spectrometry (MS) and ion mobility-MS to characterize disulfide bond linkages,7–13 as disulfide connectivity, which ensures its proper folding and consequently biological function and immunogenicity, is considered as a critical quality attribute during antibody manufacturing.14, 15 Mass spectrometry has established itself as a frontrunner for these characterizations owing to its exceptional sensitivity, low sample requirements, and the ability to be coupled with chromatographic separations to generate and detect diagnostic fragment ions possessing various disulfide connectivities,16–19 which cannot be achieved easily by conventional methods such as nuclear magnetic resonance (NMR) and X-ray crystallography.20, 21
Conventional “bottom-up” MS approaches employ chemical reduction and alkylation to cleave disulfide bonds and cap the free cysteines, followed by enzymatic digestion of the protein prior to liquid chromatography-tandem mass spectrometry (LC-MS/MS) analysis.22, 23 Although protein sequence usually can be unambiguously determined using this approach, information on disulfide bond locations and connectivities can be lost.18, 19 To compensate for this limitation, alternative strategies including proteolysis without prior reduction or with partial reduction have been utilized to generate disulfide-linked peptides for LC-MS/MS measurements.24–29 This allows for the elucidation of disulfide bonding patterns by comparing the peptides resulting from the reduced regions with the peptides from constrained regions to identify disulfide-linked peptides. However, it is difficult to control the amount of disulfide reduction using this approach, which results in complex mixtures of peptides with differing amounts of capped cysteines, making data analysis challenging.30 Moreover, with limited disulfide reduction, protein sequence coverage may not be sufficient to capture all disulfide linkage information. This problem will be exacerbated with increasing protein size and/or proteins that contain a large number of disulfide bonds.31
Top-down mass spectrometry (TD-MS), where direct mass measurement and subsequent fragmentation of intact gas-phase protein ions in the mass spectrometer to obtain the primary sequence information, has gained more popularity in recent years for interrogating proteins with various PTMs, including but not limited to disulfide bonds.32–36 TD-MS bypasses the time-consuming digestion and separation steps, allowing for all disulfide information to be preserved. By comparing the accurate measured mass with the theoretical sequence mass of disulfide intact proteins, the number of disulfide bonds can be readily determined. The modification sites can be further identified by subsequent fragmentation of the intact protein ions with high sequence coverage. However, challenges still remain. Accessing disulfide bond information usually requires concurrent fragmentation of the protein backbone and disulfide bonds to gain extensive sequence coverage, which is important for localizing disulfide bridges, whereas TD-MS suffers from low relative fragmentation efficiency, limiting sequence coverage.37–39 To increase sequence coverage, various fragmentation methods (alternative to the traditionally employed collision-based techniques) have been employed to characterize disulfide-intact peptides and proteins including electron-based dissociation (ExD),9, 31, 40–43 photon-based dissociation (PD),19, 44–48 and their hybrid methods with varying success.12, 30, 49, 50 An additional approach to increase TD-MS sequencing efficiency is to incorporate the assignment of internal fragments,51, 52 generated by multiple gas-phase cleavages of the polypeptide backbone, into the data analysis workflow.53
While the analysis of internal fragment ions has been largely ignored by the TD-MS community due to the general lack of software tools to accurately and reliably assign them, the concept of the formation of internal fragment ions in TD-MS spectra is not novel. Previous studies have shown that the inclusion of internal fragments results in much richer sequence information of small peptides,51, 54–57 intact proteins,58–64 protein complexes,65, 66 and aid the identification of ambiguous proteoforms in mammalian cell lysates by top-down ptoteomics.67 In addition, a recent study by Chin et al. demonstrated the utility of internal fragments to enhance sequence coverage and to decipher disulfide bonds of disulfide-rich peptides.68 Schmitt et al. also applied internal fragments to determine sequence motifs located within a disulfide constrained loop of SOD1 protein that could not be achieved by terminal fragments alone.63 The benefits of including internal fragments for characterizing disulfide-intact proteins are two-fold. Identifiable internal fragment ions within disulfide constrained regions can be generated without the need to cleave the disulfide bond,68 lowering the barrier to obtaining more sequence information. Second, by including internal fragments, the chance of identifying product ions that result from cleavage of disulfide bonds to access disulfide linkage information is higher than analyzing terminal fragments alone.
Here, we show that assigning internal fragments generated from collisionally activated dissociation (CAD) and ExD can increase the sequence coverage of disulfide-intact proteins by accessing the interior of the protein sequence constrained by multiple disulfide bonds. Importantly, by correlating the number of disulfide bonds cleaved by internal fragments to their sequence positions, the relative locations of disulfide bonds can be determined. By specifically analyzing internal fragments with disulfide bonds remaining intact, disulfide connectivity can be determined. This study demonstrates the benefits of considering internal fragments when analyzing these heavily constrained proteins, which would be valuable for characterizing biotherapeutic proteins that contain a large number of disulfide bonds.
Experimental
Materials and Sample Preparation.
The proteins β-lactoglobulin from bovine milk, ribonuclease A from bovine pancreas, α-lactalbumin from bovine milk, trypsin inhibitor from glycine max, and m-nitrobenzyl alcohol (mNBA) were purchased from Sigma-Aldrich (St. Louis, MO, USA). Lysozyme from chicken white egg was acquired from EMD Millipore (Darmstadt, Germany). LC/MS-grade water, methanol and formic acid were obtained from Fisher Chemical (Hampton, NH). All proteins were used without further purification. Protein samples were prepared in 49.5:49.5:1 water/methanol/formic acid to a final concentration of 10 or 20 μM. Supercharging agent mNBA was added to the ribonuclease A and α-lactalbumin solutions at a 0.25% (v/v) concentration.
Mass Spectrometry.
All samples were measured with a 15-Tesla solariX Fourier transform ion cyclotron resonance (FTICR)-MS instrument equipped with an infinity ICR cell (Bruker Daltonics, Billerica, MA, USA). The protein solutions were loaded into in-house pulled capillaries coated with gold, and electrosprayed by applying a voltage between 0.7 and 1.5 kV on the ESI capillary. Individual charge states of each multiply-protonated protein (11+ to 15+ for β-lactoglobulin, 8+ to 12+ for lysozyme, 8+ to 14+ for ribonuclease A, and 11+ to 14+ for α-lactalbumin) were isolated in the quadrupole, with an isolation window of 10 m/z before fragmentation. Three fragmentation methods including CAD, electron capture dissociation (ECD), and electron induced dissociation (EID) were applied to each isolated ion. For CAD fragmentation, collision energies were adjusted to achieve the same lab-frame energy for different charge states of each protein. The lab-frame energy is defined as the multiplication product of charge state and collision energy. The lab-frame energies used for each protein are: β-lactoglobulin, 336V; lysozyme, 438V; ribonuclease A, 330V; α-lactalbumin, 286V to achieve optimal fragmentation. For ECD fragmentation, the pulse length was set at 0.02s, with a lens voltage at 50 V and bias voltage at 2 V. For EID fragmentation, the pulse length was set at 0.02s, with a lens voltage at 50 V and bias voltage ranging from 26 to 30V.
CAD-MS/MS of trypsin inhibitor (TI) was done by isolating [TI + 17H]17+ with an isolation window of 10 m/z. The CAD energy was set at 20V which reduced the precursor ion signal to ~40% of the mass spectral level.
ECD-MS/MS of β-lactoglobulin and lysozyme were also performed on a Waters SELECT SERIES™ Cyclic IMS Q-ToF mass spectrometer (Waters, Milford, MA, USA) with the electromagnetostatic ExD cell (e-MSion Inc., Corvallis, OR) mounted before the cyclic ion mobility cell to allow for pre-cIMS ECD fragmentation. All ECD parameters were optimized to achieve the best fragmentation.
Data Analysis.
Data Processing and Fragment Assignment.
Raw MS/MS spectra acquired on FTICR were deconvoluted using Bruker Data Analysis software (SNAP algorithm). Mass spectra acquired on the Waters Cyclic IMS Q-ToF instrument was deconvoluted using Waters BayesSpray algorithm. Deconvoluted mass lists were uploaded into the ClipsMS (2.0) program53 for fragment ion matching. The mass tolerance was set at 2 ppm for FTICR data and 5 ppm for Waters Q-ToF data and the smallest internal fragment size was set at 5 amino acids. For sequence coverage and disulfide bond cleavage analyses, to account for all disulfide-containing fragment ions, modifications considering all possible disulfide cleavage positions (S-S and C-S cleavage) were imported as an unlocalized modification file for fragment matching. Up to 2 water and ammonia losses were included in the unlocalized modification file for CAD fragmentation. No localized modifications were imported for these analyses. For disulfide connectivity analysis, modifications applying one hydrogen loss on each cysteine to suggest the integrity of the disulfide bond were imported as a localized modification file for fragment matching. No unlocalized modifications were imported for this analysis. All localized and unlocalized modification files for fragment matching are available in the Supporting Information (Table S1 to S7). All six terminal fragment types including a, b, c, x, y, z were searched for all three fragmentation methods, while only by internal fragments were searched for CAD and cz internal fragments for ECD/EID spectra. All terminal fragments were assigned first (i.e., given first priority) before considering internal fragments, and all overlapping internal fragments due to the arrangement and/or frameshift ambiguity63 were removed. After fragment matching and duplicates removal, all assigned internal fragments were further verified by manually examining their isotopic profiles against the raw MS/MS spectra to eliminate uncertain assignments.
Protein Sequence Coverage.
Protein sequence coverage is calculated by the number of observed inter-residue cleavage sites divided by the total number of possible inter-residue cleavage sites on the protein backbone.
Results and discussion
Internal fragments can access the interior protein sequence constrained by multiple disulfide bonds.
To demonstrate that internal fragments can enhance sequence information of disulfide intact proteins, three fragmentation methods were applied, CAD, ECD, and EID on various isolated precursor charge states of four disulfide-intact proteins, including β-lactoglobulin (2 disulfide bonds), lysozyme (4 disulfide bonds), ribonuclease A (4 disulfide bonds), and α-lactalbumin (4 disulfide bonds). The disulfide connectivity of these proteins is shown in Scheme 1. EID fragmentation of β-lactoglobulin, [B-lac + 14H]14+ generated rich mass spectra filled with informative peaks (Fig. 1A). Many of the peaks in the spectra that were not assigned as terminal fragments can be assigned as internal fragments (Fig. 1A inset), demonstrating that more information can be extracted from a single MS/MS spectrum when considering internal fragments. Importantly, the location of all the assigned fragments for B-lac demonstrates that internal fragments span much of the interior sequence enclosed by multiple disulfide bonds, providing complementary sequence information to terminal fragments (Fig. 1B). Similar results were also observed for EID of lysozyme, [Lys + 10H]10+ (Fig. 1C and 1D). In both cases, the extent of information extracted from a single mass spectrum can be enhanced significantly when including internal fragments. Further, ECD and CAD of the same isolated precursor ions show similar fragmentation patterns, although ECD is less energetic than EID and CAD, and generated significantly less internal fragments (Fig. S1 and S2).
Scheme 1.
Disulfide bond connectivities of the four proteins examined, A. β-lactoglobulin (2 disulfide bonds), B. lysozyme (4 disulfide bonds), C. ribonuclease A (4 disulfide bonds), D. α-lactalbumin (4 disulfide bonds).
Fig. 1.
Representative EID MS/MS spectra of A. β-lactoglobulin, [B-lac + 14H]14+ and C. lysozyme, [Lys + 10H]10+. Fragment location maps indicating the region of the protein sequence covered by terminal fragments (blue) and internal fragments (orange) for B. EID of β-lactoglobulin, [B-lac + 14H]14+ (spectrum in A) and D. EID of lysozyme, [Lys + 10H]10+ (spectrum in C). Vertical dashed lines in panels B and D represent cysteines positions, with the same color indicating a disulfide bond is formed between those two cysteines.
To compare sequence information obtained from terminal fragments with internal fragments, all assigned unique fragments generated from every charge state for each protein were integrated. Assigning internal fragments generated from CAD, ECD, and EID increases the sequence coverage by 20-60% for all proteins examined. For example, sequence coverage increases from 43% to 83% for EID of β-lactoglobulin (Fig. 2D), 37% to 84% for EID of lysozyme (Fig. 3F), 40% to 87% for EID of ribonuclease A (Fig. S3F), and 36% to 90% for EID of α-lactalbumin (Fig. S4F) after including internal fragments. Incorporating internal fragments can cover almost every single inter-residue site to achieve near complete sequence coverage (99%) for CAD of lysozyme (Fig. 3F), with CAD of α-lactalbumin also close to 100% sequence coverage (96%, Fig. S4F). This is primarily due to the fact that the generation of terminal fragments beyond regions enclosed by disulfide bonds is difficult (vide infra); most often, an S-S bond would need to be cleaved in order to release the terminal fragment. This is further discussed below.
Fig. 2.
The extent of sequence information obtained by terminal and internal fragments for β-lactoglobulin at different sequence regions after integrating data from all five charge states (11+ to 15+) and for all three fragmentation methods (CAD, ECD, and EID) examined, A. sequence not enclosed by disulfide bond, B. sequence enclosed by one disulfide bond, C. sequence enclosed by two disulfide bonds, D. whole sequence. Cross marks in each panel indicate the sequence coverage after combing terminal and internal fragments.
Fig. 3.
The extent of sequence information obtained by terminal and internal fragments for lysozyme at different sequence regions after combining data from all five charge states (8+ to 12+) and for all three fragmentation methods (CAD, ECD, and EID) examined, A. sequence not enclosed by disulfide bond, B. sequence enclosed by one disulfide bond, C. sequence enclosed by two disulfide bonds, D. sequence enclosed by three disulfide bonds. E. sequence enclosed by four disulfide bonds. F. whole sequence. Cross marks in each panel indicate the sequence coverage after combing terminal and internal fragments.
The sequence of these proteins can be classified into different regions depending on the number of disulfide bonds enclosed. For example, β-lactoglobulin has two disulfide bonds with a connectivity of Cys66-Cys160 and Cys106-Cys119 (Scheme 1A), thus the β-lactoglobulin sequence can be classified into three regions: i) sequence not enclosed by disulfide bond (residues 1-66, 160-162), ii) sequence enclosed by one disulfide bond (residues 66-106, 119-160), and iii) sequence enclosed by two disulfide bonds (residues 106-119). Similarly, the sequence of lysozyme, which possesses four disulfide bonds (Scheme 1B) can be classified into five regions including sequence not enclosed disulfide bond (residues 1-6, 127-129), sequence enclosed by one disulfide bond (residues 6-30, 115-127), sequence enclosed by two disulfide bonds (residues 30-64, 94-115), sequence enclosed by three disulfide bonds (residues 64-76, 80-94), and sequence enclosed by four disulfide bonds (residues 76-80). For the other two proteins with four disulfide bonds, the primary protein sequence can also be separated into specific regions (ribonuclease A, Scheme 1C, and α-lactalbumin, Scheme 1D). To investigate the utility of internal fragments for accessing highly disulfide constrained regions, the extent of sequence information obtained from terminal and internal fragments at different sequence regions were compared and a clear trend can be observed. Generally, most internal fragments originate from the interior of the sequence within disulfide bonded regions, while terminal fragments originate from the outermost sequence. For example, for CAD of β-lactoglobulin, terminal fragments cover more sequence not enclosed by disulfide bond than internal fragments (64% vs. 60%, Fig. 2A), corresponding to a change of +4%, while no terminal fragments and only internal fragments cover the sequence enclosed by two disulfide bonds (0% vs. 54%, Fig. 2C), corresponding to a change of −54%. Similarly, for CAD of lysozyme with four disulfide bonds and five distinct sequence regions, the sequence coverage change when comparing terminal vs. internal fragments are +43%, −14%, −47%, −65%, and −20% when going deeper into the middle of the sequence (Fig. 3A to E). This data clearly demonstrates that internal fragments significantly enhances sequence information of the regions constrained by multiple disulfide bonds. A similar trend was observed for ECD and EID of these two proteins (Fig. 2 and 3) and the other two proteins possessing four disulfide bonds (ribonuclease A, Fig. S3, and α-lactalbumin, Fig. S4), with the relative sequence coverage decreasing for terminal fragments while increasing for internal fragments when reaching the interior protein sequence (Fig. 2A to C, 3A to E, S3A to E, and S4A to E). Notably, some specific sequence regions can only be accessed by internal fragments, such as the sequence enclosed by two disulfide bonds of β-lactoglobulin (Fig. 2C) and sequence enclosed by four disulfide bonds of ribonuclease A (Fig. S3E) and α-lactalbumin (Fig. S4E), highlighting the ability of internal fragments to cover sequence regions that cannot be reached by terminal fragments. The data shown here shows promise for the inclusion of internal fragments in obtaining more comprehensive sequence information for disulfide-intact proteins.
Internal fragments can determine the relative position of disulfide bonds.
To determine the position of disulfide bonds, the number of disulfide bond cleavages were analyzed for these proteins. We show here that terminal fragments cleave disulfide bonds located on the exterior of the protein while internal fragments cleave disulfide bonds within the interior of the protein. For example, terminal fragments generated by EID of β-lactoglobulin (2 disulfide bonds) induced more cleavages at the outermost disulfide bond (Cys66-Cys160) than internal fragments (38 vs. 11, Fig. 4A), while only internal fragments cleaved the interior disulfide bond (9 times at the Cys106-Cys119 bond, Fig. 4B). This trend is more pronounced for proteins with a greater number of disulfide bonds. For example, EID of lysozyme (4 disulfide bonds) showed that the Cys6-Cys127 bond was cleaved 62 times by terminal fragments but only 6 times by internal fragments (Fig. 5A). For the Cys30-Cys115 bond, located more interior of the protein sequence, the difference between disulfide cleavages from terminal and internal fragments was reversed, 10 vs. 16, respectively (Fig. 5B). For the Cys64-Cys80 bond and the Cys76-Cys94 bond, the disulfide cleavages comparison is 0 vs. 17 and 0 vs. 19 (terminal vs. internal, Fig. 5C and D). This trend was also observed for CAD and ECD of β-lactoglobulin and lysozyme, and the other two disulfide bonded proteins (ribonuclease A, Fig. S5, and α-lactalbumin, Fig. S6). Surprisingly, for disulfide bonds buried within the protein, their cleavages were only explained by internal fragments (Fig. 4B, 5C and D, S5D, S6C and D), highlighting the use of internal fragments to access disulfide bond information that cannot be obtained by terminal fragments.
Fig. 4.
Number of disulfide bonds cleaved by terminal and internal fragments for β-lactoglobulin after integrating data from all five charge states (11+ to 15+) for all three fragmentation methods (CAD, ECD, and EID) examined, A. Cys66-Cys160 bond, B. Cys106-Cys119 bond. Cross marks in each panel indicate the disulfide bond cleavage counts after combing terminal and internal fragments.
Fig. 5.
Number of disulfide bonds cleaved by terminal and internal fragments for lysozyme after combining data from all five charge states (11+ to 15+) for all three fragmentation methods (CAD, ECD, and EID) examined, A. Cys6-Cys127 bond, B. Cys30-Cys115 bond, C. Cys64-Cys80 bond, D. Cys76-Cys94 bond. Cross marks in each panel indicate the disulfide bond cleavage counts after combing terminal and internal fragments.
These data indicate that by correlating the relative number of disulfide cleavages explained by internal fragments to their sequence positions, the relative locations of disulfide bonds can be determined. The outermost disulfide bonds are explained more by terminal fragments, as their formation usually only require one backbone cleavage in addition to one disulfide bond cleavage. In contrast, in order for internal fragments to explain these outermost disulfide bond cleavages, simultaneous cleavages of one disulfide bond and multiple protein backbone bonds are required, raising the energy barrier compared to terminal fragments. When going deeper into the protein sequence, more internal fragments result from cleavage of innermost disulfide bonds. In these highly constrained regions, simultaneous cleavages of multiple disulfide bonds and one protein backbone bond are needed to generate terminal fragments, while the formation of internal fragments still only require one disulfide bond cleavage in addition to multiple protein backbone cleavages. These results can be rationalized by considering the relative energies required to cleave the protein backbone (~10-15 kcal/mol) compared to the disulfide bond (~45-60 kcal/mol).38, 69 Because the energy barrier of cleaving a disulfide bond is higher than cleaving a protein backbone bond, the energy requirement of forming internal fragments in the interior protein sequence could be lower than for terminal fragments, and thus internal fragments could be more easily to cleave disulfide bonds buried within the protein. To support our data, ECD of β-lactoglobulin and lysozyme were conducted using a different mass spectrometry system (Waters Select Series Cyclic IMS Q-TOF). Similar trends for both sequence coverages and disulfide bond cleavages were observed (Fig. S7 and S8), further demonstrating the utility of internal fragments to cover the interior protein sequence and determine the relative positions of disulfide bonds.
Internal fragments retaining intact disulfide bonds can determine disulfide connectivity.
To determine the disulfide connectivity between cysteines, we focus on fragments that only result from protein backbone cleavages and retain the intact disulfide bonds. Fragments that arise from these types of cleavages can be divided into Type I fragments and Type II fragments (Scheme 2). Type I fragments correspond to fragments (terminal and internal) that traverse an even number of dehydrocysteine residues (e.g., 2, 4, 6) and contain mass shifts associated with the multiplication product of the number of disulfide bonds and dehydrocysteines (no. of disulfide bonds × −2 Da, Scheme 2). Type II fragments correspond to internal fragments formed between adjacent cysteine residues; thus no disulfide bonds are involved (Scheme 2). Type I fragment suggests that intact disulfide bonds are maintained within the cysteines involved, while Type II fragment suggests that those two adjacent cysteines are highly unlikely to be connected.
Scheme 2.
The two types of fragments retaining intact disulfide bonds to determine disulfide connectivity. A hydrogen loss (−1 Da) was applied on every cysteine residue to suggest the integrity of disulfide bonds involved. Type I fragment traverses an even number of dehydrocysteines (2, 4, 6 etc.), suggesting that intact disulfide bonds are formed within the cysteines involved. Type II fragment is generated between adjacent cysteines with no disulfide bonds involved, suggesting that those two adjacent cysteines are highly unlikely to be connected.
To determine disulfide connectivity using Type I and Type II fragments, CAD fragmentation of trypsin inhibitor (181 residues, 20.1 kDa, 2 disulfide bonds, Fig. S9A), [TI + 17H]17+ (Fig. 6A) was investigated, as the non-overlapping feature of the two disulfide bonds of trypsin inhibitor makes it a good test example. Type I fragments can be used to determine the disulfide connectivity of the two disulfide bonds of trypsin inhibitor. For example, the two dehydrocysteines (Cys39 and Cys86) located close to the N-terminus were traversed by 9 Type I terminal fragments and 70 Type I internal fragments, and the two dehydrocysteines (Cys136 and Cys145) located closer to the C-terminus were traversed by 8 Type I terminal fragments and 7 Type I internal fragments, which strongly suggests that the connectivity between these cysteines should be “Cys39-Cys86” and “Cys136-Cys145” for these two disulfide bonds (Fig. 6A). Four examples of Type I internal fragments traversing these two disulfide bonds are shown (Fig. S9). It should be noted that fragments traversing an even number of dehydrocysteines do not guarantee the integrity of disulfide bonds involved; however, the likelihood of them being cleaved is much lower. For example, only one internal fragment (by42-137) traversed the middle two dehydrocysteines (Cys86 and Cys136), whereas the formation of 3 Type II fragments between Cys86 and Cys136 (by96-115, by100-115, by125-132) indicates that these two cysteines are not likely to be connected.
Fig. 6.
Fragment location maps after importing a hydrogen loss localized modification on every cysteine, suggesting the integrity of every disulfide bond of A. CAD of trypsin inhibitor, [TI + 17H]17+, and B. CAD of α-lactalbumin after integrating data from all four charge states examined (11+ to 14+). Vertical dashed lines represent cysteines positions, with the same color indicating a disulfide bond is formed between those two cysteines. Internal fragments traversing an even number of dehydrocysteines (Type I fragments) suggest that intact disulfide bonds are formed within those cysteines, while internal fragments formed between adjacent cysteines (Type II fragments) suggest that those two cysteines are not likely to relate to each other.
Similar results could also be gleaned when α-lactalbumin (123 residues, 14.2 kDa, 4 disulfide bonds, Scheme 1D), which possesses a more complicated disulfide linkage was analyzed (Fig. 6B). Disulfide connectivity of α-lactalbumin was determined by interrogating the innermost disulfide bonds, and expanding to the outermost disulfide bonds. The middle four dehydrocysteines (Cys61, Cys73, Cys77, Cys91) were traversed by 4 Type I internal fragments (by50-97, by51-106, by53-97, by60-106, Fig. S10), indicating that two disulfide bonds are formed within these four cysteines (Fig. 6B). Type II internal fragments were then used to aid the assignment of the exact connectivity within these four cysteines. The formation of 8 Type II internal fragments between Cys61 and Cys73, and 12 Type II internal fragments between Cys77 and Cys91 strongly suggests that the connectivity of “Cys61-Cys73” and “Cys77-Cys91” is not likely. In addition, the lack of Type I internal fragments traversing the middle two dehydrocysteines (Cys73 and Cys77) indicates that the “Cys73-Cys77” connectivity is not likely either. Should Cys73 and Cys77 be connected, Type I internal fragments traversing the dehydro form of these two cysteines would have been generated, as demonstrated for CAD of trypsin inhibitor (Fig. 6A). Therefore, the only possible connectivity of these four cysteines is “Cys61-Cys77” and “Cys73-Cys91”. Expanding to the outermost cysteines, the formation of 1 Type I internal fragment traversing the middle six dehydrocysteines (by20-113), and 8 Type I terminal fragments traversing all eight dehydrocysteines indicates that two more disulfide bonds are formed between the four cysteines located on the exterior protein sequence. The presence of Type I internal fragment by20-113 determines the connectivity of “Cys28-Cys111”, provided that the middle four cysteines are associated with each other. This is further supported by the fact that 28 Type II internal fragments are formed between Cys6 and Cys28, 40 Type II internal fragments are formed between Cys28 and Cys61, 28 Type II internal fragments are formed between Cys91 and Cys111, and 1 Type II internal fragment is formed between Cys111 and Cys120. These Type II internal fragments rule out the possibility of “Cys6-Cys28”, “Cys28-Cys61”, “Cys91-Cys111”, and “Cys111-Cys120” connectivities. Therefore, the two outermost disulfide bond connectivities can be determined as “Cys28-Cys111” and “Cys6-Cys120”. It is noteworthy that only internal fragments can access the middle four cysteines and determine their connectivities, demonstrating again the value of analyzing internal fragments to obtain comprehensive disulfide bond information. The disulfide connectivity of lysozyme can be elucidated and determined in a similar way using these two types of fragments (Fig. S11).
Conclusions
Here we report the utility of internal fragments to enhance information obtained from disulfide intact proteins. We demonstrate that internal fragments can access the interior protein sequence constrained by multiple disulfide bonds that cannot be reached by terminal fragments, resulting in a sequence coverage increase of 20-60% to cover nearly the complete sequence of disulfide-intact proteins. We show that terminal fragments result from cleavage of disulfide bonds located on the exterior of the protein while internal fragments represent cleavage of more disulfide bonds buried within the interior of the protein. By correlating the relative number of internal fragments that result in disulfide cleavages to their sequence positions, the relative positions of disulfide bonds can be determined. Lastly, we show that internal fragments retaining intact disulfide bonds, which are traditionally overlooked, can be used to determine the disulfide connectivity. By analyzing internal fragments, it is possible to gain more sequence information and elucidate disulfide linkage patterns for proteins with unknown disulfide connectivities, which would be valuable for characterizing biotherapeutic proteins that contain many disulfide bonds.
Supplementary Material
Acknowledgements
Support from the US National Institutes of Health (R01GM103479, R35GM145286, S10RR028893), the US National Science Foundation (NSF) (CHE1808492), and the US Department of Energy (DEFC02-02ER63421) are gratefully acknowledged. C. L. acknowledges support from the Ruth L. Kirschstein National Research Service Award program (GM007185).
Footnotes
Conflicts of interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Data Availability
All experimental supporting data associated with this article are available in the main manuscript and in the ESI.
Notes and references
- 1.Matsumura M, Signor G and Matthews BW, Nature, 1989, 342, 291–293. [DOI] [PubMed] [Google Scholar]
- 2.Wedemeyer WJ, Welker E, Narayan M and Scheraga HA, Biochem, 2000, 39, 4207–4216. [DOI] [PubMed] [Google Scholar]
- 3.Hogg PJ, Trends Biochem. Sci, 2003, 28, 210–214. [DOI] [PubMed] [Google Scholar]
- 4.Trivedi MV, Laurence JS and Siahaan TJ, Curr. Protein Pept. Sci, 2009, 10, 614–625. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Glocker MO, Arbogast B and Deinzer ML, J. Am. Soc. Mass Spectrom, 1995, 6, 638–643. [DOI] [PubMed] [Google Scholar]
- 6.Feige MJ, Braakman I and Hendershot LM, in Oxidative Folding of Proteins: Basic Principles, Cellular Regulation and Engineering, The Royal Society of Chemistry, 2018, DOI: 10.1039/9781788013253-00001, pp. 1–33. [DOI] [Google Scholar]
- 7.Zhang W, Marzilli LA, Rouse JC and Czupryn MJ, Anal. Biochem, 2002, 311, 1–9. [DOI] [PubMed] [Google Scholar]
- 8.Wypych J, Li M, Guo A, Zhang Z, Martinez T, Allen MJ, Fodor S, Kelner DN, Flynn GC, Liu YD, Bondarenko PV, Ricci MS, Dillon TM and Balland A, J. Biol. Chem, 2008, 283, 16194–16205. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Wu S-L, Jiang H, Lu Q, Dai S, Hancock WS and Karger BL, Anal. Chem, 2009, 81, 112–122. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Bagal D, Valliere-Douglass JF, Balland A and Schnier PD, Anal. Chem, 2010, 82, 6751–6755. [DOI] [PubMed] [Google Scholar]
- 11.Badescu G, Bryant P, Bird M, Henseleit K, Swierkosz J, Parekh V, Tommasi R, Pawlisz E, Jurlewicz K, Farys M, Camper N, Sheng X, Fisher M, Grygorash R, Kyle A, Abhilash A, Frigerio M, Edwards J and Godwin A, Bioconjug. Chem, 2014, 25, 1124–1136. [DOI] [PubMed] [Google Scholar]
- 12.Shaw JB, Liu W, Vasil Ev YV, Bracken CC, Malhan N, Guthals A, Beckman JS and Voinov VG, Anal. Chem, 2020, 92, 766–773. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Deslignière E, Botzanowski T, Diemer H, Cooper-Shepherd DA, Wagner-Rousset E, Colas O, Béchade G, Giles K, Hernandez-Alba O, Beck A and Cianférani S, J. Am. Soc. Mass Spectrom, 2021, DOI: 10.1021/jasms.1c00151. [DOI] [PubMed] [Google Scholar]
- 14.Zhang L, Chou CP and Moo-Young M, Biotechnol. Adv, 2011, 29, 923–929. [DOI] [PubMed] [Google Scholar]
- 15.Góngora-Benítez M, Tulla-Puche J and Albericio F, Chem. Rev, 2014, 114, 901–926. [DOI] [PubMed] [Google Scholar]
- 16.Gorman JJ, Wallis TP and Pitt JJ, Mass Spectrom. Rev, 2002, 21, 183–216. [DOI] [PubMed] [Google Scholar]
- 17.Tsai PL, Chen S-F and Huang SY, Rev Anal Chem, 2013, 32. [Google Scholar]
- 18.Wiesner J, Resemann A, Evans C, Suckau D and Jabs W, Expert Rev. Proteom, 2015, 12, 115–123. [DOI] [PubMed] [Google Scholar]
- 19.Quick MM, Crittenden CM, Rosenberg JA and Brodbelt JS, Anal. Chem, 2018, 90, 8523–8530. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Walewska A, Skalicky JJ, Davis DR, Zhang M-M, Lopez-Vera E, Watkins M, Han TS, Yoshikami D, Olivera BM and Bulaj G, J. Am. Chem. Soc, 2008, 130, 14280–14286. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Poppe L, Hui JO, Ligutti J, Murray JK and Schnier PD, Anal. Chem, 2012, 84, 262–266. [DOI] [PubMed] [Google Scholar]
- 22.Shevchenko A, Wilm M, Vorm O and Mann M, Anal. Chem, 1996, 68, 850–858. [DOI] [PubMed] [Google Scholar]
- 23.Ong SE and Mann M, Nat. Chem. Biol, 2005, 1, 252–262. [DOI] [PubMed] [Google Scholar]
- 24.Foley SF, Sun Y, Zheng TS and Wen D, Anal. Biochem, 2008, 377, 95–104. [DOI] [PubMed] [Google Scholar]
- 25.Zhang Y, Dewald HD and Chen H, J. Proteome Res, 2011, 10, 1293–1304. [DOI] [PubMed] [Google Scholar]
- 26.Zhao DS, Gregorich ZR and Ge Y, Proteomics, 2013, 13, 3256–3260. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Nicolardi S, Giera M, Kooijman P, Kraj A, Chervet J-P, Deelder AM and van der Burgt YEM, J. Am. Soc. Mass Spectrom, 2013, 24, 1980–1987. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Cramer CN, Haselmann KF, Olsen JV and Nielsen PK, Anal. Chem, 2016, 88, 1585–1592. [DOI] [PubMed] [Google Scholar]
- 29.Cramer CN, Kelstrup CD, Olsen JV, Haselmann KF and Nielsen PK, Anal. Chem, 2017, 89, 5949–5957. [DOI] [PubMed] [Google Scholar]
- 30.Wongkongkathep P, Li H, Zhang X, Ogorzalek Loo RR, Julian RR and Loo JA, Int. J. Mass Spectrom, 2015, 390, 137–145. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Zhang J, Loo RRO and Loo JA, Int. J. Mass Spectrom, 2015, 377, 546–556. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Siuti N and Kelleher NL, Nat. Methods, 2007, 4, 817–821. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Zhang Y, Cui W, Zhang H, Dewald HD and Chen H, Anal. Chem, 2012, 84, 3838–3842. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Catherman AD, Skinner OS and Kelleher NL, Biochem. Biophys. Res. Commun, 2014, 445, 683–693. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Lermyte F, Tsybin YO, O’Connor PB and Loo JA, J. Am. Soc. Mass Spectrom, 2019, 30, 1149–1157. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Toby TK, Fornelli L and Kelleher NL, Annu Rev Anal Chem (Palo Alto Calif), 2016, 9, 499–519. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Kleinnijenhuis AJ, Duursma MC, Breukink E, Heeren RMA and Heck AJR, Anal. Chem, 2003, 75, 3219–3225. [DOI] [PubMed] [Google Scholar]
- 38.Lioe H and OrsHair RAJ, J. Am. Soc. Mass Spectrom, 2007, 18, 1109–1123. [DOI] [PubMed] [Google Scholar]
- 39.Chen J, Shiyanov P, Zhang L, Schlager JJ and Green-Church KB, Anal. Chem, 2010, 82, 6079–6089. [DOI] [PubMed] [Google Scholar]
- 40.Gammelgaard SK, Petersen SB, Haselmann KF and Nielsen PK, J. Am. Soc. Mass Spectrom, 2020, DOI: 10.1021/jasms.0c00257. [DOI] [PubMed] [Google Scholar]
- 41.Roman NAK Zubarev A, Fridriksson Einar K., Lewis Mark A., Horn David M., Carpenter Barry K., and McLafferty Fred W., J. Am. Chem. Soc, 1999, 121, 2857–2862. [Google Scholar]
- 42.Ganisl B and Breuker K, ChemistryOpen, 2012, 1, 260–268. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Cole SR, Ma X, Zhang X and Xia Y, J. Am. Soc. Mass Spectrom, 2012, 23, 310–320. [DOI] [PubMed] [Google Scholar]
- 44.Wen D, Xiao Y, Vecchi MM, Gong BJ, Dolnikova J and Pepinsky RB, Anal. Chem, 2017, 89, 4021–4030. [DOI] [PubMed] [Google Scholar]
- 45.Fung YME, Kjeldsen F, Silivra OA, Chan TWD and Zubarev RA, Angew. Chem. Int. Ed, 2005, 44, 6399–6403. [DOI] [PubMed] [Google Scholar]
- 46.Agarwal A, Diedrich JK and Julian RR, Anal. Chem, 2011, 83, 6455–6458. [DOI] [PubMed] [Google Scholar]
- 47.Bonner J, Talbert LE, Akkawi N and Julian RR, Analyst, 2018, 143, 5176–5184. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Talbert LE and Julian RR, J. Am. Soc. Mass Spectrom, 2018, 29, 1760–1767. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Gammelgaard SK, Petersen SB, Haselmann KF and Nielsen PK, ACS Omega, 2020, 5, 7962–7968. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Rush MJP, Riley NM, Westphall MS and Coon JJ, Anal. Chem, 2018, 90, 8946–8953. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Ballard KD and Gaskell SJ, Int. J. Mass Spectrom. Ion Processes, 1991, 111, 173–189. [Google Scholar]
- 52.Wysocki VH, Resing KA, Zhang Q and Cheng G, Methods, 2005, 35, 211–222. [DOI] [PubMed] [Google Scholar]
- 53.Lantz C, Zenaidee MA, Wei B, Hemminger Z, Ogorzalek Loo RR and Loo JA, J. Proteome Res, 2021, 20, 1928–1935. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Barran PE, Polfer NC, Campopiano DJ, Clarke DJ, Langridge-Smith PRR, Langley RJ, Govan JRW, Maxwell A, Dorin JR, Millar RP and Bowers MT, Int. J. Mass Spectrom, 2005, 240, 273–284. [Google Scholar]
- 55.Michalski A, Neuhauser N, Cox J and Mann M, J. Proteome Res, 2012, 11, 5479–5491. [DOI] [PubMed] [Google Scholar]
- 56.Lyon YA, Riggs D, Fornelli L, Compton PD and Julian RR, J. Am. Soc. Mass Spectrom, 2018, 29, 150–157. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Wei B, Zenaidee MA, Lantz C, Ogorzalek Loo RR and Loo JA, Anal. Chim. Acta, 2022, 1194. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Cobb JS, Easterling ML and Agar JN, J. Am. Soc. Mass Spectrom, 2010, 21, 949–959. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Durbin KR, Skinner OS, Fellers RT and Kelleher NL, J. Am. Soc. Mass Spectrom, 2015, 26, 782–787. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Chen J, Shiyanov P and Green KB, J. Mass Spectrom, 2019, 54, 527–539. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Griaud F, Denefeld B, Kao-Scharf C-Y, Dayer J, Lang M, Chen J-Y and Berg M, Anal. Chem, 2019, 91, 8845–8852. [DOI] [PubMed] [Google Scholar]
- 62.Zenaidee MA, Lantz C, Perkins T, Jung W, Loo RRO and Loo JA, J. Am. Soc. Mass Spectrom, 2020, 31, 1896–1902. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Schmitt ND, Berger JM, Conway JB and Agar JN, Anal. Chem, 2021, 93, 6355–6362. [DOI] [PubMed] [Google Scholar]
- 64.Zenaidee MA, Wei B, Lantz C, Wu HT, Lambeth TR, Diedrich JK, Ogorzalek Loo RR, Julian RR and Loo JA, J. Am. Soc. Mass Spectrom, 2021, 32, 1752–1758. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Li H, Sheng Y, Mcgee W, Cammarata M, Holden D and Loo JA, Anal. Chem, 2017, 89, 2731–2738. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Li H, Nguyen HH, Ogorzalek Loo RR, Campuzano IDG and Loo JA, Nat. Chem, 2018, 10, 139–148. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Rolfs Z and Smith LM, J. Proteome Res, 2021, DOI: 10.1021/acs.jproteome.1c00599. [DOI] [Google Scholar]
- 68.Chin S, Chen T, Hannoush RN and Crittenden CM, J. Pharm. Biomed. Anal, 2021, 195, 113893. [DOI] [PubMed] [Google Scholar]
- 69.Paizs B and Suhai S, J. Am. Soc. Mass Spectrom, 2004, 15, 103–113. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All experimental supporting data associated with this article are available in the main manuscript and in the ESI.