Abstract
The combined use of electrospray ionization run in so-called “native mode” with top-down mass spectrometry (nTDMS) is enhancing both structural biology and discovery proteomics by providing three levels of information in a single experiment: the intact mass of a protein or complex, the masses of its subunits and non-covalent cofactors, and fragment ion masses from direct dissociation of subunits that capture the primary sequence and combinations of diverse post-translational modifications (PTMs). While intact mass data are readily deconvoluted using well-known software options, the analysis of fragmentation data that result from a tandem MS experiment – essential for proteoform characterization – is not yet standardized. In this tutorial, we offer a decision-tree for the analysis of nTDMS experiments on protein complexes and diverse bioassemblies. We include an overview of strategies to navigate this type of analysis, provide example data sets, and highlight software for the hypothesis-driven interrogation of fragment ions for localization of PTMs, metals, and cofactors on native proteoforms. Throughout we have emphasized the key features (deconvolution, search mode, validation, other) that the reader can consider when deciding upon their specific experimental and data processing design using both open-access and commercial software.
Introduction
An emerging trend in structural biology involves using tools and methods from proteomics, particularly mass spectrometry (MS), to investigate the composition, stoichiometry, assembly, and topology of protein complexes [1–8]. One technique, top-down MS run in “native mode” (nTDMS), employs a three-tiered strategy for compositional analysis that employs non-denaturing electrospray ionization (ESI) to preserve non-covalent protein assemblies and determine their intact mass, followed by subunit disassembly and further primary sequence characterization by tandem MS [9–12] (Fig. 1A). There are three levels of mass information operative in the nTDMS analysis of a protein complex — the intact complex (MS1), its subunits (MS2), and the subunits’ N/C-terminal fragment ions, such as b/y and c/z ion types (MS3) [9, 13]. The fragmentation step uniquely provides “proteoform” resolution that reveals subtle changes to a protein’s canonical sequence that may be critical to structure and function (e.g., coding single nucleotide polymorphisms, mutations, splice variants, post-translational modifications (PTMs), endogenous proteolysis, or allosteric modulators and co-factors) [10]. Importantly, MSn is an instrument-centric term that describes the levels of tandem MS performed on the analyte. In the case of a monomeric protein, the MS2 describes the fragmentation step; in contrast, the fragmentation of a subunit ejected from a complex is denoted by the 3 in MS3 to denote the third stage of MS analysis.
Figure 1.

Depictions of nTDMS data processing (A) and of the Bioinformatics and Validation of the data (B).
A recently published resource [14] provides extensive details on how to perform intact and tandem MS experiments on a standardized set of native-state proteins and complexes, including discussion on key parameters related to the data acquisition and the interpretation of the MS1 and MS2 data using readily available deconvolution software, such as Unidec and MagTran [15, 16]. Recently, progress has also been made on the integration of intact mass data (MS1 and MS2 of protein complexes) with fragmentation (MS3 of ejected subunit), or alternative omics approaches (e.g., bottom-up), for a comprehensive “native-proteomics” pipeline for large-scale characterization of native-state protein complexes [17, 18]. For example, MetaOdysseus, is an R-based software tool that integrates bottom-up with native top-down protein analysis for the characterization of metal–protein complexes (i.e., metal–protein stoichiometry, identification of metal-binding sites, or characterization of metal-coupled folding events)[18]. Similarly, the Search Engine for Multi-Proteoform Complexes was created as an online search strategy for identification of monomeric proteins and complexes with proteoform resolution. This platform [19, 20] utilizes search routines that combine protein interaction information from the CORUM database of human protein complexes [21] with comprehensive PTM, isoform, and cofactor information from the UniProtKB database (e.g., ligands, metals, etc.) to allow precise identification and scoring of subunits, along with bound non-covalent cofactors and covalent PTMs.
Subunit fragment data emulates MS/MS proteomics datasets while retaining the heterogeneity introduced by variable complexation to proteins, cofactors, and adducts. This retained heterogeneity makes traditional data processing challenging, as in conventional TDMS software, all modifications are considered as covalent and not transient. For example, a metal cofactor may or may not be retained post-fragmentation, resulting in traditional software annotating two separate “proteoforms” despite only the bound form being present (Fig. 1B) [8, 14, 17, 22, 23]. Moreover, subunits and their fragment ions can partially retain non-covalent cofactors in combination with other PTMs, complicating the analysis. For these reasons, the assessment of both raw and processed fragmentation data, along with manual validation of results, is critical for confident proteoform assertions and relies on steps that are highly time-consuming and often incompatible with the automated search engines used in proteomics [14]. More advanced bioinformatics analysis for specialized hypothesis-testing of fragments most often relies on the creative re-purposing of tools made for other proteomics applications, presenting a significant barrier to entry for new researchers [24–28]. To help address this issue, we outline strategies for the analysis of fragmentation data with examples for zinc-bound Carbonic Anhydrase (CA), a modified histone ejected from a nucleosome (Nuc), and copper-bound and oxidized amyloid beta (Aβ). This report deploys a decision-tree (Fig. 2) to guide the analysis of this data using open-access, freeware and commercial resources (Table S1) to (1) identify the subunits of protein-complexes in mixtures, (2) derive proteoform-level insights (e.g., sequence variants or PTMs), and (3) map non-covalent cofactors. Throughout, we highlight many shared features of existing open-access and commercial software (e.g., deconvolution approaches, informatic search modality, validation resources) to inform the reader how to best design their unique experimental and data processing pipelines (Table S1). The examples provided were obtained on Orbitrap hybrid instruments in high mass modes; however, the software resources discussed are also amenable to mass spectrometers that selectively activate targets, such as quadrupole time-of-flight (QTOF) or Fourier Transform ion cyclotron resonance (FT-ICR) [29, 30]. Finally, in a supplemental discussion we briefly introduce a new nTDMS software resource, ProSight Native, that is built for seamless navigation across the various levels of tandem MS through the integration of mass deconvolution, native fragmentation data, proteoform identification, and stoichiometry calculations into a single resource (see Supplemental Discussion 4).
Figure 2.

Decision tree to guide the identification and characterization of unknown proteoforms, including those harboring multiple post-translational modifications or non-covalent cofactors. The name of select software tools used at each step are in italics, with an expanded list of common open-source and commercial resources detailed in Table S1. Much of this decision tree has been incorporated into a new nTDMS resource, ProSight Native, that provides a platform for the analysis of native proteoforms and for determining the composition of protein complexes (see Supplemental Discussion 4 (SD.4)). Other topics discussed in more detail in the supplement include: (SD.1) Deconvolution strategies of fragmentation data; (SD.2) the calculation of a precise mass shift from the fragmentation data; (SD.3) interrogation of histone H3 datasets for fragments containing and acetylation site; (SD.5) and introductions to the discovery/validation resource, TDVALIDATOR; (SD.6) an introduction to a proteoform database generation tool, PROTEIN ANNOTATOR; (Table S1) and an overview of existing open-access, freeware and commercial resources commonly used for nTDMS.
An important distinction within the described workflow is the concept of subunit “identification” vs. “characterization”. Here, different confidence metrics can often be used to describe the quality of the identified protein in a database (e.g., P-score, E-score, Q-score) [31] and describe the selectivity of the proteoform characterized (e.g., C-score) [26]. While the goal of any informatics approach is to arrive at a solution that achieves absolute confidence via automated search engines independent of user intervention, asserting the existence of a modified proteoform is challenging in practice. In our experience, many datasets need researcher intervention to manage the complexities of the fragmentation data quality, such as the simultaneous fragmentation of isobaric proteoforms, the complete or partial ejection of labile PTMs or non-covalent cofactors (e.g., phosphorylation, glycosylation, metals), charge-state dependence of protein-metal ion interactions [32], or possible rearrangement of binding sites upon activation in the gas-phase [33].
Protein Identification and Proteoform Characterization
Fragmentation datasets are made up of distinct – though often overlapping – isotopic distributions for the fragment ions in the mass-to-charge (m/z) domain. Typically the observed fragment m/z values are converted into monoisotopic masses by various deconvolution algorithms that are available as part of the manufacturer’s software packages (e.g., Xtract, FreeStyle, etc.) or through open-source/free tools like mMass [34], Mash Explorer, etc.; Table S1). These tools use assorted algorithmic strategies with varied mass determination approaches for resolved isotopic distributions that result in subtly different fragment mass and intensity outputs (Supplemental Discussion 1). The most important variable at this level is the signal-to-noise ratio (SNR) thresholds for deconvoluting fragment ions, wherein a lower value often enables detection of low-abundant fragments that can improve characterization metrics above acceptable threshold limits (e.g., C-score ≥ 40) [26]. However, improved characterization can come at the expense of generating a noisier output that can generate false positive hits that negatively influence the identification confidence scores typically employed (e.g., P-Score ≤ 0.01) [35]. Therefore, for identification of unknown subunits in a complex, emphasis must first be placed on improving identification scores; this often means that searches should ideally be performed with robust, high-quality fragment ion populations (e.g., SNR ≥ 3 is a good starting point, but can vary from 10:1 down to 1.2:1 depending on the application).
Automated identification of unknown proteins in complexes can be achieved through the repurposing of top-down MS search engines (Fig. 2 and Table S1; e.g., ProSight PC, ProSight PD, BIGMASCOT, TopPIC, TDPortal [36, 37]). These tools produce a list of protein candidates, graphical fragment maps showing sequence coverage, and various statistics associated with the confidence of the identification between candidate proteins vs. candidate proteoforms for a given protein. In most cases, these resources utilize knowledge of the subunit mass to preselect candidate proteins to test against the collected fragmentation data [13, 24, 26, 35]. In cases where the identity of the protein is known, tools like ProSight Lite or Mash Explorer can be useful for visualization of the fragment ions directly on the sequence. This type of analysis can be used to derive confidence protein/proteoform scores (e.g., P-score, C-score) independent from database searches [24, 25]. These tools also allow the user to determine if observed fragments support user-defined sequences containing PTMs, cofactors, or sequence variants (e.g., SNPs, mutations, or proteolytic truncations) at different positions [23].
As with any -omics approach, fidelity of the search is highly dependent upon the content of the database. Resources such as UniProt [38] provide prepopulated databases for an individual organism’s proteome that can be added to search engines. Moreover, these databases can be further annotated to account for common events closely tied to gene-translation (e.g., cleavage of initiator methionine, signal peptide removal, or N-terminal acetylation). To support accurate proteoform assignment, Protein Annotator (Fig. S7) helps amend databases to include candidate proteoforms that result from removal of signal peptides, known sequence variations, PTMs and cofactors available in UniProt or UniMod for a more targeted approach. However, unlimited restrictions on the size of proteoform families in a database can have deleterious consequences both on search speed as well as confidence metrics linked to database size (e.g., expectation scores). In such instances, it is best to separate the identification from the characterization step.
Characterization of Unknown Events
Asserting the existence of a proteoform necessitates a high level of informatic rigor to discern the true form of the protein present in a complex. Specific sequence events (i.e., sequence variants, PTMs, or cofactors) can affect the intact mass shift and create a confusing fragmentation landscape (e.g., bound-complex detected by MS1 and/or MS2) [14]. For example, proteins with intra-molecular disulfide bridges – treated as covalent PTMs – can fragment in unconventional ways given their rigid structure and inaccessible termini [17]. Additionally, proteolysis or modifications at one or both termini, affinity tags [39], and the generation of internal fragments can complicate the analysis and shroud the presence of added PTMs or cofactors. To overcome these challenges, different fragmentation techniques may provide complementary results (e.g., HCD, CID, EThcD, and UVPD) that corroborate and improve sequence coverage [40]. However, not all fragmentation techniques are universally supported by available software, such as UVPD [41]. While the annotation of internal fragments can significantly improve sequence coverage, this is not commonly supported in existing tools. Testing for diverse internal fragments (ax, ay, az, bx, by, bz, cx, cy, and cz) exponentially increases search times and adversely impacts most scoring metrics via high false discovery rates [42]. However, TDValidator and the new tool ClipsMS can assign both terminal and internal fragments for TDMS [43].
While there are no established baseline criteria for confidently asserting the existence of a proteoform, a general recommendation is to consider a proteoform “characterized” if the observed and theoretical masses match and there is adequate fragmentation coverage to localize unique proteoform-specific features. Various statistical metrics (C-score) can also help to establish confidence in identified proteoforms. If the observed and theoretical masses for a given protein are not in agreement, this is a clear indication that the analyte is an unknown proteoform or contains a cofactor. To elucidate the unknown, an informative first approach is to perform a de novo PTM identification; this entails shortlisting the chemical formula of a modification based on the precise mass shift — preferably within hundreds of a Dalton — directly calculated from the fragmentation evidence. For additional information on how to calculate the exact mass shift from fragmentation data, see the Supplemental Discussion 2 on this topic.
Another useful approach to interrogating unresolved mass shifts is to identify gaps in fragmentation coverage that may exist across the sequence. As a starting point, the mass shift is added to the N- or C-terminus using a sequence gazer (e.g., ProSight Lite), revealing the nearest terminus to the sequence event in terms of sequence coverage – more fragments may support a site near the correct terminus (the “delta-M” search in ProSight PC automates this function). To find the exact location, the PTM or cofactor can be iteratively added to possible sites downstream of the “correct” terminus as the fragmentation quality is monitored (i.e., with metrics like the numbers of matched fragments and characterization scores).
In the challenging case where the mass residual entails a combination of smaller modifications, there is currently no straightforward solution. If the fragmentation data is deemed to be of high quality, then one can justify the effort put into splitting the mass shift into smaller “hypothetical” modifications and testing different parts of the sequence with the new hypothetical masses. Relying on prior knowledge to define the most logical explanations for each protein will facilitate such hypothesis-driven proteoform interrogation. Here, tools such as mMass [34], Mash Explorer [44], MetaOdysseus [18], or TDValidator [28] can then help determine if additional fragmentation coverage due to multiple modifications stems from bona fide signals in the data.
Given the labile nature of many PTMs (e.g., glycosylation) and non-covalent cofactors (e.g., metals), the intact mass data on the complex or ejected subunit may reveal a mass shift for which the fragmentation data shows only weak (or no) evidence (reflected in poor fragmentation or scoring metrics) [8, 23, 29, 30]. This can be the case when the collisional energy ejects the labile species prior to subunit dissociation or during backbone cleavage, or because multiple residues from disparate parts of the protein are involved in the binding/coordination of a cofactor/metal. In the latter case, it is important to clarify that localizing labile cofactors, such as metals, actually involves finding evidence that they are retained on a subset of fragment ions [33, 45].
If the fragmentation data has sufficient SNR, it can be helpful to first treat the metal as a “known PTM” and manually iterate through the protein sequence on a sequence gazer (e.g., ProSight Lite) or automatically using Protein Annotator and ProSight PC to find possible binding sites. Anecdotally, larger fragments at higher charge states seem likelier to contain labile cofactors, perhaps because they sterically shield the cofactor/binding region from disruptive collisions. Given that these tools rely on the deconvoluted data, it is important to return to the raw mass spectrum and manually validate the quality of the fragmentation.
Validation
The final analytical step – the validation of the fragment assignments – constitutes a rigorous determination of data quality for a given characterized proteoform (see NOTE 1). While validation can be time-consuming, it is highly complementary to any informatics workflow and bolsters researchers’ confidence in both protein identification and proteoform characterization. Once a suspected sequence event has been found and its position narrowed down to a single (or few) residue(s), it is important to assess (and often report) the spectral quality of the diagnostic ions critical to validating a proteoform’s identity within the broadband mass spectrum. Similarly, comparing the intensities of the diagnostic fragments is useful to determine the ratio of the positional isomers that were fragmented simultaneously [46].
Various software tools available support the validation process that typically involves generating theoretical fragments that can be qualitatively compared to the original broadband spectrum in the m/z domain (Table S1) (Fig. 1B). For example, TDValidator automatically searches the m/z spectrum for fragments corresponding to a proteoform of interest; the researcher can then manually compare the raw data to the theoretical isotopologue distributions for validation. A complementary approach that is useful for hypothesis-driven fragment interrogation – i.e., looking for evidence that a specific sequence event is present – involves fragmenting the proteoform in silico and comparing observed and theoretical fragment ions directly within the spectrum [17, 23]. For this purpose, mMass can generate a list of all possible fragment ions and their charge states for a given proteoform with a defined modification state or cofactors. The user can then select a fragment of interest and overlay its theoretical isotopic distribution on the raw data to confirm its presence.
There are some recommended strategies for choosing which fragments to investigate: (1) structural and spectroscopic data in the literature can reveal potential sites/residues in similar proteins or for the specific metal ion; (2) D/E/G/P sites are highly prone to HCD fragmentation and generate high signal intensity fragments [47, 48]; and (3) if intense but unbound fragment ions are detected in the data, these fragments may have unassigned satellite peaks with a mass shift corresponding to a cofactor.
To put this framework to practice and walk readers through the data analysis workflow, we provide our analyses of three protein systems: (1) localizing zinc-binding on Carbonic Anhydrase (CA); (2) determining the acetylation site on a histone ejected from a whole nucleosome (Nuc); and (3) localizing PTMs and copper-binding on the amyloid beta peptide (Aβ).
Example Datasets
Metal-bound Carbonic Anhydrase
Protein structure is often reliant upon binding of small ligands (such as metals) whose conserved interactions contribute to stability of tertiary and quaternary structures. Though an active area of research, it is possible to localize ligand binding sites using nTDMS data by asserting either the regions or the specific residues that make up the binding pocket. We highlight this process using bovine Carbonic Anhydrase II (CA, 29 kDa) as an example, a well-studied zinc metalloenzyme that catalyzes the reversible hydration of carbon dioxide in vivo [49]. Given that a published crystal structure clearly captures CA binding to Zn(II) via residues H93, 95, and 118 [49], we chose it as a model system for developing tips and best practices in nTDMS analysis related to localizing ligands bound through multiple positions via non-covalent bonding [14, 23, 33, 50–52].
First, the MS1 analysis demonstrates that the predominant species is N-terminally acetylated CA (theoretical average mass of 29,024.32 Da) complexed with a Zn(II) ion (63.18 Da) at 1:1 stoichiometry (Fig. 3A, see NOTE 2), and shows a minor signal for sodium-bound CA + Zn(II) as well as non-covalent adducts present at low relative abundance (e.g., bicarbonate) [14]. To maximize sequence coverage for localization of binding sites, we used multiple fragmentation techniques to characterize CA and its metal binding site (i.e., HCD, EThcD and UVPD, Fig. 3B). We show iterative hypothesis testing of the metal site by placing it on multiple candidate residues throughout the protein, including the termini, and monitoring for the sites that yield the greatest number of matched fragments and ultimately the lowest P-score (Fig. 3C). In particular, there is an extensive number of unbound-fragments on the N- and C-terminal ends of the protein that are not sufficiently represented in a bound-form, indicating that the zinc modification is likely in the core of the primary sequence where bound-form coverage is maximized and unbound-form coverage is minimized (for a visualization of unbound- vs. bound-fragment intensities as a function of the CA sequence, see Fig. S5 and Supplemental Discussion 4 on ProSight Native).
Figure 3.

nTDMS analysis of the bovine Carbonic Anhydrase II in complex with Zn(II). (A) Broadband mass spectrum indicates CA is complexed with a Zn(II) ion at 1:1 stoichiometry. (B) Representative HCD, EThcD, and UVPD fragmentation spectra for CA+Zn(II) complex. (C) ProSight Lite fragment maps showing P-score minimization strategy as Zn(II) cofactor is placed at N- and C-termini, at H63, and at H93. (D) UVPD and EThcD fragment maps for Zn(II) at H93. (E) TDValidator helps localize the Zn(II) ion to H93. Color boxes highlight diagnostic fragments depicted in (F). (F) Examples of theoretical fragment isotopic distributions (red triangles) overlaid on top of experimental fragment isotopic distributions. The spectral window is wide enough to visualize Zn-bound and unbound fragments.
We tested Zn(II) on H65 and H93 as two possible binding resides using the HCD data and found that their P-scores are comparable. Alternative fragmentation techniques like EThcD and UVPD (Fig. 3D) were also used to obtain additional fragments that support metal localization at or near H93 with their respective P-scores. Nonetheless, the P-score metric alone on ProSight Lite is not sufficient to confidently assert the Zn(II) site given the labile nature of metal cofactors. Specifically, the P-score indicates the probability of matching ‘n’ ions (or more) by chance alone. Because the proteoform [CA + Zn(II)] can formally be composed of positional isomers with different metal binding, “chance alone” is not the only factor to consider when comparing P-scores. Therefore, examination of the raw data using tools such as TDValidator, Mash Explorer or mMass is imperative in order to evaluate the quality of the “localizing” fragments identified by ProSight Lite and determine the likeliest Zn-coordinating residues.
Given that the metal or a given non-covalent cofactor may not always be retained upon fragmentation, it is important to manually validate both the unbound- and bound-fragments that occur at the same fragmentation site (Fig. 3E). To do this, we used TDValidator to carefully inspect and validate the spectral signals that give rise to the “fragments” in the sequence maps of Zn-bound CA (Fig. S6). This software aligns the raw data with theoretical isotopic distributions for Zn-bound and unbound fragments, producing a list of “validated” fragment ions and the corresponding scoring metrics plus a graphical map. The examination of each fragment in turn enabled examination of their metal content, painting a more complex picture than expected (Fig. 3F). First, the b40 and y67 ions revealed little Zn(II) binding, confirming that the major binding site is internal. The y179 fragment showed clear evidence that Zn(II) binding occurs at or near H93/95; nonetheless, the b80 and b70 fragments indicate that there may be other Zn(II) binding sites N-terminal to Asp80 (such as H63). Looking up the CA entry in Uniprot (P00921) as well as examination of the crystal structure confirms that the Zn(II) cofactor binds at H93/95 [49]. Interestingly, H63 is found in the secondary coordination sphere of the Zn(II), consistent with the fragmentation evidence indicating metal binding near that residue.
Finally, there is moderate fragmentation evidence of Zn(II) binding near H118 but the localizing y-type fragments are too low in intensity to fully differentiate this event from binding at H93/95. To manually investigate these localizing y-type fragments in a hypothesis-driven fashion, mMass produces the theoretical isotopic distributions of any desired fragment and overlays them on the raw spectrum. In contrast, if the TDValidator algorithm does not detect a target fragment due to low SNR, the software will not enable visualization of that fragment’s theoretical isotopic distribution. Hence, mMass can provide additional flexibility to examine the presence of low intensity fragments that help localize a given cofactor or metal.
H3K14ac ejected from a nucleosome
Eukaryotic chromatin is comprised of repeating structural units called nucleosomes (Nucs), which are involved in gene regulation and compaction of genetic material [54, 55]. The Nucs consist of globular protein octamers that contain two copies each of histones H2A, H2B, H3, and H4 that are wrapped around by 147 bp of DNA [55]. The N-terminal tails of the histones undergo various PTMs, which serve as epigenetic marks and drivers of critical cellular processes [56–60]. Recently, we reported the development of Nuc-MS, the three-tiered tandem MS approach based on the native top-down workflow that measures intact nucleosome particles and displays the modification status of their constituent histones in the same spectrum [61]. Here, we demonstrate how a Nuc-MS experiment can assert the positions of a specific histone PTM on synthetic Nucs acetylated by a recombinant acetyl transferase in vitro (pCAF) [62, 63].
Deconvolution results from the MS1 analysis of synthetic Nucs yielded an intact mass (199,947.8 Da) that was within Δm = 4.1 Da (20.5 ppm) of the theoretical mass (Fig. 4A). The MS2 results on the histones ejected from the intact Nuc indicated the presence of a mixture of H3.1 proteoforms that were either unmodified, acetylated and/or oxidized (Fig. 4B). To characterize the site of acetylation, the acetylated H3 was isolated from the unmodified H3 and fragmented by HCD. In order to first narrow down the sites of acetylation out of the many possible lysine residues within the sequence of H3.1 (Fig. 4C), a database of 16,384 H3.1 “candidate proteoforms” was created in Protein Annotator (Fig. S7) and subsequently used in a search of the H3-acetyl fragmentation data in ProSightPC. For the four top-scoring H3.1 proteoforms, the mass accuracy observed between the theoretical intact mass and the observed precursor mass (−0.21 ppm) supports a single acetylation event on the target H3.1 proteoform. Even though the top-scoring proteoforms indicate that the acetylation was localized near the N-terminus, the C-score = 0 highlights that there was insufficient fragmentation data from the Xtract deconvolution process to differentiate the four possible acetylation sites (Fig. 4D).
Figure 4. The new process of ‘Nuc-MS’ uses nTDMS to measure Histone H3 acetylated at K14 ejected from an intact nucleosome (Nuc) [53].

(A) Mass spectrum of Nuc complex containing up to 2 acetylations and a smaller Nuc species denoted by an asterisk that contains a shorter DNA strand. (B) Deconvoluted mass spectrum of ejected histone H3 and acetylated proteoform. (C) A sequence map of histone H3 with all theoretical sites of acetylation highlighted. The map serves as an input for Protein Annotator which results in a database of 16,384 differentially acetylated proteoforms of H3. (D) ProSightPC output demonstrating automated localization using curated H3 proteoform database of acetylation event near N-terminus (aa 1–17). (E) TDValidator helps to localize acetylation to K14. (F) Selected fragments are shown with theoretical isotopic distributions (red triangles) demonstrating confident localization. (G) Examples of fragments not detected by TDValidator but manually identified using mMass.
To overcome this challenge, a tool like TDValidator can be used to search for additional low intensity fragments that help to successfully localize the acetylation on K14 (Fig. 4E). Specifically, by analyzing the raw data directly we successfully assigned seven low SNR fragments that were not picked up by the Xtract algorithm (b7 through b13), four of which clearly distinguished the acetylation position at K14 (Fig. S3, Fig. 4F). To further bolster confidence in the localization, mMass was used to manually validate the fragmentation found by TDValidator, and to search for additional fragmentation not detected by TDValidator in a “hypothesis-driven fashion” (Fig. 4G). Through the combination of various tools, we are able to fully characterize the acetylated histone proteoform as H3.1K14ac, consistent with the enzymatic activity of pCAF, and can assert that no other positional isomer exists at >5% abundance [53].
Amyloid-Beta
nTDMS is emerging as a powerful approach to characterize the presence of covalent PTMs and non-covalent ligands on amyloidogenic proteins, such as the 42 amino acid peptide amyloid beta (Aβ1–42) [64]. Aβ1–42 is an intrinsically disordered protein commonly found in the brain of Alzheimer’s disease patients where it has propensity to form aggregates thought to disrupt synaptic pathways [65–67]. Aggregation of the peptide follows the formation of oligomeric species and structurally rich β-sheet fibrils, the latter of which will continue to develop into thick plaques [68–70]. In recent years, native ion mobility MS has been used to investigate the amyloidogenic assemblies of synthetic Aβ1–42 and its fragments to help gain an understanding of the misfolding pathway [71–77]. Interestingly, emerging clinical proteomics data suggests that canonical Aβ1–42 represents only one of many Aβ proteoforms in the AD brain that have been linked to toxic functions (e.g., pyroglutamylation, di-tyrosine crosslinking, or modification by pleiotropic effectors) [78, 79].
In addition to PTMs, the Aβ landscape in vivo is further muddied through non-covalent interactions between Aβ proteoforms (i.e., aggregation) and non-covalent cofactors that can contribute to formation of toxic reactive oxygen species targeting Aβ [80]. In the example shown here, native-mode ESI of a synthetic, monomeric Aβ1–42 peptide revealed the presence of multiple proteoforms, with the two dominant species each exhibiting at least two putative oxidations events (+16 and +32 Da) (Fig. 5A, inset). Fragmentation data from the most abundant Aβ proteoform (m/z = 1128.8) followed by sequence interrogation with ProSight Lite confidently identified Aβ1–42 with 29 matched b/y-ions and 70% sequence coverage (P-score = 2.4x10−46). Similar analysis of the tandem MS dataset on the +16 Da proteoform (m/z = 1132.8) also showed that Aβ1–42 was oxidized at M35 (P-score = 4.4x10−39, Fig. 5A–C). In this case, overlay of theoretical and observed isotopologues in mMass for the fragments y8 (1+), and y6 (1+) confirms the localization of the oxidation at M35 (Fig. 5F, see NOTE 3). Interestingly, the presence of methionine sulphone was not detected in the fragmentation data, indicating that the +32 Da proteoform (m/z = 1136.8) derives from a more complicated mixture of oxidation positional isomers. To confirm this, we searched for other positional isomers in ProSight Lite through a termini Delta mass test by manually adding +16 Da to both the N- and C-termini. This test on the tandem MS dataset for the mono-oxidized proteoform revealed oxidation on 11 diagnostic b-ions ending at F19/20 (Fig. 5C). Again, the results were manually confirmed via mMass and TDValidator, including the highly diagnostic fragments b21 and b26 (Fig. 5F). While the mechanism of Aβ oxidation could be attributed to sample handling, nTDMS may provide an avenue to establish evidence for further hypothesis testing related to biologically meaningful oxidation (e.g., metal-induced oxidation) targeting Aβ [80]. For example, many of the remaining Aβ proteoforms detected in the intact spectrum indicated that the proteoforms Aβ1–42, Aβ1–42ox, Aβ1–42ox(x2) each had a +60.93 Da shift. This Δm was found to correspond to a Cu(II) (monoisotopic mass: 60.92 Da; average mass: 61.5 Da) —a known Aβ ligand in vivo and in vitro [81]. A Delta mass test in TDValidator confirmed the presence of the cofactor near the N-terminus which harbors histidine residues at positions 6, 13, 14. Iterative hypothesis testing – i.e., examination of each residue in turn for copper binding – localized the cofactor to H13/14, a known binding site for Cu(II) on Aβ [81] (Fig. 5D). To bolster confidence in this localization, b13, b14, b21 and b27 bound-fragments were manually validated using TDValidator (Fig. 5E, see NOTE 4).
Figure 5.

nTDMS analysis of Aβ1-42. (A) Mass spectrum of native Aβ1-42. Highlighted are masses corresponding to Aβ 1-42 (yellow), Aβ1-42+Ox (blue) and Aβ1-42+Cu (green). (B) MS/MS spectra of the corresponding selected m/z. (C) ProSight Lite fragment maps of base Aβ1-42 (1128.8 m/z) and oxidized Aβ1-42 (1132.8 m/z) with the oxidation placed at M35 or F19. (D) Δmass of theoretical and observed b ions of Aβ1-42+Cu (II). ProSight Lite graphical map of fragments relating to Aβ1-42+Cu(II) and Aβ1-42+Cu(I) are overlaid. Fragments seen only for Aβ1-42+Cu(I) are indicated (*). (E) TDValidator detection of b13 (2+), b14 (2+), b21 (2+), and b27 (2+). Monoisotopic peaks relating to bound Cu(II) (green triangles) and Cu(I) (purple squares) are highlighted. (F) Fragment masses not found through ProSight Lite and TDValidator were manually searched through mMass. Monoisotopic peaks relating to Aβ1-42 oxidized at M35 (blue circles) and oxidized at F19 (red stars) are highlighted.
Conclusion
We offer a framework and some tips for analyzing and reporting fragmentation data that result from nTDMS analysis on proteins and their complexes. The landscape of strategies and software tools for fragment analysis is evolving and complex, with no apparent one-size-fits-all, fully automated approach currently available. Our hope is that the decision tree, the example datasets and publicly available raw data in this tutorial will provide a baseline of guidance for new and advanced researchers endeavoring to fragment and characterize native proteoforms that may be relevant in broader structural biology investigations of composition, stoichiometry, assembly, interactions, and topology of proteins and their complexes.
Methods
Carbonic Anhydrase experiments
The specific MS experimental conditions and analytical workflows for MS1 and MS2 data on CA can be found in our recent protocol for performing native mass spectrometry and nTDMS experiments [14]. High-resolution fragmentation data were processed using Xtract (Signal-to-Noise threshold ranging from 1-30, Thermo Fisher Scientific), mMass 5.5.0 (www.mmass.org), ProSight Lite 1.4 [25] (precursor mass type: average; fragmentation method: HCD; fragmentation tolerance: 10-15 ppm), and TDValidator 1.0 [82] (Proteinaceous, max ppm tolerance: 25 ppm; cluster tolerance: 0.35; charge range: 1-10; minimum score: 0.5; SNR cutoff: 3; BRAIN algorithm; minimum size: 2) to assign recorded fragment ions to the primary sequence of the protein.
Histone experiments
The nucleosome synthesis and modification protocol, the specific MS experimental conditions, and analytical workflows for MS1 and MS2 data, can be found in the Nuc-MS publication [53]. High-resolution fragmentation data were processed using Xtract (Signal-to-Noise threshold ranging from 1-30, Thermo Fisher Scientific), mMass 5.5.0 (www.mmass.org), ProSight Lite 1.4 [25] (precursor mass type: average; fragmentation method: HCD; fragmentation tolerance: 10-15 ppm), and TDValidator 1.0 [82] (Proteinaceous, max ppm tolerance: 25 ppm; cluster tolerance: 0.35; charge range: 1-10; minimum score: 0.5; SNR cutoff: 3; BRAIN algorithm; minimum size: 2) to assign recorded fragment ions to the primary sequence of the subunits.
Amyloid-beta experiments
Aβ1-42 (California Peptide) was resuspended in dymethylsulfoxide, diluted to 100 µM in phosphate-buffered saline and then exchanged into 150 ammonium acetate at a final concentration of 10 µM. The sample was directly infused into a Q Exactive-HF Extended Mass Range (Thermo Fisher Scientific) with a nano ESI source [9]. All resulting spectra were deconvoluted with Xtract (S/N:3, Fit factor:45, Remainder:80) in Xcalibur (Thermo Fisher Scientific) followed by a manual calibration (12 ppm). Calibrated masses were input into ProSight Lite and TDValidator (Proteinaceous, max ppm tolerance: 10 ppm; cluster tolerance: 0.35; charge range: 1-5; minimum score: 0.5; SNR cutoff: 3; BRAIN algorithm; minimum size: 2), with an average mass error of 0.97±1.496 Da.
MS Data Availability
Fragmentation spectra presented in the manuscript are available online in the MassIVE database under accession code MSV000086404 as .RAW files and can be visualized with Thermo Qual Browser.
Data Acquisition
Data were collected on either a custom Thermo Fisher Q Exactive Orbitrap HF MS with Extended Mass Range (QE-EMR) [8] or a commercially available Thermo Fisher Q Exactive Orbitrap MS with Ultra High Mass Range (UHMR). Critical details of nTDMS data collection were previously outlined in our manuscript on the development of standardized best-practices for collection of nTDMS data. This standard operating procedure included extensive discussion on recommended reagents, materials, suggested values for key instrument tuning and data acquisition parameters, and step by step instructions on the collection of MS1, MS2, and MS3 data [14].
Supplementary Material
Acknowledgments
This work was supported by the National Institute of General Medical Sciences P41 GM108569 for the National Resource for Translational and Developmental Proteomics at Northwestern University and NIH grants S10 OD025194, RF1 AG063903, R01 GM115739, and P30 DA018310. LFS is a Gilliam Fellow of the Howard Hughes Medical Institute. Research in this publication was also supported by Thermo Fisher Scientific and a fellowship associated with the Chemistry of Life Processes Predoctoral Training Grant T32GM105538 at Northwestern University.
Footnotes
While standards for “validating” fragment ion data have not been established within the field, the following are some considerations and tips for implementing the methods described here on a system of interest:
• At least two fragment ion matches underlie the assignment of a mass shift (e.g., sequence variant or PTM);
• The part-per-million error is consistent among fragment ions;
• The instrument resolving power is sufficient to differentiate the target isotopologue distribution from other signal (e.g., spectral noise or neighboring distributions);
• There are two or more charge states for the specific fragment ion >7 kDa;
• The fragment ion results from a backbone cleavage site with known high cleavage propensity (D/E/G/P).
According to UniMod, two viable modifications at around 63.18 Da are the replacement of 2 protons by zinc and the replacement of 1 proton by copper (Zn(II): 63.39 Da; Cu(I): 62.53 Da). In this case, we make the preliminary hypothesis that native Carbonic Anhydrase contains a Zn (II) ion – a well calibrated instrument can typically differentiate these two metallated forms.
Initially these specific fragment ions were not detected by ProSight Lite, suggesting that they are lost in the Xtract deconvolution process due to low SNR. Manual inspection of the raw data using TDValidator and mMass increases the sensitivity to bona fide but low abundance fragments. This instance shows that it is generally worthwhile to double-check data in a manual fashion when looking for specific low abundance targets.
A retrospective analysis of the tandem MS datasets revealed that a significant portion of the identified holo-fragment isotopologues showed a 1 Da shift relative to the predicted Aβ+Cu(II). This mass shift suggests that there is a 1 H+ difference and indicates a mixture of Cu(II) and Cu(I) oxidation states, as previously shown and believed to result from gas-phase reduction of Cu(II) to Cu(I) during the tandem MS event (Fig. 5D). The presence of isotopic distributions for both Cu(I) and Cu(II) forms in the tandem MS datasets highlights the importance of manual inspection of complicated data.
References
- 1.Kirshenbaum N, Michaelevski I, and Sharon M, Analyzing large protein complexes by structural mass spectrometry. J Vis Exp, 2010(40). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Sharon M, Biochemistry. Structural MS pulls its weight. Science, 2013. 340(6136): p. 1059–60. [DOI] [PubMed] [Google Scholar]
- 3.Heck AJR and van den Heuvel RHH, Investigation of intact protein complexes by mass spectrometry. Mass Spectrometry Reviews, 2004. 23(5): p. 368–389. [DOI] [PubMed] [Google Scholar]
- 4.Loo JA, Studying noncovalent protein complexes by electrospray ionization mass spectrometry. Mass Spectrom Rev, 1997. 16(1): p. 1–23. [DOI] [PubMed] [Google Scholar]
- 5.Kafader JO, et al. , Native vs Denatured: An in Depth Investigation of Charge State and Isotope Distributions. Journal of the American Society for Mass Spectrometry, 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Benesch JLP, et al. , Tandem Mass Spectrometry Reveals the Quaternary Organization of Macromolecular Assemblies. Chemistry & Biology, 2006. 13(6): p. 597–605. [DOI] [PubMed] [Google Scholar]
- 7.Benesch JLP, et al. , Protein Complexes in the Gas Phase: Technology for Structural Genomics and Proteomics. Chemical Reviews, 2007. 107(8): p. 3544–3567. [DOI] [PubMed] [Google Scholar]
- 8.Kenney GE, et al. , The biosynthesis of methanobactin. Science, 2018. 359(6382): p. 1411–1416. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Belov ME, et al. , From protein complexes to subunit backbone fragments: a multi-stage approach to native mass spectrometry. Anal Chem, 2013. 85(23): p. 11163–73. [DOI] [PubMed] [Google Scholar]
- 10.Smith LM, Kelleher NL, and P. Consortium for Top Down, Proteoform: a single term describing protein complexity. Nat Methods, 2013. 10(3): p. 186–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Fornelli L, et al. , Top-down proteomics: Where we are, where we are going? Journal of Proteomics, 2018. 175: p. 3–4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Li H, et al. , An integrated native mass spectrometry and top-down proteomics method that connects sequence to structure and function of macromolecular complexes. Nature Chemistry, 2018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Skinner OS, et al. , An informatic framework for decoding protein complexes by top-down mass spectrometry. Nature Methods, 2016. 13(3): p. 237-+. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Schachner LF, et al. , Standard Proteoforms and Their Complexes for Native Mass Spectrometry. Journal of The American Society for Mass Spectrometry, 2019. 30(7): p. 1190–1198. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Zhang Z and Marshall AG, A universal algorithm for fast and automated charge state deconvolution of electrospray mass-to-charge ratio spectra. Journal of the American Society for Mass Spectrometry, 1998. 9(3): p. 225–233. [DOI] [PubMed] [Google Scholar]
- 16.Marty MT, et al. , Bayesian Deconvolution of Mass and Ion Mobility Spectra: From Binary Interactions to Polydisperse Ensembles. Analytical Chemistry, 2015. 87(8): p. 4370–4376. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Skinner OS, et al. , Top-down characterization of endogenous protein complexes with native proteomics. Nat Chem Biol, 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Peris-Díaz MD, et al. , Mass Spectrometry-Based Structural Analysis of Cysteine-Rich Metal-Binding Sites in Proteins with MetaOdysseus R Software. Journal of Proteome Research, 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Skinner OS, et al. , An informatic framework for decoding protein complexes by top-down mass spectrometry. Nat Methods, 2016. 13(3): p. 237–40. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Skinner OS, et al. , Top-down characterization of endogenous protein complexes with native proteomics. Nat Chem Biol, 2018. 14(1): p. 36–41. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Ruepp A, et al. , CORUM: the comprehensive resource of mammalian protein complexes--2009. Nucleic Acids Res, 2010. 38(Database issue): p. D497–501. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Park YJ, et al. , Repurposed HisC Aminotransferases Complete the Biosynthesis of Some Methanobactins. Biochemistry, 2018. 57(25): p. 3515–3523. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Ro SY, et al. , Native top-down mass spectrometry provides insights into the copper centers of membrane-bound methane monooxygenase. Nature Communications, 2019. 10(1): p. 2675. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.DeHart CJ, et al. , Bioinformatics Analysis of Top-Down Mass Spectrometry Data with ProSight Lite, in Protein Bioinformatics: From Protein Modifications and Networks to Proteomics, Wu CH, Arighi CN, and Ross KE, Editors. 2017, Springer New York: New York, NY. p. 381–394. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Fellers RT, et al. , ProSight Lite: graphical software to analyze top-down mass spectrometry data. Proteomics, 2015. 15(7): p. 1235–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.LeDuc RD, et al. , The C-Score: A Bayesian Framework to Sharply Improve Proteoform Scoring in High-Throughput Top Down Proteomics. Journal of Proteome Research, 2014. 13(7): p. 3231–3240. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Skinner OS, Schachner LF, and Kelleher NL The Search Engine for Multi-Proteoform Complexes: An Online Tool for the Identification and Stoichiometry Determination of Protein Complexes. Current protocols in bioinformatics, 2016. 56, 13.30.1–13.30.11 DOI: 10.1002/cpbi.16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Fornelli L, et al. , Accurate Sequence Analysis of a Monoclonal Antibody by Top-Down and Middle-Down Orbitrap Mass Spectrometry Applying Multiple Ion Activation Techniques. Analytical Chemistry, 2018. 90(14): p. 8421–8429. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Laganowsky A, et al. , Membrane proteins bind lipids selectively to modulate their structure and function. Nature, 2014. 510: p. 172. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Robinson CV, et al. , Probing the Nature of Noncovalent Interactions by Mass Spectrometry. A Study of Protein−CoA Ligand Binding and Assembly. Journal of the American Chemical Society, 1996. 118(36): p. 8646–8653. [Google Scholar]
- 31.LeDuc RD, et al. , Accurate Estimation of Context-Dependent False Discovery Rates in Top-Down Proteomics. Molecular & Cellular Proteomics, 2019. 18(4): p. 796–805. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Zheng S, et al. , Charge-dependent modulation of specific and nonspecific protein-metal ion interactions in nanoelectrospray ionization mass spectrometry. Rapid Commun Mass Spectrom, 2019. 33(19): p. 1502–1511. [DOI] [PubMed] [Google Scholar]
- 33.Wongkongkathep P, et al. , Native Top-Down Mass Spectrometry and Ion Mobility MS for Characterizing the Cobalt and Manganese Metal Binding of α-Synuclein Protein. Journal of The American Society for Mass Spectrometry, 2018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Niedermeyer THJ and Strohalm M, mMass as a Software Tool for the Annotation of Cyclic Peptide Tandem Mass Spectra. PLOS ONE, 2012. 7(9): p. e44913. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Tran JC, et al. , Mapping intact protein isoforms in discovery mode using top-down proteomics. Nature, 2011. 480(7376): p. 254–258. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Kou Q, Xun L, and Liu X, TopPIC: a software tool for top-down mass spectrometry-based proteoform identification and characterization. Bioinformatics, 2016. 32: p. 3495–3497. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Toby TK, et al. , A comprehensive pipeline for translational top-down proteomics from a single blood draw. Nat Protoc, 2019. 14(1): p. 119–152. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Consortium TU, UniProt: a worldwide hub of protein knowledge. Nucleic Acids Research, 2018. 47(D1): p. D506–D515. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Geoghegan KF, et al. , Spontaneous α-N-6-Phosphogluconoylation of a “His Tag” inEscherichia coli:The Cause of Extra Mass of 258 or 178 Da in Fusion Proteins. Analytical Biochemistry, 1999. 267(1): p. 169–184. [DOI] [PubMed] [Google Scholar]
- 40.Greisch J-F, et al. , Expanding the mass range for UVPD-based native top-down mass spectrometry. Chemical Science, 2019. 10(30): p. 7163–7171. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Srzentić K, et al. , Interlaboratory Study for Characterizing Monoclonal Antibodies by Top-Down and Middle-Down Mass Spectrometry. Journal of the American Society for Mass Spectrometry, 2020. 31(9): p. 1783–1802. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Zenaidee MA, et al. , Internal Fragments Generated by Electron Ionization Dissociation Enhance Protein Top-Down Mass Spectrometry. J Am Soc Mass Spectrom, 2020. 31(9): p. 1896–1902. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Lantz C, et al. , ClipsMS: An Algorithm for Analyzing Internal Fragments Resulting from Top-Down Mass Spectrometry. J Proteome Res, 2021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Wu Z, et al. , MASH Explorer: A Universal Software Environment for Top-Down Proteomics. Journal of proteome research, 2020. 19(9): p. 3867–3876. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Li H, et al. , Revealing Ligand Binding Sites and Quantifying Subunit Variants of Non-Covalent Protein Complexes in a Single Native Top-Down FTICR MS Experiment. Journal of the American Society for Mass Spectrometry, 2014. 25(12): p. 2060–2068. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Pesavento JJ, Mizzen CA, and Kelleher NL, Quantitative analysis of modified proteins and their positional isomers by tandem mass spectrometry: human histone H4. Anal Chem, 2006. 78(13): p. 4271–80. [DOI] [PubMed] [Google Scholar]
- 47.Haverland NA, et al. , Defining Gas-Phase Fragmentation Propensities of Intact Proteins During Native Top-Down Mass Spectrometry. Journal of The American Society for Mass Spectrometry, 2017. 28(6): p. 1203–1215. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Ives A, et al. , Using 10,000 fragment ions to inform scoring in native top-down proteomics. Journal of the American Society for Mass Spectrometry, 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Saito R, et al. , Structure of bovine carbonic anhydrase II at 1.95 A resolution. Acta Crystallogr D Biol Crystallogr, 2004. 60(Pt 4): p. 792–5. [DOI] [PubMed] [Google Scholar]
- 50.O’Connor PB, et al. , Tandem mass specatrometry of carbonic anhydrase (29 kDa). Journal of Mass Spectrometry, 1995. 30(1): p. 88–93. [Google Scholar]
- 51.Senko MW, Beu SC, and McLafferty FW, High-resolution tandem mass spectrometry of carbonic anhydrase. Analytical Chemistry, 1994. 66(3): p. 415–417. [DOI] [PubMed] [Google Scholar]
- 52.Sze SK, et al. , Top-down mass spectrometry of a 29-kDa protein for characterization of any posttranslational modification to within one residue. Proceedings of the National Academy of Sciences, 2002. 99(4): p. 1774–1779. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Schachner LF, et al. , Decoding the Protein Composition of Whole Nucleosomes with Nuc-MS. bioRxiv, 2020: p. 2020.09.08.287656. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Campos EI and Reinberg D, Histones: annotating chromatin. Annu Rev Genet, 2009. 43: p. 559–99. [DOI] [PubMed] [Google Scholar]
- 55.Luger K, et al. , Crystal structure of the nucleosome core particle at 2.8 A resolution. Nature, 1997. 389(6648): p. 251–60. [DOI] [PubMed] [Google Scholar]
- 56.Patel DJ and Wang Z, Readout of epigenetic modifications. Annual review of biochemistry, 2013. 82: p. 81–118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Ruthenburg AJ, et al. , Multivalent engagement of chromatin modifications by linked binding modules. Nat Rev Mol Cell Biol, 2007. 8(12): p. 983–94. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Suganuma T and Workman JL, Crosstalk among Histone Modifications. Cell, 2008. 135(4): p. 604–7. [DOI] [PubMed] [Google Scholar]
- 59.Voigt P, et al. , Asymmetrically modified nucleosomes. Cell, 2012. 151(1): p. 181–93. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Zheng Y, Huang X, and Kelleher NL, Epiproteomics: quantitative analysis of histone marks and codes by mass spectrometry. Curr Opin Chem Biol, 2016. 33: p. 142–50. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Schachner LF, et al. , Decoding the protein composition of whole nucleosomes with Nuc-MS. Nature Methods, 2021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Kornacki JR, Stuparu AD, and Mrksich M, Acetyltransferase p300/CBP associated Factor (PCAF) regulates crosstalk-dependent acetylation of histone H3 by distal site recognition. ACS chemical biology, 2015. 10(1): p. 157–164. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Kuo Y-M and Andrews AJ, Quantitating the Specificity and Selectivity of Gcn5-Mediated Acetylation of Histone H3. PLOS ONE, 2013. 8(2): p. e54896. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Lermyte F, et al. , Metal ion binding to the amyloid β monomer studied by native top-down FTICR mass spectrometry. Journal of the American Society for Mass Spectrometry, 2019. 30: p. 2123–2134. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Lorenzo A and Yankner BA, Amyloid fibril toxicity in Alzheimer’s disease and diabetes. Annals of the New York Academy of Sciences, 1996. 777: p. 89–95. [DOI] [PubMed] [Google Scholar]
- 66.Verma M, Vats A, and Taneja V, Toxic species in amyloid disorders: oligomers or mature fibrils. Annals of Indian Academy of Neurology, 2015. 18: p. 138–145. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Sipe JD and Cohen AS, Review: history of the amyloid fibril. Journal of Structural Biology, 2000. 130: p. 88–98. [DOI] [PubMed] [Google Scholar]
- 68.Benilova I, Karran E, and Strooper BD, The toxic Aβ oligomer and Alzheimer’s disease: an emperor in need of clothes. Nature Neuroscience, 2012. 15: p. 349–357. [DOI] [PubMed] [Google Scholar]
- 69.Cline EN, et al. , The amyloid-β oligomer hypothesis: beginning of the third decade. Journal of Alzheimer's Disease, 2018. 64: p. S567–S610. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Stefani M and Dobson CM, Protein aggregation and aggregate toxicity: new insights into protein folding, misfolding diseases and biological evolution. Journal of Molecular Medicine, 2003. 81: p. 678–699. [DOI] [PubMed] [Google Scholar]
- 71.Bernstein SL, et al. , Amyloid-β protein oligomerization and the importance of tetramers and dodecamers in the aetiology of Alzheimer’s disease. Nature Chemistry, 2009. 1: p. 326–331. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Bleiholder C and Bowers MT, The solution assembly of biological molecules using ion mobility methods: from amino acids to amyloid β-protein. Annual Review of Analytical Chemistry, 2017. 10: p. 365–386. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Bleiholder C, et al. , Ion mobility-mass spectrometry reveals a conformational conversion from random assembly to beta-sheet in amyloid fibril formation. Nature Chemistry, 2011. 3: p. 172–177. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Hoffmann W, von Helden G, and Pagel K, Ion mobility-mass spectrometry and orthogonal gas-phase techniques to study amyloid formation and inhibition. Current Opinion in Structural Biology, 2017. 46: p. 7–15. [DOI] [PubMed] [Google Scholar]
- 75.Kłoniecki M, et al. , Ion mobility separation coupled with MS detects two structural states of Alzheimer’s disease Aβ1–40 peptide oligomers. Journal of Molecular Biology, 2011. 407: p. 110–124. [DOI] [PubMed] [Google Scholar]
- 76.Österlund N, et al. , Native ion mobility-mass spectrometry reveals the formation of β-barrel shaped amyloid-β hexamers in a membrane-mimicking environment. Journal of the American Chemical Society, 2019. 141: p. 10440–10450. [DOI] [PubMed] [Google Scholar]
- 77.Woods LA, Radford SE, and Ashcroft AE, Advances in ion mobility spectrometry-mass spectrometry reveal key insights into amyloid assembly. Biochimica et Biophysica Acta, 2013. 1834: p. 1257–1268. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Esparza TJ, et al. , Soluble amyloid-beta aggregates from human Alzheimer’s disease brains. Scientific Reports, 2016. 6: p. 1–16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Wildburger NC, et al. , Diversity of Amyloid-beta Proteoforms in the Alzheimer’s Disease Brain. Sci Rep, 2017. 7(1): p. 9520. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.De Benedictis CA, Vilella A, and Grabrucker AM, The Role of Trace Metals in Alzheimer’s Disease, in Alzheimer’s Disease, Wisniewski T, Editor. 2019, Codon Publications Copyright: The Authors.: Brisbane (AU). [PubMed] [Google Scholar]
- 81.Borghesani V, Alies B, and Hureau C, Cu(II) binding to various forms of amyloid-β peptides. Are they friends or foes? European journal of inorganic chemistry, 2018. 2018(1): p. 7–15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Fornelli L, et al. , Accurate sequence analysis of a monoclonal antibody by top-down and middle-down orbitrap mass spectrometry applying multiple Ion activation techniques. Anal. Chem, 2018. 90(14): p. 8421–8429. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
