Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 2024 Jul 29;121(32):e2409676121. doi: 10.1073/pnas.2409676121

Fragment correlation mass spectrometry: Determining the structures of biopolymers in a complex mixture without isolating individual components

Yangjie Li a, Guy Cavet b, Richard N Zare a,1, Taran Driver c,1
PMCID: PMC11317569  PMID: 39074273

Abstract

Fragment correlation mass spectrometry correlates ion pairs generated from the same fragmentation pathway, achieved by covariance mapping of tandem mass spectra generated with an unmodified linear ion trap without preseparation. We enable the identification of different precursors at different charge states in a complex mixture from a large isolation window, empowering an analytical approach for data-independent acquisition. The method resolves and matches isobaric fragments, internal ions, and disulfide bond fragments. We suggest that this method represents a major advance for analyzing structures of biopolymers in mixtures.

Keywords: fragment correlation, covariance mapping, tandem mass spectrometry, proteomics, data-independent acquisition


Consider a complex biopolymer mixture in which the goal is to determine the molecular structure of each biopolymer by tandem mass spectrometry (1, 2). Whether employing bottom–up or top–down approaches (3), the first step is to separate individual components. If the sample is limited in amount or if the constituents have posttranslational modifications (4), this task is often challenging, especially when compounds coelute in the separation step. Methods based on Fourier-transform ion cyclotron resonance (5) and linear ion trap (6) have been developed to resolve mixtures in two dimensions (2-D): mass-to-charge (m/z: Th) of precursor and product ions. However, when mixtures contain isobaric or multiply charged precursors, these ions cannot be separated by such a 2-D method. We describe here the extension of an approach (79) involving the coincident detection of fragment ion pairs generated from the same pathway by correlating their signal intensities.

Recently, data-independent acquisition (DIA) has emerged as a powerful technology for proteomics by cofragmenting all precursor ions in a large isolation window, instead of selecting the most abundant ones for further fragmentation as in data-dependent acquisition (DDA) (10). However, complex fragment ion spectra increase the challenges of data analysis. We demonstrate fragment correlation mass spectrometry for determining the structures of seven peptides, coisolated at different charge states and cofragmented using collision-induced dissociation (CID). This approach does not require mixture components to be preseparated; assigns identical internal ions specifically to their different structures by their different companion ions; and is not hindered by isobaric precursor or fragment peaks.

We exploit covariance mapping techniques to correlate fragment ions generated from the same pathway of the same precursor, whose signals fluctuate in synchrony (11). Mapping of simple covariance is effective to study fragmentation of small molecules (12), but for analysis of biomolecules it becomes overwhelmed by spurious correlations arising from signal fluctuations because of, e.g., ionization source instability (13). Previously, a partial-covariance method was used for DDA of both peptides (7) and proteins (8) including a database search engine (9). In this work, we generalize this technique using contingent covariance, which can additionally suppress undesired correlations produced by nonlinear coupling to fluctuating experimental parameters.

Results

We report a universal method for fragment correlation mass spectrometry using a large isolation window in an unmodified linear ion trap without prior separation. This method exploits intrinsic fluctuations of fragment signals resulting from the stochastic nature of the fragmentation process and inherent to any tandem mass spectra (MS2) measurement. We mixed seven peptides, several of which are structural analogs that are challenging to resolve, including therapeutic cyclic peptides (14) containing disulfides (15). We cofragmented all peptides using a 50-Th isolation window and continuously collected MS2 spectra. We calculated contingent covariance between ion intensities, followed by peak centroiding and jackknife resampling to further evaluate the covariance signal-to-noise (S/N) (further details in SI Appendix). Correlation peaks with robust signals, matched to peptide precursor structures, are shown in Fig. 1A as a correlation plot. We show an example in the orange box of Fig. 1B: [b6]+ and [y3]+ signal intensities from desmopressin are correlated (orange), while they are not correlated with [y3]+ of oxytocin (green). Signals of the fragment ion pairs fluctuate synchronously, generating a positive covariance between their intensities. We reconstitute spectra in Fig. 1B as a fingerprint for each precursor structure, the so-called “extracted covariance tandem mass spectrum.” This is achieved by projecting the symmetric correlation plot onto one m/z axis with the added signals using the sum of correlation peak intensities at a certain m/z location.

Fig. 1.

Fig. 1.

Fragment correlation mass spectrometry. (A) a correlation plot generated from positive contingent covariance between signal intensities in tandem mass spectra. Each data point in the correlation plot indicates a pair of correlated fragments generated by the same pathway from their color-coded precursor. (B) Extracted covariance tandem mass (MS2) spectra were generated by projecting the symmetric correlation plot along the axes for each precursor classification. An example in the orange box indicates synchronizing measurements of fragment pairs in orange. (C) Coverage maps show terminal ions assigned to specific structures (in black slashes) with additional sequence cleavages from correlated isobaric ions, internal ions, and/or disulfides fragments (pink).

We plotted in Fig. 2 the correlated peaks in the covariance maps prior to jackknife resampling. The method can directly measure disulfide bond fragments (octreotide in purple in Fig. 2A as an example), resolve isobaric fragment ions, and match internal ions (Fig. 2 B and C). By comparing covariance peaks (colored peaks on the covariance maps) with the averaged tandem mass spectra peaks (in black traces at the back of the maps), we find that the present method can not only correlate fragments but also uncover more peaks from their nearby chemical noise.

Fig. 2.

Fig. 2.

Covariance maps generated by calculating contingent covariance matrix from fluctuating intensities of each fragment pair from MS2 scans, with averaged mass spectra shown on the back. (A) The generated covariance map at a large m/z range, showing four peaks with the highest covariance S/N ratios after jackknife resampling assigned to four different fragment pairs from four different precursors in the mixture (with pathways in orange and green used in Fig. 1B); (B) isobaric fragments resolved by correlating with companion ions that have different m/z values; (C) internal ions assigned by correlating with terminal ions generated from a pathway that leads to three fragments including an undetected neutral terminal fragment.

Discussion

In the mixture of seven peptides, the structure of each analyte of interest is successfully elucidated in the presence of matrix effects from the other seven peptides. In Fig. 1A, among all 91 peaks in the MS2 spectrum of the mixture, 36% of fragment ions are isobaric with at least one ion of a different fragment structure. As shown in the coverage maps, fragment correlation mass spectrometry significantly improves sequence coverage of each compound in the mixture compared to standard tandem mass analysis using terminal ions alone. This happens especially when the structures are difficult to analyze, such as cyclic peptides with disulfide bond connectivity and structures with identical b ions (e.g., dynorphin with its analog).

Note that the internal ion, [bi(4-5)-H2O]+, is assigned unambiguously to oxytocin in green in Fig. 2B. Although the same internal ion exists in all three compounds in the mixture, the other two compounds have different companion ions correlated with that internal ion (see desmopressin in orange and oxytocin analog in brown).

Our method selectively amplifies previously missing structural signals over neighboring signals generated by chemical or electrical noise. When the fragmentation pathway yields only two fragment ions, then the sum of the two fragment masses equals the precursor mass (Fig. 2B). When it yields three fragments of which one is neutral, it generates two correlated fragment ions, as shown in Fig. 2C. In this case, mass of the neutral loss was deduced by subtracting the sum of the fragment ion pair masses from the mass of the precursor that was matched. As demonstrated here, our method can be immediately applied to resolve mixed samples and improve sequence coverage in MS2 measurements of multiply charged precursors. Future developments should leverage high scan rate instruments, such as time-of-flight instrumentation, for tandem mass spectrometry to reduce the required collection time for each correlation map to several seconds. This will enable our method to be coupled to a liquid chromatography (LC-MS2) workflow.

This study empowers DIA by resolving and matching isobaric fragments, internal ions, and disulfide bond fragments without resorting to a modified instrument, high resolution, prior separation, or derivatization.

Materials and Methods

A solution (10 to 100 µM) of seven different peptides was prepared without the need for sample workup. Aliquots of 10 to 30 µL were nanoelectrosprayed into a linear ion trap mass spectrometer in which MS2 spectra were obtained in positive mode using CID with a 50-Th isolation width centered at m/z 520. Samples were sprayed continuously until 10,000 MS2 scans were collected in less than 28 min in each trial for calculation of a covariance matrix. See SI Appendix for more details.

Supplementary Material

Appendix 01 (PDF)

pnas.2409676121.sapp.pdf (88.9KB, pdf)

Acknowledgments

Y.L. and R.N.Z. acknowledge the support from the Air Force Office of Scientific Research through the Multidisciplinary University Research Initiative program (AFOSR FA9550-21-1-0170). Figures were created with BioRender.com.

Author contributions

Y.L., G.C., and T.D. designed research; Y.L. and T.D. performed research; T.D. contributed new analytic tools; Y.L. analyzed data; and Y.L., G.C., R.N.Z., and T.D. wrote the paper.

Competing interests

T.D. and G.C. are co-inventors on US Patent Application #18/479,114.

Contributor Information

Richard N. Zare, Email: rnz@stanford.edu.

Taran Driver, Email: tdriver@stanford.edu.

Data, Materials, and Software Availability

Datasets generated in this study are available via the Stanford Digital Repository (https://purl.stanford.edu/gs029dv3773) (16). The codes to implement the contingent covariance calculation are available via https://github.com/TaranDriver/FCMS (17).

Supporting Information

References

  • 1.Domon B., Aebersold R., Mass spectrometry and protein analysis. Science 312, 212–217 (2006). [DOI] [PubMed] [Google Scholar]
  • 2.Mann M., Hendrickson R. C., Pandey A., Analysis of proteins and proteomes by mass spectrometry. Annu. Rev. Biochem. 70, 437–473 (2001). [DOI] [PubMed] [Google Scholar]
  • 3.Donnelly D. P., et al. , Best practices and benchmarks for intact protein analysis for top-down mass spectrometry. Nat. Methods 16, 587–594 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Aebersold R., et al. , How many human proteoforms are there? Nat. Chem. Biol. 14, 206–214 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Van Agthoven M. A., Lam Y. P. Y., O’Connor P. B., Rolando C., Delsuc M.-A., Two-dimensional mass spectrometry: New perspectives for tandem mass spectrometry. Eur. Biophys. J. 48, 213–229 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Le M. T., et al. , Two-dimensional tandem mass spectrometry for biopolymer structural analysis. Angew. Chem. Int. Ed. Engl. 63, e202315904 (2024). [DOI] [PubMed] [Google Scholar]
  • 7.Driver T., et al. , Two-dimensional partial-covariance mass spectrometry of large molecules based on fragment correlations. Phys. Rev. X 10, 041004 (2020). [Google Scholar]
  • 8.Driver T., Averbukh V., Frasiński L. J., Marangos J. P., Edelson-Averbukh M., Two-dimensional partial covariance mass spectrometry for the top-down analysis of intact proteins. Anal. Chem. 93, 10779–10788 (2021). [DOI] [PubMed] [Google Scholar]
  • 9.Driver T., et al. , Proteomic database search engine for two-dimensional partial covariance mass spectrometry. Anal. Chem. 93, 14946–14954 (2021). [DOI] [PubMed] [Google Scholar]
  • 10.Venable J. D., Dong M.-Q., Wohlschlegel J., Dillin A., Yates J. R., Automated approach for quantitative analysis of complex peptide mixtures from tandem mass spectra. Nat. Methods 1, 39–45 (2004). [DOI] [PubMed] [Google Scholar]
  • 11.Frasinski L. J., Covariance mapping techniques. J. Phys. B: At. Mol. Opt. Phys. 49, 152004 (2016). [Google Scholar]
  • 12.Frasinski L. J., Codling K., Hatherly P. A., Covariance mapping: A correlation method applied to multiphoton multiple ionization. Science 246, 1029–1031 (1989). [DOI] [PubMed] [Google Scholar]
  • 13.Fenn J. B., Mann M., Meng C. K., Wong S. F., Whitehouse C. M., Electrospray ionization for mass spectrometry of large biomolecules. Science 246, 64–71 (1989). [DOI] [PubMed] [Google Scholar]
  • 14.Li Y., et al. , Accelerated forced degradation of therapeutic peptides in levitated microdroplets. Pharm. Res. 37, 138 (2020). [DOI] [PubMed] [Google Scholar]
  • 15.Li Y., Kolasinski K. W., Zare R. N., Silica particles convert thiol-containing molecules to disulfides. Proc. Natl. Acad. Sci. U.S.A. 120, e2304735120 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Li Y., Cavet G., Zare R. N., Driver T., Data for FCMS manuscript. Stanford Digital Repository. https://purl.stanford.edu/gs029dv3773. Deposited 10 May 2024.
  • 17.Li Y., Cavet G., Zare R. N., Driver T., FCMS. GitHub. https://github.com/TaranDriver/FCMS. Deposited 17 June 2024.

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Appendix 01 (PDF)

pnas.2409676121.sapp.pdf (88.9KB, pdf)

Data Availability Statement

Datasets generated in this study are available via the Stanford Digital Repository (https://purl.stanford.edu/gs029dv3773) (16). The codes to implement the contingent covariance calculation are available via https://github.com/TaranDriver/FCMS (17).


Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES