Abstract

Hydrogen–deuterium exchange mass spectrometry (HDX/MS) is increasingly used to study the dynamics of protein conformation. Coupled to native MS, HDX can also characterize the conformations of oligonucleotides and their binding to cations, small molecules, and proteins. Data processing and visualization of native HDX/MS of oligonucleotides requires dedicated software solutions. OligoR is a web-browser-based application that addresses the specific needs of DNA HDX/MS and native MS experiments from raw data in an open format to visualization and export of results. Whole experiments spanning many time points can be processed in minutes for several mass-separated species. To access valuable folding dynamics information, we have developed a simple and robust approach to deconvolute bimodal isotope distributions, even when they are highly overlapping. This approach is based on modeling physically possible isotope distributions determined from chemical formulae and could be extended to any type of analyte (proteins, peptides, sugars, and small molecules). All results are presented in interactive data tables, and publication-quality figures can be generated, customized, and exported.
Introduction
Hydrogen–deuterium exchange mass spectrometry (HDX/MS) is a technique of choice to study the structural dynamics of proteins.1−5 Incubation of a protein in a deuterated buffer results in the exchange of amide hydrogen by deuterium isotopes at a rate that strongly depends on the hydrogen-bonding status and solvent accessibility.6−8 Therefore, the deuteration yields inform on the structure of proteins. HDX/MS is increasingly used both in academic and industrial contexts to compare different states of a protein in, e.g., epitope mapping,2,9,10 batch-to-batch comparisons,11 biosimilarity evaluation,2,12 and stress testing.13
Most HDX/MS experiments share a similar bottom-up approach: the protein is first incubated in a deuterium-rich buffer for a defined time, then the reaction is quenched, the protein denatured and digested with a protease, and the resulting peptides are finally analyzed by LC/MS.8 The deuterium content is therefore quantified for many peptides, over several time points, generating a large amount of data.14,15 Consequently, several programs have been developed to assist in HDX/MS raw data processing (peptide detection and identification, determination of their deuterium content, isotopic distribution modeling, and fitting),16−26 post-processing (comparison of deuterium uptake across samples and assessment of statistical significance), and visualization.19,22,24,27−35
We have recently demonstrated that HDX/MS can also apply to DNA oligonucleotides.36 Such an approach can characterize the secondary structures of oligonucleotides (e.g., genomic and viral sequences, aptamers), investigate complicated folding pathways, and probe oligonucleotide complexes with cations, small molecules, and proteins. The analytical workflow we propose differs from that of routine protein HDX/MS: the analytes are injected in the mass spectrometer with no quenching or denaturation steps and thus without chromatographic separation. We have employed the following two approaches.
Real-time monitoring of the exchange (called “RT-HDX”): the exchange is triggered by manual mixing of a pre-deuterated DNA solution with a non-deuterated buffer. The exchanging solution is infused in the mass spectrometer and analyzed in a time-dependent manner: each MS scan corresponds to a different deuteration time point.
Continuous-flow mixing (called “CF-HDX”): the pre-deuterated analytes are exchanged for a defined time by mixing with a non-deuterated buffer from a mixing tee to the mass spectrometer. The duration of the exchange is controlled by the volume and the flow rate between the mixing tee and the mass spectrometer. All MS scans correspond to the same deuteration time point.
This has several consequences, which are listed below.
-
(i)
The MS measurement can be performed in native conditions, and therefore, several species (including non-covalent complexes) can be mass-separated, and their deuteration quantified concurrently.
-
(ii)
The number of analytes in each injection is reduced compared to bottom-up approaches (1–10 vs hundreds), but all analytes to be considered are present in all scans (no chromatographic separation).
-
(iii)
The number of time points is increased compared to bottom-up approaches (thousands for RT-HDX vs 1–10 for bottom-up).
-
(iv)
Theoretical isotopic envelopes must be calculated for DNA analytes, whose number of exchangeable sites (one for dT, two for dA and dC, and three for dG) differs from that of peptides (one per amide, except prolines). Non-covalent binding of cations and organic molecules must also be considered.
An application specifically designed to process oligonucleotide HDX/MS raw data is therefore necessary to handle these constraints. Further analysis of the processed data is also desirable: fitting of the exchange kinetics is necessary to determine exchange rates and unlock underlying folding kinetics and equilibrium constants. Some oligonucleotide conformations exchange simultaneously through different kinetics regimes (the EX1 and EX2 mechanisms). EX1 kinetics and multiple conformers interconverting slower than their respective exchange rates yield bimodal distributions.21,37 We want to deconvolute these distributions to access the exchange rates of individual populations, as well as the rate of unfolding of the analyte to less-folded EX1-competent conformations. Deconvolution of deuterated isotopic distributions has only been achieved in a few of the available HDX/MS software programs and is only functioning for peptides.
We introduce here an open-source R-based application, termed as OligoR, dedicated to data processing of HDX/MS data of oligonucleotides, spanning all steps from raw MS import to the generation of publication-quality figures. We demonstrate here the use of OligoR with both RT- and CF-HDX/MS data, including bimodal distribution deconvolution. We also exemplify the use of OligoR for native MS kinetics experiments.
Materials and Methods
Materials and experimental methods are given in the Supporting Information.
Raw Data File Handling
Raw MS data files in the .raw format (proprietary file format from Thermo) or the .d format (Agilent) were converted to mzML,38−40 using the MSConvert utility (Version: 3.0.20175),41,42 using the following parameters: 64-bit binary encoding precision, write index, TPP compatibility, zlib compression, and packaged in gzip. All data files are available as demo data alongside the source code; their m/z and time ranges were filtered with MSConvert to limit their size. Import of raw data in the mzML format in OligoR relies on the mzR R package.43
HDX Calculations
Data were processed in R using the tidyverse44 and data.table45,46 packages.
Deuterated Isotopic Distribution Calculation
The calculation of deuterated isotopic distributions of an oligonucleotide requires the convolution of its natural isotopic distribution with the deuteron distribution. OligoR extracts the chemical composition of the analyte from its sequence and charge state and adds user-supplied atoms (cations and small molecules). The isotopic masses and abundances are those of the last IUPAC Technical Report.47 The number of exchangeable sites nX of a given oligonucleotide anion is a constant given by eq 1, where ndX is the number of deoxynucleotides X. By default, phosphates are not counted because they are completely deprotonated in the solution and then partially protonated (but not deuterated) by water vapor in the MS source, regardless of the deuterium content in the solution.36 Terminal hydroxyl groups are also not accounted for by default.36 However, users can choose to account for phosphates and/or hydroxyl groups as needed and can also manually set up nX for applications with non-standard oligonucleotides or other types of analytes.
| 1 |
The deuterium distribution follows the binomial law, where the number of exchangeable sites (here, amino and imino protons from nucleobases) is the number of trials and the deuterium molar fraction the probability of success. The convolution of natural isotopic and deuteration distributions was performed by fast Fourier transform.48 All calculations above were implemented in OligoR based on the code we described previously.36
Number of Protected Sites and Exchange Rate Determination
The centroid mass-to-charge ratio m/zcentroid is computed from the n data points of each distribution using eq 2, where m/zi and Ii are, respectively, the mass-to-charge and intensity of data point i.
| 2 |
Centroids are converted to the number of unexchanged sites NUS as a function of time t, calculated using eq 3, where m/zt is the centroid at an exchange time t, m/z∞ is the centroid of the fully exchanged reference, and z is the charge state of the analyte. The apparent isotopic mass shift for a single site Δms is defined in eq 4, where DC0 and DC∞ are the deuterium content (expressed as molar fractions) in the bulk medium before (t = 0) and after (t = ∞) mixing, and mD and mH are the isotopic masses of hydrogen (1.007825 u) and deuterium (2.0141018 u), respectively. The fully exchanged reference can be determined in OligoR from experimental data or the computed theoretical distribution (see above).
| 3 |
| 4 |
The determination of apparent exchange rates by non-linear fitting is performed with eq 5, where NUS∞ is the offset, Ni is the number of exchanging sites, ki is the exchange rate, and j is the number of groups of sites with similar rates.
| 5 |
The fitting parameters are initialized by linearization of the data and linear fitting, or manually by the user.
Modeling and Deconvolution of Isotopic Distributions
The modeling of isotopic distributions is carried out by optimizing for each time point the apparent deuterium content DC of theoretical isotopic distributions (as described above) to the user-selected isotopic distributions. The number of exchangeable sites nX is known and kept constant to ensure that the position and width of the optimized isotopic distributions are correct.49 The algorithm systematically performs the modeling with one or two distributions; the determination of the effective number of distributions is done afterward by visual inspection and statistical testing.
Theoretical isotopic distribution calculation generates a list of i model peaks with their respective m/zi and intensity Imodel(i) for a given DC. User-selected distributions are peak-picked with a function based on the peakPick R package,50 also generating a list of i peaks of intensity Iexp(i) (Figure S1). The m/z values are rounded to the nearest multiple of 1/z to account for slight differences between the m/z scale of the experimental and model data.
To fit the modeled isotopic distributions to the peak-picked experimental data, the DC variable in the model is optimized by least-squares minimization with the box-constrained L-BFGS-B method.51 For each optimization iteration, the sum of square residuals S is calculated from the modeled intensities of peaks i using eq 6. For bimodal distributions, the optimization is performed using two model distributions j, from which a single overall peak intensity Ibimodal(i, j) is calculated for each peak i for the optimization steps (eq 7). Two optimized DCj and two abj values are therefore obtained, and two individual distributions are generated from these optimized parameters. The upper and lower DC bounds are the actual initial (DCi, e.g., 90%) and final (DCf, e.g., 9%) experimental DC in the experimental bulk medium, thus avoiding physically impossible results (Figure S2).
| 6 |
| 7 |
There is a risk that significantly overlapped bimodal distributions look monomodal.37 Significant overlap happens when EX2 and EX1 occur at the same time, particularly if the number of sites involved in EX1 is small,37 and/or if significant exchange through EX2 already occurred (see longer mixing times in Figure S3). Visual inspection alone may not be sufficient to assess this, leading to false-negative detections. Conversely, in the cases where a single population is sufficient, bimodal distributions may still appear to fit marginally better (i.e., lower RSS) because of the larger number of parameters. This could lead to false-positive detections. Therefore, the statistically better model is identified in OligoR by calculation of the p-value of the F-statistic defined in eq 8, where df is the degree of freedom and mono and bi refer to the mono- and bimodal models, respectively. The null hypothesis is that the bimodal model does not explain the variance better than the monomodal model. The significance level α is set to 0.05 by default and can be changed by the user.
![]() |
8 |
Derived Parameters
Several HDX parameters are derived from the deconvoluted bimodal distributions. The NUSj(t) values at a time t for a population j (1 or 2) are obtained from eq 9, where DCj(t) is the optimized DC for the corresponding time and population. This gives access to the “pure” EX2 contributions of each population. These NUS as a function of the exchange time can be fitted using eq 5, similarly to the non-deconvoluted data.
| 9 |
Plotting the abundance of each population as a function of the exchange time gives access to the EX1 contribution.37 The apparent rate of this contribution, which approximates that of the opening rate of the analyte, is obtained using eq 10, where ab∞ is the final abundance and abΔ the amplitude of the change of abundance.
| 10 |
Application Infrastructure, User Interface, and Data Output
Application Infrastructure
OligoR is a Shiny application that runs in a web browser.52−55 Both the user interface (UI) and server were developed in R. OligoR can be run directly from the source code: it requires the installation of R (version >4.0.5), while package dependencies are automatically installed the first time the application is run. OligoR is also available as a Docker container running R 4.2.1 on Ubuntu 20.04.4 LTS, which only requires to install Docker desktop. This approach allows running OligoR in a predictable way, without potential dependency issues, on any host local computer or cloud.
User Interface
The interface is divided into six interconnected modules dedicated to specific tasks.
-
1.
OligoRef computes the chemical formula, electrospray series (monoisotopic and average masses and m/z as a function of z), and (deuterated) isotopic distributions of oligonucleotide anions. To do so, the user inputs the sequence, molecularity, and charge state of the analyte. It is also possible to manually add atoms and charges accounting for cation adducts (K+ and NH4+), ligands, or modified nucleotides. The number of exchangeable protons is determined automatically or can be manually specified, which is useful for, e.g., non-canonical nucleotides. Users must also specify the final deuterium content in the solution DC∞, which OligoR will automatically take into account when calculating the isotopic distributions. In HDX experiments, DC∞ is a critical parameter to (i) constrain the fitting algorithm and (ii) calculate derived parameters (eqs 3, 4, and 9). Experimental isotopic distributions (imported from MSxploR) can be compared to the calculated ones to assess the accuracy of the measurement. OligoRef also allows determining the number of exchange sites from experimental isotopic distributions, provided that the deuterium solution content is known and the exchange reaction is complete.
-
2.
MSxploR imports raw MS data from mzML files, an open format proposed as a community standard.40 Users can sum scans, select species/isotopic distributions of interest, and send data to other modules for further processing. For CF-HDX experiments, the scans are combined on a time range wherein the exchange time remains constant and then sent to MSstackR. For RT-HDX experiments, wherein each scan corresponds to a different exchange time, all scans are sent to TimeR.
-
3.
MSstackR displays spectra from discrete exchange time points. It performs isotopic peak picking and mono-/bimodal isotopic distribution modeling and calculates the overlap coefficient Δ. Users can modify the optimization parameters and constraints to fit the experiment and perform statistical testing on the number of distributions. Previously processed data can be loaded as .xlsx files and merged with new data where necessary.
-
4.
TimeR filters and plots time-dependent data from RT-HDX experiments or conventional kinetics. It calculates the raw intensity, standard-corrected intensity (if an internal standard is used), and centroid for each scan. Scans can be averaged across a user-supplied scan range if the signal intensity from single scans is deemed insufficient.
-
5.
HDXplotR plots HDX data from MSstackR and TimeR and performs non-linear fitting of exchange kinetics. If bimodal distributions were deconvoluted in MSstackR, HDXplotR plots the deconvoluted NUS and isotopic population abundances and performs non-linear fitting on both.
-
6.
TitratR determines amounts of bound and unbound species in equilibrium titrations, from species selected in MSxploR, and calculates response factors and dissociation constants by internal standardization. It is a direct implementation of a previously published method that will not be discussed here.56
Data Output
Figures are produced using ggplot2 and add-ons.57−62 All figures can be customized (e.g., dimensions and colors). Figures can be exported as png or pdf files (for vectorial graphic post-processing). All plots in Figures 1 and 4 were produced with OligoR.
Figure 1.
OligoR workflow with main parameters (solid lines with square endpoints) and visualization outputs (dotted lines withcircle endpoints). Input data are shown in brown; dashed input is pre-processed data exported as an .xlsx file from a previous session. Processed HDX data are made of two files (peak-picked and deconvoluted MS spectra). Only selected time points are shown for deconvoluted mass spectra (see Figure S4 for all time points). Visualization outputs were generated with OligoR from CF-HDX data of 23TAG and the binding kinetics of K+ with 23TAG (time-dependent intensity output).
Figure 4.
Examples of the use of OligoR to process and visualize HDX/MS results. Left: CF-HDX experiment with two mass-separated species of interest. (A) Empirical determination of the fully exchanged reference centroid (blue line) and comparison to the theory (orange line and points) for 23TAG·PhenDC3 (C264H303O141N100P22K1, z = 4- with 51 exchangeable protons, 9%D; 23TAG alone is shown in Figure 1). (B) Collected mass spectra for 23TAG and 23TAG·PhenDC3 for different mixing time points (gray lines; only selected time points shown, see Figure S4 for all time points). Automated peak picking is shown with pink points. (C) Corresponding apparent exchange kinetics (points) and non-linear fitting with eq 5 (lines). Right: RT-HDX experiment acquired for 90 min (dead time: 40 s). (D) Same as (A) for T30177-TT binding two K+ (C190H230O119N74P18K2, z = 4- with 43 exchangeable protons, 9%D). (E) Calculations of m/z centroids for the species of interest for each of the 3867 scans. (F) Corresponding apparent exchange kinetics for each scan (points) and non-linear fitting with eq 5 (lines).
All raw and processed data are displayed in tables built with the datatables plug-in for the jQuery JavaScript library. They feature search fields, column sorting and filtering, column and row re-ordering, and export buttons. Data tables can be copied into the clipboard or exported as csv and Excel files. Processed data in the .xlsx format can be re-imported in OligoR. Processed data files for 23TAG, T30177-TT, and VEGF were all made available as demo data alongside the OligoR source code.
Results
Isotopic Distribution Deconvolution
The isotopic distribution modeling algorithm was evaluated using CF-HDX data from three oligonucleotides, namely, T30177-TT (5′-T2GTG2(TG3)3T), VEGF (5′-CG4CG3C2T2G3CG4T), and 23TAG (5′-TA(G3T2A)3G3). T30177-TT is a stable G-quadruplex, whereas VEGF and 23TAG fold into less stable (i.e., more prone to unfold) structures, in our experimental conditions.63
T30177-TT exchanges through an EX2 kinetics characterized by a single isotopic distribution shifting progressively with the exchange time. Thus, a single distribution suffices to fit all time points of T30177-TT (Figure 2). The mean relative difference between the experimental and modeled centroids across the 20 time points is 13 ppm. Conversely, VEGF and 23TAG visibly exhibit bimodal isotopic distributions for most of the time points. At longer mixing times, however, it is not possible to conclude from visual inspection alone whether a single or two highly overlapped isotopic distributions are present. Fitting was achieved with two distributions for all time points, with a mean relative difference in centroid m/z of 12 and 10 ppm, respectively. The p-values of the F-statistic indicate that the use of two isotopic distributions is indeed necessary for all time points, for both VEGF and 23TAG, with a high level of confidence (α = 0.05).
Figure 2.

Example of isotopic distribution modeling with OligoR on monomodal and bimodal distributions. The experimental data are shown in gray, peak picking in blue, the overall fit in orange, and individual populations in green and purple (z = 4-, K+ = 2, time labeled in seconds above panels). The vertical dashed lines show the position of their respective centroids. All time points are shown in Figures S4 and S5. In the case of 23TAG, the abundance of the highly exchanged isotopic population increases, suggesting the presence of EX1 kinetics.21,37 On the contrary, there is no change in the abundances of isotopic populations of VEGF; thus, the bimodal distribution probably results from distinct conformations in the solution.
Fitting with Gaussians is a simpler approach that has already been used for protein and peptide isotopic distributions.20,64Figures 3 and S3 illustrate the differences between our approach and Gaussian fitting. The latter generally provides an overall fit of the data similar to that of OligoR. However, OligoR automatically provides constraints on the distribution widths based on experimental parameters, whereas a simple Gaussian fitting does not (it would require the determination of peak widths in m/z units for any given centroid). Consequently, the individual distributions obtained by Gaussian fitting are wider or narrower compared to isotopic distributions modeled by OligoR, resulting in both incorrect centroids and abundances of the deconvoluted populations. The error increases as the two isotopic distributions become more overlapped, consistent with a previous report.37 For instance, for 23TAG, the error is small at 0.02 min but becomes clear at 1.02 min. At 2.33 min, the overlap becomes too important for successful fitting with two Gaussians (i.e., it did not converge) but was achieved with OligoR. A single Gaussian actually fits the experimental data well but is unrealistic: the distribution is too wide for this analyte at this deuteration level, as clearly evidenced by OligoR.
Figure 3.

Comparison of fitting methods of increasingly overlapped experimental isotopic distributions (black dots: peak-piked data from 23TAG at three different time points). Gaussian distributions are shown in solid green and blue lines (overall fit in solid orange), and optimized isotopic distributions obtained in OligoR are shown as green and blue areas (overall fit in dotted black). At 2.33 min, only a single Gaussian could be fitted. The coefficient of overlap between the isotopic distributions is given by Δ. The Gaussian fitting and coefficient-of-overlap calculation methods are given in the Supporting Information.
To assess the accuracy of the deconvoluted centroids and abundances as a function of the overlap coefficient, we quantified the fit error on a set of 2181 bimodal spectra generated from predefined deuterium contents for the three sequences T30177-TT, 23TAG, and VEGF at z = 4- (Figure S6). Noise was added to better mimic the experimental data, and the relative abundances of the isotopic populations were randomized. The OligoR fitting routine was applied to all spectra (Figures S7–S9). Very accurate centroids were obtained regardless of the overlap coefficient (mean relative error = 0.9%; Figure S10B), with no significant increase in the mean squared error (Figure S10A). The accuracy of the abundances is excellent (mean relative error = 0.9%) for overlaps below 0.5 (Figure S10C), which is the case for all spectra shown in Figure 1. For very large overlap values, however, the relative error increases sharply (4% for 0.5 ≤ Δ < 0.6, 10% for 0.6 ≤ Δ < 0.7, and up to 29% for Δ > 0.7). This is particularly evident for low-abundance populations. Users should therefore be cautious with values obtained from low-abundance, highly overlapping isotope distributions. Such cases are most likely to occur at the very end of exchange kinetics. More generally, users can use Figure S10 as a benchmark to estimate expected accuracies.
Application of the Workflow
Here, we briefly illustrate the results of CF-HDX, RT-HDX, and binding kinetics experiments using OligoR for data processing and visualization.
CF-HDX Processing
We explore here CF-HDX data from experiments involving two species, namely, 23TAG (binding two K+) and 23TAG·PhenDC3, its 1:1 complex with the ligand PhenDC3 binding a single K+.65,66
Figures 1 and 4A show the determination of the fully exchanged reference centroids of 23TAG and 23TAG·PhenDC3, respectively. This is necessary for the calculation of NUS values following eq 3. The OligoRef module also compares those experimental data with the theory, which is useful to verify its validity and quantify experimental errors.
Figure 4B displays the isotopic distributions selected in MSxploR for each species across seven time points (here, the 4- charge state of 23TAG binding two K+, or PhenDC3 and only one K+). Peak picking of the isotopic peak, which is necessary for deconvolution, is systematically performed on all collected spectra in MSstackR. Note that users can tweak the peak-picking algorithm parameters (e.g., intensity threshold), but default parameters proved very effective in our hands. 23TAG displays bimodal distributions, which are successfully deconvoluted in MSstackR (see Figure 1).
HDXplotR computes and plots the apparent centroids against the exchange time of both species (Figure 4C), using the references processed in OligoRef (Figures 1 and 4A) and experimental parameters provided by the user (e.g., deuterium content and charge states). Non-linear fitting using eq 5 can be toggled on, as shown in Figure 4C.
In the case of unbound 23TAG, HDXplotR also plots the deconvoluted exchange and isotopic population abundance as a function of time (Figure 1: low- and high-exchange populations). Non-linear fitting of these two plots, respectively, yields the pure apparent EX2 exchange rate of 23TAG (k1 = 0.148 s–1 and k2 = 0.014 s–1) and the rate of cooperative unfolding undergone by 23TAG to access EX1-capable conformers (kop = 0.05 s–1).
RT-HDX Processing
To illustrate the RT-HDX treatment, we used previously published T30177-TT exchange data.36 The fully exchanged reference was obtained in OligoRef, as above (Figure 4D). After selection of the m/z range of the isotopic distribution of interest (here, the 4- charge state of the oligonucleotide binding two K+) in MSxploR, TimeR computes the centroid mass for every single scan in a few seconds (Figure 4E). As for CF-HDX, HDXplotR converts the kinetics into NUS units and performs non-linear fitting with eq 5 (Figure 4F).
Binding Kinetics Processing
We exemplify here the use of OligoR for processing binding kinetics using time-dependent data on K+ cations binding to 23TAG. This system was first reported by our team to study the folding pathways of G-quadruplexes.67
Three species of interest, namely, 23TAG binding 0, 1, and 2K+ at the 5- charge state, were selected in MSxploR. TimeR then readily computes their absolute and relative intensities (shown in Figure 1) as a function of time by processing all scans independently. Here, the abundance of the unfolded strand 23TAG·0K+ decreases with time, while the expected equilibrium product, the 3-tetrad G-quadruplex 23TAG·2K+, is formed. The 23TAG·1K+ ensemble, which includes both off-pathway conformers (2-tetrad G-quadruplex) and folding intermediates, is formed in the first few minutes and is then increasingly converted to the 2K+ species.
Conclusions
HDX/MS is a method of choice to study protein conformational dynamics. Several commercial and academic programs are available for data processing and visualization of the results. None, however, are adapted to the specifics of DNA HDX/MS experiments that we recently introduced. We present here OligoR as an answer to the specific needs of DNA HDX/MS and native MS experiments.
OligoR is divided into six interconnected modules dedicated to specific tasks, from raw data to result visualization and export. Processing of entire experiments spanning many time points can be performed in minutes. Furthermore, we implemented a simple and robust approach to deconvolute bimodal distributions, determine whether they result from different conformers or EX1 kinetics, and access pure EX1 and EX2 exchange rates where relevant. This approach yields physically possible results only and could be expanded to any type of analyte (protein, peptides, sugars, and small molecules) as it relies on chemical formulae, provided that the number of exchangeable sites is known. OligoR provides a means of experimentally determining this number.
All results are presented in data tables that can be exported in several formats to save/reimport data and for other software. Publication-quality figures can be produced, customized, and exported as png or pdf files. Finally, OligoR runs in a simple web browser either directly from the R source code or a Docker container. The latter runs consistently across machines without risks of package dependency or update issues and can be deployed on a server.
The source code and demo data are available on GitHub (https://github.com/EricLarG4/OligoR) and are archived on Zenodo (doi.org/10.5281/zenodo.7691330). The Docker image can be retrieved from the GitHub container registry (ghcr.io/ericlarg4/oligor:master) and Docker Hub (https://hub.docker.com/r/ericlarg4/oligor).
Acknowledgments
We thank Dr. Valérie Gabelica for her feedback during the development of the software and for proofreading the manuscript. This work was supported by the Agence Nationale de la Recherche (ANR-21-CE29-0004 “DNA-HDXMS” to E.L.).
Supporting Information Available
The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/acs.analchem.3c01321.
Additional materials, experimental methods (HDX/MS and native MS kinetics), computational methods (Gaussian fitting and titration experiments), examples of peak-picking results, theoretical isotopic distributions, comparison of binomial and Gaussian fitting, results of isotopic distribution modeling, and evaluation of the influence of the overlap coefficient on the relative fitting error (PDF)
The authors declare no competing financial interest.
Supplementary Material
References
- Trabjerg E.; Nazari Z. E.; Rand K. D. Conformational Analysis of Complex Protein States by Hydrogen/Deuterium Exchange Mass Spectrometry (HDX-MS): Challenges and Emerging Solutions. TrAC, Trends Anal. Chem. 2018, 106, 125–138. 10.1016/j.trac.2018.06.008. [DOI] [Google Scholar]
- Oganesyan I.; Lento C.; Wilson D. J. Contemporary Hydrogen Deuterium Exchange Mass Spectrometry. Methods 2018, 144, 27–42. 10.1016/j.ymeth.2018.04.023. [DOI] [PubMed] [Google Scholar]
- Zheng J.; Strutzenberg T.; Pascal B. D.; Griffin P. R. Protein Dynamics and Conformational Changes Explored by Hydrogen/Deuterium Exchange Mass Spectrometry. Curr. Opin. Struct. Biol. 2019, 58, 305–313. 10.1016/j.sbi.2019.06.007. [DOI] [PubMed] [Google Scholar]
- Engen J. R.; Botzanowski T.; Peterle D.; Georgescauld F.; Wales T. E. Developments in Hydrogen/Deuterium Exchange Mass Spectrometry. Anal. Chem. 2021, 93, 567–582. 10.1021/acs.analchem.0c04281. [DOI] [PubMed] [Google Scholar]
- James E. I.; Murphree T. A.; Vorauer C.; Engen J. R.; Guttman M. Advances in Hydrogen/Deuterium Exchange Mass Spectrometry and the Pursuit of Challenging Biological Systems. Chem. Rev. 2022, 122, 7562–7623. 10.1021/acs.chemrev.1c00279. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hvidt A.; Nielsen S. O. Hydrogen Exchange in Proteins. Adv. Protein Chem. 1966, 21, 287–386. 10.1016/s0065-3233(08)60129-1. [DOI] [PubMed] [Google Scholar]
- Englander S. W.; Kallenbach N. R. Hydrogen Exchange and Structural Dynamics of Proteins and Nucleic Acids. Q. Rev. Biophys. 1983, 16, 521–655. 10.1017/S0033583500005217. [DOI] [PubMed] [Google Scholar]
- Konermann L.; Pan J.; Liu Y.-H. Hydrogen Exchange Mass Spectrometry for Studying Protein Structure and Dynamics. Chem. Soc. Rev. 2011, 40, 1224–1234. 10.1039/C0CS00113A. [DOI] [PubMed] [Google Scholar]
- Wei H.; Mo J.; Tao L.; Russell R. J.; Tymiak A. A.; Chen G.; Iacob R. E.; Engen J. R. Hydrogen/Deuterium Exchange Mass Spectrometry for Probing Higher Order Structure of Protein Therapeutics: Methodology and Applications. Drug Discovery Today 2014, 19, 95–102. 10.1016/j.drudis.2013.07.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu X. R.; Huang R. Y.-C.; Zhao F.; Chen G.; Tao L. Advances in Mass Spectrometry-Based Epitope Mapping of Protein Therapeutics. J. Pharm. Biomed. Anal. 2022, 215, 114754 10.1016/j.jpba.2022.114754. [DOI] [PubMed] [Google Scholar]
- Pirrone G. F.; Wang H.; Canfield N.; Chin A. S.; Rhodes T. A.; Makarov A. A. Use of MALDI-MS Combined with Differential Hydrogen–Deuterium Exchange for Semiautomated Protein Global Conformational Screening. Anal. Chem. 2017, 89, 8351–8357. 10.1021/acs.analchem.7b01590. [DOI] [PubMed] [Google Scholar]
- Deng B.; Lento C.; Wilson D. J. Hydrogen Deuterium Exchange Mass Spectrometry in Biopharmaceutical Discovery and Development – A Review. Anal. Chim. Acta 2016, 940, 8–20. 10.1016/j.aca.2016.08.006. [DOI] [PubMed] [Google Scholar]
- Tremblay C. Y.; Limpikirati P.; Vachet R. W. Complementary Structural Information for Stressed Antibodies from Hydrogen–Deuterium Exchange and Covalent Labeling Mass Spectrometry. J. Am. Soc. Mass Spectrom. 2021, 32, 1237–1248. 10.1021/jasms.1c00072. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sørensen L.; Salbo R. Optimized Workflow for Selecting Peptides for HDX-MS Data Analyses. J. Am. Soc. Mass Spectrom. 2018, 29, 2278–2281. 10.1007/s13361-018-2056-1. [DOI] [PubMed] [Google Scholar]
- Eggertson M. J.; Fadgen K.; Engen J. R.; Wales T. E.. Considerations in the Analysis of Hydrogen Exchange Mass Spectrometry Data. In Methods Mol. Biol. Clifton NJ; 2020; Vol. 2051, pp 407 −435, 10.1007/978-1-4939-9744-2_18. [DOI] [PubMed] [Google Scholar]
- Abzalimov R. R.; Kaltashov I. A. Extraction of Local Hydrogen Exchange Data from HDX CAD MS Measurements by Deconvolution of Isotopic Distributions of Fragment Ions. J. Am. Soc. Mass Spectrom. 2006, 17, 1543–1551. 10.1016/j.jasms.2006.07.017. [DOI] [PubMed] [Google Scholar]
- Weis D. D.; Engen J. R.; Kass I. J. Semi-Automated Data Processing of Hydrogen Exchange Mass Spectra Using HX-Express. J. Am. Soc. Mass Spectrom. 2006, 17, 1700–1703. 10.1016/j.jasms.2006.07.025. [DOI] [PubMed] [Google Scholar]
- Chik J. K.; Vande Graaf J. L.; Schriemer D. C. Quantitating the Statistical Distribution of Deuterium Incorporation to Extend the Utility of H/D Exchange MS Data. Anal. Chem. 2006, 78, 207–214. 10.1021/ac050988l. [DOI] [PubMed] [Google Scholar]
- Lou X.; Kirchner M.; Renard B. Y.; Köthe U.; Boppel S.; Graf C.; Lee C.-T.; Steen J. A. J.; Steen H.; Mayer M. P.; Hamprecht F. A. Deuteration Distribution Estimation with Improved Sequence Coverage for HX/MS Experiments. Bioinformatics 2010, 26, 1535–1541. 10.1093/bioinformatics/btq165. [DOI] [PubMed] [Google Scholar]
- Kan Z.-Y.; Mayne L.; Sevugan Chetty P.; Englander S. W. ExMS: Data Analysis for HX-MS Experiments. J. Am. Soc. Mass Spectrom. 2011, 22, 1906. 10.1007/s13361-011-0236-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Guttman M.; Weis D. D.; Engen J. R.; Lee K. K. Analysis of Overlapped and Noisy Hydrogen/Deuterium Exchange Mass Spectra. J. Am. Soc. Mass Spectrom. 2013, 24, 1906–1912. 10.1007/s13361-013-0727-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lindner R.; Lou X.; Reinstein J.; Shoeman R. L.; Hamprecht F. A.; Winkler A. Hexicon 2: Automated Processing of Hydrogen-Deuterium Exchange Mass Spectrometry Data with Improved Deuteration Distribution Estimation. J. Am. Soc. Mass Spectrom. 2014, 25, 1018–1028. 10.1007/s13361-014-0850-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Salisbury J. P.; Liu Q.; Agar J. N. QUDeX-MS: Hydrogen/Deuterium Exchange Calculation for Mass Spectra with Resolved Isotopic Fine Structure. BMC Bioinf. 2014, 15, 403. 10.1186/s12859-014-0403-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kan Z.; Ye X.; Skinner J. J.; Mayne L.; Englander S. W. ExMS2: An Integrated Solution for Hydrogen–Deuterium Exchange Mass Spectrometry Data Analysis. Anal. Chem. 2019, 91, 7474–7481. 10.1021/acs.analchem.9b01682. [DOI] [PubMed] [Google Scholar]
- Na S.; Lee J.-J.; Joo J. W. J.; Lee K.-J.; Paek E. DeMix: Decoding Deuterated Distributions from Heterogeneous Protein States via HDX-MS. Sci. Rep. 2019, 9, 3176. 10.1038/s41598-019-39512-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yamamoto T.; Yamagaki T.; Satake H. Development of Software for the In-Depth Analysis of Protein Dynamics as Determined by MALDI Mass Spectrometry-Based Hydrogen/Deuterium Exchange. Mass Spectrom. 2019, 8, S0082. 10.5702/massspectrometry.S0082. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bouyssié D.; Lesne J.; Locard-Paulet M.; Albigot R.; Burlet-Schiltz O.; Marcoux J. HDX-Viewer: Interactive 3D Visualization of Hydrogen–Deuterium Exchange Data. Bioinformatics 2019, 35, 5331–5333. 10.1093/bioinformatics/btz550. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hourdel V.; Volant S.; O’Brien D. P.; Chenal A.; Chamot-Rooke J.; Dillies M.-A.; Brier S. MEMHDX: An Interactive Tool to Expedite the Statistical Validation and Visualization of Large HDX-MS Datasets. Bioinformatics 2016, 32, 3413–3419. 10.1093/bioinformatics/btw420. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lau A. M. C.; Ahdash Z.; Martens C.; Politis A. Deuteros: Software for Rapid Analysis and Visualization of Data from Differential Hydrogen Deuterium Exchange-Mass Spectrometry. Bioinformatics 2019, 35, 3171–3173. 10.1093/bioinformatics/btz022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lumpkin R. J.; Komives E. A. DECA, A Comprehensive, Automatic Post-Processing Program for HDX-MS Data. Mol. Cell. Proteomics 2019, 18, 2516–2523. 10.1074/mcp.TIR119.001731. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Puchała W.; Burdukiewicz M.; Kistowski M.; Dąbrowska K. A.; Badaczewska-Dawid A. E.; Cysewski D.; Dadlez M. HaDeX: An R Package and Web-Server for Analysis of Data from Hydrogen–Deuterium Exchange Mass Spectrometry Experiments. Bioinformatics 2020, 36, 4516–4518. 10.1093/bioinformatics/btaa587. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lau A. M.; Claesen J.; Hansen K.; Politis A. Deuteros 2.0: Peptide-Level Significance Testing of Data from Hydrogen Deuterium Exchange Mass Spectrometry. Bioinformatics 2021, 37, 270–272. 10.1093/bioinformatics/btaa677. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Smit J. H.; Krishnamurthy S.; Srinivasu B. Y.; Parakra R.; Karamanou S.; Economou A. Probing Universal Protein Dynamics Using Hydrogen-Deuterium Exchange Mass Spectrometry-Derived Residue-Level Gibbs Free Energy. Anal. Chem. 2021, 93, 12840–12847. 10.1021/acs.analchem.1c02155. [DOI] [PubMed] [Google Scholar]
- Zhang N.; Yu X.; Zhang X.; D’Arcy S. HD-EXplosion: Visualization of Hydrogen–Deuterium Exchange Data as Chiclet and Volcano Plots with Statistical Filtering. Bioinformatics 2021, 37, 1926–1927. 10.1093/bioinformatics/btaa892. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Seetaloo N.; Kish M.; Phillips J. J. HDfleX: Software for Flexible High Structural Resolution of Hydrogen/Deuterium-Exchange Mass Spectrometry Data. Anal. Chem. 2022, 94, 4557–4564. 10.1021/acs.analchem.1c05339. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Largy E.; Gabelica V. Native Hydrogen/Deuterium Exchange Mass Spectrometry of Structured DNA Oligonucleotides. Anal. Chem. 2020, 92, 4402–4410. 10.1021/acs.analchem.9b05298. [DOI] [PubMed] [Google Scholar]
- Weis D. D.; Wales T. E.; Engen J. R.; Hotchko M.; Eyck L. F. Identification and Characterization of EX1 Kinetics in H/D Exchange Mass Spectrometry by Peak Width Analysis. J. Am. Soc. Mass Spectrom. 2006, 17, 1498–1509. 10.1016/j.jasms.2006.05.014. [DOI] [PubMed] [Google Scholar]
- Pedrioli P. G. A.; Eng J. K.; Hubley R.; Vogelzang M.; Deutsch E. W.; Raught B.; Pratt B.; Nilsson E.; Angeletti R. H.; Apweiler R.; Cheung K.; Costello C. E.; Hermjakob H.; Huang S.; Julian R. K.; Kapp E.; McComb M. E.; Oliver S. G.; Omenn G.; Paton N. W.; Simpson R.; Smith R.; Taylor C. F.; Zhu W.; Aebersold R. A Common Open Representation of Mass Spectrometry Data and Its Application to Proteomics Research. Nat. Biotechnol. 2004, 22, 1459–1466. 10.1038/nbt1031. [DOI] [PubMed] [Google Scholar]
- Keller A.; Eng J.; Zhang N.; Li X.; Aebersold R. A Uniform Proteomics MS/MS Analysis Platform Utilizing Open XML File Formats. Mol. Syst. Biol. 2005, 1, 2005.0017. 10.1038/msb4100024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Martens L.; Chambers M.; Sturm M.; Kessner D.; Levander F.; Shofstahl J.; Tang W. H.; Römpp A.; Neumann S.; Pizarro A. D.; Montecchi-Palazzi L.; Tasman N.; Coleman M.; Reisinger F.; Souda P.; Hermjakob H.; Binz P.-A.; Deutsch E. W. MzML—a Community Standard for Mass Spectrometry Data. Mol. Cell. Proteomics 2011, 10, R110.000133 10.1074/mcp.R110.000133. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kessner D.; Chambers M.; Burke R.; Agus D.; Mallick P. ProteoWizard: Open Source Software for Rapid Proteomics Tools Development. Bioinformatics 2008, 24, 2534–2536. 10.1093/bioinformatics/btn323. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chambers M. C.; Maclean B.; Burke R.; Amodei D.; Ruderman D. L.; Neumann S.; Gatto L.; Fischer B.; Pratt B.; Egertson J.; Hoff K.; Kessner D.; Tasman N.; Shulman N.; Frewen B.; Baker T. A.; Brusniak M.-Y.; Paulse C.; Creasy D.; Flashner L.; Kani K.; Moulding C.; Seymour S. L.; Nuwaysir L. M.; Lefebvre B.; Kuhlmann F.; Roark J.; Rainer P.; Detlev S.; Hemenway T.; Huhmer A.; Langridge J.; Connolly B.; Chadick T.; Holly K.; Eckels J.; Deutsch E. W.; Moritz R. L.; Katz J. E.; Agus D. B.; MacCoss M.; Tabb D. L.; Mallick P. A Cross-Platform Toolkit for Mass Spectrometry and Proteomics. Nat. Biotechnol. 2012, 30, 918–920. 10.1038/nbt.2377. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fischer B.; Neumann S.; Gatto L.; Kou Q.; Rainer J.. MzR, 2017. https://bioconductor.org/packages/mzR (accessed 2022-06-09).
- Wickham H.; Averick M.; Bryan J.; Chang W.; McGowan L.; François R.; Grolemund G.; Hayes A.; Henry L.; Hester J.; Kuhn M.; Pedersen T.; Miller E.; Bache S.; Müller K.; Ooms J.; Robinson D.; Seidel D.; Spinu V.; Takahashi K.; Vaughan D.; Wilke C.; Woo K.; Yutani H. Welcome to the Tidyverse. J. Open Source Softw. 2019, 4, 1686. 10.21105/joss.01686. [DOI] [Google Scholar]
- Dowle M.; Srinivasan A.. Data.Table: Extension of ‘data.Frame’, 2021. https://CRAN.R-project.org/package=data.table.
- Fairbanks M.Tidytable: Tidy Interface to “Data.Table”, 2022. https://CRAN.R-project.org/package=tidytable.
- Prohaska T.; Irrgeher J.; Benefield J.; Böhlke J. K.; Chesson L. A.; Coplen T. B.; Ding T.; Dunn P. J. H.; Gröning M.; Holden N. E.; Meijer H. A. J.; Moossen H.; Possolo A.; Takahashi Y.; Vogl J.; Walczyk T.; Wang J.; Wieser M. E.; Yoneda S.; Zhu X.-K.; Meija J. Standard Atomic Weights of the Elements 2021 (IUPAC Technical Report). Pure Appl. Chem. 2022, 94, 573–600. 10.1515/pac-2019-0603. [DOI] [Google Scholar]
- Sadygov R. G. Poisson Model To Generate Isotope Distribution for Biomolecules. J. Proteome Res. 2018, 17, 751–758. 10.1021/acs.jproteome.7b00807. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kan Z.-Y.; Walters B. T.; Mayne L.; Englander S. W. Protein Hydrogen Exchange at Residue Resolution by Proteolytic Fragmentation Mass Spectrometry Analysis. Proc. Natl. Acad. Sci. U. S. A. 2013, 110, 16438–16443. 10.1073/pnas.1315532110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Westermark P. O.PeakPick: Peak Picking Methods Inspired by Biological Data, 2015. https://CRAN.R-project.org/package=peakPick.
- Byrd R. H.; Lu P.; Nocedal J.; Zhu C. A Limited Memory Algorithm for Bound Constrained Optimization. SIAM J. Sci. Comput. 1995, 16, 1190–1208. 10.1137/0916069. [DOI] [Google Scholar]
- Chang W.; Cheng J.; Allaire J.; Sievert C.; Schloerke B.; Xie Y.; Allen J.; McPherson J.; Dipert A.; Borges B.. Shiny: Web Application Framework for R, 2021. https://CRAN.R-project.org/package=shiny.
- Chang W.; Borges Ribeiro B.. Shinydashboard: Create Dashboards with “Shiny”, 2021. https://CRAN.R-project.org/package=shinydashboard.
- Granjon D.ShinydashboardPlus: Add More “AdminLTE2” Components to “Shinydashboard”, 2021. https://CRAN.R-project.org/package=shinydashboardPlus.
- Perrier V.; Meyer F.; Granjon D.. ShinyWidgets: Custom Inputs Widgets for Shiny, 2022. https://CRAN.R-project.org/package=shinyWidgets.
- Gabelica V.; Rosu F.; Pauw E. D. A Simple Method to Determine Electrospray Response Factors of Noncovalent Complexes. Anal. Chem. 2009, 81, 6708–6715. 10.1021/ac900785m. [DOI] [PubMed] [Google Scholar]
- Wickham H.Ggplot2: Elegant Graphics for Data Analysis; Springer-Verlag: New York, 2016. [Google Scholar]
- Kassambara A.Ggpubr: “ggplot2” Based Publication Ready Plots, 2020. https://CRAN.R-project.org/package=ggpubr.
- Slowikowski K.Ggrepel: Automatically Position Non-Overlapping Text Labels with “Ggplot2”, 2021. https://CRAN.R-project.org/package=ggrepel.
- Arnold J. B.Ggthemes: Extra Themes, Scales and Geoms for “Ggplot2”, 2021. https://CRAN.R-project.org/package=ggthemes.
- Sievert C.; Schloerke B.; Cheng J.. Thematic: Unified and Automatic “Theming” of “Ggplot2,” “Lattice,” and “base” R Graphics, 2021. https://CRAN.R-project.org/package=thematic.
- Aphalo P. J.Ggpmisc: Miscellaneous Extensions to “Ggplot2”, 2022. https://CRAN.R-project.org/package=ggpmisc.
- Ghosh A.; Largy E.; Gabelica V. DNA G-Quadruplexes for Native Mass Spectrometry in Potassium: A Database of Validated Structures in Electrospray-Compatible Conditions. Nucleic Acids Res. 2021, 49, 2333–2345. 10.1093/nar/gkab039. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fu C.-Y.; Prevelige P. E. Dynamic Motions of Free and Bound O29 Scaffolding Protein Identified by Hydrogen Deuterium Exchange Mass Spectrometry. Protein Sci. 2006, 15, 731–743. 10.1110/ps.051921606. [DOI] [PMC free article] [PubMed] [Google Scholar]
- De Cian A.; Delemos E.; Mergny J.-L.; Teulade-Fichou M.-P.; Monchaud D. Highly Efficient G-Quadruplex Recognition by Bisquinolinium Compounds. J. Am. Chem. Soc. 2007, 129, 1856–1857. 10.1021/ja067352b. [DOI] [PubMed] [Google Scholar]
- Marchand A.; Granzhan A.; Iida K.; Tsushima Y.; Ma Y.; Nagasawa K.; Teulade-Fichou M.-P.; Gabelica V. Ligand-Induced Conformational Changes with Cation Ejection upon Binding to Human Telomeric DNA g-Quadruplexes. J. Am. Chem. Soc. 2015, 137, 750–756. 10.1021/ja5099403. [DOI] [PubMed] [Google Scholar]
- Marchand A.; Gabelica V. Folding and Misfolding Pathways of G-Quadruplex DNA. Nucleic Acids Res. 2016, 44, 10999–11012. 10.1093/nar/gkw970. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.



