MaxQuant Software for Ion Mobility Enhanced Shotgun Proteomics

Nikita Prianichnikov; Heiner Koch; Scarlet Koch; Markus Lubeck; Raphael Heilig; Sven Brehmer; Roman Fischer; Jürgen Cox

doi:10.1074/mcp.TIR119.001720

. 2020 Mar 10;19(6):1058–1069. doi: 10.1074/mcp.TIR119.001720

MaxQuant Software for Ion Mobility Enhanced Shotgun Proteomics^*

Nikita Prianichnikov ^‡, Heiner Koch ^§, Scarlet Koch ^§, Markus Lubeck ^§, Raphael Heilig ^¶, Sven Brehmer ^§, Roman Fischer ^¶, Jürgen Cox ^‡,^‖,^**

PMCID: PMC7261821 PMID: 32156793

The ion mobility enhanced MaxQuant software is introduced, offering an end to end computational workflow for LC-IMS-MS/MS shotgun proteomics data. Highly parallelizable 4D feature detection and de-isotoping algorithms are featured and non-linear multi-dimensional mass recalibration is performed. A new matching between runs (MBR) algorithm that utilizes aligned peptide collisional cross sections (CCS). MS1 level label-free quantification is also implemented providing good results. MaxQuant for LC-IMS-MS/MS is part of the basic MaxQuant release and can be downloaded from http://maxquant.org.

Keywords: Bioinformatics, bioinformatics software, mass spectrometry, label-free quantification, quantification

Graphical Abstract

graphic file with name zjw0052061240008.jpg

Highlights

Highly parallelizable 4D feature detection in ion mobility enhanced shotgun proteomics.
Multidimensional non-linear mass, retention time and ion mobility recalibration.
Collision cross section aware matching between runs.
Label-free quantification of ion mobility MS data.

Abstract

Ion mobility can add a dimension to LC-MS based shotgun proteomics which has the potential to boost proteome coverage, quantification accuracy and dynamic range. Required for this is suitable software that extracts the information contained in the four-dimensional (4D) data space spanned by m/z, retention time, ion mobility and signal intensity. Here we describe the ion mobility enhanced MaxQuant software, which utilizes the added data dimension. It offers an end to end computational workflow for the identification and quantification of peptides and proteins in LC-IMS-MS/MS shotgun proteomics data. We apply it to trapped ion mobility spectrometry (TIMS) coupled to a quadrupole time-of-flight (QTOF) analyzer. A highly parallelizable 4D feature detection algorithm extracts peaks which are assembled to isotope patterns. Masses are recalibrated with a non-linear m/z, retention time, ion mobility and signal intensity dependent model, based on peptides from the sample. A new matching between runs (MBR) algorithm that utilizes collisional cross section (CCS) values of MS1 features in the matching process significantly gains specificity from the extra dimension. Prerequisite for using CCS values in MBR is a relative alignment of the ion mobility values between the runs. The missing value problem in protein quantification over many samples is greatly reduced by CCS aware MBR.MS1 level label-free quantification is also implemented which proves to be highly precise and accurate on a benchmark dataset with known ground truth. MaxQuant for LC-IMS-MS/MS is part of the basic MaxQuant release and can be downloaded from http://maxquant.org.

Ion mobility spectrometry (1–3) (IMS)¹ separates molecules in the gas phase by their collisional cross section (CCS) which is the effective area of a molecule quantifying the likelihood of scattering events with the gas. It can be coupled to mass spectrometry (MS) for which it constitutes a separation dimension in addition to mass over charge (m/z). Together with liquid chromatography (LC) and tandem mass spectrometry (MS/MS) one obtains LC-IMS-MS/MS shotgun proteomics, a promising strategy for the analysis of complex samples (4–7). Specifically, the object of our studies is the timsTOF Pro instrument (8) which is a time-of-flight (TOF) mass spectrometer utilizing a trapped ion mobility spectrometry (9–11) (TIMS)device operated with the parallel accumulation-serial fragmentation (PASEF) scan mode (12).

MaxQuant (13, 14) is a popular software platform for LC-MS/MS shotgun proteomics possessing a large ecosystem of algorithms for comprehensive data analysis (15). It incorporates the peptide search engine Andromeda (16) and the companion software Perseus (17, 18) offers a complete solution for the downstream bioinformatics analysis. MaxQuant performs quantification with labels (19) and via the MaxLFQ algorithm (20) on label-free data. MaxQuant achieves high peptide mass accuracies thanks to its advanced nonlinear recalibration algorithms (21, 22). It contains comfortable visualization capabilities (23) for the inspection of the raw data and runs on Windows and Linux operating systems (24).

The aim of this publication is to combine these two factors, i.e. to make MaxQuant capable of analyzing timsTOF Pro data. Although ion mobility information has been used previously in data-independent-acquisition (DIA) workflows (25), here we focus on its integration into data-dependent acquisition (DDA). In general, a main challenge originates from the inflation of raw data by the added dimension. To keep the computation time required for the analysis down to a realistic amount is of highest importance. In this manuscript we describe the updated computational workflow of MaxQuant for ion mobility-enhanced shotgun proteomics data, which has been optimized for computational performance. It is based on 4D feature detection in the space augmented by the extra dimension provided by IMS. Matching between runs is a crucial algorithmic step in MaxQuant for retrieving MS1 features in data-dependent acquisition shotgun proteomics, in order to make quantification of peptides and proteins more reproducible over many samples. It requires high mass accuracy and precise relative retention time measures, obtained by non-linear retention time alignment in order to restrict false positives matches. Here we present a new matching algorithm that considers ion mobility values of peptide features measured on a timsTOF Pro instrument. Similarly, as for the retention times, we observe the necessity for relative alignment of ion mobility values between LC-IMS-MS/MS runs, and include a nonlinear multi-sample alignment algorithm into the MaxQuant workflow. Matching the 4D features using these aligned CCS values did significantly improve specificity. Furthermore, we observe a strong positive impact on the missing value problem in quantitative proteomics. All new algorithmic steps as well as their application to example data are detailed in the Results section.

EXPERIMENTAL PROCEDURES

Cell Culture and Sample Preparation

Whole protein extracts of human cervical cancer cells (HeLa) were purchased from Promega (Madison, WI) and digestion was performed according to the protocol of Wang et al. (26). Briefly, the protein lyophilisate was re-constituted in water and trifluoroethanol (1:1). Disulfide bonds were reduced with dithiothreitol at a concentration of 5 mm and alkylated with chloroacetamide (20 mm) in ammonium bicarbonate buffer, followed by 90 min incubation in the dark. Trypsin (Promega) was added in a protease: protein ratio (wt/wt) of 1:100 and incubation was performed overnight at 37 °C. Digestion was stopped by acidification with formic acid to pH 2 and peptides were desalted on C18 cartridges (3 m Empore) and dried in a vacuum centrifuge. For the mixed species experiments, tryptic protein digests of H. sapiens (HeLa), S. cerevisiae (Promega) and E. coli (Waters, Milford, MA) were mixed in two different experiments leading to a ratio of 1:1 (HeLa), 1:2 (S. cerevisiae) and 1:4 (E. coli) between the two samples.

The human blood plasma samples were collected from acute inflammation patients and were depleted for the 12 most abundant proteins using spin columns (Pierce, Waltham, MA) according to the manufacturer's instructions. Ethics approval was granted nationally (REC Reference Number 05/MRE00/38, 08/H0505/78, and 08/H0706/94) and for individual participating centers, with informed consent obtained from patients or their legal representative. In brief, plasma samples were spun at 15,000 × g for 10 min and 10 μl were transferred into a Top 12 abundant protein depletion spin column. The slurry was fully suspended and incubated on an end-over-end mixer for 1h at room temperature. To collect the unbound proteins spin columns were centrifuged for 2 min at 1000 × g yielding about 500 μl of depleted plasma protein solution. Proteins were precipitated following a trichloroacetic acid (TCA)/sodium deoxycholate (DOC) protocol (27). Briefly, sodium deoxycholate solution (2% w/v) was added to the samples to a final concentration of 125 μg/ml, vortexed and incubated at room temperature for 15 min. After incubation TCA (24% w/v) was added to a final concentration of 6%, vortexed and centrifuged at 12,000 × g at 4°C for 30 min. The supernatant was aspirated without disturbing the protein pellet and discarded. To remove excess TCA protein pellets were washed with ice-cold acetone, spun at 12,000 × g at 4°C for 5 min. The protein pellets were resuspended in 10 μl 6 m urea, 40 μl water and 150 μl SMART digest buffer. The samples were transferred into tubes containing immobilized SMART trypsin and digested for 2 h at 70°C and 1400rpm. After digestion samples were diluted 1:1 with 0.1% TFA and desalted (SOLAμ HRP plate, Pierce) using a vacuum manifold. Eluates were dried in a vacuum centrifuge and resuspended in 0.1% formic acid. Peptide concentration was determined using Pierce Quantitative Colorimetric Peptide assay. Samples were diluted to 5 ng/μl in 0.1% of formic acid in water for subsequent transfer to Evotips.

Liquid Chromatography

A nanoElute (Bruker Daltonics, Bremen, Germany) high pressure nanoflow system was connected to the timsTOF Pro, an ion-mobility spectrometry quadrupole time of flight mass spectrometer (Bruker Daltonics). Peptides were reconstituted in 0.1% FA and 200 ng were delivered to reversed phase analytical columns (25 cm × 75 μm i.d., IonOopticks) with pulled emitter tips. Liquid chromatography was performed at 50 °C and peptides were separated on the analytical column using a 120 min gradient (solvent A: 0.1% FA; solvent B: 0.1% FA, in ACN) at a flow rate of 400 nl/min. A linear gradient from 2–17% B was applied for 60 min, followed by 17–25% B for the next 30 min and followed by a step to 37% B for 10 min and a step to 80% B for 10 min followed by 10 min of washing at 80% B. For the plasma proteome samples, the Evosep One (Evosep, Odense Denmark) was used to achieve low overhead times and a high sample throughput by using the 100 sample/day method. Separation was performed by transferring 100 ng proteolytic digest from the sample loop on short columns (8 cm × 100 μm i.d.) at flow rate of 1.5 μl/min achieving 11.5 min gradient times.

Ion Mobility Mass Spectrometry

For all experiments, the timsTOF Pro was operated in PASEF mode. Ions entering the instrument were orthogonally deflected into an electrodynamic funnel and were trapped in the front region of the trapped ion mobility (TIMS) analyzer. The TIMS tunnel is separated into two parts (“dual TIMS” design), allowing for accumulation of entering ions in the first part and trapped ion mobility elution in the second part. The dual TIMS design allows usage of almost 100% of the ions if equal accumulation and analysis times are used. Although ions are eluted from the second part, accumulation of new ions can already take place in the first part for subsequent transfer into the second part. Trapped ion mobility separation was achieved by repulsion of an increasing longitudinal electric field gradient (ramp) and a drag force of incoming gas from ambient air. Dependent on the collisional cross sections and charge states high mobility ions are accumulated at the front part and low mobility ions at the end of the analyzer. To achieve close to 100% duty cycle, we set the ramp and accumulation time for the electric field gradient to 100 ms. For each topN acquisition cycle and long nanoLC runs, one full frame and 10 PASEF MS/MS frames, each containing on average 12 MS/MS spectra, were acquired resulting in a cycle time of 1.1 s. MS and MS/MS spectra were recorded from 100 to 1700 m/z and precursor ions for PASEF scans were selected in real time by the precursor selection algorithm. A polygon filtering was applied in the m/z and ion mobility area to exclude the low m/z of singly charged ions for PASEF precursor selection. For the 2 h runs a “target value” of 20.000 was applied to repeatedly schedule MS precursors for PASEF MS/MS spectra until this intensity value is reached and an ion mobility range (1/K₀) of 0.6–1.6 Versus/cm² was used. By using one full frame and 4 PASEF MS/MS frames we have optimized data acquisition for short gradients on the Evosep system to achieve a good MS1 sampling rate for quantification at a cycle times of 0.5 s. A target value of 6.000 was applied to repeatedly schedule MS precursors with low intensity in digests of plasma proteomes and the ion mobility range was limited to 0.85–1.3 Versus/cm². For all experiments the quadrupole isolation width was set to 2 Th for m/z < 700 and 3 Th for m/z > 700. Collision energy was changed in 5 steps within a TIMS elution ramp from 52 eV for 0–19% of the ramp time; 47 eV from 19 to 38%; 42 eV from 38 to 57%; 37 eV from 57 to 76%; and 32 eV for 76 to 95%.

Experimental Design and Statistical Rationale

The complete dataset reported in this study comprises 226 raw files consisting of 4 batches. The exact number of batch sizes are n = 10 for Fig. 2, 6A, 6C; n = 2 for Fig. 3, 4, 5; n = 208 for Fig 6B, 6D; n = 6 for Fig. 7. Replicate injections were performed to assess the technical reproducibility of the respective methods and their quantitative accuracy. The exact N numbers are n = 10 in Fig 6A, 6C; n = 3 in Fig. 7. This study does not draw biological conclusions, which is why process and biological replicates or controls were not performed. Bioinformatic analysis and visualization was performed in Python (Jupyter Notebook).

Fig. 6. — **Protein quantification coverage.** A, Number of protein groups quantified in 10 replicates without and with matching between runs. B, Number of proteins groups quantified in N out of 10 replicates without and with matching between runs. C, Number of protein groups quantified in 208 short human plasma runs. D, Gain of quantified protein groups in C, by matching between runs.

Fig. 3. — **Retention time and ion mobility alignment.** A, Difference in retention time between matched feature pairs plotted against the retention time of the feature in one of the runs. The point density is color-coded in plots *A–D. B*, Same as in A, but after retention time alignment has been applied. C, Like A, but now the difference in 1/K₀ within the feature pair is plotted against 1/K₀ in one of the runs, indicating differences between runs in terms of ion mobility. D, Same as in C, but after ion mobility alignment has been applied.

Fig. 4. — **Accuracy of matching between runs.** A, Retention time match difference distribution before alignment. B, Retention time match difference distribution after alignment. C, 1/K₀ match difference distribution before alignment. D, 1/K₀ match difference distribution after alignment.

Fig. 5. — **Specificity of ion mobility enhanced matching between runs.** A, Matches were performed without retention time restriction and then divided into “true” (Δt < 42s) and “false” (Δt > 42s) matches. The percentage of these matches is shown as a function of the window size in 1/K₀ that was applied to the matching. B, The gain in specificity by using ion mobility as a function of the window size in 1/K₀.

Fig. 7. — **LFQ on a benchmark dataset.** A, LFQ intensity plotted against fold change between replicate groups. (Both logarithmic.) Vertical lines correspond to fold changes expected by the mixing of species-derived samples. B, Same as A, but with matching between runs. *C–D*, Histograms of data in *A–B*, projected on the horizontal axes.

MaxQuant Software Framework

MaxQuant version 1.6.6.0 was used to perform all data analysis including the Andromeda search engine, which is downloadable from http://maxquant.org. MaxQuant has a plugin API for raw data access, which has been implemented for the access to Bruker TDF raw data format. MaxQuant is written in C# (.NET Framework 4.7.2 or higher) and runs on Windows and Linux operating systems (24). MaxQuant can be run interactively from a user interface or alternatively be called from the command line. Projects analyzed in MaxQuant can be automatically uploaded to the PRIDE repository as a 'complete' submission (28). A MaxQuant help forum can be visited at https://groups.google.com/group/maxquant-list/. Bug reports should be addressed to https://maxquant.myjetbrains.com/youtrack/.

For searches with HeLa data we used human UniProt sequences (UP000005640, version January 26, 2019) containing 73,920 proteins. For LFQ validation data set we used additionally E. Coli (UP000000625, version December 18, 2018) and S. cerevisiae (UP000002311, version November 9, 2018) sequences containing 4352 and 6049 proteins respectfully. All searches were performed with oxidation of methionine and protein N-terminal acetylation as variable modifications and cysteine carbamidomethylation as fixed modification. Trypsin was selected as protease allowing up to two missed cleavages, and the peptide mass was limited to a maximum of 4600 Da. The initial mass tolerance was 70 ppm for precursor ions and 40 ppm for fragment ions. Subsequently, MaxQuant applies individual precursor mass tolerances to each PSM (13). The retention time and 1/K₀ matching windows used in the analyses were 0.7 min and 0.05 Versus/cm², respectively. PSM and protein FDRs were both applied at 1%. In general, values of parameters in MaxQuant have not been changed from their default values unless explicitly stated.

RESULTS

Computational Workflow for Data-dependent LC-IMS-MS/MS Data

The MaxQuant workflow is an end-to-end solution, taking the mass spectrometric raw data as input and providing output tables on several levels, e.g. a table for results on the level of protein groups, of peptides or of peptide spectrum matches (PSMs). The identification strategy is peptide database search engine-based, utilizing the integrated Andromeda search engine as the main source of identifications. Also, for ion mobility-enhanced data the identification of peptides will be done by conventional Andromeda searches for which the input spectra are projected to the usual form of m/z-intensity profiles. The workflow consists of a series of data processing steps, the most important ones of which are summarized in Table I. Some computational steps needed substantial adaptation in order to support IMS, whereas other steps remained nearly unchanged. For instance, feature detection needed to be massively adapted, because adding a dimension to the raw data space requires new concepts for the detection of features. Also the preparation of MS/MS spectra for database search is different for ion-mobility enhanced data, because it involves a projection from the higher-dimensional data to conventional spectra. The alignment of raw data between samples will be affected because, in addition to the alignment of retention times, it may be necessary to also calibrate ion mobility measurements relative to each other. The process of matching features between runs for the sake of increasing the feature set usable for quantification will also need adaptations because the collision cross section values can be used to make the matching more specific. On the other hand, there are algorithms that need no adaptation from the standard workflow: for instance, grouping proteins into redundant protein groups or determining their false discovery rate (FDR) will remain unchanged. In the following sub-sections we describe how the individual data processing steps have been adapted for LC-IMS-MS/MS data.

Table I. Main processing steps of the computational workflow. The table lists the most important and time consuming steps in the MaxQuant pipeline for LC-IMS-MS/MS data. In the second column, the number of plus signs signifies the extent of changes in the code in order to accommodate ion mobility information. For instance, “+++” for “feature detection” indicates fundamental changes. The right column contains the percentage of computation time that is spent on the step on the HeLa data set consisting of ten replicates, on 60 core machine running Windows Server 2016.

Workflow component	Changes for IMS	Percentage computation time
Feature detection	+++	31.2%
MS/MS preparation	+++	15.6%
Initial search	-	14.1%
Recalibration	++	1.8%
Main search	-	19.1%
PSM FDR	-	1.9%
Alignment	++	12.5%
Match between runs	++	0.4%
Protein groups/FDR	–	2.6%
Label-free quantification	–	0.3%
Writing of results	+	0.4%

Open in a new tab

Feature Detection

The task of feature detection becomes more challenging in the higher-dimensional ion mobility-enhanced data. Although in LC-MS data, peak boundaries were determined by a closed line in the m/z-retention time plane, now a peak is bounded by a two-dimensional closed surface in three-dimensional space. In principle the task is like segmentation of 3-dimensional voxel data into regions of independent signals. However, because the shapes of feature boundaries in MS data typically exhibit certain regularities, we want to exploit these to obtain a well performing algorithm, because otherwise computation time remains a bottleneck for ion-mobility enhanced MS data. Fig. 1 shows an overview of the steps involved in feature detection. First, the data is interpolated onto a common mass grid (Fig. 1A) whereas the number of scans in the ion mobility direction remains the same. The lattice spacing of the new m/z grid depends on m/z, to ensure that the local point density is adapted to the peak width that is approximately expected by the resolution, and is calculated such that

Δ m / z = \frac{m / z}{4 R}

where R plays the role of an effective instrument resolution which we take as 32,000 by default. For interpolating intensities into one of the resulting spectra we consider a window of ion mobility indices around the current index over which the signals are averaged. All raw intensity measurements within the ion mobility window and within a mass window corresponding to three sigma of the peak resolution R are smoothed with a Gaussian kernel, whose width is locally adapted to correspond to the resolution R. This results in a data cube with a regular gridding in all three dimensions and smoothly varying signal intensities over the cube. All the data gridding and smoothing is done on the fly when a spectrum from the data cube is needed and, in order to save space, not written to disk. We proceed by slicing the cube into planes perpendicular to the ion mobility dimension (Fig. 1B). For each fixed ion mobility value we obtain a pseudo LC-MS run with signal intensities depending only on m/z and retention time. In these we apply the conventional MaxQuant algorithm for detecting peak boundaries as closed lines. Neighboring ion mobility slices are usually similar to each other regarding their signal intensity distribution. Therefore, in order to save computing time, ion mobility slices can be omitted and only every n^th slice is used in the data analysis, where n is a user-definable parameter with default value set to three, which was the value used for the analyses in this manuscript. Although feature detection is performed only on every n^th slice, all data is used because of the smoothing step prior to feature detection. The original MaxQuant feature detection algorithm processes then each of these slices resulting in irregularly shaped peak boundaries, which do not have to be rectangular (Fig. 1C). Because the processing of one slice does not depend on the processing of other slices, this step is trivially parallelized onto a specified number of threads less or equal to the number of slices. Once the peak boundaries in the slices are obtained, overlapping areas are clustered across slices to obtain closed surfaces enclosing m/z-retention time-ion mobility volumes (Fig. 1D). These volume elements define the base of each feature and each point inside the volume has an intensity value attached, defining the m/z, ion mobility and retention time-dependent intensity profile of the feature. The total feature intensity is given by the integral of signals over the volume.

Fig. 1. — **Elements of feature detection.** A, Raw data is mapped to a common grid. Although the ion mobility scans already form a regular grid, the m/z values of data centroids are irregular. All intensity values within an ion mobility window and within a mass range are mapped to a common mass grid with a spacing that is monotone increasing with m/z. To obtain the processed intensities on the common grid, raw intensities are averaged using a Gaussian kernel with a locally adapted width according to the effective resolution. B, The raw data cube is sliced along the ion mobility axis to obtain planes with signal intensity as a function of m/z and retention time. In these planes, features can be detected with the algorithms used for feature detection in the conventional MaxQuant workflow for LS-MS data without ion mobility. C, The result of the conventional LC-MS MaxQuant algorithm are feature boundaries in each m/z-retention time plane. These boundaries are closed curves each surrounding the base area of a peak. D, The base areas of peaks found in C, are clustered between consecutive planes to obtain closed surfaces surrounding the three-dimensional base volumes of the 4D features.

The de-isotoping step aims at grouping those peaks, which are different isotopic forms of the same peptide molecule, together into isotope patterns. For this purpose, we generalized the de-isotoping algorithm in MaxQuant, which makes use of the correlation of peak intensities as a function of retention time, to also considers correlations in ion mobility direction. The isotopic peak clustering is a two-step process. The first step consists of a pre-clustering of features. Here, two features are put together into a cluster whenever two criteria are met: (1) their difference in m/z is compatible with an averaging (29) mass difference between two consecutive peaks in an isotope pattern and (2) The cosine correlation between the two intensity patterns over retention time and ion mobility exceeds a threshold. Because these pre-clusters are assembled based on pair-wise relations between peaks, they can be too large and inconsistent regarding charge state, typically containing, in addition to the main isotope pattern of the cluster, other correlating peaks. These are separated in the second step which refines the pre-clusters based on several criteria, as, for instance, consistency of charge state. As an example we report numbers of scans, features, isotope patterns and identifications found in a single typical LC-IMS-MS/MS run in Table II.

Table II. Numbers of features and other elements in a single LC-IMS-MS/MS run. For a typical run with a 2-hour gradient we show the numbers of scans, detected elements and identified entities.

Workflow component	Number of items per run in HeLa sample	Number of items per run in plasma sample
MS1 frames	7828	1729
MS2 frames	52679	3990
IMS scans per frame	918	918
PASEF precursors	233437	44214
4D features	2164978	540917
4D isotope patterns	409379	65501
MS/MS submitted	114477	16928

Open in a new tab

Extraction of MS/MS Spectra

The timsTOF Pro effectively gains MS/MS spectrum acquisition speed by measuring multiple PASEF fragmentation patterns in one frame. It generates up to 110 of these MS/MS frames per second, which, in the example shown in Table II amounts to 233437 raw PASEF precursors distributed over 52679 MS/MS frames in a two-hour LC-IMS-MS/MS run. Multiple scans can be triggered for the same precursor. This is done deliberately, because the acquisition software decides to allocate additional MS/MS scans to a low-abundant precursor in order to accumulate signal for the fragmentation spectrum, to increase the likelihood of its identification. There can be additional multiplicity in terms of multiple MS/MS scans per precursor, which happens accidentally, and only becomes visible post acquisition by the MaxQuant analysis. These resequencing events are usually further apart in retention time and only the 4D peak detection reveals that they belong to the same precursor. All MS/MS spectra that were acquired for the same precursor are accumulated in MaxQuant. In them equal ions are added up, and only a single summed spectrum per 4D precursor is submitted to the database search. Fragmentation spectra are projected from the higher-dimensional raw data by integrating the MS/MS frame intensities over the ion mobility index range for each PASEF scan. This results in a single conventional MS/MS spectrum per MaxQuant precursor that can be submitted to the Andromeda search engine in the conventional way. In the projected and summed spectra de-isotoping is performed and in case an isotope pattern is detected, its members are removed from the spectrum and added as the monoisotopic peak of a singly charged fragment. If this procedure leads to overlapping peaks within a mass tolerance, resulting from multiple charge states for the same fragment, the corresponding peak intensities are added up to a single peak.

In the example run in Table II the initial number of PASEF precursors is 233437 (run 20190122_HeLa_QC_Slot1–47_1_3219.d) which gets accumulated to 114477 MS/MS scans by MaxQuant corresponding to unique 4D precursors. Only these accumulated scans are submitted to the database search. Usually, a large fraction of MS1 precursors remains without an MS/MS scan taken for them (294,902 in the example in Table II), which makes them ideal targets for exploiting them for quantification with the help of the ion mobility-enhanced matching between runs algorithm. (See later subsection on alignment and matching between runs.)

Mass Recalibration

Nonlinear mass recalibration is a crucial prerequisite for obtaining high peptide mass accuracies in MS-based proteomics. For LC-MS/MS data, MaxQuant performs nonlinear recalibration of the m/z range. The recalibration function used for this depends nonlinearly on m/z, retention time and signal intensity. Peptides present in the sample are used as standards, which eliminates the requirement for spike-in of molecules as calibration standards. A preliminary round of peptide identification with simple identification criteria, as, for instance, a fixed Andromeda score cutoff, is performed for generating such a list of peptides that are going to be used as internal standards. By default, we use an Andromeda score cutoff of 70. Once the recalibration function is determined and applied to the whole data, in the subsequent analysis, more stringent mass tolerances can be applied to precursor masses, for instance in the main peptide search and in the matching of MS1 features between runs. These precursor mass windows can even be adapted to each individual peptide by exploiting the variability of multiple mass measurements within the 4D precursor. Also, in the main search more refined statistical methods will be used for the peptide identification process than a simple score cutoff.

Ion mobility adds another variable, potentially confounding the mass error of MS1 features in a nonlinear way. Hence, the m/z recalibration function can now depend on four variables in total. We split the general dependence of the mass error in parts per million (p.p.m.) into four additive components, each of them depending only on a single variable

Δ m / z = f_{1} (m / z) + f_{2} (R T) + f_{3} (I) + f_{4} (I M)

A constant contribution can be arbitrarily split among the four contributions. Hence, three of the four functions can be constrained to have a certain value, e.g. at an argument of choice. We determine first the ion mobility dependent part f₄(IM) by calculating a window median of the mass deviations over ion mobility index windows and subtract it from the original mass deviations. We then apply the conventional MaxQuant algorithm depending on the other three dimensions to the ion mobility independent residual deviation. The m/z and retention time dependent parts are modeled as piecewise linear functions, whereas the intensity dependent part is assumed to be a low degree polynomial in the logarithm of the signal intensity.

In Fig. 2A–2D we show the individual contributions of the four variables to the mass error. For each variable, the residual error is shown after the other three dependences have been considered. For instance, in Fig. 2A Δm/z − f₂(RT) − f₃(I) − f₄(IM) is plotted against m/z to show the m/z dependent contribution to the mass error. It can be observed that the ion mobility-dependent component has the largest amplitude. It explains 52%of the variance captured by the model. Hence, it is crucial to include an ion mobility-dependent component in the mass recalibration. The residuals after complete recalibration (supplemental Fig. S1) show no remaining systematic effects. As can be seen in the mass error histograms before and after recalibration (Fig. 2E–2F) the mass accuracy improves considerably upon recalibration. The median absolute deviation (MAD) is 2.79 p.p.m. before and 0.94 p.p.m after recalibration, corresponding to a 3-fold improvement.

Alignment and Matching Between Runs

To alleviate the stochasticity of shotgun proteomics, MaxQuant employs the matching between runs algorithm, which consists of transferring identifications of MS1 features between samples based on accurate mass and retention time values (30). One promise of LC-IMS-MS is that this matching between runs becomes more specific by exploiting the additional dimension of ion mobility, which can be used to eliminate false positive matches. In order to use the retention time as a criterion for matching features, MaxQuant allows to first perform a nonlinear recalibration of retention times. It is an open question if such an alignment is also necessary prior to using ion mobility values for this kind of matching.

To address this question we matched features between two LC-IMS-MS/MS runs. These were runs 2823 and 2799 from the data set “20181129HeLafraction22,” which were measured on the same mass spectrometer and column, but days apart without recalibrating mass or mobility. We first find feature pairs with one feature from each run by applying mass, retention time and ion mobility windows. Masses are already recalibrated, which implies that small windows can be used. The actual criterion for matching features between the two runs applies the individual mass tolerances determined by MaxQuant after mass recalibration. Furthermore, we use wide windows for retention time and ion mobility index because the alignment has not yet been performed in these two dimensions. For instance, time windows of several (15 by default) minutes could be used. Whenever there is more than one matching feature found in the second run for a feature in the first run, we take only the closest match according to a weighted distance in m/z, retention time and ion mobility index space.

Based on the feature pairs found by this procedure, we create plots of retention time difference within each feature pair against retention time in one of the runs (Fig. 3A) and similarly for ion mobility index difference against ion mobility index (Fig. 3C). In each plot the local point density has been color coded. As can be seen for this typical case, a nonlinear recalibration function needs to be applied in order to make the retention times in the two runs comparable. This is also the case for the ion mobility index: it is necessary to do ion mobility alignment in addition to retention time alignment. In the example shown, a linear recalibration would be enough. Nevertheless, we apply the same nonlinear recalibration algorithm that is used for retention time alignment also for the ion mobility direction, in order to be prepared for the general case. As can be seen in Fig. 3B and 3D, after alignment the retention times and ion mobility indices become comparable between runs. After alignment, the matching requirement is, in addition to the masses matching, that the peak maximum is within a tighter retention time window, typically 42 s, and the aligned ion mobility index match is bounded by a tighter window as well.

The distribution of time differences between matching features as well as of the ion mobility index differences are shown in Fig. 4. In Fig. 4A the matching time differences before alignment are shown whereas Fig. 4B displays the distribution of time differences after alignment. Only after alignment is the distribution centered on zero. The full width half maximum (FWHM) for the retention time differences is 8.5 s after alignment in this two-hour run. Fig. 4C–4D shows similar distributions before and after alignment for the ion mobility index. As can be seen, besides a centering to zero, the FWHM improves dramatically through alignment, decreasing by a factor of 2.59, which brings it down to less than 1% of the total ion mobility range. This again displays the need for relative alignment of ion mobility measurements in the general case before it is being used for matching of features. Aligned and calibrated ion mobilities should then be comparable between different platforms, because they are a property of the molecule and not of the measuring systems, as retention times are.

Although so far we have described how two runs are aligned and its features are matched, MaxQuant can perform the same for multiple samples as well, without singling out one of the samples as a master run. The alignment is done by first building a guide tree based on the similarity between the runs. The alignment starts then between the two most similar runs. After these have been aligned, they are merged to an effective aligned sample. The process continues by always taking the yet not merged most similar samples from the guide tree, align and merge them, and continue until all samples are aligned.

In order to estimate the gain in accuracy that is brought about by using ion mobility as an additional criterion for matching features between runs, we perform a special analysis of matches with relatively loose criteria. Now, matches are performed without using retention time but with applying a varying matching window in 1/K₀. (See Fig. 5.) Then we want to use the fact that matches with large retention time differences have a larger likelihood of being false than matches with small retention time difference. We introduce the (not strictly correct) nomenclature of calling a match with retention time difference larger than 42 s a “false” match and when it is smaller than 42 s a “true” match. Note that the threshold is picked arbitrarily, just to allow us distinguishing two populations of matches one of which is enriched and one depleted of false positives. In Fig. 5A the numbers of true and false matches are shown as a function of the window size in 1/K₀ used for accepting matches. Both curves are plateauing to the right which corresponds to only mass-based matching. When decreasing the 1/K₀ window the false matches are decreasing more rapidly than the true matches. In order to quantify the 1/K0 window dependent gain we show in Fig. 5B the ratio of the two curves for true and false matches, which before are normalized to both plateau at 1. Although a large 1/K0 window provides no gain in terms of specificity, a smaller window provides an up to 3-fold gain in specificity. One should keep in mind that the gain curve is meant to demonstrate the trend of data improvement by false positive removal with the help of ion mobility matching. To obtain the true gain curve, a more realistic target decoy model for MS1 features would have to be applied.

Feature Coverage and Label-free Quantification

The main reason for applying matching between runs is to make the MS1 features needed for quantification appear consistently across samples and by that remove the missing value problem that shotgun proteomics potentially possesses because of the stochasticity of MS/MS selection. We first assess the extent of the problem in technical replicates of cellular proteomes. For this we used data of ten replicates of HeLa cell lysate measured on the timsTOF Pro as described in the Experimental Procedures and analyzed them with MaxQuant, once without and once with matching between runs. Fig. 6A shows for each of the ten technical replicates the number of protein groups identified and quantified without and with match between runs. Although without matching between runs, on average 5358 protein groups were found per sample, match between runs increased this number by 381 on average. The number of protein groups that were quantified in all ten replicates is increased by matching between runs by nearly 1000 to 5503 (Fig. 6B), whereas the corresponding increase on the peptide level is much higher (supplemental Fig. S2).

Reproducibility of quantification between samples is a much more challenging problem in plasma samples with their high dynamic range of concentrations and stronger variability. To this end we analyzed 208 plasma samples of human donors with single shot runs on a timsTOF Pro with a 11.5 min LC gradient. (See experimental procedures.) In Fig. 6C the number of protein groups identified and quantified per sample is plotted for each of the 208 plasma samples in ascending order. Without matching between runs this number reaches from 49 to 176 protein groups with an average of 103. The gain in number of quantified protein groups achieved by matching between runs (Fig. 6D) is 90% on average.

To judge quantitative accuracy of label-free quantification, we recorded a dataset with known ground truth by mixing three cellular proteomes from different species in known ratios, similar to the strategy applied in the context of data-independent acquisition (DIA) data (31). Although human proteins are expected to have a 1:1 ratio, all yeast proteins are expected to go up by a 2:1 ratio whereas all E. coli proteins go down by a 1:4 ratio. The MaxLFQ algorithm (20) can be applied to ion mobility enhanced data without major changes. Our adaptation to timsTOF Pro data uses the signal intensities of the 4D MS1 features as input but is otherwise unaltered from its established LC-MS/MS version. The benchmark data consists of two replicate groups consisting of 3 samples each. The runs in a replicate group have been quantified as a single experiment with the MaxLFQ algorithm, which has been run once with and once without the matching between runs option activated. The resulting logarithmic LFQ intensities in the protein groups table have been normalized in the Perseus software by subtracting the most frequent value. Fold changes between the two replicate groups are plotted against the summed LFQ intensity in Fig. 7A and 7B without and with matching, respectively. The respective fold change data distributions are shown in Fig. 7C and 7D. The fold changes are centered on the positions that were expected for the three different species. The number of quantified proteins increases from 5819 to 6626 by using the matching between runs. There is no discernible intensity-dependent trend in the fold changes, meaning there are no nonlinearities in quantification at the upper or lower end of the dynamic range. Also, the matching does not lead to a systematic change in the population of fold changes or to a discernible increased rate of wrongly quantified proteins, supporting the view that the matching is specific, as independently found with the method introduced in Fig. 5.

Computational Performance

A major challenge in the project was to achieve acceptable computational performance for large datasets with many LC-IMS-MS/MS runs being analyzed together in one project. Parallelization on multiple cores was achieved for nearly the entire computational workflow. Performance improvements will continue to be worked on and integrated into future software versions. The current split of computation time for a typical dataset onto the different workflow components can be seen in Table I. Dominating parts are feature detection, MS/MS preparation, the searches, as well as the alignment. The ion-mobility specific algorithmic parts and the Bruker TDF raw data format readers run and Linux and Windows operating systems as the whole MaxQuant framework does. We specifically tested the software successfully on Windows 2012 Server and Linux Ubuntu, version 18.04.3 LTS. All data sets were analyzed on a 60 core Intel Xeon CPU E7–4870 v2 system (120 logical cores) with 256 GB of RAM (32 × 8GB 1333MHz DDR3_SDRAM/RDIMM) and 7.8 TB RAID0 consisting of eight SAS hard drives. All data shown in Fig. 6 (blood plasma proteome and HeLa replicates) and Fig. 7 (multi-species mixture) was run on Windows for the publication. Total running times were ∼31 h, 20 h, and 14 h, respectively.

DISCUSSION

We implemented a novel computational workflow in MaxQuant that uses peptide CCS values to largely alleviate the missing value problem, typical in DDA shotgun proteomics. The newly developed MaxQuant CCS re-alignment algorithm does not require external or recognizable calibrants added to the sample, and successfully aligns ion mobilities from different runs against each other, prior to performing MBR. It has profound implications on protein quantification which we show to be possible with label free methods to good proteome depths and with enhanced precision. The combination of this IMS-QTOF hardware and the MaxQuant software provides the user with a robust platform for shotgun proteomics with deep and reproducible proteome coverage. Furthermore, the missing value problem of DDA shotgun proteomics because of its inherent stochasticity is appreciably reduced.

Currently, one of our development emphases is to improve computational performance of the total workflow. Although with the current performance, data analysis is feasible and not prohibitively slower compared with conventional LC-MS/MS data, speed gains in future software releases will still have a positive impact on the everyday usage of MaxQuant on standard computer hardware. We are in an iterative development process of finding computational bottlenecks in the workflow and alleviating them. Given the significant improvements described in this paper, we foresee a CCS aware MaxQuant version soon that will run just as fast as the current version for conventional LC-MS/MS data. Going forward, the combination of DIA with IMS in combination with MaxQuant's advanced feature detection and recalibration routines has the potential to increase robustness of proteomics analyses even further. The application of deep learning for the prediction of MS/MS spectra, retention time and ion mobilities, and its future integration into MaxQuant (32) has the potential to further improve protein identification rates and LFQ.

DATA AVAILABILITY

The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium via the PRIDE partner repository with the dataset identifier PXD014777.

Supplementary Material

Supplementary Figures

155063_1_supp_468101_q4z5cq.pdf^{(402.4KB, pdf)}

Supplementary Fig. 2

155063_1_supp_468175_q4z9zx.pdf^{(46KB, pdf)}

Acknowledgments

We thank Nicolai Bache and Ole Bjeld Horning from Evosep Biosystems for their support to run the plasma proteome samples and Thomas Kosinski from Bruker for sample preparation and LC-MS/MS analyses.

Footnotes

* The authors have declared a conflict of interest. The authors state that they have potential conflicts of interest regarding this work: S.K., H.K., M.L., and S.B. are employees of Bruker.

This article contains supplemental Figures and Tables.

¹ The abbreviations used are:

IMS: ion mobility spectrometry
LC-MS: liquid chromatography-mass spectrometry
TIMS: trapped ion mobility spectrometry
QTOF: quadrupole time-of-flight
MBR: matching between runs
CCS: collisional cross section
1/K₀: inverse reduced ion mobility
MS: mass spectrometry
LC-IMS-MS/MS: liquid chromatography-ion mobility spectrometry-tandem mass spectrometry.

REFERENCES

1. Kanu A. B., Dwivedi P., Tam M., Matz L., and Hill H. H. (2008) Ion mobility-mass spectrometry. J. Mass Spectrom. 43, 1–22 [DOI] [PubMed] [Google Scholar]
2. Cumeras R., Figueras E., Davis C. E., Baumbach J. I., and Gràcia I. (2015) Review on Ion Mobility Spectrometry. Part 2: hyphenated methods and effects of experimental parameters. Analyst 140, 1391–1410 [DOI] [PMC free article] [PubMed] [Google Scholar]
3. May J. C., and McLean J. A. (2015) Ion mobility-mass spectrometry: Time-dispersive instrumentation. Anal. Chem. 87, 1422–1436 [DOI] [PMC free article] [PubMed] [Google Scholar]
4. Valentine S. J., Plasencia M. D., Liu X., Krishnan M., Naylor S., Udseth H. R., Smith R. D., and Clemmer D. E. (2006) Toward plasma proteome profiling with ion mobility-mass spectrometry. J. Proteome Res. 5, 2977–84 [DOI] [PubMed] [Google Scholar]
5. Baker E. S., Livesay E. A., Orton D. J., Moore R. J., Danielson W. F., Prior D. C., Ibrahim Y. M., LaMarche B. L., Mayampurath A. M., Schepmoes A. A., Hopkins D. F., Tang K., Smith R. D., and Belov M. E. (2010) An LC-IMS-MS platform providing increased dynamic range for high-throughput proteomic studies. J. Proteome Res. 9, 997–1006 [DOI] [PMC free article] [PubMed] [Google Scholar]
6. Geromanos S. J., Hughes C., Ciavarini S., Vissers J. P. C., and Langridge J. I. (2012) Using ion purity scores for enhancing quantitative accuracy and precision in complex proteomics samples. Anal. Bioanal. Chem. 404, 1127–1139 [DOI] [PubMed] [Google Scholar]
7. Helm D., Vissers J. P. C., Hughes C. J., Hahne H., Ruprecht B., Pachl F., Grzyb A., Richardson K., Wildgoose J., Maier S. K., Marx H., Wilhelm M., Becher I., Lemeer S., Bantscheff M., Langridge J. I., and Kuster B. (2014) Ion mobility tandem mass spectrometry enhances performance of bottom-up proteomics. Mol. Cell Proteomics 13, 3709–3715 [DOI] [PMC free article] [PubMed] [Google Scholar]
8. Meier F., Brunner A.-D., Koch S., Koch H., Lubeck M., Krause M., Goedecke N., Decker J., Kosinski T., Park M. A., Bache N., Hoerning O., Cox J., Räther O., and Mann M. (2018) Online parallel accumulation – serial fragmentation (PASEF) with a novel trapped ion mobility mass spectrometer. Mol Cell Proteomics. 17, 2534–2545 [DOI] [PMC free article] [PubMed] [Google Scholar]
9. Fernandez-Lima F. A., Kaplan D. A., and Park M. A. (2011) Note: Integration of trapped ion mobility spectrometry with mass spectrometry. Rev. Sci. Instrum. 82, 126106. [DOI] [PMC free article] [PubMed] [Google Scholar]
10. Fernandez-Lima F., Kaplan D. A., Suetering J., and Park M. A. (2011) Gas-phase separation using a trapped ion mobility spectrometer. Int. J. Ion. Mobil. Spectrom. 14 [DOI] [PMC free article] [PubMed] [Google Scholar]
11. Silveira J. A., Ridgeway M. E., and Park M. A. (2014) High resolution trapped ion mobility spectrometery of peptides. Anal. Chem. 86, 5624–5627 [DOI] [PubMed] [Google Scholar]
12. Meier F., Beck S., Grassl N., Lubeck M., Park M. A., Raether O., and Mann M. (2015) Parallel accumulation-serial fragmentation (PASEF): Multiplying sequencing speed and sensitivity by synchronized scans in a trapped ion mobility device. J. Proteome Res. 14, 5378–5387 [DOI] [PubMed] [Google Scholar]
13. Cox J., and Mann M. (2008) MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification. Nat. Biotechnol. 26, 1367–1372 [DOI] [PubMed] [Google Scholar]
14. Tyanova S., Temu T., and Cox J. (2016) The MaxQuant computational platform for mass spectrometry-based shotgun proteomics. Nat. Protoc. 11, 2301–2319 [DOI] [PubMed] [Google Scholar]
15. Sinitcyn P., Rudolph J. D., and Cox J. (2018) Computational Methods for Understanding Mass Spectrometry–Based Shotgun Proteomics Data. Annu. Rev. Biomed. Data Sci. 1, 207–234 [Google Scholar]
16. Cox J., Neuhauser N., Michalski A., Scheltema R. A., Olsen J. V., and Mann M. (2011) Andromeda: a peptide search engine integrated into the MaxQuant environment. J. Proteome Res. 10, 1794–1805 [DOI] [PubMed] [Google Scholar]
17. Tyanova S., Temu T., Sinitcyn P., Carlson A., Hein M. Y., Geiger T., Mann M., and Cox J. (2016) The Perseus computational platform for comprehensive analysis of (prote)omics data. Nat. Methods 13, 731–740 [DOI] [PubMed] [Google Scholar]
18. Tyanova S., and Cox J. (2018) Perseus: a bioinformatics platform for integrative analysis of proteomics data in cancer research in Methods in Molecular Biology 1711, 133–148 [DOI] [PubMed] [Google Scholar]
19. Tyanova S., Mann M., and Cox J. (2014) MaxQuant for in-depth analysis of large SILAC datasets. Methods Mol. Biol. 1188, 351–364 [DOI] [PubMed] [Google Scholar]
20. Cox J., Hein M. Y., Luber C. A., Paron I., Nagaraj N., and Mann M. (2014) Accurate Proteome-wide Label-free Quantification by Delayed Normalization and Maximal Peptide Ratio Extraction, Termed MaxLFQ. Mol. Cell Proteomics 13, 2513–2526 [DOI] [PMC free article] [PubMed] [Google Scholar]
21. Cox J., Michalski A., and Mann M. (2011) Software lock mass by two-dimensional minimization of peptide mass errors. J. Am. Soc. Mass. Spectrom. 22, 1373–1380 [DOI] [PMC free article] [PubMed] [Google Scholar]
22. Cox J., and Mann M. (2009) Computational principles of determining and improving mass precision and accuracy for proteome measurements in an Orbitrap. J. Am. Soc. Mass. Spectrom. 20, 1477–1485 [DOI] [PubMed] [Google Scholar]
23. Tyanova S., Temu T., Carlson A., Sinitcyn P., Mann M., and Cox J. (2015) Visualization of LC-MS/MS proteomics data in MaxQuant. Proteomics 15, 1453–1456 [DOI] [PMC free article] [PubMed] [Google Scholar]
24. Sinitcyn P., Tiwary S., Rudolph J. D., Gutenbrunner P., Wichmann C., Yılmaz Ş Hamzeiy H, Salinas F., and Cox J. (2018) MaxQuant goes Linux. Nat. Methods 15, 401. [DOI] [PubMed] [Google Scholar]
25. Distler U., Kuharev J., Navarro P., Levin Y., Schild H., and Tenzer S. (2014) Drift time-specific collision energies enable deep-coverage data-independent acquisition proteomics. Nat. Methods 11, 167–170 [DOI] [PubMed] [Google Scholar]
26. Wang H., Qian W. J., Mottaz H. M., Clauss T. R. W., Anderson D. J., Moore R. J., Camp D. G., Khan A. H., Sforza D. M., Pallavicini M., Smith D. J., and Smith R. D. (2005) Development and evaluation of a micro- and nanoscale proteomic sample preparation method. J. Proteome Res. 4, 2397–2403 [DOI] [PMC free article] [PubMed] [Google Scholar]
27. Bensadoun A., and Weinstein D. (1976) Assay of proteins in the presence of interfering materials. Anal. Biochem. 70, 241–250 [DOI] [PubMed] [Google Scholar]
28. Perez-Riverol Y., Csordas A., Bai J., Bernal-Llinares M., Hewapathirana S., Kundu D. J., Inuganti A., Griss J., Mayer G., Eisenacher M., Pérez E., Uszkoreit J., Pfeuffer J., Sachsenberg T., Yilmaz Ş Tiwary S, Cox J., Audain E., Walzer M., Jarnuczak A. F., Ternent T., Brazma A., and Vizcaíno J. A. (2019) The PRIDE database and related tools and resources in 2019: Improving support for quantification data. Nucleic Acids Res. 47, D442–D450 [DOI] [PMC free article] [PubMed] [Google Scholar]
29. Senko M. W., Beu S. C., and McLaffertycor F. W. (1995) Determination of monoisotopic masses and ion populations for large biomolecules from resolved isotopic distributions. J. Am. Soc. Mass Spectrom. 6, 229–233 [DOI] [PubMed] [Google Scholar]
30. Pasa-Tolic L., Masselon C., Barry R. C., Shen Y., and Smith R. D. (2004) Proteomic analyses using an accurate mass and time tag strategy. BioTechniques 37, 621–636 [DOI] [PubMed] [Google Scholar]
31. Navarro P., Kuharev J., Gillet L. C., Bernhardt O. M., MacLean B., Röst H. L., Tate S. A., Tsou C. C., Reiter L., Distler U., Rosenberger G., Perez-Riverol Y., Nesvizhskii A. I., Aebersold R., and Tenzer S. (2016) A multicenter study benchmarks software tools for label-free proteome quantification. Nat. Biotechnol. 34, 1130–1136 [DOI] [PMC free article] [PubMed] [Google Scholar]
32. Tiwary S., Levy R., Gutenbrunner P., Salinas Soto F., Palaniappan K. K., Deming L., Berndl M., Brant A., Cimermancic P., and Cox J. (2019) High quality MS/MS spectrum prediction for data-dependent and -independent acquisition data analysis. Nat. Methods 16, 519–525 [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Figures

155063_1_supp_468101_q4z5cq.pdf^{(402.4KB, pdf)}

Supplementary Fig. 2

155063_1_supp_468175_q4z9zx.pdf^{(46KB, pdf)}

Data Availability Statement

The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium via the PRIDE partner repository with the dataset identifier PXD014777.

[B1] 1. Kanu A. B., Dwivedi P., Tam M., Matz L., and Hill H. H. (2008) Ion mobility-mass spectrometry. J. Mass Spectrom. 43, 1–22 [DOI] [PubMed] [Google Scholar]

[B2] 2. Cumeras R., Figueras E., Davis C. E., Baumbach J. I., and Gràcia I. (2015) Review on Ion Mobility Spectrometry. Part 2: hyphenated methods and effects of experimental parameters. Analyst 140, 1391–1410 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B3] 3. May J. C., and McLean J. A. (2015) Ion mobility-mass spectrometry: Time-dispersive instrumentation. Anal. Chem. 87, 1422–1436 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B4] 4. Valentine S. J., Plasencia M. D., Liu X., Krishnan M., Naylor S., Udseth H. R., Smith R. D., and Clemmer D. E. (2006) Toward plasma proteome profiling with ion mobility-mass spectrometry. J. Proteome Res. 5, 2977–84 [DOI] [PubMed] [Google Scholar]

[B5] 5. Baker E. S., Livesay E. A., Orton D. J., Moore R. J., Danielson W. F., Prior D. C., Ibrahim Y. M., LaMarche B. L., Mayampurath A. M., Schepmoes A. A., Hopkins D. F., Tang K., Smith R. D., and Belov M. E. (2010) An LC-IMS-MS platform providing increased dynamic range for high-throughput proteomic studies. J. Proteome Res. 9, 997–1006 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B6] 6. Geromanos S. J., Hughes C., Ciavarini S., Vissers J. P. C., and Langridge J. I. (2012) Using ion purity scores for enhancing quantitative accuracy and precision in complex proteomics samples. Anal. Bioanal. Chem. 404, 1127–1139 [DOI] [PubMed] [Google Scholar]

[B7] 7. Helm D., Vissers J. P. C., Hughes C. J., Hahne H., Ruprecht B., Pachl F., Grzyb A., Richardson K., Wildgoose J., Maier S. K., Marx H., Wilhelm M., Becher I., Lemeer S., Bantscheff M., Langridge J. I., and Kuster B. (2014) Ion mobility tandem mass spectrometry enhances performance of bottom-up proteomics. Mol. Cell Proteomics 13, 3709–3715 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B8] 8. Meier F., Brunner A.-D., Koch S., Koch H., Lubeck M., Krause M., Goedecke N., Decker J., Kosinski T., Park M. A., Bache N., Hoerning O., Cox J., Räther O., and Mann M. (2018) Online parallel accumulation – serial fragmentation (PASEF) with a novel trapped ion mobility mass spectrometer. Mol Cell Proteomics. 17, 2534–2545 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B9] 9. Fernandez-Lima F. A., Kaplan D. A., and Park M. A. (2011) Note: Integration of trapped ion mobility spectrometry with mass spectrometry. Rev. Sci. Instrum. 82, 126106. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B10] 10. Fernandez-Lima F., Kaplan D. A., Suetering J., and Park M. A. (2011) Gas-phase separation using a trapped ion mobility spectrometer. Int. J. Ion. Mobil. Spectrom. 14 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B11] 11. Silveira J. A., Ridgeway M. E., and Park M. A. (2014) High resolution trapped ion mobility spectrometery of peptides. Anal. Chem. 86, 5624–5627 [DOI] [PubMed] [Google Scholar]

[B12] 12. Meier F., Beck S., Grassl N., Lubeck M., Park M. A., Raether O., and Mann M. (2015) Parallel accumulation-serial fragmentation (PASEF): Multiplying sequencing speed and sensitivity by synchronized scans in a trapped ion mobility device. J. Proteome Res. 14, 5378–5387 [DOI] [PubMed] [Google Scholar]

[B13] 13. Cox J., and Mann M. (2008) MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification. Nat. Biotechnol. 26, 1367–1372 [DOI] [PubMed] [Google Scholar]

[B14] 14. Tyanova S., Temu T., and Cox J. (2016) The MaxQuant computational platform for mass spectrometry-based shotgun proteomics. Nat. Protoc. 11, 2301–2319 [DOI] [PubMed] [Google Scholar]

[B15] 15. Sinitcyn P., Rudolph J. D., and Cox J. (2018) Computational Methods for Understanding Mass Spectrometry–Based Shotgun Proteomics Data. Annu. Rev. Biomed. Data Sci. 1, 207–234 [Google Scholar]

[B16] 16. Cox J., Neuhauser N., Michalski A., Scheltema R. A., Olsen J. V., and Mann M. (2011) Andromeda: a peptide search engine integrated into the MaxQuant environment. J. Proteome Res. 10, 1794–1805 [DOI] [PubMed] [Google Scholar]

[B17] 17. Tyanova S., Temu T., Sinitcyn P., Carlson A., Hein M. Y., Geiger T., Mann M., and Cox J. (2016) The Perseus computational platform for comprehensive analysis of (prote)omics data. Nat. Methods 13, 731–740 [DOI] [PubMed] [Google Scholar]

[B18] 18. Tyanova S., and Cox J. (2018) Perseus: a bioinformatics platform for integrative analysis of proteomics data in cancer research in Methods in Molecular Biology 1711, 133–148 [DOI] [PubMed] [Google Scholar]

[B19] 19. Tyanova S., Mann M., and Cox J. (2014) MaxQuant for in-depth analysis of large SILAC datasets. Methods Mol. Biol. 1188, 351–364 [DOI] [PubMed] [Google Scholar]

[B20] 20. Cox J., Hein M. Y., Luber C. A., Paron I., Nagaraj N., and Mann M. (2014) Accurate Proteome-wide Label-free Quantification by Delayed Normalization and Maximal Peptide Ratio Extraction, Termed MaxLFQ. Mol. Cell Proteomics 13, 2513–2526 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B21] 21. Cox J., Michalski A., and Mann M. (2011) Software lock mass by two-dimensional minimization of peptide mass errors. J. Am. Soc. Mass. Spectrom. 22, 1373–1380 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B22] 22. Cox J., and Mann M. (2009) Computational principles of determining and improving mass precision and accuracy for proteome measurements in an Orbitrap. J. Am. Soc. Mass. Spectrom. 20, 1477–1485 [DOI] [PubMed] [Google Scholar]

[B23] 23. Tyanova S., Temu T., Carlson A., Sinitcyn P., Mann M., and Cox J. (2015) Visualization of LC-MS/MS proteomics data in MaxQuant. Proteomics 15, 1453–1456 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B24] 24. Sinitcyn P., Tiwary S., Rudolph J. D., Gutenbrunner P., Wichmann C., Yılmaz Ş Hamzeiy H, Salinas F., and Cox J. (2018) MaxQuant goes Linux. Nat. Methods 15, 401. [DOI] [PubMed] [Google Scholar]

[B25] 25. Distler U., Kuharev J., Navarro P., Levin Y., Schild H., and Tenzer S. (2014) Drift time-specific collision energies enable deep-coverage data-independent acquisition proteomics. Nat. Methods 11, 167–170 [DOI] [PubMed] [Google Scholar]

[B26] 26. Wang H., Qian W. J., Mottaz H. M., Clauss T. R. W., Anderson D. J., Moore R. J., Camp D. G., Khan A. H., Sforza D. M., Pallavicini M., Smith D. J., and Smith R. D. (2005) Development and evaluation of a micro- and nanoscale proteomic sample preparation method. J. Proteome Res. 4, 2397–2403 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B27] 27. Bensadoun A., and Weinstein D. (1976) Assay of proteins in the presence of interfering materials. Anal. Biochem. 70, 241–250 [DOI] [PubMed] [Google Scholar]

[B28] 28. Perez-Riverol Y., Csordas A., Bai J., Bernal-Llinares M., Hewapathirana S., Kundu D. J., Inuganti A., Griss J., Mayer G., Eisenacher M., Pérez E., Uszkoreit J., Pfeuffer J., Sachsenberg T., Yilmaz Ş Tiwary S, Cox J., Audain E., Walzer M., Jarnuczak A. F., Ternent T., Brazma A., and Vizcaíno J. A. (2019) The PRIDE database and related tools and resources in 2019: Improving support for quantification data. Nucleic Acids Res. 47, D442–D450 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B29] 29. Senko M. W., Beu S. C., and McLaffertycor F. W. (1995) Determination of monoisotopic masses and ion populations for large biomolecules from resolved isotopic distributions. J. Am. Soc. Mass Spectrom. 6, 229–233 [DOI] [PubMed] [Google Scholar]

[B30] 30. Pasa-Tolic L., Masselon C., Barry R. C., Shen Y., and Smith R. D. (2004) Proteomic analyses using an accurate mass and time tag strategy. BioTechniques 37, 621–636 [DOI] [PubMed] [Google Scholar]

[B31] 31. Navarro P., Kuharev J., Gillet L. C., Bernhardt O. M., MacLean B., Röst H. L., Tate S. A., Tsou C. C., Reiter L., Distler U., Rosenberger G., Perez-Riverol Y., Nesvizhskii A. I., Aebersold R., and Tenzer S. (2016) A multicenter study benchmarks software tools for label-free proteome quantification. Nat. Biotechnol. 34, 1130–1136 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B32] 32. Tiwary S., Levy R., Gutenbrunner P., Salinas Soto F., Palaniappan K. K., Deming L., Berndl M., Brant A., Cimermancic P., and Cox J. (2019) High quality MS/MS spectrum prediction for data-dependent and -independent acquisition data analysis. Nat. Methods 16, 519–525 [DOI] [PubMed] [Google Scholar]

PERMALINK

MaxQuant Software for Ion Mobility Enhanced Shotgun Proteomics*

Nikita Prianichnikov

Heiner Koch

Scarlet Koch

Markus Lubeck

Raphael Heilig

Sven Brehmer

Roman Fischer

Jürgen Cox