Abstract
Proteomics has become an increasingly important tool in medical and medicinal applications. It is necessary to improve the analytical throughput for these applications, particularly in large-scale drug screening to enable measurement of a large number of samples. In this study, we aimed to establish an ultrafast proteomic method based on 5-min gradient LC and quadrupole-Orbitrap mass spectrometer (Q-Orbitrap MS). We precisely optimized data-independent acquisition (DIA) parameters for 5-min gradient LC and reached a depth of >5000 and 4200 proteins from 1000 and 31.25 ng of HEK293T cell digest in a single-shot run, respectively. The throughput of our method enabled the measurement of approximately 80 samples/day, including sample loading, column equilibration, and wash running time. We demonstrated that our method is applicable for the screening of chemical responsivity via a cell stimulation assay. These data show that our method enables the capture of biological alterations in proteomic profiles with high sensitivity, suggesting the possibility of large-scale screening of chemical responsivity.
Keywords: Q-Orbitrap MS, data-independent acquisition, HT proteomics, Short LC gradients
Introduction
In recent years, mass spectrometry (MS)-based proteomics have been applied in both clinical and medicinal fields, as represented by biomarker discovery and drug development.1−4 An improvement in the analytical throughput is a requisite for these applications because of the requirement of analyzing a large number of samples. Usually, nano liquid chromatography (LC) is used in MS-based proteomics because of its high sensitivity and low flow rate that improves the ionization efficiency of peptides by electrospraying. Recent in-depth proteomics, comprising nano LC and data-independent acquisition (DIA/SWATH), have reached a depth of more than 7,000 protein identifications from mammalian tissues and cells in a single-shot run.5−8 However, in-depth proteomics require a longer acquisition time, making it challenging to improve the analytical throughput. Therefore, new chromatographic and mass spectrometric approaches are required to overcome this challenge. The sensitivity of MS has improved over several years, making micro-flow LC possible for high-throughput (HT) proteomic analysis. Recently, high-flow-rate LC-based methods have been developed.9−14 Bache et al. reported the identification of 5,446 proteins from HeLa cells using a 20 min-per-sample method based on a novel LC system combined with online solid phase extraction and micro-/nano-flow LC.9 Further improvements in the analytical throughput have been attempted by combining the applied-DIA/SWATH with quadrupole time-of-flight MS (Q-TOF MS)12,13 along with ion mobility spectrometry.12 Meier et al. reported approximately 3000 proteins from HeLa cells using 5.6-min gradient LC and ion mobility/applied-SWATH.12 Messner et al. obtained 5004 proteins from K562 cells using 5-min micro-flow LC and applied SWATH.13 Consequently, “ultrafast proteomics” have become feasible for large-sample studies.
Q-Orbitrap and Q-TOF MS have been widely used in the proteome research because of their high resolution and sensitivity, especially in DIA.15,16 DIA utilizes isolation windows to co-isolate and elute fragment peptides regardless of their signal intensity, thereby providing a systematic collection of peptide fragments. Consequently, DIA allows the identification of peptides with high sensitivity and improved reproducibility. In DIA, a narrow isolation window width with higher mass resolution (required longer transient time) decreases the spectral complexity.5 Thus, Q-Orbitrap MS, which enables the regulation of mass resolution via the injection of precursor/product ions in the C-trap into the Orbitrap mass analyzer, has a potential advantage in improving sensitivity; ions in Q-Orbitrap MS are initially accumulated and/or fragmented in the ion routing pole and then transferred to the C-trap.14 Q-Orbitrap is considered unsuitable for HT proteome analysis because of its relatively slow scan speed arising from ion accumulation; therefore, Q-TOF MS has been mainly applied for the development of HT proteomic methods.11−13 Meanwhile, it has not been fully investigated whether Q-Orbitrap MS is unsuitable for HT proteomics. The mass range, mass resolution, and isolation window width in Q-Orbitrap MS for HT proteomic analysis remain poorly understood.
Herein, we evaluated DIA parameters combined with 5-min gradient LC and Q-Orbitrap MS. Based on the findings, we established a simple method with high throughput and deep proteome coverage without any specialized equipment. Moreover, for future high-throughput proteomic applications, we performed cell stimulation assays using chemical substances and continuous proteome analysis of stimulated cell samples.
Experimental Section
Cell Culture
HEK293T cells (Lot No. 19I006; ECACC, Wiltshire, U.K.) were cultured in a 15 cm dish at 80% confluence in Dulbecco’s modified Eagle’s medium (DMEM; Thermo Fisher Scientific, Waltham, MA) containing 10% fetal bovine serum (FBS; Thermo Fisher Scientific) and 1% penicillin/streptomycin (Fujifilm Wako, Osaka, Japan) at 37 °C in a 5% CO2 incubator. The cells were detached using TrypLE Express (Thermo Fisher Scientific) at 37 °C for 5 min and then collected in phosphate-buffered saline (PBS; Nacalai Tesque, Kyoto, Japan). After collection, 1 × 106 cells were precipitated at 1000 rpm and 4 °C for 2 min. The precipitate was frozen immediately at −80 °C and kept frozen until protein extraction.
Sample Preparation for LC-MS/MS
Cell samples were prepared as described in a previous study.17 In brief, precipitates were dissolved in 100 mM Tris-HCl (pH 8.5) containing 2% sodium dodecyl sulfate (SDS) using BIORUPTOR BR-II (SONIC BIO Co., Kanagawa, Japan) with settings at “High” and “30 s On/Off” cycle for a duration of 5 min. The extracted proteins were quantified using a Pierce BCA Protein Assay Kit (Thermo Fisher Scientific) at 1000 ng/μL. The extracts were reduced with 10 mM dithiothreitol for 30 min at 50 °C, followed by alkylation with 30 mM iodoacetamide for 30 min at 25 °C in the dark. Protein purification and digestion were performed using the sample preparation (SP3) method.17,18 The tryptic digestion was performed using 500 ng/μL Trypsin/Lys-C Mix (Promega, Madison, WI) overnight at 37 °C. Cell digests were purified using GL-Tip SDB (GL Sciences, Tokyo, Japan) according to the manufacturer’s protocol. The peptides were dissolved again in 3% acetonitrile (ACN) containing 0.1% trifluoroacetic acid (TFA) and then quantified using a Lunatic UV/Vis absorbance spectrometer (Unchained Labs, Pleasanton, CA).19
5-Min Gradient LC-MS/MS
Two mobile phases, A and B, containing H2O and 80% ACN/H2O (4/1, v/v) were prepared in 0.1% formic acid (FA). Then, 1 or 2 μL of the digested peptides were loaded directly using a 150 μm-inner-diameter × 5 cm capillary column (Aurora C18, particle size 1.6 μm, 120 Å; IonOpticks, VIC, Australia) at 8.0 μL/min and 40 °C for 4.0 min; to this end, an UltiMate 3500 RSLC nano system (Thermo Fisher Scientific) equipped with a 10 μL injection loop was employed. The linear gradient was executed at 1.5 μL/min and 40 °C, and the gradient conditions were as follows: 0–4.0 min, B 2% held; 4.0–4.1 min, B 2–5%; 4.1–8.0 min, B 5–36%; 8.0–8.8 min, B 36–95%; and 8.8–9.0 min, B 95% held. MS acquisitions were initiated 4.0 min after sample injection. The eluted peptides were detected using a quadrupole Orbitrap Exploris 480 hybrid mass spectrometer (Thermo Fisher Scientific) equipped with an ING Nano ion source (AMR Inc., Tokyo, Japan). The total acquisition time of one sample was 9.5 min, corresponding to the sum of the sample loading time, gradient time, and overhead time, which were 4.0, 5.0, and 0.5 min, respectively. The wash running time between the samples was 8.0 min. Overall, the injection time was 17.5 min (approximately 80 samples/day).
Screening of Suitable DIA Parameters for 5-Min Gradient LC
To evaluate DIA with 5-min gradient LC, we designed 46 DIA methods comprising five groups with different combinations of the precursor mass range (narrow range, 120 Da; wide range, 240 Da), mass resolutions (7500, 15,000, 30,000), and isolation window width (4, 8, 16 Da). The five groups were marked as Res7.5k/W4, Res15k/W8, Res30k/W16, Res15k/W4, and Res30k/W8, respectively, in the present study (Table 1). In all methods, the MS1 scan range was set as full scan, m/z 400–1100, and mass resolution as 7500. The auto gain control (AGC) target was set as 1 × 106 and the maximum injection time as 10 ms for MS1. The AGC target for MS2 was set as 3 × 106, at 28% normalized collision energy. In DIA, the maximum injection time values at mass resolutions of 7500, 15,000, and 30,000 were set at 7, 22, and 54 ms, respectively. The overlapping windows of the Res7.5k/W4, Res15k/W8, Res30k/W16, Res15k/W4, and Res30k/W8 groups were set as 2, 4, 8, 2, and 4 Da, respectively.
Table 1. Information of 46 DIA Methods for 5-min Gradient LC.
wide
mass range method |
narrow
mass range method |
|||
---|---|---|---|---|
resolution/isolation window/mass range | resolution/isolation window/mass range | |||
7500/4 Da | 15000/8 Da | 30000/16 Da | 15000/4 Da | 30000/8 Da |
Res7.5kW4/400-640 | Res15k/W8/400-640 | Res30k/W16/400-640 | Res15k/W4/400-520 | Res30k/W8/400-520 |
Res7.5k/W4/450-690 | Res15k/W8/450-690 | Res30k/W16/450-690 | Res15k/W4/450-570 | Res30k/W8/450-570 |
Res7.5k/W4/500-740 | Res15k/W8/500-740 | Res30k/W16/500-740 | Res15k/W4/500-620 | Res30k/W8/500-620 |
Res7.5k/W4/550-790 | Res15k/W8/550-790 | Res30k/W16/550-790 | Res15k/W4/550-670 | Res30k/W8/550-670 |
Res7.5k/W4/600-840 | Res15k/W8/600-840 | Res30k/W16/600-840 | Res15k/W4/600-720 | Res30k/W8/600-720 |
Res7.5k/W4/650-890 | Res15k/W8/650-890 | Res30k/W16/650-890 | Res15k/W4/650-770 | Res30k/W8/650-770 |
Res7.5k/W4/700-940 | Res15k/W8/700-940 | Res30k/W16/700-940 | Res15k/W4/700-820 | Res30k/W8/700-820 |
Res7.5k/W4/750-990 | Res15k/W8/750-990 | Res30k/W16/750-990 | Res15k/W4/750-870 | Res30k/W8/750-870 |
Res15k/W4/800-920 | Res30k/W8/800-920 | |||
Res15k/W4/850-970 | Res30k/W8/850-970 | |||
Res15k/W4/900-1020 | Res30k/W8/900-1020 |
Cell Stimulation Assay
HEK293T cells were cultured in six-well plates (Thermo Fisher Scientific) at 70% confluence in DMEM, and then this medium was replaced with another medium containing chemical substances. All chemical substances were purchased from Avanti Polar Lipids (AL). The cells were stimulated with different concentrations (0.1, 1.0, and 10 μM) of 1-myristoyl-2-hydroxy-sn-glycero-3-phosphate (LPA(14:0)), d-erythro-sphingosine-1-phosphate (S1P), d-erythro-sphingosine 18:1 (Sph), 1,2-dioleoyl-sn-glycero-3-phospho-(1′-myo-inositol-3′-phosphate) (PI3P), 1,2-dioleoyl-sn-glycero-3-phospho-(1′-myo-inositol-4′-phosphate) (PI4P), 1,2-dioleoyl-sn-glycero-3-phospho-(1′-myo-inositol-5′-phosphate) (PI5P), d-myo-inositol-1,4,5-triphosphate (IP3), and d-myo-inositol-1,3,4,5-tetraphosphate (IP4). After 24 h of stimulation, individual cells were collected by lysis in 400 μL of 2% SDS in Tris-HCl (pH8.5) followed by a PBS wash. Proteomic samples were prepared using the above-mentioned protocols, and samples were placed in 96-well plates (Eppendorf, Hamburg, Germany). The cell stimulation assay was performed in triplicate.
Data Processing
The raw data files were converted to mzML files (using parameters “peak picking” and “demultiplex”) by ProteoWizard (version: 3.0.19254). The converted mzML files were transformed into DIA files for search using DIA-NN11,20 (version:1.8, https://github.com/vdemichev/DiaNN). Peptides and proteins were identified and quantified from each DIA file using a two-step search. First, a spectral library for the library-free search was generated from the human protein sequence database (UniProt id UP000005640, reviewed, canonical, 20381 entries) using DIA-NN. The DIA-NN search parameters were as follows: protease, trypsin/P; missed cleavages, 1; peptide length range, 7–45; precursor charge range, 2–4; fragment ion m/z range, 200–1800; mass accuracy, 15 ppm; static modification, cysteine carbamidomethylation; and enabled “Remove likely interferences” and “Use isotopologues.” The precursor mass range was varied according to the DIA method (Table 1). Additional commands were set as follows: peak translation and -relaxed-prot-inf. The protein identification threshold was set at <1% for both peptide and protein false discovery rate (FDR) values. Next, a protein search was performed again using the generated specific spectral library. The protein quantity was measured as the sum of the quantities of the four best precursors. In addition, the raw data files were analyzed against a human spectral library using Scaffold DIA (Proteome Software Inc., Portland, OR). The human spectral library was built from the human protein database (UniProt id UP000005640, reviewed, canonical) established by Prosit.21 The Scaffold DIA search parameters were as follows: experimental data search enzyme, trypsin; maximum missed cleavage sites, 1; precursor mass tolerance, 15 ppm; fragment mass tolerance, 15 ppm; and static modification, cysteine carbamidomethylation. The peptide and protein FDR values were set to <1%. Peptide quantification was performed using the EncyclopeDIA22 algorithm in Scaffold DIA. For each peptide, four fragment ions of the highest quality were selected for quantification. Protein values were normalized using linear median normalization between samples.
Statistical Analysis of the Cell Stimulation Assay
Statistical quantitative analysis was performed using Perseus.23 Samples were categorized into individual dose groups (0.1, 1.0, and 10 μM) for each substance. The intensity values were log2-transformed, and proteins that quantified more than 70% of each group were chosen. Normalization was performed by width adjustment prior to imputation of missing values (downshift = 1.8, width = 0.3). One-way ANOVA for each substance group was performed, and significantly different proteins (p < 0.05) were extracted. Subsequently, hierarchical clustering was performed using the extracted proteins, and characteristic clusters were selected for each group. All other statistical analyses were performed using GraphPad Prism 9 (GraphPad Software, San Diego, CA).
Pathway and Gene Ontology (GO) Enrichment Analysis
Pathway analysis was performed by the enrichment analysis of WikiPathway using Enricher (https://maayanlab.cloud/Enrichr/).24 The proteins with a significant difference (p < 0.05) between each stimulation and control were analyzed by this platform. Gene ontology enrichment analysis was performed for each cluster using Metascape (https://metascape.org/gp/index.html#/main/step1). The top five terms of the adjusted p-value in the biological process were selected in each cluster, and the dot plot was generated using R software (ggplot2 library). To evaluate the Res30k/W8/650–770 method, we performed GO enrichment analysis using DAVID (https://david.ncifcrf.gov/home.jsp) and extracted the proteins with GO terms MF of “kinase activity,” “protein kinase activity,” “receptor signaling protein serine/threonine kinase activity,” “transcription coactivator activity,” “transcription factor binding”, and “GTPase activity.”
Results and Discussion
Optimization of DIA for 5-Min Gradient LC
Q-TOF MS has been utilized in ultrafast proteomics because of its fast scan rate in high-resolution MS.11−13 This study aimed to establish an ultrafast method based on Q-Orbitrap MS by optimizing DIA with fast LC. We constructed 46 methods combining a different number of isolation windows, mass ranges, and resolutions, with the same scan cycle time, to evaluate DIA systematically for 5-min gradient LC (Table 1). We then screened the DIA with a higher number of identified proteins and peptides from the above 46 methods using 5-min gradient LC (Figure S1). Nine DIA methods (Res30k/W8/500–620, Res30k/W8/550–670, Res30k/W8/600–720, Res30k/W8/650–770, Res15k/W4/650–770, Res15k/W8/500–740, Res15k/W8/550–790, Res15k/W8/600–840, and Res15k/W8/650–890) showed high protein and peptide identification (Figure 1A, Dashed line). In the precursor mass range, these methods cover m/z 500–890 (starting to end at Res30k/W8d/500–620 to Res15k/W8/650–890; Table 1). The number of identified peptides in the MS range of m/z 400–640 and 450–690 exceeded 15,000, but few proteins were identified. In the low MS range, many peptides were identified because peaks of long peptides of triple charge or higher were observed in addition to short peptide peaks. However, owing to the increase in the number of peptide peaks detected, the MS2 pattern of DIA became more complicated, making it difficult to identify trace peptides. In higher MS ranges such as m/z 700–940, 750–990, 850–970, and 900–1020, fewer peptide peaks were observed, and the number of identified proteins did not increase. For shorter gradients, MS ranges of approximately m/z 500–890 were a suitable range that allowed us to increase the number of identified proteins, which was attributed to the combination of the number of peptides detected and the reduced complexity of MS2.
To verify the results of this screening, we measured the nine selected methods in triplicate and compared the proteins and peptides. Consequently, the methods with narrow precursor mass ranges (120 Da: Res15k/W4/650–770, Res30k/W8/650–770, Res30k/W8/600–720, Res30k/W8/550–670, and Res30k/W8/500–620) indicated a higher number of protein identifications than the wide mass range methods (240 Da: Res15k/W8/650–890, Res15k/W8/600–840, Res15k/W8/550–790, and Res15k/W8/500–740) (Figure 1B and Table S1). In contrast, there were fewer peptide identifications in the narrow mass range than in the wide mass range (Figure 1C and Table S1). Importantly, more peptides were co-eluted in the shorter LC gradient. A narrow mass range leads to a decrease in the number of detectable peptides, thus reducing spectral complexity. Thus, a narrow mass range is effective for improving protein identification in the shorter gradient LC.
Among the narrow methods, Res30k/W8/650–770 showed the highest protein identification, with an average of 5,055 proteins (Figure 1B and Table S1). Moreover, Res30k/W8/650–770 showed the highest reproducibility with respect to protein quantities (median coefficient of variation (CV) 11.6%, Table S1). To verify the protein identification of Res30k/W8/650–770 using a search algorithm other than DIA-NN, we reanalyzed the data obtained from the nine methods (indicated in Figure 1B,C) using Scaffold DIA with EncyclopeDIA. Results showed that the Res30k/W8/650–770 method showed the highest protein identifications even for Scaffold DIA software, although the protein and peptide identifications in Scaffold DIA were fewer than those in DIA-NN (Figure S2 and Table S2). In addition, protein and peptide identifications for the nine methods showed similar trends in Scaffold DIA (Figure S2 and Table S2); i.e., the number of protein and peptide identifications in narrow methods was higher and fewer, respectively, compared with those in wide methods. These results suggest that our method not only works with DIA-NN but also with other software. Based on these results, we determined that Res30k/W8/650–770 is the most efficient DIA for combining 5-min gradient LC and Q-Orbitrap MS, that is, precursor mass range, m/z 650–770; mass resolution, 30,000; and isolation window width, 8 Da. The throughput of our method enabled the measurement of approximately 80 samples/day, including the sample loading, column equilibration, and wash running time.
Evaluation of Proteome Coverage and Reproducibility of Optimized DIA Parameters
To further examine the proteome coverage and robustness of the optimized DIA parameters, we measured 31.25, 62.5, 125, 250, 500, and 1000 ng of cell digests in a single-shot run. Our method showed the highest number of identified proteins for 1000 ng of digest injection, whereas the difference in the number of identified proteins was small between 500 and 1000 ng of digest injection (Figure 2A). The average numbers of identified proteins in the 31.25, 62.5, 125, 250, 500, and 1000 ng digests were 4209, 4544, 4763, 4975, 5094, and 5150, respectively. When measuring based on our method in a single-shot run in triplicate, more than 5000 proteins, namely, 5176, 5145, and 5129 proteins, were identified (Figure 2A).
We identified 5,186 proteins and quantified 4930 proteins in triplicate using a 1000 ng cell digest (Figure 2B). Of these, 3880 and 2467 proteins indicated CV < 0.2 and 0.1, respectively (median CV of 10.2%). The CV% of proteins was not significantly different from that of previous ultrafast methods (median CVs of 10.3 and 6.4%12,13). Among the identified proteins, the dynamic range of intensity was >104 (Figure 2C, left panel). To investigate the proteomic coverage of our method, we focused on the number of identified minor component proteins, represented as kinases, transcriptional factors, and small GTPases. In total, 139, 186, and 99 for kinases, transcription factor-related proteins, and GTPases, respectively, were identified using our method (Figure 2C, right panel). Kinase and transcription factor identifications were fewer than those of our previous method (502 and 1029 identifications, respectively: 90 min gradient at a flow rate of 100 nL/min).7 Given the difference in measurement time, we considered that our method indicated an adequate depth of proteome coverage, especially in clinical applications. Our method is suitable for the routine measurement of numerous samples because of its analytical throughput and may be effective for the diagnosis and screening of biomarker candidates. As shown in Figure 3, our method showed good protein identification rates of 134.7, 19.9, and 5.2 proteins/ng for 31.25, 250, and 1000 ng digest injections, respectively. These results imply that our method requires a lower sample volume for protein extraction. Consequently, our method reduces sample volume, acquisition time, and data size.
High-Throughput Proteomic Application by a Chemical Stimulation Assay of Cells
Drug screening should be performed for many samples; thus, HT methods are required. One of the important aims of applying proteomics to HT methods is to capture the differences in protein profiles as a consequence of stimulation. To obtain findings for future HT proteomic applications, we investigated whether our method enables continuous measurement of multiple samples and captures biological alterations in proteome profiles via a cell stimulation assay using eight chemical substances. Subsequently, we performed a pathway analysis for each stimulation. Information on the pathway analysis with statistical significance is summarized in Table S3. The adequacy of the assay was verified by observing the bioactivity of LPA(14:0) and S1P stimulation. LPA(14:0) and S1P are bioactive lysolipids, and their G-protein-coupled receptors (GPCRs) are ubiquitously expressed in human cells.25−27 LPA and S1P receptor (LPAR and S1PR) signaling regulate various biological functions, including cytoskeleton and adherens junction assembly, mediated by actin polymerization, followed by GTPase activation.25,26 In 0.1 μM LPA stimulation, upregulation of the “Regulation of actin cytoskeleton” pathway suggested that LPA stimulation activates LPAR/Rho/Rho-associated protein kinase signaling.26 In addition, PLCB3 (1-phosphatidylinositol 4,5-bisphosphate phosphodiesterase β-3; phospholipase C-β3) was included in upregulated pathways by LPA(14:0) stimulation (Table S3). LPAR is known to activate LPAR/phospholipase C-β3/AKT/ERK signaling.26,28 These observations support the idea that LPAR signaling is activated by LPA stimulation. Although annotated pathways such as S1PR signaling were not found in this assay, an increase in actin- and tubulin-related proteins, including ACTB (β-actin), FLNA (Filamin A), and TUBA1A (Tubulin Alpha 1a) (Table S3), was supported to enhance membrane trafficking and actin polymerization via SIPR signaling activation. Our findings are consistent with those of previous studies, which support the adequacy of our method. S1P stimulation resulted in the highest number of both up- and downregulated pathways: 8, 7, and 14 pathways at 0.1, 1.0, and 10 μM, respectively (total 29 pathways) (Table S3). These results suggest that S1P-S1PR signaling regulates numerous biological and physiological functions. For other stimulations, 8, 10, 7, 9, 11, and 8 pathways were annotated by Sph, PI3P, PI4P, PI5P, IP3, and IP4 stimulation, respectively (Table S3). Substance-specific altered pathways were observed in each stimulation such as “Protein export” for 10 μM PI3P and “Ubiquinone and other terpenoid-quinone biosynthesis” for 1.0 μM PI4P stimulation, whereas several altered pathways were overlapped in individual stimuli (Table S3). Thus, each stimulation may have a different effect on intracellular signaling. However, it is unclear how these substances act in the extracellular space as their GPCRs have not yet been identified. Thus, further investigation is required to clarify the mechanisms underlying the bioactivity.
Additionally, we performed hierarchical clustering and GO enrichment analysis of the proteins with statistical significance for each substance (Figure S3A–H). In this analysis, we found that a dose-dependent response was an efficient bioactivity for each substance. Thus, we selected two clusters that indicated that the bioactivity increased or decreased in a dose-dependent manner after each stimulation (Figure 4A,B). The number of significantly altered proteins is summarized in Table S4. Predictably, actin filament, membrane traffic, and cell adhesion-related terms (GO:0030048, GO:0030039, GO:1900024, R-HSA-2029482, R-HSA-199991, indicated by asterisks in Figure 5) were significant GO terms in both LPA(14:0) and S1P stimulation. In addition, “signaling by receptor tyrosine kinase (R-HSA-9006934)” was significantly increased after LPA(14:0) stimulation. LPA signaling induces the transactivation of an epidermal growth factor receptor (EGFR),29,30 which is a receptor tyrosine kinase. In this assay, EPS15, CRK, MID, GTF2F2, PTK2, and WASF2 were annotated with “signaling by receptor tyrosine kinase (R-HSA-9006934)”. LPA(14:0) stimulation significantly increased the levels of these proteins compared with those of the control (EPS15, PTK2, and CRK at 10 μM LPA(14:0); MDK at 0.1 and 10 μM LPA(14:0); GTF2F2 and WASF2 at 1.0 μM LPA(14:0)) (Figure S4). The EGFR levels were not significantly different between the LPA stimulation and control groups (Figure S4). We investigated the bioactivity of Sph, PI3P, PI4P, PI5P, IP3, and IP4 using our HT proteomic platform (Figure 5). For instance, Sph stimulation was found to be involved in mitochondrial localization (GO:0051654, GO:0051646, GO:0047497, and GO:0034643). IP3 stimulation was mainly associated with translation and peptide synthesis (GO:0043043 and 0006412). Our results may be helpful for researchers planning to perform HT screening using these compounds. Consequently, our method enabled the measurement of approximately 80 samples and further alteration of proteomic profiles by chemical stimulation.
Conclusions
In this study, we revealed that the optimized DIA parameters are efficient for improving protein coverage and they enable us to capture the alteration of proteome profiles when using 5-min gradient LC and Q-Orbitrap MS. Of note, these DIA parameters show a high number of protein identifications even when a smallamount of digest is injected. To our knowledge, this study is the first to evaluate the detailed DIA parameters for HT proteomics using Q-Orbitrap MS. Overall, our results may provide helpful information for enhancing HT proteomic applications using Q-Orbitrap MS in future medical and medicinal fields.
Acknowledgments
The authors thank Mrs. Hiroko Kinoshita for technical assistance.
Supporting Information Available
The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/acs.jproteome.2c00121.
Representative total ion chromatogram of 5-min gradient LC-MS/MS (Figure S1); re-analysis of data obtained from selected nine methods by analyzing Scaffold DIA (Figure S2); hierarchical clustering and GO enrichment analysis of HEK293T cell proteins stimulated by (A) LPA(14:0), (B) S1P, (C) Sph, (D) PI3P, (E) PI4P, (F) PI5P, (G) IP3, and (IP4) (Figure S3); levels of EGFR and proteins annotated with “signaling by receptor tyrosine kinase (R-HSA-9006934)” by LPA(14:0) stimulation (Figure S4) (PDF)
Number of identified proteins and peptides in triplicate runs for selected nine DIA methods by analyzing DIA-NN (Table S1); number of identified proteins and peptides in triplicate runs for selected nine DIA methods by analyzing Scaffold DIA (Table S2); information of annotated pathway with statistical significance for each stimulation (Table S3); summary of cell stimulation assay using eight chemical substances (Table S4) (XLSX)
Author Contributions
§ M.I. and R.K. contributed equally to this work. Y.K. and O.O. conceived and supervised the study. M.I. and Y.K. performed the LC-MS analysis. R.K. and H.S. performed the computational work. M.I., D.N., and R.N. performed the biological experiments, and R.K., M.G., and K.F. analyzed the data. M. I., R.K., and Y.K. wrote the manuscript, and O.O. edited the manuscript. All authors have read and approved the final manuscript.
The authors declare no competing financial interest.
Notes
Mass spectrometry proteomics data were deposited in the ProteomeXchange Consortium via the jPOST partner repository31 with the dataset identifier PXD034227 for ProteomeXchange and JPST001513 for jPOST.
Supplementary Material
References
- Meyer J. G.; Schilling B. Clinical applications of quantitative proteomics using targeted and untargeted data-independent acquisition techniques. Exp. Rev. Proteomics 2017, 14, 419–429. 10.1080/14789450.2017.1322904. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Geyer P. E.; Holdt L. M.; Teupser D.; Mann M. Revisiting biomarker discovery by plasma proteomics. Mol. Syst. Biol. 2017, 13, 942. 10.15252/msb.20156297. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Savitski M. M.; Reinhard F. B.; Franken H.; Werner T.; Savitski M. F.; Eberhard D.; Martinez Molina D.; Jafari R.; Dovega R. B.; Klaeger S.; Kuster B.; Nordlund P.; Bantscheff M.; Drewes G. Tracking cancer drugs in living cells by thermal profiling of the proteome. Science 2014, 346, 1255784 10.1126/science.1255784. [DOI] [PubMed] [Google Scholar]
- Williams S. A.; Kivimaki M.; Langenberg C.; Hingorani A. D.; Casas J. P.; Bouchard C.; Jonasson C.; Sarzynski M. A.; Shipley M. J.; Alexander L.; Ash J.; Bauer T.; Chadwick J.; Datta G.; DeLisle R. K.; Hagar Y.; Hinterberg M.; Ostroff R.; Weiss S.; Ganz P.; Wareham N. J. Plasma protein patterns as comprehensive indicators of health. Nat. Med. 2019, 25, 1851–1857. 10.1038/s41591-019-0665-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bruderer R.; Bernhardt O. M.; Gandhi T.; Xuan Y.; Sondermann J.; Schmidt M.; Gomez-Varela D.; Reiter L. Optimization of Experimental Parameters in Data-Independent Mass Spectrometry Significantly Increases Depth and Reproducibility of Results. Mol. Cell. Proteomics 2017, 16, 2296–2309. 10.1074/mcp.RA117.000314. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hebert A. S.; Prasad S.; Belford M. W.; Bailey D. J.; McAlister G. C.; Abbatiello S. E.; Huguet R.; Wouters E. R.; Dunyach J. J.; Brademan D. R.; Westphall M. S.; Coon J. J. Comprehensive Single-Shot Proteomics with FAIMS on a Hybrid Orbitrap Mass Spectrometer. Anal. Chem. 2018, 90, 9529–9537. 10.1021/acs.analchem.8b02233. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kawashima Y.; Watanabe E.; Umeyama T.; Nakajima D.; Hattori M.; Honda K.; Ohara O. Optimization of Data-Independent Acquisition Mass Spectrometry for Deep and Highly Sensitive Proteomic Analysis. Int. J. Mol. Sci. 2019, 20, 5932. 10.3390/ijms20235932. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Muntel J.; Gandhi T.; Verbeke L.; Bernhardt O. M.; Treiber T.; Bruderer R.; Reiter L. Surpassing 10 000 identified and quantified proteins in a single run by optimizing current LC-MS instrumentation and data analysis strategy. Mol. Omics 2019, 15, 348–360. 10.1039/C9MO00082H. [DOI] [PubMed] [Google Scholar]
- Bache N.; Geyer P. E.; Bekker-Jensen D. B.; Hoerning O.; Falkenby L.; Treit P. V.; Doll S.; Paron I.; Müller J. B.; Meier F.; Olsen J. V.; Vorm O.; Mann M. A Novel LC System Embeds Analytes in Pre-formed Gradients for Rapid, Ultra-Robust Proteomics. Mol. Cell Proteomics 2018, 17, 2284–2296. 10.1074/mcp.TIR118.000853. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bian Y.; Zheng R.; Bayer F. P.; Wong C.; Chang Y. C.; Meng C.; Zolg D. P.; Reinecke M.; Zecha J.; Wiechmann S.; Heinzlmeir S.; Scherr J.; Hemmer B.; Baynham M.; Gingras A. C.; Boychenko O.; Kuster B. Robust, reproducible and quantitative analysis of thousands of proteomes by micro-flow LC–MS/MS. Nat. Commun. 2020, 11, 157 10.1038/s41467-019-13973-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Messner C. B.; Demichev V.; Wendisch D.; Michalick L.; White M.; Freiwald A.; Textoris-Taube K.; Vernardis S. I.; Egger A. S.; Kreidl M.; Ludwig D.; Kilian C.; Agostini F.; Zelezniak A.; Thibeault C.; Pfeiffer M.; Hippenstiel S.; Hocke A.; von Kalle C.; Campbell A.; Hayward C.; Porteous D. J.; Marioni R. E.; Langenberg C.; Lilley K. S.; Kuebler W. M.; Mülleder M.; Drosten C.; Suttorp N.; Witzenrath M.; Kurth F.; Sander L. E.; Ralser M. Ultra-High-Throughput Clinical Proteomics Reveals Classifiers of COVID-19 Infection. Cell Systems 2020, 11, 11–24. e4. 10.1016/j.cels.2020.05.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Meier F.; Brunner A. D.; Frank A.; Ha A.; Bludau I.; Voytik E.; Kaspar-Schoenefeld S.; Lubeck M.; Raether O.; Bache N.; Aebersold R.; Collins B. C.; Röst H. L.; Mann M. diaPASEF: parallel accumulation–serial fragmentation combined with data-independent acquisition. Nat. Methods 2020, 17, 1229–1236. 10.1038/s41592-020-00998-0. [DOI] [PubMed] [Google Scholar]
- Messner C. B.; Demichev V.; Bloomfield N.; Yu J.S.L.; White M.; Kreidl M.; Egger A. S.; Freiwald A.; Ivosev G.; Wasim F.; Zelezniak A.; Jürgens L.; Suttorp N.; Sander L. F.; Kurth F.; Liley K. S.; Mülleder M.; Tate S.; Ralser M. Ultra-fast proteomics with Scanning SWATH. Nat. Biotechnol. 2021, 39, 846–854. 10.1038/s41587-021-00860-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bekker-Jensen D. B.; Martínez-Val A.; Steigerwald S.; Rüther P.; Fort K. L.; Arrey T. N.; Harder A.; Makarov A.; Olsen J. V. A Compact Quadrupole-Orbitrap Mass Spectrometer with FAIMS Interface Improves Proteome Coverage in Short LC Gradients. Mol. Cell Proteomics 2020, 19, 716–729. 10.1074/mcp.TIR119.001906. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Eliuk S.; Makarov A. Evolution of Orbitrap mass spectrometry instrumentation. Annu. Rev. Anal. Chem. 2015, 8, 61–80. 10.1146/annurev-anchem-071114-040325. [DOI] [PubMed] [Google Scholar]
- Ludwig C.; Gillet L.; Rosenberger G.; Amon S.; Collins B. C.; Aebersold R. Data-independent acquisition-based SWATH-MS for quantitative proteomics: a tutorial. Mol. Syst. Biol. 2018, 14, e8126 10.15252/msb.20178126. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nakamura R.; Nakajima D.; Sato H.; Endo Y.; Ohara O.; Kawashima Y. A Simple Method for In-Depth Proteome Analysis of Mammalian Cell Culture Conditioned Media Containing Fetal Bovine Serum. Int. J. Mol. Sci. 2021, 22, 2565. 10.3390/ijms22052565. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sielaff M.; Kuharev J.; Bohn T.; Hahlbrock J.; Bopp T.; Tenzer S.; Distler U. Evaluation of FASP, SP3, and iST Protocols for Proteomic Sample Preparation in the Low Microgram Range. J. Proteome Res. 2017, 16, 4060–4072. 10.1021/acs.jproteome.7b00433. [DOI] [PubMed] [Google Scholar]
- Maia T. M.; Staes A.; Plasman K.; Pauwels J.; Boucher K.; Argentini A.; Martens L.; Montoye T.; Gevaert K.; Impens F. Simple Peptide Quantification Approach for MS-Based Proteomics Quality Control. ACS Omega. 2020, 5, 6754–6762. 10.1021/acsomega.0c00080. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Demichev V.; Messner C. B.; Vernardis S. I.; Lilley K. S.; Ralser M. DIA-NN: neural networks and interference correction enable deep proteome coverage in high throughput. Nat. Methods 2020, 17, 41–44. 10.1038/s41592-019-0638-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gessulat S.; Schmidt T.; Zolg D. P.; Samaras P.; Schnatbaum K.; Zerweck J.; Knaute T.; Rechenberger J.; Delanghe B.; Huhmer A.; Reimer U.; Ehrlich H. C.; Aiche S.; Kuster B.; Wilhelm M. Prosit: proteome-wide prediction of peptide tandem mass spectra by deep learning. Nat. Methods 2019, 16, 509–518. 10.1038/s41592-019-0426-7. [DOI] [PubMed] [Google Scholar]
- Searle B. C.; Pino L. K.; Egertson J. D.; Ting Y. S.; Lawrence R. T.; MacLean B. X.; Villén J.; MacCoss M. J. Chromatogram libraries improve peptide detection and quantification by data independent acquisition mass spectrometry. Nat. Commun. 2018, 9, 5128 10.1038/s41467-018-07454-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tyanova S.; Temu T.; Sinitcyn P.; Carlson A.; Hein M. Y.; Geiger T.; Mann M.; Cox J. The Perseus computational platform for comprehensive analysis of (prote)omics data. Nat. Methods 2016, 13, 731–740. 10.1038/nmeth.3901. [DOI] [PubMed] [Google Scholar]
- Xie Z.; Bailey A.; Kuleshov M. V.; Clarke D.J.B.; Evangelista J. E.; Jenkins S. L.; Lachmann A.; Wojciechowicz M. L.; Kropiwnicki E.; Jagodnik K. M.; Jeon M.; Ma’ayan A. Gene set knowledge discovery with Enrichr. Curr. Protoc. 2021, 1, e90 10.1002/cpz1.90. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lin M. E.; Herr D. R.; Chun J. Lysophosphatidic acid (LPA) receptors: signaling properties and disease relevance. Prostaglandins Other Lipid Mediat. 2010, 91, 130–138. 10.1016/j.prostaglandins.2009.02.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yung Y. C.; Stoddard N. C.; Chun J. LPA receptor signaling: pharmacology, physiology, and pathophysiology. J. Lipid Res. 2014, 55, 1192–1214. 10.1194/jlr.R046458. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Drexler Y.; Molina J.; Mitrofanova A.; Fornoni A.; Merscher S. Sphingosine-1-Phosphate Metabolism and Signaling in Kidney Diseases. J. Am. Soc. Nephrol. 2021, 32, 9–31. 10.1681/ASN.2020050697. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lin F. T.; Lai Y. J. Regulation of the LPA2 Receptor Signaling through the Carboxyl-Terminal Tail-Mediated Protein-Protein Interactions. Biochim. Biophys. Acta 2008, 1781, 558–562. 10.1016/j.bbalip.2008.04.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Inoue A.; Arima N.; Ishiguro J.; Prestwich G. D.; Arai H.; Aoki J. LPA-producing enzyme PA-PLA1α regulates hair follicle development by modulating EGFR signaling. EMBO J. 2011, 30, 4248–4260. 10.1038/emboj.2011.296. [DOI] [PMC free article] [PubMed] [Google Scholar]
- No Y. R.; He P.; Yoo B. K.; Yun C. C. Regulation of NHE3 by lysophosphatidic acid is mediated by phosphorylation of NHE3 by RSK2. Am. J. Physiol. Cell Physiol. 2015, 309, C14–C21. 10.1152/ajpcell.00067.2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Okuda S.; Watanabe Y.; Moriya Y.; Kawano S.; Yamamoto T.; Matsumoto M.; Takami T.; Kobayashi D.; Araki N.; Yoshizawa A. C.; Tabata T.; Sugiyama N.; Goto S.; Ishihama Y. jPOSTrepo: an international standard data repository for proteomes. Nucleic Acids Res. 2017, 45, D1107–D1111. 10.1093/nar/gkw1080. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.