Abstract
The inability to quantify large numbers of proteins in tissues and biofluids with high precision, sensitivity, and throughput is a major bottleneck in biomarker studies. We previously demonstrated that coupling immunoaffinity enrichment using anti-peptide antibodies (SISCAPA) to multiple reaction monitoring mass spectrometry (MRM-MS) produces Immunoprecipitation MRM-MS (immuno-MRM-MS) assays that can be multiplexed to quantify proteins in plasma with high sensitivity, specificity, and precision. Here we report the first systematic evaluation of the interlaboratory performance of multiplexed (8-plex) immuno-MRM-MS in three independent labs. A staged study was carried out in which the effect of each processing and analysis step on assay coefficient of variance, limit of detection, limit of quantification, and recovery was evaluated. Limits of detection were at or below 1 ng/ml for the assayed proteins in 30 μl of plasma. Assay reproducibility was acceptable for verification studies, with median intra- and interlaboratory coefficients of variance above the limit of quantification of 11% and <14%, respectively, for the entire immuno-MRM-MS assay process, including enzymatic digestion of plasma. Trypsin digestion and its requisite sample handling contributed the most to assay variability and reduced the recovery of target peptides from digested proteins. Using a stable isotope-labeled protein as an internal standard instead of stable isotope-labeled peptides to account for losses in the digestion process nearly doubled assay accuracy for this while improving assay precision 5%. Our results demonstrate that multiplexed immuno-MRM-MS can be made reproducible across independent laboratories and has the potential to be adopted widely for assaying proteins in matrices as complex as plasma.
Hundreds of proteins with claimed potential as diagnostic or prognostic biomarkers of disease presence or progression have emerged from discovery proteomics studies, but few have progressed to use in the clinic. The failure to deliver on the promise of plasma protein biomarkers is in part due to the lack of sufficiently precise methods to measure changes in abundance of numerous candidate protein biomarkers in hundreds of patient samples. We call this critical step “verification,” and it bridges discovery and clinical validation of biomarkers (1–5). Verification of candidate protein biomarkers currently relies primarily on standard immunoassays. Analytically validated immunoassays measure target analytes with high specificity, sensitivity, and throughput, although these are not without problems caused by autoantibodies and off target interferences. In addition, antibody reagents suitable for constructing useful sandwich immunoassays exist for only a small fraction of the proteome. Developing reliable immunoassays for every potential candidate biomarker is not practical at present because of the long development times and high cost (3), which have led to a dearth of new protein biomarkers (6). Alternative methods that have similar performance attributes to immunoassays are clearly needed for the biomarker verification step.
The performance characteristics of multiplexed SID-MRM-MS1 assays for direct measurement of proteins in plasma have been systematically and rigorously evaluated, including the intra- and interlaboratory reproducibility of the technology, using well established methods for analytical validation of clinical assays (7–9). These studies demonstrated that the technology is highly specific, sensitive, and precise for direct quantification of proteins in plasma. Limits of quantification (LOQ) by direct analysis of digested plasma by SID-MRM-MS were in the range of 1–5 μg/ml of protein (9), with intralaboratory reproducibility between 6.6 and 54.9% and interlaboratory imprecision of 18.5% as measured at the LOQ. Importantly, these excellent performance levels were achieved regardless of the specific LC or MS instrument employed, provided that proper care was exercised in sample preparation and data analysis.
Although many proteins of clinical utility are present in blood at or above 1 μg/ml (e.g. C-reactive protein), most disease-specific protein biomarkers are present at ng/ml and lower levels (e.g. cardiac troponins in heart failure, prostate specific antigen in prostate cancer and thyroglobulin in thyroid cancer) (6). The complexity and 1011 dynamic range of protein abundances in blood severely limit the sensitivity of protein quantification in plasma by SID-MRM-MS. A number of approaches have been taken to address this problem. Keshishian et al. (8) developed a method involving abundant protein depletion using commercial immunoaffinity columns followed by limited peptide fractionation (6–8 fractions) prior to on-line LC-SID-MRM-MS. Using this approach, termed fractionation MRM (fMRM), robust, multiplexed quantification of proteins at the bottom of the ng/ml range in plasma was achieved with an overall technical reproducibility between 2.6 and 37% with a median CV of 9.8% at 2.5 ng/ml. This technology has been used to develop multiplexed assays for known and novel markers of cardiovascular injury that are sufficiently sensitive and reproducible to assess levels of these proteins in patient samples (10). A limitation of the approach is that throughput of the approach fMRM is lower than for direct MRM-MS analysis primarily because of the increased amount of instrument time required for analysis of each sample fraction. Zhang and co-workers (11, 12) developed a highly selective approach that specifically enriches N-glycosylated peptides and proteins and then quantifies the formerly glycosylated peptides by MRM-MS. This approach permitted precise quantification of N-linked glycoproteins at low ng/ml range in plasma. Unlike the fMRM approach, the choice of which peptide to use for quantification is strictly limited to peptides that had been N-glycosylated. If the formerly N-glycosylated peptide does not have suitable HPLC retention or ionization properties for LC-MS/MS, it will not be usable for quantification. In addition, a change in the level of one of these peptides could be due to either a change in the level of the protein or a change in the extent of glycosylation at this particular site, the latter occurring without a change in protein level.
Stable isotope standards and capture by anti-peptide antibodies (SISCAPA) employs immunoaffinity enrichment of targeted peptides from digested tissue or biofluids prior to analysis by MRM-MS (immuno-MRM) (13). A number of reports from our laboratories (14–17), as well as from others (18–21), have demonstrated the ability of immuno-MRM assays to achieve LODs of ∼1 ng/ml for proteins using as little as 10 μl of plasma with process reproducibility similar to that of MRM-MS and fMRM-MS. Immuno-MRM assays have been developed and applied to measure a range of known and potentially novel protein biomarkers in cancer (14, 15, 22) and cardiovascular disease (16). An automated magnetic bead-based platform capable of high throughput processing for immuno-MRM has been developed and tested to measure proteins with high precision (15, 17). Multiplexing of immuno-MRM assays can be achieved in plex levels of at least 10 by combining anti-peptide antibodies produced against different distinct peptides (17). Mixtures of these antibodies are used to immuno-affinity enrich the target peptides from plasma in multiplex. Subsequent washing and elution of the beads releases all peptides captured for analysis by LC-MRM-MS. The pharmaceutical and biotechnology industries are now beginning to develop immuno-MRM assays internally or in partnership with academia and contract research organizations to measure proteins in preclinical animal models for pharmacodynamic assessment of drug candidates (19–21). These studies have clearly demonstrated the advantages and potentials of immuno-MRM to construct multiplexed assays to measure large numbers of proteins in biofluids and to overcome limitations of traditional immunoassays.
Here we report a systematic evaluation of the intra- and interlaboratory performance of multiplexed immuno-MRM assays, a prerequisite for more widespread acceptance and adoption of the technology in academia and industry. Protocols were developed to assess the intra- and interlaboratory reproducibility of the technique performed at multiple sites and to systematically examine the sources of imprecision in each phase of the assay process. The results for the 8-plex assay developed define the achievable performance of immuno-MRM using automated bead capture and washing and provide additional support that immuno-MRM-MS is valuable technology for bridging the gap between discovery and clinical validation of candidate protein biomarkers.
EXPERIMENTAL PROCEDURES
Peptide Standards, Protein Standards, Rabbit Anti-peptide Polyclonal Antibodies, and Plasma
Eight peptides specific to six proteins were selected using methods and criteria previously described (8, 9) (Table I). Briefly, all possible tryptic peptides for each protein were selected in silico using Agilent Spectrum Mill Peptide Selector (http://proteomics.broadinstitute.org/millhtml/mssluice.htm; Agilent, Santa Clara, CA) and chosen on the basis of uniqueness, size, and lack of modifiable residues. They were synthesized with a single stable isotope-labeled amino acid using Fmoc (N-(9-fluorenyl)methoxycarbonyl) chemistry and purified by reversed phase chromatography to greater than 90% (MIT Biopolymer Lab, Cambridge, MA). [13C6]Leucine (98.4% isotopic enrichment), [13C6]lysine (99.6% isotopic enrichment), and [13C6]arginine (99.2% isotopic enrichment) were purchased from Cambridge Isotope Laboratories (Andover, MA). [13C6]Arginine and [13C6]lysine were attached to Tentagel S resin (Rapp Polymere, Tübingen, Germany) prior to synthesis. Corresponding unlabeled peptides were also made (Genscript, Piscataway, NJ). Peptide concentrations were determined by amino acid analysis (Dana Farber Cancer Institute, Boston, MA).
Table I. Proteins, unique peptides, enrichment antibodies and selected masses for MRM-MS.
Detailed scheduled MRM methods are listed in Supplemental Methods. Transition ions in bold are used for all calculations including reporting relative abundance, calculating LOD, LOQ and CV. Stable isotope labeled amino acids are in blue (one AA) or red (all AA) and carbamidomethylated cysteine is in green. Antibody grades were assigned in the initial screening assays (17) based primarily on the limits of detection determined from a mini-curve prepared as 10-fold step dilutions from 500 fmol to 0.5 fmol.
In general, the proteins were selected for assay development because of an expected low concentration (less than 1 ng/ml) in plasma so that standard addition experiments could be generated with standard peptides in the absence of endogenous analyte. Granulocyte colony-stimulating factor protein standard (CSF3) was purchased from US Biological (Swampscott, MA; catalogue number G8951-26P). Unlabeled and fully 15N-labeled forms of recombinant S100B were obtained from Argonne National Laboratory (Argonne, IL). Proteins S100A7, S100A8, S100A12, and IL1RN were purchased from Abnova (Taipei, Taiwan) and were all expressed with GST tags. Protein concentrations were determined by colorimetric methods or amino acid analysis and used without further formulation. Protein purity was determined to be greater than 90% by SDS-PAGE. Although the GST-tagged protein standards each yielded the expected tryptic peptides when digested in buffer, none of the targeted peptides were observed by MRM when the proteins were spiked into plasma prior to digestion. The reason for a lack of digestion of the GST-tagged proteins in plasma is unclear, but could be due to facile homodimerization of the GST tag that could decrease the accessibility of the tagged proteins, thereby limiting digestion by trypsin or, alternatively, that the dimerized GST tag could decrease the solubility of the tagged protein, again resulting in low or no yield of the targeted peptides. No further work was done using the GST-tagged proteins.
Rabbit polyclonal antibodies were generated in New Zealand White rabbits following a standard 98-day protocol (Epitomics, Burlingame, CA) as previously described (17). In brief, peptides were synthesized to 80% purity with an additional cysteine on the C terminus and conjugated to keyhole limpet hemocyanin for immunization. Peptides were combined into groups of five, and each rabbit was immunized with a different mixture of five distinct peptides. Antisera titers were measured by peptide ELISA (23) to determine the peptides that elicited the highest immune response. Anti-peptide antibodies, typically specific for two or three peptides per protein, were purified separately by affinity chromatography using the immunizing peptide covalently attached to a Sulfolink column (Thermo Fisher Scientific, San Jose, CA). Affinity-purified polyclonal antibodies were shipped in 1× PBS, 0.02% sodium azide to one site where they were formulated to 25% glycerol and aliquotted for long term storage at −20 °C. Antibodies were sent to each site on wet ice and stored at 4 °C until use. Plasma pooled from deidentified human specimens was purchased from Bioreclamation Inc. (Westbury, NY) and treated as not human subjects research for samples prepared in this study.
Sample Preparation for All Study Phases
All of the samples were prepared at one site to remove initial sample preparation as a source of variability. Curves containing eight light peptides unique to six proteins ranging from 3 amol/μl to 500 fmol/μl were prepared in triplicate. The same concentrations of analyte were used in each phase of the study to directly compare the performance across all phases (supplemental Table 1). In phase 1, peptides were spiked into digested plasma diluted 1:500 in 5% acetic acid, 0.03% CHAPS to simulate nonspecifically bound plasma remaining after magnetic antibody bead enrichment (120 μg/ml). In phase 2, the peptides were spiked into digested plasma diluted 1:5 in 1× PBS, 0.03% CHAPS to simulate a resuspended plasma digest prepared for magnetic antibody bead enrichment. In phases 3 and 4, two proteins, S100B (unlabeled (light) and stable isotope-labeled (heavy) forms) and CSF3 (light form only) were added by standard addition into plasma at molar concentrations equivalent to those used in the peptide curves to simulate biomarkers in a clinical sample (supplemental Table 1). The highest concentration, 500 fmol/μl, was not prepared for phases 3 or 4 because of limiting amounts of reagents. The concentration of heavy peptide standard was 5 fmol/μl for phases 1 and 2 and 10 fmol/μl for phases 3 and 4. All of the samples were frozen and then distributed to each site.
Plasma Enzymatic Digestion
Digestion was performed using urea as the denaturant as described previously (8). Conditions based on digestion of 100 μl of plasma were scaled accordingly to digest 10 μl to 100 ml of plasma as required. Bulk plasma digests, used as a background matrix for assay evaluation, were digested with bovine trypsin (Sigma). Mock biomarker samples, containing protein standards, were digested with porcine trypsin (Promega, Madison, WI). Briefly, 100 μl of plasma were reduced and denatured in 7.5 m urea, 0.2 m Tris buffer, pH 8, and 40 mm tris(2-carboxyethyl) phosphine at 37 °C for 30 min, alkylated in the dark at ambient temperature with 120 mm iodoacetamide for 30 min, diluted 1:6 with 0.2 m Tris buffer pH 8, and incubated for 16 h with trypsin in a 1:50 enzyme to substrate on a Thermomixer (Eppendorf AG, Hamburg, Germany) at 37 °C and 750 RPM. After 16 h, a second volume of trypsin was added at the same ratio, and incubation continued for an additional 4 h. Formic acid was added to 2% final concentration to quench the reaction.
Samples were diluted 1:2 with 0.1% formic acid and loaded onto Oasis® HLB 3cc (60 mg) extraction cartridges (Waters, Milford, MA) that were prewashed and equilibrated with 2 ml of 90% acetonitrile, 0.1% formic acid followed by 2 ml of 0.1% formic acid. Stable isotope-labeled (heavy) peptides (100 fmol) were added to phase 3 and 4 samples prior to loading. After loading, each sample was washed with 5 ml of 0.1% formic acid and eluted with 2 volumes of 400 μl of 90% acetonitrile, 0.1% formic acid. Samples were vacuum-centrifuged or lyophilized to dryness and stored at −80 °C until analysis.
Automated Peptide Immunoaffinity Enrichment
Fifty microliters of each sample were added to a 96-well microtiter plate in triplicate. A bulk mixture of all eight anti-peptide antibodies was prepared to a final concentration of 20 μg/ml in sufficient quantity to add 1 μg of each antibody to each sample. Sixteen microliters of 1-μm protein G magnetic beads (Invitrogen) were added to each sample and mixed briefly. The plates were sealed with aluminum adhesive seal mats and gently inverted on a Labquake® (Thermo Fisher Scientific) rotator overnight at 4 °C. After overnight incubation, the plate was transferred onto a KingFisher magnetic bead processor (Thermo Fisher Scientific) equipped with a PCR magnet head. The plates were processed as described previously (15) with modifications (supplemental methods). Briefly, the beads were transferred from the incubation plate into a 250-μl plate containing 1× PBS, 0.03% CHAPS and mixed for 1.5 min. The beads were subsequently transferred and mixed twice more, with the second wash plate containing one-tenth the salt concentration. In the final step, the beads were transferred into a 100-μl PCR plate containing 25 μl of 5% acetic acid to elute the bound material. Elution plates were sealed with an adhesive aluminum seal mat and frozen at −80 °C until analysis.
NanoLC/MRM-MS Analysis
4000 Q TRAP® LC/MS/MS system (ABSciex, Foster City, CA) equipped with a nano-ion spray source was coupled to a NanoLC two-dimensional system (Eksigent, Dublin, CA) and set to the following parameters across three sites: spray voltage of 2000–2200 V, curtain gas of 20–25, nebulizer gas of 3–15, interface heater temperature of 150 °C, collision gas set to “medium,” entrance potential of 6–10, and unit resolution for both quadrupoles. Mixtures of standard peptides were infused, and the parameters were optimized for each transition ion (Table I). The Eksigent NanoLC two-dimensional system was configured with an autosampler with magnetic plate holder and 10-μl loop, PepMap C18, 300-μm × 5-mm trap column (Dionex, Sunnyvale, CA), and a self-pack PicoFrit® 75-μm analytical column with a 10-μm emitter (New Objective, Woburn, MA) packed to 10 cm with ReproSil-Pur C18-AQ 3-μm 200 Å reversed phase medium (Dr. Maisch GmbH, Entringen, Germany) and configured with the following solvents: CH1A (2% acetonitrile, 0.1% formic acid), CH1B (2% acetonitrile, 0.1% formic acid), CH2A (0.1% formic acid), and CH2B (90% acetonitrile, 0.1% formic acid). A third valve was installed to back flush the trap column (supplemental Fig. 1).
Full loop injections (10 μl) of each sample were loaded onto the trap column with CH1 A: CH1B (50:50) at 10 μl/min for 1.5 min. After loading, CH2 was switched into line with the trap and analytical columns, and samples were eluted with the following gradient at 300 nl/min over 18 min: 2.3% B isocratically for 3 min, 2.3% B to 40% B in 10 min, 40% B to 90% B for 2 min, and hold at 90% B for 3 min. After 18 min, the system was returned to 2.3% B for 19 min, increasing the flow rate to 400 nl/min for the last 10 min and switching the trap column in-line with CH1 for the last 2 min at 10 μl/min. Heavy peptide mixtures (50 fmol) in digested plasma diluted 1:1000 in 5% acetic acid were injected prior to analysis to condition the system and scheduled with the MRM method monitoring three transitions per peptide per isotopic version (light, heavy, heavy 15N). Five abundant plasma peptides that nonspecifically bound to the beads were also monitored for a total of 68 transitions (supplemental methods).
Data Analysis and Calculations
Extracted ion chromatograms of all transition ions were integrated using MultiQuantTM (AB Sciex Foster City, CA) version 1.2. Integrated peak areas were manually inspected to make sure the automated settings in the software integrated the same peak and retention time for each light and heavy pair of peptides.
Calculation of Protein Concentration Using Spiked (Single AA) Labeled Heavy Peptide
Light peptide to heavy (single AA) peptide peak area ratios (PAR) were entered into the following equation to calculate the observed concentration of protein in the original sample reported in ng/mL: (PAR) × (13C standard peptide concentration (fmol/μL)) × (analysis volume (μL)) × (Protein molecular weight (fg/fmol))/(original process volume (μL))/1000 (fg/μL to ng/mL conversion factor)=.
Calculation of Protein Concentration Using Spiked Uniformly Labeled (All AA) Peptide Derived from Heavy Protein
Light peptide to uniformly heavy (all AA) peptide PARs were entered into the following equation to calculate the observed concentration of protein in the original sample reported in ng/ml: (PAR) × (heavy protein concentration (ng/ml)/(normalization factor) = where the normalization factor is the average peak area ratio of the light peptide to the corresponding uniformly 15N-labeled peptide. The normalization factor accounts for the fact that the light protein and the uniformly 15N-labeled protein are spiked in at identical levels.
Calculation of Intralaboratory and Interlaboratory % CV, % RE, LOD, and LOQ
Imprecision (% CV) was calculated using the mean of the process replicates (i) within each site (intralab) and (ii) across all sites (interlab). Precision was calculated as 1 minus % CV. The percentage of relative error (% RE) was calculated as (observed − expected)/expected × 100. Accuracy and percent recovery were calculated as 1 plus % RE. Limit of detection (LOD) and quantification (LOQ) were calculated according to a method we described previously (10), based on a variation of the methods described by Linnet (24) and Currie (25). Linear regression was used to fit the calibration curve and determine the slope and intercept, along with respective standard error estimates, for data in the detectable range. Standard errors were reported to quantify variation and can be used to calculate appropriate confidence intervals. Weighted robust linear regression using Tukey's biweight estimator (26, 27) minimized the influence of outliers while appropriately accounting for the heteroscedasticity of the data. All of the calculations were performed using custom programs written in R (28).
RESULTS
Design of Interlaboratory Study
The primary goal of this study was to establish an automated, multiplexed anti-peptide antibody enrichment coupled to SID-MRM-MS analysis for quantification of proteins in plasma with well defined and suitable performance characteristics to enable distribution of the technology and the assays across laboratories. The performance of the immuno-MRM assay workflow across three independent laboratories was evaluated in a series of experiments that were implemented in four distinct phases as detailed below and in Fig. 1. Four phases were designed to evaluate intra- and interlaboratory reproducibility by determining the linear response, percent recovery, and LODs and LOQs for each unique peptide derived from target protein. % CV was calculated for each peptide at each site, as well as across all sites to estimate the process imprecision and interlaboratory reproducibility.
Fig. 1.

Design of interlaboratory study. The study consisted of four phases. In each phase, samples were prepared centrally, and the same molar concentrations of peptide or protein were used to generate response curves (see supplemental Table 1). Phase 1 consisted of pooled, digested diluted plasma (120 μg/ml) spiked with standard peptides at 12 concentrations plus a blank (no light peptide added) analyzed by LC-MRM (lightest shading). Phase 2 consisted of pooled, digested plasma (10 mg/ml) spiked with standard peptides at 12 concentrations, enriched by SISCAPA and analyzed by LC-MRM. Phase 3 consisted of undigested plasma (60 mg/ml) spiked with standard proteins, digested centrally, distributed, enriched by SISCAPA, and analyzed by LC-MRM. Phase 4 consisted of undigested plasma (60 mg/ml) spiked with standard proteins, distributed, digested locally, enriched by SISCAPA, and analyzed by LC-MRM (darkest shading). The data were centrally analyzed and statistically evaluated.
Phase 1 was used to establish the baseline LC-SID-MRM-MS response for the peptides at each laboratory in the absence of immunoaffinity enrichment, but in a background simulating the complexity of peptides nonspecifically bound to 1-μm protein G magnetic beads after SISCAPA immunoaffinity enrichment. The curves were prepared in triplicate for each site by spiking light peptide standards at 12 different concentrations into digested plasma diluted 1:500 (120 μg/ml). The background level of digested plasma used closely approximated the abundance and distribution of peptides we have detected nonspecifically bound to these magnetic beads when incubated 1:10 (v:v) with digested plasma in the absence of antibody (based on data-dependent LC-MS/MS analysis on an Orbitrap; data not shown). Phase 2 introduced immunoaffinity enrichment of the peptides (SISCAPA) and was designed to evaluate the efficiency and reproducibility of heavy and light peptide capture by anti-peptide antibodies. The samples were prepared by spiking peptides at the same molar concentrations as in Phase 1 into 10 μl of digested plasma diluted 1:5 (12 mg/ml), enriched using antibodies and magnetic beads (SISCAPA; KingFisher), and analyzed by LC-MRM-MS. Phases 3 and 4 were designed to determine the variability introduced by trypsin digestion in the context of the assay workflow. In phase 3, the samples were spiked with the target proteins, digested, spiked with heavy peptide, desalted at one site, and then distributed to each site for antibody enrichment and LC-MRM-MS analysis. Phase 4 evaluated the entire workflow by simulating the analysis of a real set of biomarker assays for proteins in plasma. In Phase 4, undigested plasma spiked with the target proteins spanning a range of concentrations was processed entirely at each site.
Linearity of Response and Peptide Recovery
Overlaid plots for each analyte of the experimentally determined average concentrations for all phases of the study help visualize the linearity of response and percent recovery of each peptide across the sites throughout the study. Representative curves for the peptide IQGDGAALQEK from CSF3, a growth factor and cytokine, are shown in Fig. 2. Results for the other peptides are shown in supplemental Fig. 2, A–G. Each curve represents the interlaboratory performance plotting the average measurement at each phase (color-coded lines). Because the measurements spanned a large dynamic range, linearity of response was assessed by the variability of the slope and intercept (21) calculated using a robust linear model for all data collected in the detectable range between 4.3 ng/ml (0.23 fmol/μl) and 3.1 μg/ml (167 fmol/μl) for each phase in the study (Table II). Standard error of slope was less than 0.02 for all peptides between 0.23 and 167 fmol/μl in this study, indicating a robust linear response between assays and sites. Therefore, immuno-MRM-MS can be used to determine the relative levels of protein between samples containing a large range of concentrations without first running an assay to estimate the concentration, which is often a prerequisite of other protein measurement systems. Below these concentrations, the peptide response was no longer linear due to low ion counts, background noise, and, in some cases, the presence of trace peptide carryover from prior analysis (see detailed discussion of LOD and LOQ below). Intercept (supplemental Table 4), an estimation of endogenous levels of peptide in plasma, has a 95% confidence interval (intercept ± 1.96 S.E.) that encompasses 0 ng/ml for all peptides, indicating the absence of any endogenous peptide, with the exception of S100A8.ALN in phase 2 (b = 4.12 ± 0.01; supplemental Table 4), which was included in this study as an example of an endogenous protein detected by immuno-MRM-MS in plasma.
Fig. 2.
Comparison of standard curves for granulocyte colony-stimulating factor peptide across the phases in the study. Observed average concentration (averaged across all three sites, with three replicates at each site, for a total of n = 9) was plotted against the expected concentration for each phase of the study (phase 1, blue line; phase 2, green line; phase 3, gray line; phase 4, red line) and compared with theoretical (dashed line). The error bars show the standard deviations. The asterisk indicates the 2.1 fmol/μl concentration at which recovery was determined.
Table II. Summary of interlaboratory immuno-MRM assay performance.
Slope, LOD, and LOQ are median values from three sites determined using the same transition ion in the linear range of the assay as described under “Experimental Procedures.” The percentage of recovery was determined at two concentrations, one at or near the LOD (0.23 fmol/μl) and one in the lower half of the linear range of the assay (2.1 fmol/μl). LOD and LOQ are reported in fmol/μl (peptide/protein) and ng/ml (protein). The slopes were determined for curve concentrations in the linear range using weighted robust linear regression as described under “Experimental Procedures.” The standard errors were reported to quantify variation and can be used to calculate appropriate confidence intervals. Higher recoveries were observed at the limit of detection because of the influence of background signal. The recovery reported in phase 2 for S100A8.ALN is artifactual because endogenous protein was detected in the background plasma digest. NA, not applicable, protein not spiked into plasma; ND, not determined.
| GeneName.PepSeq | Phase | Slope (±standard error)a | Percentage of recovery |
LOD |
LOQ |
|||
|---|---|---|---|---|---|---|---|---|
| 0.23 fmol/μlb | 2.1 fmol/μlc | fmol/μl (nm) | ng/ml (protein) | fmol/μl (nm) | ng/ml (protein) | |||
| S100A12.GHF | 1 | 1.03 ± 0.02 | 107.0% | 102.2% | ||||
| 2 | 1.09 ± 0.02 | 164.5% | 114.1% | 0.20 | 2.1 | 0.59 | 6.3 | |
| 3 | NA | NA | NA | NA | NA | NA | NA | |
| 4 | NA | NA | NA | NA | NA | NA | NA | |
| S100B.AMV | 1 | 1.18 ± 0.023 | 128.2% | 124.6% | ||||
| 2 | 1.11 ± 0.03 | 141.8% | 119.8% | 0.08 | 0.9 | 0.25 | 2.7 | |
| 3 | 0.40 ± 0.02 | 142.3% | 57.0% | 0.11 | 1.2 | 0.32 | 3.5 | |
| 4 | 0.42 ± 0.008 | 59.4% | 41.0% | 0.07 | 0.8 | 0.22 | 2.4 | |
| S100B.ELI | 1 | 1.37 ± 0.03 | 125.8% | 133.6% | ||||
| 2 | 1.08 ± 0.03 | 149.2% | 117.3% | 0.07 | 0.8 | 0.20 | 2.3 | |
| 3 | 0.19 ± 0.003 | 25.8% | 19.5% | 0.02 | 0.2 | 0.05 | 0.5 | |
| 4 | 0.51 ± 0.02 | 63.9% | 54.8% | 0.14 | 1.6 | 0.43 | 4.8 | |
| CSF3.IQG | 1 | 0.85 ± 0.01 | 106.5% | 89.8% | ||||
| 2 | 0.77 ± 0.01 | 81.5% | 75.2% | 0.03 | 0.5 | 0.09 | 1.6 | |
| 3 | 0.36 ± 0.01 | 58.6% | 40.4% | 0.04 | 0.7 | 0.11 | 2.2 | |
| 4 | 0.41 ± 0.01 | 55.5% | 42.2% | 0.06 | 1.1 | 0.17 | 3.2 | |
| IL1RN.IDV | 1 | 0.65 ± 0.02 | 75.1% | 60.3% | ||||
| 2 | 0.63 ± 0.01 | 89.4% | 69.8% | 0.14 | 2.8 | 0.41 | 8.3 | |
| 3 | NA | NA | NA | NA | NA | NA | NA | |
| 4 | NA | NA | NA | NA | NA | NA | NA | |
| S100A7.GTN | 1 | 1.09 ± 0.01 | 116.4% | 111.1% | ||||
| 2 | 1.08 ± 0.01 | 121.9% | 112.8% | 0.07 | 0.8 | 0.21 | 2.4 | |
| 3 | NA | NA | NA | NA | NA | NA | NA | |
| 4 | NA | NA | NA | NA | NA | NA | NA | |
| S100A7.ENF | 1 | 1.12 ± 0.02 | 132.1% | 117.7% | ||||
| 2 | 0.99 ± 0.01 | 99.2% | 97.1% | 0.03 | 0.3 | 0.08 | 0.9 | |
| 3 | NA | NA | NA | NA | NA | NA | NA | |
| 4 | NA | NA | NA | NA | NA | NA | NA | |
| S100A8.ALN | 1 | 0.91 ± 0.01 | 104.7% | 97.5% | ||||
| 2 | 0.91 ± 0.02 | 1894%d | 298.4%d | ND | ND | ND | ND | |
| 3 | ND | ND | ND | ND | ND | ND | ND | |
| 4 | ND | ND | ND | ND | ND | ND | ND | |
a Slope was determined using all curve concentrations in the linear range of the assay.
b Determined at the LOD of the assay.
c Determined at a concentration in the lower half of the linear range of the assay.
d Recovery for S100A8.ALN is artifactual as endogenous protein was detected in the background plasma digest.
The percentage of recovery following sample processing and immuno-MRM-MS analysis was determined from the relative error of each peptide as measured by MRM-MS at two concentrations, one at or near the LOD of the assay (0.23 fmol/μl) and one in the lower half of the linear range of the assay (2.1 fmol/μl) (Table II, Fig. 2, and supplemental Fig. 2, A–G). In Phase 1, the percentage of recovery in the linear range (2.1 fmol/μl) ranged from 60.3% (IL1RN.IDV) to 133.6% (S100B.ELI) for all peptides (Table II, column 3). These values reflect the true unadjusted amount of peptide added to prepare the curves and represent the experimentally derived maximum response for evaluating recovery in all subsequent phases in the study. In phase 2, where the same peptide mixtures were spiked into a more complex digest of plasma and enriched by immunoaffinity capture prior to MRM-MS, the recovery was similar to phase 1, ranging from 69.8% (IL1RN.IDV) to 119.8% (S100B.AMV) for all peptides at 2.1 fmol/μl (Table II, column 3). The percentages of recovery were higher at or near the limit of detection (0.23 fmol/μl) because of the influence of background signal. Apparent recovery of S100A8.ALN was much higher than expected at both concentration levels (298.4% at 2.1 fmol/μl and 1894.3% at 0.23 fmol/μl), indicating that endogenous S100A8 protein was present at ∼70 ng/ml in the background plasma digest. Slope was also calculated from the robust linear regression as an alternative value of recovery using all the concentration levels (n = 7) in the linear range of the assay. In general, slope values (Table II, column 1), converted to a percentage, compare favorably with the percentage of recovery values at one concentration in the linear range (Table II, column 3), suggesting that this is a suitable alternative for determining assay recovery. In phases 3 and 4, where proteins were spiked into plasma and digested prior to immuno-MRM-MS, the response was still linear from 0.23 to 167 fmol/μl, but the determined recoveries were distinctly less compared with phase 1, ranging from 19.5 to 54.8% for the peptides measured. The percentages of recovery at 2.1 fmol/μl of the CSF3 peptide in phases 3 and 4 as shown in Fig. 2 were 40.4 and 42.2%, respectively.
Intra- and Interlaboratory Reproducibility
The reproducibility of the quantitative measurements for each peptide within and across laboratories in each study phase were evaluated (“Experimental Procedures” and Fig. 1). Intralaboratory CVs for phase 1 constitute a measure of the technical variation caused by instrument and data acquisition because samples were prepared centrally. As summarized at the bottom of Table III, the median of intralaboratory CVs determined in the linear range in phase 1 was 3.4% for all peptides in the study (n = 8). Introducing SISCAPA capture into the process in phase 2 did not significantly affect assay imprecision relative to MRM-MS alone (median CV = 3.1%), because 13C-labeled peptides were present from the beginning of the analysis, and the antibody captures both forms without discrimination. Median intralaboratory CV approximately doubled to ∼7.0% when variability arising from the tryptic digestions performed at a single site in process replicate was introduced in phase 3. Performing each part of the analysis process entirely at each site as in phase 4 only marginally increased the intralaboratory CV relative to phase 3. However, primarily because of the differences in the mean determined between sites, the median interlaboratory CV of all peptides was higher (13.9%) than the median intralaboratory CV (8.0%) (Table III, column 1, bottom, phase 4).
Table III. Intra- and interlaboratory imprecision of multiplex immuno-MRM measurements.
The table shows CV (%) calculated from the means of process replicates, i.e. single process-single injection within sites (intralaboratory, n = 3) and across sites (interlaboratory, n = 9) and reported as the median for all values in the linear range, at a concentration near the LOD of the assay (0.23 fmol/μl) and at a concentration in the lower half of the linear range of the assay (2.1 fmol/μl). Unusually high CVs were observed for peptide IL1RN.IDV in phase 2 at 0.23 fmol/μl because this concentration is below the LOQ.
| ID (GeneName.PepSeq) | Site | Median CV in linear range of assay (%)a |
0.23 fmol/μl (%)b |
2.1 fmol/μl (%)c |
|||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Phase 1 | Phase 2 | Phase 3 | Phase 4 | Phase 1 | Phase 2 | Phase 3 | Phase 4 | Phase 1 | Phase 2 | Phase 3 | Phase 4 | ||
| S100B.AMV | 1 | 3.1 | 2.0 | 11.0 | 4.0 | 4.8 | 4.6 | 11.0 | 9.1 | 3.1 | 1.0 | 14.6 | 2.9 |
| 2 | 2.5 | 2.9 | 8.2 | 10.0 | 4.7 | 4.0 | 8.2 | 22.3 | 1.2 | 2.9 | 24.3 | 11.0 | |
| 3 | 3.4 | 1.7 | 5.9 | 9.3 | 7.4 | 5.4 | 26.4 | 19.0 | 1.6 | 2.4 | 13.3 | 0.9 | |
| Interlaboratory | 11.7 | 10.8 | 18.9 | 13.9 | 9.5 | 20.7 | 34.1 | 18.6 | 10.8 | 10.8 | 22.9 | 9.8 | |
| S100B.ELI | 1 | 2.1 | 1.1 | 5.0 | 5.6 | 4.6 | 4.4 | 7.4 | 10.6 | 1.0 | 1.1 | 6.7 | 6.5 |
| 2 | 1.5 | 1.6 | 1.7 | 17.9 | 3.1 | 5.0 | 13.6 | 20.7 | 1.3 | 2.8 | 2.5 | 9.9 | |
| 3 | 4.0 | 3.2 | 7.0 | 8.0 | 14.5 | 3.0 | 20.8 | 10.0 | 3.9 | 1.6 | 8.7 | 0.4 | |
| Interlaboratory | 13.2 | 14.2 | 9.4 | 18.6 | 11.7 | 28.3 | 17.6 | 23.0 | 11.2 | 10.6 | 10.1 | 18.6 | |
| CSF3.IQG | 1 | 1.8 | 3.3 | 13.9 | 3.8 | 3.0 | 4.4 | 14.0 | 2.4 | 1.1 | 3.5 | 19.3 | 7.3 |
| 2 | 11.1 | 3.0 | 4.9 | 13.8 | 13.0 | 10.1 | 28.7 | 13.8 | 5.5 | 3.9 | 13.5 | 14.7 | |
| 3 | 6.7 | 6.4 | 13.0 | 7.6 | 14.4 | 7.9 | 33.8 | 8.8 | 11.7 | 5.6 | 13.7 | 7.6 | |
| Interlaboratory | 8.0 | 6.7 | 14.0 | 11.5 | 19.1 | 12.2 | 24.1 | 18.2 | 8.0 | 6.7 | 16.3 | 14.2 | |
| d | d | ||||||||||||
| S100A8.ALNd | 1 | 1.3 | 3.3 | 7.4 | 5.2 | 2.1 | 3.3 | 1.2 | 2.2 | 5.3 | 10.8 | ||
| 2 | 3.4 | 3.1 | 6.7 | 11.7 | 11.9 | 3.1 | 2.3 | 4.1 | 3.0 | 11.3 | |||
| 3 | 7.4 | 4.0 | 6.5 | 2.6 | 12.2 | 4.6 | 8.0 | 2.8 | 1.6 | 4.3 | |||
| Interlaboratory | 7.2 | 5.5 | 10.0 | 10.6 | 18.2 | 5.5 | 7.2 | 4.8 | 7.3 | 11.2 | |||
| S100A12.GHF | 1 | 8.3 | 2.4 | 16.4 | 1.1 | 8.3 | 3.3 | ||||||
| 2 | 5.3 | 2.5 | 14.5 | 4.8 | 11.1 | 3.5 | |||||||
| 3 | 4.3 | 3.6 | 3.8 | 10.7 | 7.4 | 2.6 | |||||||
| Interlaboratory | 11.9 | 10.2 | 19.8 | 10.2 | 10.9 | 13.0 | |||||||
| IL1RN.IDV | 1 | 3.2 | 5.9 | 21.5 | 20.1 | 4.6 | 9.1 | ||||||
| 2 | 7.5 | 6.5 | 34.5 | 35.5 | 16.7 | 6.5 | |||||||
| 3 | 24.0 | 10.9 | 40.5 | 52.3 | 32.0 | 10.9 | |||||||
| Interlaboratory | 19.2 | 8.1 | 35.9 | 35.8 | 19.2 | 8.1 | |||||||
| S100A7.GTN | 1 | 1.6 | 1.8 | 5.3 | 3.6 | 0.5 | 0.5 | ||||||
| 2 | 3.1 | 3.4 | 7.6 | 3.8 | 5.4 | 7.8 | |||||||
| 3 | 5.2 | 3.8 | 11.6 | 26.2 | 6.9 | 1.5 | |||||||
| Interlaboratory | 4.5 | 7.1 | 8.5 | 13.7 | 4.5 | 7.1 | |||||||
| S100A7.ENF | 1 | 2.0 | 1.4 | 7.5 | 2.1 | 0.4 | 1.3 | ||||||
| 2 | 4.2 | 2.6 | 11.5 | 2.7 | 5.1 | 2.0 | |||||||
| 3 | 3.7 | 4.2 | 4.9 | 10.5 | 7.6 | 3.8 | |||||||
| Interlaboratory | 10.7 | 4.3 | 14.6 | 7.4 | 11.5 | 3.2 | |||||||
| Intralaboratory median (all peptidese) | 3.4 | 3.1 | 7.0 | 8.0 | 7.6 | 4.6 | 14.0 | 10.6 | 4.6 | 2.8 | 11.0 | 7.4 | |
| Interlaboratory median (all peptidese) | 11.2 | 7.6 | 14.0 | 13.9 | 16.4 | 12.9 | 24.1 | 18.6 | 10.8 | 7.6 | 13.2 | 12.7 | |
a The median CV was calculated from the concentrations (0.23–167 fmol/μl) in the linear range of the assay (n = 7 independent concentrations).
b The CV was calculated at or near the linear range of the assay (0.23 fmol/μl).
c The CV was calculated at a concentration in the lower half of the linear range of the assay (2.1 fmol/μl).
d For S100A8.ALN, the CV was calculated at 70 ng/ml (6.7 fmol/μl).
e Median of all peptides (n = 8) in the study.
In phase 2, which evaluated all eight peptide antibodies by immuno-MRM, some assays, such as S100A7.GTN and S100A7.ENF, consistently performed well and had low intralaboratory CVs (<10%), whereas other assays, e.g. IL1RN.IDV were less reproducible. These results correlate to the antibody grade assignment (Table I) determined from the initial screening assays (17), which were based primarily on the limits of detection determined from a mini-curve prepared as 10-fold step dilutions from 500 to 0.5 fmol. Peptides S100A7.GTN and S100A7.ENF were detected by immuno-MRM at the lowest level, 0.5 fmol, and so the corresponding antibodies were given a grade of A. In contrast, the lowest level at which peptide IL1RN.IDV could be detected using antibody enrichment in plasma was 50 fmol, and this antibody was given a grade of D. Assay imprecision increased noticeably at the highest concentration in the phase 1 and 2 studies, 500 fmol/μl, because of variability in the ion counts between sites as LC column and MS detector saturation start to occur toward the end of the curves. Saturation of binding to the antibodies does not contribute significantly to this observed effect, because it was also observed in phase 1 (LC-MRM-MS only). As expected, assay imprecision increased dramatically below the LOQ because of the low ion counts from the analytes and the presence of a continuum of low intensity chemical noise (supplemental Fig. 3, bracketed area).
The interlaboratory CVs provide an estimate of the analytical variability of the measurement for a clinical specimen analyzed at multiple centers. The results for the detectable range of measurements for all phases are illustrated by box and whisker plots in Fig. 3. CVs were plotted for each peptide (by color) for each phase (by shading). Median CV (calculated using the mean of measurements made at all sites throughout the linear range; see “Experimental Procedures”) is represented by a horizontal black line in each box. The interlaboratory reproducibility across all labs and all four phases of the study was, in general, very good (Fig. 3 and Table III), but overall greater than the corresponding intralaboratory imprecision (Table III and supplemental Tables 2). The median interlaboratory CV for all peptides in the linear range of the assays (i.e. for protein concentrations of ∼1 ng/ml to 3 μg/ml) was less than 20%. Trypsin digestion of proteins spiked into plasma prior to peptide capture and analysis (study phases 3 and 4) significantly increased the median interlaboratory CVs relative to SISCAPA capture of spiked peptides (study phases 2) from ∼8% to ∼14% (Table III, bottom row). Increased variability because of enzymatic digestion was also observed in an interlaboratory study of SID-MRM-MS (9). As expected, imprecision of the measurements at peptide concentrations at or near the determined LOD (see Table II and “LOD and LOQ”) was higher at every phase of the study for all peptides studied (Table III). Most notably are the higher CVs observed for peptide IL1RN.IDV in phase 2 (Table III, column 2), with a range of intralaboratory CVs from 20.1 to 52.3% and an interlaboratory CVs of 35.8%, because this concentration, 0.23 fmol/μl or 5 ng/ml protein, is at or below the detectable limit for the peptide.
Fig. 3.
Interlaboratory imprecision of assays for each phase in the study. Box and whisker plots of interlaboratory CV determined from the mean of three replicates across three sites (n = 9) for concentrations ranging from 0.08 to 500 fmol/μl. CVs were plotted for each peptide (by color) for each phase (by shading: phase 1, lightest shading; phase 4, darkest shading). Median of all peptides for a study is shown by black horizontal line. The whiskers extend up to 1.5 times the interquartile range. Markers outside the whiskers are considered outliers (circled). The highest concentration, 500 fmol/μl, was not prepared for phases 3 or 4 because of limiting amounts of reagents.
LOD and LOQ
The LOD and LOQ of the SISCAPA assays were determined using a modification of methods proposed by Linnet (24) and Currie (25) that account for the variability of the blank sample and a sample with the analyte spiked in at a low concentration (10). LOD and LOQ for all peptides for all sites (supplemental Table 3) and median interlaboratory LOD and LOQ were calculated (Table II) to facilitate comparison of assay performance across sites. The interlaboratory LOD and LOQ obtained in study phase 2, where light and heavy peptide standards were spiked into digested plasma background and distributed for measurement at the sites and then immuno-enriched, ranged from 0.03 to 0.20 fmol/μl (LOD) and 0.08 to 0.59 fmol/μl (LOQ), which corresponds to protein concentrations of 0.3–2.1 ng/ml (LOD) and 0.9–6.3 ng/ml (LOQ). These values represent the maximum sensitivities expected to be able to be achieved in these assays using peptide immuno-enrichment alone. These values do not take into account recovery of the peptides from protein digestion, which occurs upstream from immuno-enrichment in the assay workflow (Fig. 1, panels 3 and 4). For the three peptides for which proteins were available, tryptic digestion at a single site (phase 3) generally resulted in only small increases in the LOD of the assays to a maximum of 1.6 ng/ml (Table II). Proteolytic digestion at each site (phase 4) did not further increase the LOD of the assays.
Improving Assay Accuracy and Precision Using Stable Isotope-labeled Protein Standards
We investigated whether the accuracy and precision of the assay could be improved through the use of fully 15N-labeled proteins as internal standards as suggested previously (29, 30). Standard addition experiments were prepared by adding equimolar amounts of unlabeled and 15N-labeled S100B protein at each concentration level into plasma. Response curves for SISCAPA captured peptides S100B.AMV and S100B.ELI from the digested proteins are shown in Fig. 4. As in previous experiments, heavy peptide standards were spiked prior to desalting the plasma digest. Transition ions were monitored for the specific light peptide and two isotopically labeled versions of the heavy peptide: (i) the synthetic heavy peptide containing one 13C-labeled amino acid and (ii) heavy peptide containing uniformly 15N-labeled amino acids. The amount of light protein present was determined using two different methods (Table IV and Fig. 4; and described under “Experimental Procedures”). First, the concentration of light S100B protein was calculated using the peak area ratio of light peptide (from digested light protein) to spiked heavy synthetic peptide (Fig. 4, red lines) as was performed in previous phases. In the second approach, the concentration of the light S100B protein was determined using the peak area ratio of light peptide to uniformly 15N-labeled heavy peptide produced by tryptic digestion of the respective heavy form of the protein. This latter calculation results in direct determination of protein concentration in nanograms (without conversion from moles) because protein concentration of the standard was known, and responses could be normalized in a site-specific manner because the ratio of spiked light to heavy protein was constant throughout the curve. Precision and accuracy were calculated to compare these two methods of determining protein concentration (Table IV). Interlaboratory assay precision improved 6%, from 83 to 89%, which was very similar to the precision observed in phases 1 and 2, where digestion of proteins was not performed. Interlaboratory accuracy improved substantially, from 44–54% when synthetic heavy peptide was used to 95–101% when the heavy protein was used as the internal standard. Because the heavy standard protein undergoes the same processing as the light target protein, the peptide recovery is the same for light and heavy peptide. Use of heavy proteins in this manner did not improve the LOD of the method because neither light nor heavy peptide was detected below 2.3 fmol/μl.
Fig. 4.
Improvement of assay precision and accuracy using uniformly 15N-labeled protein standard. Standard curve plots of the interlaboratory average observed concentration of S100B versus the expected concentration, using two unique peptides, AMVALIDVFHQYSGR (A) and ELINNELSHFLEEIK (B). Protein concentration determined using 15N peptide (blue line) was closer to the theoretical line (dashed) than when 13C heavy peptide (red line) was used. The error bars show the standard deviations.
Table IV. Precision and accuracy of immuno-MRM assays using stable isotope peptide and protein standards.
Precision is reported as 1 − median % CV calculated from the mean of process replicates, i.e. single process-single injection within sites (intralaboratory, n = 3) and across sites (interlaboratory, n = 9). Accuracy was as 1 + median % RE calculated from the means of the process replicates in the linear range as was performed on CV to calculate precision. Both values were calculated in the linear assay range using all the measurements determined above the LOD (0.23–167 fmol/μl). The peptide standard refers to the 13C-labeled synthetic peptide; protein standard refers to the uniformly 15N-labeled protein.
| Protein | Peptide | Site | Precision (%)a |
Accuracy (%)b |
||
|---|---|---|---|---|---|---|
| Peptide standard | Protein standard | Peptide standard | Protein standard | |||
| S100B | AMVALIDVFHQYSGR | 1 | 96.0 | 96.1 | 43.3 | 100.8 |
| 2 | 90.0 | 84.4 | 45.6 | 105.2 | ||
| 3 | 90.7 | 94.9 | 43.9 | 103.0 | ||
| Interlaboratory | 86.1 | 88.9 | 44.3 | 101.0 | ||
| S100B | ELINNELSHFLEEIK | 1 | 94.4 | 96.5 | 53.5 | 100.1 |
| 2 | 82.1 | 91.0 | 52.0 | 97.0 | ||
| 3 | 92.0 | 93.4 | 63.0 | 99.6 | ||
| Interlaboratory | 81.4 | 90.0 | 54.8 | 95.5 | ||
a Calculated from median % CV across the linear range.
b Calculated from % RE across the linear range.
DISCUSSION
The results of the first multi-site evaluation of SISCAPA coupled to MRM-MS presented here demonstrate that the technology, including the semi-automated method we developed, is transferable between laboratories and has sufficient precision for use in biomarker verification studies performed across multiple sites to quantify proteins in plasma with multiplexed assays. Under conditions simulating a real interlaboratory biomarker verification study, the overall interlaboratory % CV, (including protein digestion, desalting, peptide antibody enrichment, and scheduled LC-MRM-MS analysis) was below 25% at or near the LOQ and below 20% at or near the midpoint of the linear range for each analyte peptide (Table III). Limits of detection were at or below 1 ng/ml for the assayed proteins using just 10–30 μl of plasma (Table II). The multi-staged design of the study enabled us to obtain useful insights into the sources of variation. Importantly, we have shown that the addition of the semi-automated, anti-peptide immunoaffinity capture (SISCAPA) to LC-MRM-MS analysis process does not significantly increase either intra- or interlaboratory variability relative to MRM-MS (9). Our results demonstrate that multiplexed immuno-MRM-MS (where both immunoprecipitation and MRM-MS of target peptides are done as mixtures) is a sensitive and precise technology that can be widely implemented for assaying proteins in matrices as complex as plasma.
A number of prior studies have evaluated the intra- and/or interlaboratory performance of MRM, fMRM, and immuno-MRM assays with respect to precision, linearity, and sensitivity (9, 31). In the first published assessment of the interlaboratory performance of SID-MRM-MS for measuring proteins in digested plasma, Addona et al. (9) demonstrated detection limits of ∼1 μg/ml without prior enrichment of proteins or peptides and showed that the assays were reproducible across laboratories with CVs of ∼25%. Keshishian et al. (10) showed that combining immuno-affinity depletion of proteins and limited chromatographic fractionation of peptides improves the LOD of SID-MRM-MS assays ∼1000-fold (∼1 ng/ml) with intralaboratory CVs of 10–25% when starting with 100 μl of plasma. Here we have shown that the interlaboratory performance of multiplexed immuno-MRM-MS assays are consistent with those observed in prior studies conducted in single laboratories (15, 16, 18), as well as by MRM-MS and fMRM-MS. Importantly, by enriching for target peptides using anti-peptide antibodies prior to SID-MRM-MS, LOD improved ∼1000-fold without increasing the interlaboratory imprecision relative to these other approaches. These findings support the value of generating antibodies for large scale biomarker verification studies to verify the increasing number of analytes in larger cohorts of patient samples.
Many factors contribute to the LOD and LOQ of immuno-MRM assays, including antibody capture efficiency, peptide ionization efficiency, and peptide recovery from protein digestion. In addition, more subtle effects can increase the LOD. These antibodies most likely recognize a linear epitope because tryptic peptides were used as immunogens. However, other peptides with sequences different from the analyte may contain part of the linear or conformational epitope (32) that could potentially be recognized by the antibody and simultaneously enriched. Furthermore, if the recognized epitope is contained in a peptide from an abundant protein, the antibody may become saturated, thereby decreasing the amount of analyte peptide captured. It is important to note that in this case only assay sensitivity is affected, not assay selectivity, because any captured peptide that is not the true analyte will be filtered out in the triple quadrupole mass spectrometer at either the precursor or product mass levels and will likely be separated by LC retention time as well. In contrast, auto-antibodies and heterophilic anti-reagent antibodies are known to produce false positive and false negative results in sandwich immunoassays (33–35). These types of interferences are not observed in immuno-MRM-MS because the auto- and heterophilic antibodies are inactivated by reduction, alkylation, and enzymatic digestion during sample preparation for SISCAPA (36–38).
The present study focuses on interlaboratory evaluation of assay performance because large scale biomarker verification studies will need to be performed across laboratories in hundreds or even thousands of patient samples to accommodate the workload. In addition, it is imperative to understand the limits of analytical variation of the assay before assessing the variability of the patient population. The percentages of change in the determined peptide/protein concentrations were consistent and linear from site to site. For example, the fold change between each concentration in the linear range was 3.01 ± 0.16 for CSF3 (phase 2) for all sites where the curves were prepared using a 3-fold serial dilution. Our results indicate that the magnitude of differential regulation of proteins observed in patients samples assayed at one site by immuno-MRM-MS would be the highly similar if not identical to the values that other sites would determine analyzing the same samples. Thus, immuno-MRM-MS has sensitivity comparable with fMRM-MS and like MRM-MS is transferable across laboratories.
In general, the interlaboratory variation of immuno-MRM, including all sample processing steps (phases 3 and 4), was roughly twice the corresponding intralaboratory CVs across the linear range of the assays (Table III, column 1, bottom), averaging 14% compared with 7%. Interlaboratory variation in the linear range of the assays was also observed to be higher than intralaboratory variation in a previous MRM-MS study (9). With very tightly controlled intralaboratory CVs, the apparent larger interlaboratory CVs are attributable to differences in the mean concentrations determined at each site. We hypothesize that, despite using exactly identical samples, this small but consistent difference in measurement is due to hitherto unforeseen differences in the MS or LC, as well as other small variations within the analysis path. We suspect the latter, because (i) the changes are, for the most part, rather small and (ii) the variation is not consistent across sites, peptides, and phases of the study. Such differences between sites could arise through the accumulated effect of small contributions from factors such as variations in chromatographic quality (i.e. variation in peak tailing, peak width, etc.), instrument calibration affecting precursor, and product ion isolation window locations and width, as well as variations in how software (or analyst) integrates the data. Such variations are not unique to MS-based assays. Immunoassays can have large interlaboratory variations that are typically compensated for by using an appropriate common calibration standard (39, 40). Similar approaches would be applicable to improve the accuracy of MRM-MS and SISCAPA assays. Here, in the 15N-labeled protein standard experiment (Table IV and see “Experimental Procedures”), we used the observed peak area ratio at each site to normalize the response for that site. Once normalization was applied, the accuracy of each site improved, as did the interlaboratory accuracy, and this led to slight increases in overall precision as well. This technique could also be applied to assays where an internal heavy protein standard is not available. If response curves similar to phase 2 samples are simultaneously prepared and run with true patient samples (mimicked by phase 4 samples), the response observed at each site, i.e. the slope (as shown in supplemental Table 4), could be used to calculate a site-specific calibration factor for that particular day and set of runs in a peptide-specific manner.
As was demonstrated in the MRM-MS interlaboratory study (9), trypsin digestion and its requisite sample handling contributed the most to SISCAPA assay variability and reduced the recovery of target peptides from digested proteins. Here we have shown that using a stable isotope-labeled (heavy) recombinant protein as an internal standard instead of stable isotope-labeled (heavy) peptides approximately doubled the accuracy of the assay for the protein studied, from 50 to 98% on average for two peptides for S100B. This improvement is due to similar (if not identical) efficiency of proteolytic release and recovery of the target peptides from the heavy-labeled protein and the native analyte protein. These results are consistent with previous findings from a number of laboratories (29, 30) demonstrating that intact albumin could be quantified by LC-MS in patient urine using a corresponding 15N-labeled protein standard. We also found that interlaboratory assay precision improved slightly from 83 to 89%, similar to the levels observed in the absence of digestion. Provided that the protein standard accurately recapitulates the behavior of its endogenous counterpart (which may not always be true because of differences in post-translational modifications or unanticipated effects of the tag used, for example), the presence of the heavy protein at time of digestion takes into account small differences in digestion efficiency and peptide recovery in the process replicates, thereby improving the precision of the measurements in addition to the accuracy. Further improvement in the precision and accuracy of the method is expected by using liquid handling robotics for automated reduction, alkylation, and proteolytic digestion.
Although the immuno-MRM-MS and fMRM approaches have similar LOD, LOQ, and quantitative precision, these technologies differ in important practical ways relative to the ability to conduct large scale patient studies requiring the analysis of hundreds of patient samples. The practical upper multiplex level of MRM and fMRM is in excess of 100 and is only limited by the scan speed/sensitivity and scheduling capabilities of the specific triple quadrupole LC-MS/MS platform employed. In contrast, multiplex levels of higher than 50 have yet to be demonstrated for immuno-MRM-MS. The principle drawback of SISCAPA is the considerable added time (∼6 months) and upfront reagent costs of approximately $4000/anti-peptide antibody to produce the anti-peptide antibody reagents. Cost and time are significant factors for the fMRM-MS method as well. The requisite first step of fMRM is immunoaffinity depletion using commercial columns. These columns cost in the vicinity of $10,000 and have limited lifespans. Their use adds approximately $100/sample. Most importantly, throughput of fMRM-MS is 6–8-fold lower than for immuno-MRM-MS analyses because of the need to analyze each SCX or high pH RP peptide fraction by LC-MRM-MS, and even at the highest plex levels, the throughput difference is still 3–4-fold lower than immuno-MRM. Another factor in favor of the immuno-MRM-MS approach is that it requires only 10–30% of the starting input plasma required for fMRM-MS to achieve comparable ng/ml sensitivity (10–30 μl for immuno-MRM versus 100–200 μl for fMRM). The time and total cost (instrument use and reagents) for a hypothetical verification study assaying levels of 100 distinct analyte peptides (using 100 heavy peptides analogs) in 100 patient samples is roughly the same for immuno-MRM and fMRM. Although fMRM takes considerably more MS instrument time for sample analysis, this is balanced by the up-front time (∼6 months) and cost of anti-peptide antibody reagents (supplemental Table 5). However, with more than ∼100 patient samples, the added cost of instrument and personnel time for fMRM-MS analysis of a 100-plex analyte assay outweighs the initial cost and delivery time of antibody reagents. For these reasons, we consider fMRM-MS to be most useful as the method for defining which assays to move into immuno-MRM assay development for large clinical sample studies and in situations where the number of samples is relatively small, such as cell-based biology studies.
Based on the results presented here, we are optimistic that immuno-MRM-MS reagents and the overall assay technology can be widely deployed for assaying proteins in plasma and tissue as MRM-MS technology has already been shown to be. We believe that immuno-MRM technology has sufficient intra- and interlaboratory assay precision and sensitivity for use today in large scale verification studies of biomarker candidate proteins in patient biofluid samples in any disease of interest, as well as in quantitative biology studies in tissues and cells.
Acknowledgments
We thank Amina Aziz, Lynda Dieckman, and Lee Makowski (Argonne National Laboratory) for providing the unlabeled and 15N-labeled S100B protein standards. We also thank Xiuwen Liu and Guo-Liang Yu (Epitomics) for in-depth knowledge of handling and use of rabbit polyclonal antibodies. We thank Susan Abbatiello and Hasmik Keshishian for careful review and insightful comments on manuscript.
Footnotes
* This work was supported by National Institutes of Health Grant 1U24 CA126476-02 and 1U24CA160034 (to S. A. C., N. L. A., and A. G. P.) from the NCI as part of the NCI Clinical Proteomic Technologies Initiative, National Institutes of Health Grant R01HL096738-02 and HHSN268201000033C (to S. A. C.) from the National Heart, Lung and Blood Institute, and a platform grant from Genome Canada and Genome British Columbia. This work was also supported in part by funds from the Women's Cancer Research Fund of the Entertainment Industry Foundation (to S. A. C. and A. G. P.). The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked “advertisement” in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
This article contains supplemental material.
1 The abbreviations used are:
- SID-MRM-MS
- stable isotope dilution multiple reaction monitoring mass spectrometry
- CV
- coefficient of variance
- MRM
- multiple reaction monitoring
- fMRM
- (chromatographic) fractionation MRM-MS
- immuno-MRM-MS
- immunoprecipitation-MRM-mass spectrometry
- LOD
- limit of detection
- LOQ
- limit of quantification
- PAR
- peak area ratio
- SISCAPA
- stable isotope standards and capture with anti-peptide antibodies
- RE
- relative error
- AA
- amino acid(s).
REFERENCES
- 1. Rifai N., Gillette M. A., Carr S. A. (2006) Protein biomarker discovery and validation: The long and uncertain path to clinical utility. Nat. Biotechnol. 24, 971–983 [DOI] [PubMed] [Google Scholar]
- 2. Carr S. A., Anderson L. (2008) Protein quantitation through targeted mass spectrometry: The way out of biomarker purgatory? Clin. Chem. 54, 1749–1752 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Paulovich A. G., Whiteaker J. R., Hoofnagle A. N., Wang P. (2008) The interface between biomarker discovery and clinical validation: The tar pit of the protein biomarker pipeline. Proteomics Clin. Appl. 2, 1386–1402 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Whiteaker J. R., Lin C., Kennedy J., Hou L., Trute M., Sokal I., Yan P., Schoenherr R. M., Zhao L., Voytovich U. J., Kelly-Spratt K. S., Krasnoselsky A., Gafken P. R., Hogan J. M., Jones L. A., Wang P., Amon L., Chodosh L. A., Nelson P. S., McIntosh M. W., Kemp C. J., Paulovich A. G. (2011) A targeted proteomics-based pipeline for verification of biomarkers in plasma. Nat. Biotechnol. 29, 625–634 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Addona T. A., Shi X., Keshishian H., Mani D. R., Burgess M., Gillette M. A., Clauser K. R., Shen D., Lewis G. D., Farrell L. A., Fifer M. A., Sabatine M. S., Gerszten R. E., Carr S. A. (2011) A pipeline that integrates the discovery and verification of plasma protein biomarkers reveals candidate markers for cardiovascular disease. Nat. Biotechnol. 29, 635–643 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Anderson N. L., Anderson N. G. (2002) The human plasma proteome: History, character, and diagnostic prospects. Mol. Cell. Proteomics 1, 845–867 [DOI] [PubMed] [Google Scholar]
- 7. Anderson L., Hunter C. L. (2006) Quantitative mass spectrometric multiple reaction monitoring assays for major plasma proteins. Mol. Cell. Proteomics 5, 573–588 [DOI] [PubMed] [Google Scholar]
- 8. Keshishian H., Addona T., Burgess M., Kuhn E., Carr S. A. (2007) Quantitative, multiplexed assays for low abundance proteins in plasma by targeted mass spectrometry and stable isotope dilution. Mol. Cell. Proteomics 6, 2212–2229 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Addona T. A., Abbatiello S. E., Schilling B., Skates S. J., Mani D. R., Bunk D. M., Spiegelman C. H., Zimmerman L. J., Ham A. J., Keshishian H., Hall S. C., Allen S., Blackman R. K., Borchers C. H., Buck C., Cardasis H. L., Cusack M. P., Dodder N. G., Gibson B. W., Held J. M., Hiltke T., Jackson A., Johansen E. B., Kinsinger C. R., Li J., Mesri M., Neubert T. A., Niles R. K., Pulsipher T. C., Ransohoff D., Rodriguez H., Rudnick P. A., Smith D., Tabb D. L., Tegeler T. J., Variyath A. M., Vega-Montoto L. J., Wahlander A., Waldemarson S., Wang M., Whiteaker J. R., Zhao L., Anderson N. L., Fisher S. J., Liebler D. C., Paulovich A. G., Regnier F. E., Tempst P., Carr S. A. (2009) Multi-site assessment of the precision and reproducibility of multiple reaction monitoring-based measurements of proteins in plasma. Nat. Biotechnol. 27, 633–641 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Keshishian H., Addona T., Burgess M., Mani D. R., Shi X., Kuhn E., Sabatine M. S., Gerszten R. E., Carr S. A. (2009) Quantification of cardiovascular biomarkers in patient plasma by targeted mass spectrometry and stable isotope dilution. Mol. Cell. Proteomics 8, 2339–2349 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Zhang H., Li X. J., Martin D. B., Aebersold R. (2003) Identification and quantification of N-linked glycoproteins using hydrazide chemistry, stable isotope labeling and mass spectrometry. Nat. Biotechnol. 21, 660–666 [DOI] [PubMed] [Google Scholar]
- 12. Zhou Y., Aebersold R., Zhang H. (2007) Isolation of N-linked glycopeptides from plasma. Anal. Chem. 79, 5826–5837 [DOI] [PubMed] [Google Scholar]
- 13. Anderson N. L., Anderson N. G., Haines L. R., Hardie D. B., Olafson R. W., Pearson T. W. (2004) Mass spectrometric quantitation of peptides and proteins using stable isotope standards and capture by anti-peptide antibodies (SISCAPA). J. Proteome Res. 3, 235–244 [DOI] [PubMed] [Google Scholar]
- 14. Whiteaker J. R., Zhao L., Zhang H. Y., Feng L. C., Piening B. D., Anderson L., Paulovich A. G. (2007) Antibody-based enrichment of peptides on magnetic beads for mass-spectrometry-based quantification of serum biomarkers. Anal. Biochem. 362, 44–54 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Whiteaker J. R., Zhao L., Anderson L., Paulovich A. G. (2010) An automated and multiplexed method for high throughput peptide immunoaffinity enrichment and multiple reaction monitoring mass spectrometry-based quantification of protein biomarkers. Mol. Cell. Proteomics 9, 184–196 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Kuhn E., Addona T., Keshishian H., Burgess M., Mani D. R., Lee R. T., Sabatine M. S., Gerszten R. E., Carr S. A. (2009) Developing multiplexed assays for troponin I and interleukin-33 in plasma by peptide immunoaffinity enrichment and targeted mass spectrometry. Clin. Chem. 55, 1108–1117 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Whiteaker J. R., Zhao L., Abbatiello S. E., Burgess M., Kuhn E., Lin C. W., Pope M. E., Razavi M., Anderson N. L., Pearson T. W., Carr S. A., Paulovich A. G. (2011) Evaluation of large scale quantitative proteomic assay development using peptide affinity-based mass spectrometry. Mol. Cell. Proteomics 10.1074/mcp.M110.005645 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Hoofnagle A. N., Becker J. O., Wener M. H., Heinecke J. W. (2008) Quantification of thyroglobulin, a low-abundance serum protein, by immunoaffinity peptide enrichment and tandem mass spectrometry. Clin. Chem. 54, 1796–1804 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Ocaña M. F., Neubert H. (2010) An immunoaffinity liquid chromatography-tandem mass spectrometry assay for the quantitation of matrix metalloproteinase 9 in mouse serum. Anal. Biochem. 399, 202–210 [DOI] [PubMed] [Google Scholar]
- 20. Neubert H., Gale J., Muirhead D. (2010) Online high-flow peptide immunoaffinity enrichment and nanoflow LC-MS/MS: Assay development for total salivary pepsin/pepsinogen. Clin. Chem. 56, 1413–1423 [DOI] [PubMed] [Google Scholar]
- 21. Berna M. J., Zhen Y., Watson D. E., Hale J. E., Ackermann B. L. (2007) Strategic use of immunoprecipitation and LC/MS/MS for trace-level protein quantification: Myosin light chain 1, a biomarker of cardiac necrosis. Anal. Chem. 79, 4199–4205 [DOI] [PubMed] [Google Scholar]
- 22. Whiteaker J. R., Zhang H., Zhao L., Wang P., Kelly-Spratt K. S., Ivey R. G., Piening B. D., Feng L. C., Kasarda E., Gurley K. E., Eng J. K., Chodosh L. A., Kemp C. J., McIntosh M. W., Paulovich A. G. (2007) Integrated pipeline for mass spectrometry-based discovery and confirmation of biomarkers demonstrated in a mouse model of breast cancer. J. Proteome Res. 6, 3962–3975 [DOI] [PubMed] [Google Scholar]
- 23. Pope M. E., Soste M. V., Eyford B. A., Anderson N. L., Pearson T. W. (2009) Anti-peptide antibody screening: Selection of high affinity monoclonal reagents by a refined surface plasmon resonance technique. J. Immunol. Methods 341, 86–96 [DOI] [PubMed] [Google Scholar]
- 24. Linnet K. (1993) Evaluation of regression procedures for methods comparison studies. Clin. Chem. 39, 424–432 [PubMed] [Google Scholar]
- 25. Currie L. A. (1968) Limits for qualitative detection and quantitative determination. Anal. Chem. 40, 586–593 [Google Scholar]
- 26. Yergey A. L. (1988) The presentation of calibration curves and quantitative data. Biomed. Environ. Mass Spectrom. 15, 465. [DOI] [PubMed] [Google Scholar]
- 27. Zorn M. E., Gibbons R. D., Sonzogni W. C. (1997) Weighted least-squares approach to calculating limits of detection and quantification by modeling variability as a function of concentration. Anal. Chem. 69, 3069–3075 [DOI] [PubMed] [Google Scholar]
- 28. R Development Core Team (2011). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria: ISBN 3–900051-07–0, URL http://www.R-project.org/ [Google Scholar]
- 29. Brun V., Dupuis A., Adrait A., Marcellin M., Thomas D., Court M., Vandenesch F., Garin J. (2007) Isotope-labeled protein standards. Mol. Cell. Proteomics 6, 2139–2149 [DOI] [PubMed] [Google Scholar]
- 30. Singh R., Crow F. W., Babic N., Lutz W. H., Lieske J. C., Larson T. S., Kumar R. (2007) A liquid chromatography-mass spectrometry method for the quantification of urinary albumin using a novel N-15-isotopically labeled albumin internal standard. Clin. Chem. 53, 540–542 [DOI] [PubMed] [Google Scholar]
- 31. Hoofnagle A. N. (2010) Quantitative clinical proteomics by liquid chromatography-tandem mass spectrometry: Assessing the platform. Clin. Chem. 56, 161–164 [DOI] [PubMed] [Google Scholar]
- 32. Kuby J. (1997) Immunology, 3rd Ed., pp. 85–108, W. H. Freeman, New York [Google Scholar]
- 33. Hoofnagle A. N., Wener M. H. (2009) The fundamental flaws of immunoassays and potential solutions using tandem mass spectrometry. J. Immunol. Methods 347, 3–11 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Dale G. L., Gaddy P., Pikul F. J. (1994) Antibodies against biotinylated proteins are present in normal human serum. J. Lab. Clin. Med. 123, 365–371 [PubMed] [Google Scholar]
- 35. Hennig C., Rink L., Fagin U., Jabs W. J., Kirchner H. (2000) The influence of naturally occurring heterophilic anti-immunoglobulin antibodies on direct measurement of serum proteins using sandwich ELISAs. J. Immunol. Methods 235, 71–80 [DOI] [PubMed] [Google Scholar]
- 36. Kricka L. J. (1999) Human anti-animal antibody interferences in immunological assays. Clin. Chem. 45, 942–956 [PubMed] [Google Scholar]
- 37. Rotmensch S., Cole L. A. (2000) False diagnosis and needless therapy of presumed malignant disease in women with false-positive human chorionic gonadotropin concentrations Lancet 356, 712–715 [DOI] [PubMed] [Google Scholar]
- 38. Willman J. H., Martins T. B., Jaskowski T. D., Hill H. R., Litwin C. M. (1999) Heterophile antibodies to bovine and caprine proteins causing false-positive human immunodeficiency virus type 1 and other enzyme-linked immunosorbent assay results. Clin. Diagn. Lab. Immun. 6, 615–616 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39. Findlay J. W., Smith W. C., Lee J. W., Nordblom G. D., DeSilva B. S., Khan M. N., Bowsher R. R., Das I. (2000) Validation of immunoassays for bioanalysis: A pharmaceutical industry perspective. J. Pharmaceut. Biomed. 21, 1249–1273 [DOI] [PubMed] [Google Scholar]
- 40. Mire-Sluis A. R., Barrett Y. C., Devanarayan V., Koren E., Liu H., Maia M., Parish T., Scott G., Shankar G., Shores E., Swanson S. J., Taniguchi G., Wierda D., Zuckerman L. A. (2004) Recommendations for the design and optimization of immunoassays used in the detection of host antibodies against biotechnology products. J. Immunol. Methods 289, 1–16 [DOI] [PubMed] [Google Scholar]




