Abstract
In mass spectrometry, reliable quantification requires correction for variations in ionization efficiency between samples. The preferred method is the addition of a stable isotope labeled internal standard (SIL-IS). In targeted metabolomics, a dedicated SIL-IS for each metabolite of interest may not always be realized due to high cost or limited availability. We recently completed the analysis of more than 70 biomarkers, each with a matching SIL-IS, across 4 mass spectrometry-based platforms (one GC-MS/MS and 3 LC-MS/MS). Using data from calibrator and quality control samples added to sixty 96-well trays (analytical runs), we calculated analytical precision (CV) retrospectively. The use of integrated peak area for all metabolites and internal standards allowed us to calculate precision for all matching analyte (A) SIL-IS (IS) pairs as well as for all non-matching A/IS pairs within each platform (total n=1442). The median between-run precision for matching A/IS across the 4 platforms was 2.7 - 5.9 %. The median CV for non-matching A/IS (corresponding to pairing analytes with a non-SIL-IS) was 2.9 - 10.7 percentage points higher. Across all platforms, CVs for non-matching A/IS increased with increasing difference in retention time (Spearman´s rho of 0.17 to 0.93). The CV-difference for non-matching vs. matching A/IS was often, but not always, smaller when analyte and internal standard were close structural analogs.
Introduction
The field of metabolomics has seen great expansion in recent years in step with a continual increase in performance of new analytical platforms. Much of the recent development has involved mass spectrometry methods with multiple-stage (e.g. tandem MS) detectors for high selectivity combined with increased ionization efficiencies and improved signal/noise ratios enabling the detection of low-concentration endogenous metabolites. Metabolomics studies can be divided into targeted and non-targeted analysis with the latter involving detection of thousands of signals, essentially non-quantitative and with largely deferred identification (1), although during the last few years great strides have been made towards developing semi-targeted methods with extensive analyte coverage and good quantitative performance (2–4). In contrast, targeted analysis involves the quantitation of a limited set of pre-selected analytes (5).
A recognized limitation of mass spectrometry-based quantitative analysis is the requirement for external or internal calibration to correct for variations in ionization efficiency. Of particular concern is ion suppression/enhancement that is known to vary across samples and with retention time. Currently, the preferred method to correct for ion suppression involves the addition of a stable-isotope labeled internal standard (SIL-IS) where retention times can be expected to be close to, or identical to that of the analyte (5, 6). In typical metabolomics settings, analytical runs will involve dozens or even hundreds of analytes. In such cases, the inclusion of a matching SIL-IS for every analyte can be challenging in terms of cost and availability. An alternative is to analyze several analytes against a single or a limited number of internal standards based on structural and/or retention time similarity (7), if so, steps should be taken to ensure that quantification with the selected internal standards meet some minimum assay acceptance criteria. The analytical performance of different choices of internal standards have been compared in a number of studies (8–10), but few have undertaken a systematic comparison covering a large range of metabolites in terms of chemical and physical characteristics.
We recently analyzed more than 70 biomarkers across four MS/MS based analytical platforms as part of a large cohort study. Each biomarker was quantified using a matching (structurally identical) SIL-IS. The samples were analyzed in 96-well plates with each plate including a fixed number of calibration and quality control samples. Using data for these samples we were able to calculate within- and between-run precision retrospectively. We used the integrated peak areas directly to investigate all possible analyte/internal standard (A/IS) pairings within a given analytical platform. Analytes paired with its own specific SIL-IS are referred to as matching A/IS and all other combinations are non-matching A/IS. The primary objective of the study was to quantify the difference in analytical precision between non-matching and matching A/IS. A secondary objective was to assess or quantify the impact of structural similarity, and similarity of retention time on analytical precision for nonmatching A/IS.
Experimental Section
Source material
During the period October 2019 to April 2020 we analyzed 4980 plasma samples as part of the project «Nutritional Intervention Preconception and During Pregnancy to Maintain Healthy Glucose Metabolism and Offspring Health» (NiPPeR) (11). The samples were distributed into sixty 96-well plates each containing 83 project samples, 6 calibrator samples, and 6 quality controls. An analytical run is defined as the analysis of one 96-well plate. The calibrator was pooled EDTA plasma from healthy individuals purchased from Innovative research, Inc (www.innov-research.com). Quality control 1 (3 samples in each run) was pooled serum from Innovative research, Inc that were spiked for most analytes to serve as a high concentration control, and quality control 2 (3 samples in each run) was serum pooled from healthy personnel (n=23) at the Bevital laboratory and surrounding research units in Bergen, Norway. For precision calculations, the data from calibrator and quality controls were treated identically, hence, for the purpose of this study the three sample types (calibrator, and quality controls 1 and 2) were labeled source material 1 to 3 (S1, S2, and S3).
Sample processing, chromatography, and tandem mass-spectrometry
The analytical platforms are labeled GC, LC1, LC2, and LC3. All sample processing was performed on Hamilton robotic workstations (Bonaduz, Switzerland) equipped with disposable tips and liquid detection. An overview of each platform is shown in Table 1. Briefly, the first step consisted of adding a protein precipitation agent (ethanol or trichloroacetic acid) containing internal standards (for GC this step was preceded by addition of a reducing agent to liberate thiol compounds) followed by centrifugation. Further processing for platforms GC and LC3 included liquid-liquid extraction to separate water- and fat-soluble compounds. The aqueous phase was subject to chemical derivatization before analysis on GC, and the organic phase was reconstituted in methanol prior to analysis on LC3. For platforms LC1 and LC2 the aqueous phase was applied directly to the LC-MS/MS system.
Table 1. Main characteristics of analytical platformsa .
Platform | GC | LC1 | LC2 | LC3 |
---|---|---|---|---|
Type of analysis | GC-MS/MS | LC-MS/MS | LC-MS/MS | LC-MS/MS |
Metabolites analyzed | Amino- and carboxylic acids | Choline derivatives, charged or methylated amino acids | Kynurenine pathway metabolites and B-vitamins | Lipid-soluble vitamins |
Pretreatment | dithioerythriol | |||
Protein precipitation | ethanol | trichloroacetic acid | trichloroacetic acid | ethanol |
Liquid liquid extraction | Isooctane/chloroformb | Isooctane/chloroformb | ||
Derivatization | methylchloroformate | |||
Column | CP Sil 24 CB | 150x4.6 mm, 3 um, phenyl | 150x4.6 mm 3.5 um, C8 | 50x4.6 mm, 2.7 um, C18 |
Mobile phase | helium | acetic acid:methanol | acetic acid:HFBA:acetonitrile | methanol:NH4-formate |
Elution | temperature step gradient | isocratic | step gradient | isocratic |
MS, ion source | ESI, positive mode | ESI, positive mode | ESI, positive mode | APCI, positive mode |
Analytes (n) | 25 | 16 | 24 | 6 |
A/IS combinations (n)c | 625 | 256 | 576 | 36 |
Range of concentrations (μM)d | 0.15 - 500 | 0.5 - 300 | < 0.01 - 70 | < 0.01 - 5 |
Analyte peak area (log10) median (range) | 4.5 (2.8, 6.7) | 5.0 (3.7, 6.5) | 5.8 (4.2, 6.9) | 4.5 (3.1, 6.1) |
IS peak area (log10) median (range) | 4.6 (2.7, 6.6) | 5.8 (4.3, 6.8) | 5.5 (4.1 - 6.5) | 4.9 (3.5 - 5.9) |
RT (min), median (range) | 3.7 (2.5, 7.5) | 2.2 (2.0, 3.5) | 3.6 (2.1, 4.8) | 1.7 (1.3, 3.1) |
Ranges reported are across n analytes within each platform.
Platforms GC and LC3 shared sample processing up to and including this point.
Number of A/IS combinations investigated (all possible)
Range of typically observed (median) concentrations across n analytes. Abbreviations: ESI, electrospray ionization. APCI, atmospheric pressure chemical ionization. A, analyte. IS, internal standard. RT, retention time. HFBA heptafluorobutyric acid.
Further details for GC and LC3 (12), LC1 (13) and LC2 (14) were published previously. Most SIL-IS were labeled with deuterium (2-10 substitutions) except for histidine and homoarginine (LC1), that were labeled with 15N, and neopterin and thiamine (LC2) that were labeled with 15N and 13C, respectively (Tables S1–S4).
Statistical methods
Coefficients of variation (CV %) were calculated, assuming lognormal distributions, as 100*(eSD(X)-1) (eq 1) (15) where X was either log(A), log(IS) or log(A/IS). Different concentrations across source material S1, S2 and S3 were normalized by appropriate correction in regression models. Specifically: normalization of signal (either peak areas or ratios) across source material and run was obtained by employing a multilevel hierarchical linear regression model with source material as fixed effect and run as random effect and varying intercepts. The residuals from this regression were input as X in (eq 1). The same regression model, without correction for source material, was used to obtain CVs for S1, S2, and S3, separately. For some analyses that either assessed or demonstrated the association of CV with peak area we used data from source material 1 only, as specified. As part of constant quality control monitoring during analysis a calibrator sample was removed if it fell outside tolerance values established for each analytical run. When such measures were deemed inadequate the whole run was subject to repeat measurement. In our retrospective analysis of raw data we aimed at analysing only primary runs and we used all quality control samples for our main CV calculations. To mimic the data cleaning performed during analysis of project samples we removed primary runs that displayed unusually high within-run CVs (> 3.5 * median within-run CV). By this criterion, 14 runs across 71 analytes (0.3 % of the data), closely matching the number and identity of the runs that were reanalyzed during the course of the project, were removed before calculation of CVs. The associations of between-run CV with retention time, peak area, and absolute difference in peak area (matching A/IS) and the association of CVs with difference in retention time between analyte and all nonmatching IS were assessed by Spearman’s correlation. R version 4.0.3 was used for all statistical calculations with package “lme4” for multilevel, hierarchical, regression.
Results And Discussion
Main characteristics of analytical platforms
The number of analytes included on platforms GC, LC1, LC2 and LC3 was 25, 16, 24 and 6, respectively. The number of A/IS pairings investigated was the square of these numbers (Table 1). The metabolites analyzed on platforms GC and LC1 were mostly of intermediate to high concentrations (> 1μM), whereas metabolites measured on platforms LC2 and LC3 largely covered the nanomolar range. Within each platform the range of typically observed concentrations of metabolites exceeded 500 fold (Table 1). The concentration of each internal standard is given in supporting information tables S1–S4.
CV calculations based on uncorrected peak areas
We calculated within- and between-run CVs for all analytes (A) and internal standards (IS) separately based on peak area and compared the results to the corresponding matching A/IS. Some details of these calculations are shown in Figure 1 (using data from source material 1). Notably, the detector response, measured as peak areas, was either stable for many consecutive runs or demonstrated a gradual decline or sudden shifts over time. Explanations for such phenomena have been discussed in detail previously (16). Furthermore, while the variation around mean peak area was fairly constant across consecutive runs there were also instances of large variation for some runs. In Figure 1, this applies to runs 45-48 for the two analytes 3-methylhistidine and choline. Notably, because changes in peak areas tended to occur in parallell for analyte and matching IS, such variation were largely eliminated for the ratio A/IS, as demonstrated for the two analytes in Figure 1. Within- and between-run CVs for all analytes across the 4 analytical platforms are provided in supporting information Table S1 - S4, and between-run CVs based on source material 1 are presented in Figure 2. A notable finding for all platforms was similar CVs for analytes and their matching IS (Tables S1- S4 and Figure 2). Significantly, within-run CVs for uncorrected peak areas varied across a large range from < 1% to > 100% for the 60 runs (Tables S1- S4). This demonstrated that the factors that affected the variability in peak area for analyte or IS were specific to each compound, a finding that, by itself, underscores the importance of using (as far as possible) structurally identical internal standards. Tables S1 through S8 and Figure 2 also demonstrates that CVs on platforms GC, LC1 and LC3 were always lower for matching A/IS than for uncorrected peak areas. A few exceptions were found on platform LC2 were matching A/IS for the low-abundance metabolites thiamin monophosphate, flavin adenin mononucleotide and anthranilic acid demonstrated higher CVs than their uncorrected IS. We attribute this phenomenon to known stability issues for these analytes in serum/plasma.
Figure 1. Peak areas (raw values) for analyte and internal standard across 60 runs.
Data for two analytes, 3-methylhistidine and choline, on platform LC1 are shown. Corresponding between-run CVs (with adjustment for run) are depicted in Figure 2. The analyte to internal standard ratio (A/IS) is included for comparison and is arbitrarily centered at 5.5 on the y-axis. Peak areas from source material 1 (pooled EDTA plasma) were used for this analysis.
Figure 2. Between-run CVs for analyte and internal standard separately.
Results are compared to CVs for corresponding matching A/IS. For analyte and IS the size of each symbol reflect mean peak areas. For A/IS the size of each symbol reflects the harmonic mean (emphasizing the smaller value) peak area for analyte and IS. The symbols, when interpreted as spheres, represent accurate relative areas within, but not across, platforms. All calculations were based on peak areas from source material 1 (pooled EDTA plasma).
Taken together, these analyses demonstrates that within-run CVs, even for uncorrected peak areas, can be low as has been noted by others (17, 18), and in many cases performance can be maintained for several consecutive runs. However, it is also apparent that analytical drift, either gradual or abrupt, may happen even when investigated in a single homogenous source material using the same analytical equipment and operated by the same personnel on a day to day basis. Thus, as commented previously (1), accurate correction and normalization of analyte signals becomes increasingly important when analysing large sample numbers that necessitates many runs and/or analysis over extended time periods.
Within-run CVs for matching A/IS
Within-run CVs for matching A/IS were calculated based on six S1, three S2 and three S3 samples for each run (96-well plate) using a regression model adjusted for source material. Figure 3 shows data across all 60 runs for the best and poorest performing analyte on each platform. Generally, we observed little to no trend in terms of decreasing or increasing within-run CVs during the course of the project (7 months) and, importantly, a narrow range of within-run CVs for matching A/IS compared to uncorrected peak areas (tables S1 through S8).
Figure 3. Variation across 60 runs for selected analytes (matching A/IS).
The analytes with the lowest (left) and highest (right) between-run CVs within each platform are shown. Data for S1, S2 and S3 are plotted in that order with color-coding as indicated. Arrows indicate runs that had a within-run CV > 3.5 times the median across 60 runs. S1; pooled EDTA plasma; S2; pooled serum spiked for most analytes, S3; pooled serum
Between-run CVs for matching and non-matching A/IS
Between-run CVs were calculated using a regression model adjusted for source material and run and for each source individually (S1, S2 or S3) by adjusting for run. CVs based on all data were generally higher than CVs for any individual source (Table 2). The use of pooled samples for evaluation of precision, in our case any one of S1, S2 or S3, has been criticized, as it will not capture the actual variation in ion suppression, or relative matrix effect, that may exist across individual patient/participant samples, (19), However, Matuszewski et al. also suggested that a SIL-IS was able to correct for relative matrix effect within acceptance criteria (19). Another study found that individual samples regularly exhibited less ion suppression than pooled samples (16). To cover all possibilities, but also to maximize external validity, we chose to report our main results using the combined data of all three pooled serum/plasma sources.
Table 2. Summary of between-run precisiona .
Matching A/IS | Non-matching A/IS | ||||||
---|---|---|---|---|---|---|---|
Platform | Analytes(n) | S1 | S2 | S3 | All | Allb | CV differencec |
GC | 25 | 2.3 (1.3, 3.6) |
1.9 (1.0, 3.8) |
2.2 (1.2, 4.0) |
2.7 (1.4, 4.1) |
11.3 (2.7, 25.8) |
8.6 (-0.3, 23.6) |
LC1 | 16 | 4.4 (2.9, 6.9) |
3.7 (2.3, 6.1) |
3.5 (2.2, 10.2) |
4.4 (3.0, 7.9) |
11.5 (4.3, 31.4) |
7.0 (0.5, 27.9) |
LC2 | 24 | 4.9 (3.6, 8.5) |
4.3 (2.9, 11.8) |
4.6 (3.1, 9.6) |
5.5 (3.8, 11.6) |
8.5 (4.7, 14.2) |
2.9 (-1.1, 7.6) |
LC3 | 6 | 5.6 (3.8, 7.3) |
5.5 (3.0, 7.2) |
5.0 (2.6, 8.1) |
5.9 (3.6, 8.1) |
16.0 (7.6, 28.3) |
10.7 (1.8, 23.0) |
Numbers are median (range) between-run CVs (%) calculated for all data (All) or for individual sources (S1 - S3) as indicated. Ranges are across n analytes or
across all non-matching A/IS within a platform.
Median (range) difference (non-matching A/IS - corresponding matching A/IS) across all non-matching A/IS within a platform. Abbreviations S1, pooled EDTA plasma. S2, pooled serum spiked for most analytes. S3 pooled serum.
The median CV for matching A/IS (using all data) on each platform varied from 2.7-5.9% and median difference in CV between non-matching and matching A/IS varied from 2.9 to 10.7 percentage points (Table 2). Figure 4 shows detailed results for seven selected analytes within each platform. The analytes were selected based on high structural similarity and/or similar retention time: Left panels show retention times for analytes and their matching internal standards, middle panels show CVs for A/IS pairings, and right panels show CVs versus difference in retention time for analyte versus internal standard. Most, but not all, deuterium-labeled internal standards eluted slightly earlier than their matching analyte. A detailed explanation of this isotope effect can be found in (20). As anticipated, no difference in retention time was found for 15N or 13C labeled SIL-IS, including 15N-labeled homoarginine (hArg) and 13C-labeled thiamin (Thi) (Figure 4). The variability in retention time measured across 60 runs was greater for the late eluting analytes on all platforms except LC2, possibly related to the step-gradient elution employed on that platform. A feature common to all platforms, but especially notable on LC3, was a marked symmetry around the downward sloping diagonal (containing the matching A/IS, Figure 4, middle panels), and around zero difference in retention time (right panels). This was due to similar precision for “inverse” A/IS pairings: Generally, for two analytes, A1 and A2, the pairings A1/A2-IS and A2/A1-IS tended to demonstrate similar CVs.
Figure 4. Results for selected analytes on platforms GC through LC3.
Left: The mean ± 1 SD retention time (across 60 runs) is shown for analytes and their matching SIL-IS. Middle: CVs for all A/IS pairings. Right: CVs according to difference in retention time (analyte - internal standard). Abbreviations: A, analyte, IS, internal standard, aHB; 2-hydroxybutyrate, bHB; 3-hydroxybutyrate, HIB 3-hydroxyisobutyrate; Leu; leucine, Ile; isoleucine, Orn; ornithine, Lys; lysine, His; histidine, m1His; 1-methylhistidine, hArg; homoarginine, m3His; 3-methylhistidine, SDMA; symmetric dimethylarginine, ADMA; asymmetric dimethylarginine, HK; 3-hydroxykynurenine, HAA; 3-hydroxyanthranilic acid, XA; xanthurenic acid, KA; kynurenic acid, Thi; thiamine, Kyn; kynurenine, AA; anthranilic acid, D3; 25 hydroxyvitamin D3, VitA; all-trans-retinol, D2; 25 hydroxyvitamin D2, gToc; gamma-tocopherol, aToc; alpha-tocopherol, K1; phylloquinone
On GC, the three structural analogs, 2-hydroxybutyrate (aHB), 3-hydroxybutyrate (bHB) and 3-hydroxyisobutyrate (HIB), and the two analogues leucine and isoleucine, displayed almost identical retention times, whereas ornithine and lysine, which differ by one methylene-group, displayed non-overlapping retention times (Figure 4). The median CV difference for non-matching A/IS within the three groups of structural analogs was 4.0 compared to 8.6 across all analytes on GC.
On LC1, the histidine analogs histidine (His), 1-methyl-histidine (m1His), and 3-methyl-histidine (m3His) displayed non-overlapping retention times. Within this group, CV differences varied from 3.9 to 10.7. The retention times for arginine (Arg) and homoarginine (hArg) were also non-overlapping, and both eluted among the histidines. We found histidine-IS to be the best substitute for arginine-IS and 1-methyl-histidine-IS was the best substitute for homoarginine-IS. The median CV difference for the two groups of structural analogs described above, and for symmetric, and asymmetric dimethylarginine (SDMA and ADMA) was 5.5 compared to 7.0 across all analytes on LC1.
LC2 contained several groups of similar analytes, however, most of them differed by a functional group or a phosphate group expected to influence both retention time and ionization characteristics (6). In Figure 4, we included 6 analytes along the kynurenine pathway of tryptophan degradation, and thiamine, which eluted in the middle of this group. All the included analytes had non-overlapping retention times except for kynurenic acid (KA) and thiamine (Thi). As shown, CV differences were low for many non-matching A/IS, e.g. for KA, four alternative IS had CV differences ≤ 1percentage point. Again, there was no obvious pattern to what substitutions might entail the smallest CV difference (Figure 4).
LC3 contained two pairs of structural analogs: vitamins D3 and D2, and gamma- and alpha-tocopherol. Vitamin A (all-trans-retinol) coeluted with vitamins D3 and D2 and combinations of D3 and D2 with Vitamin A-IS carried modest CV differences of 2.7 and 2.0, respectively with corresponding inverse combinations less favorable. (Figure 4) The median CV difference for the two groups of structural analogs was 3.3 compared to 10.7 across all analytes on LC3.
Analytical precision according to retention time and peak area
As indicated in Figure 4 (right panels), there was an association between CV and difference in retention time (analyte - internal standard) on all platforms. We quantified these relations as the correlation of CVs versus the absolute value of retention time differences (Table 3). Correction of ion suppression effects are believed to depend on the closeness in retention time of analyte and IS (21). The results for platforms LC1 and LC2 (Spearman´s rho of 0.17 and 0.24, respectively) indicated rather moderate effects, as also found in a previous report (9). In contrast, we found comparatively strong correlations on platforms GC and LC3 (Spearman’s rho of 0.54 and 0.93, respectively). Notably, these platforms shared a liquidliquid extraction (LLE) step during sample preparation, and the GC-method also comprised a final LLE to extract the derivatized products. As reviewed by Wieling, an important reason for inclusion of a (processed) internal standard in chromatography was, and is, to correct for phase transfer during sample processing (22). We therefore speculate that small differencesin the partition between liquid phases, in particular, for the fat-soluble analytes on LC3, may have impacted precision in proportion to the difference in retention time between analyte and IS. Deuterium-labeled internal standards often do not coelute exactly with the analyte, especially in LC-MS/MS applications, and are, for this reason generally considered non-perfect internal standards (20, 23, 24). Consequently, higher CVs for LC-MS/MS, compared to GC-MS/MS platforms might be related to differences in the extent of correction for matrix effects. For platforms LC1 and LC2, this may have reduced the association of CV with differences in retention time as well as overall CV difference for non-matching A/IS.
Table 3. Precision versus retention time and peak areaa .
Non-matching A/IS | Matching A/IS | |||||||
---|---|---|---|---|---|---|---|---|
Platform | Δ retention timeb | retention timec | peak aread | Δ peak areae | ||||
rho | p-value | rho | p-value | rho | p-value | rho | p-value | |
GC | 0.54 | < 0.001 | −0.08 | 0.7 | −0.15 | < 0.001 | 0.14 | < 0.001 |
LC1 | 0.17 | < 0.001 | 0.01 | 1 | −0.57 | < 0.001 | 0.2 | < 0.001 |
LC2 | 0.24 | < 0.001 | 0.05 | 0.8 | −0.37 | < 0.001 | 0.02 | 0.6 |
Results of Spearman’s correlation analyses.
CV vs the absolute difference in retention time between analyte and all non-matching IS.
CV vs the mean retention time for analyte and matching IS
CV vs the harmonic mean (emphasizing the smaller value) peak area of each matching A/IS pair.
CV vs the absolute difference in peak are between analyte and IS. The two latter correlations were performed with mutual adjustment. Abbreviations A, analyte, IS, internal standard.
We also examined the relation between CV and retention time per se for matching A/IS pairs and found essentially no association across all platforms (Table 3. Note that the basis for this and the following analyses are also depicted graphically in Figure 2). There was a negative association of CV with peak area (meaning that larger areas gave rise to smaller CVs) for all four platforms. This underscores the importance of accurate peak integration for precision measures. Of note, the precision for matching A/IS on LC2 was comparable to those on LC1 despite the 500-fold difference in typical metabolite concentrations. We attribute this to a better signal/noise ratio of the LC-MS/MS system employed on LC2 compared to LC1, resulting in comparable peak areas (Table 1). Some researchers have found increased ion suppression with increasing concentration difference between analyte and internal standard (25–27). In our data we found moderately higher CVs for analytes with a larger difference in peak area for analyte and IS within platforms GC and LC1 but no signifiant association within platforms LC2 and LC3 (Table 3).
Benefits of dedicated SIL-IS beyond correction for ion suppression
We excluded a few analytes on platform GC that were found to be unstable at some stage of the sample preparation. One such example was histidine. Because histidine and histidine-IS degraded at the same rate, between-run CV for this analyte was low (3.1%). For the same reason, histidine-IS paired with other analytes on GC produced CVs in excess of 20% in all cases. It should be noted, though, that different stability of analyte and internal standard have been reported, and in such cases correction by the IS led to increased CV (6).
Isotope dilution GC-MS (as used on GC) has been termed a «nearly matrix-independent reference method» (5), and, as expected, the precision for matching A/IS on this platform was the highest among the 4 analytical platforms. Still, the median CV difference for non-matching IS on GC was second highest. As discussed above, explanations for this may be more related to chromatographic effects during sample preparation than to differences in ion suppression.
The inclusion of a dedicated SIL-IS for all analytes has some practical benefits that are worth mentioning: In chromatographic settings that involve dozens of analytes eluting during a short time span, correct peak identification is essential, especially for low abundance analytes. A SIL-IS eluting at the same time, or within a short, constant, interval facilitates accurate peak identification, and hence both automatic and manually controlled peak integration.
Parameters not addressed in this study
Ionization by atmospheric chemical ionization (APCI) is known to be less prone to ion suppression than electro spray ionization (ESI) (5). We were unable to compare APCI and ESI directly, however, we noted that CVs for matching A/IS was highest on the platform that utilized APCI (LC3). Possible explanations could include the challenge of obtaining chromatographic separation of matrix components from the low abundance, fat-soluble, metabolites analysed on this platform, but also the effect of peak area integration on CVs. Across all platforms we employed sample preparation that mainly consisted of solvent extraction/deproteinization by ethanol or acid. Additional purification, e.g. by LLE or solid phase extraction (SPE), may remove more phospholipids and therefore reduce ion suppression (21, 28, 29). We employed a shared LLE-step on platforms GC and LC3 mainly for the purpose of more efficient utilization of precious source material. As discussed above, there was no suggestion of or any way to verify a benefit of LLE in terms of improving precision (for matching A/IS) on LC3. A recognized drawback of SPE that also applies to LLE is that it will often fail to remove substances that coelutes with the analyte during chromatography (21, 30). In some cases it may even lead to increased ion suppression (31). Several previous studies noted that a lack of SIL-IS might also impair accuracy, linearity and lower limit of quantitation (LLOQ) (6, 9, 17, 19). Due to the post-hoc nature of this study we were unable to evaluate such measures.
Conclusions
In this study we quantified the difference between using a non-matching A/IS pair compared to a matching A/IS pair on analytical precision in four MS-based analytical platforms. Median differences ranged from small to moderate (3 - 11 percentage points) with wide variation (-1 to 28 percentage points) within and across platform. In cases characterized by high structural similarity or closeness of retention time between analyte and internal standard, differences tended to be smaller. Larger CV differences for A/IS pairs with increasing difference in retention time was found, but mainly in the two platforms that included a liquid-liquid extraction step during sample preparation. The latter finding is a reminder of the importance of close similarity of analyte and IS to ensure similar behavior during all steps of sample preparation. Although not specifically addressed in this study, it should be noted that inclusion of dedicated SIL-IS may also positively impact other quality measures including accuracy, linearity, and LLOQ as well as correct peak identification in multianalyte quantitative mass spectrometry applications.
Supplementary Material
Acknowledgements
The authors would like to thank Gry Kvalheim, Marit Krokeide, Randi Heimdal and Ove Aarseth for pertinent information during the analysis and write-up of the manuscript.
Footnotes
Author Contributions
Ø.M and A.U. conceived of, and developed the main ideas for the statistical analyses. A.U. performed statistical analyses and wrote the manuscript. A.M. Ø.M. K.M, K.M.G. and P.M.U. reviewed the manuscript for important intellectual content.
References
- 1.Gertsman I, Barshop BA. Promises and pitfalls of untargeted metabolomics. J Inherit Metab Dis. 2018;41:355–366. doi: 10.1007/s10545-017-0130-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Shi X, Wang S, Jasbi P, Turner C, Hrovat J, Wei Y, Liu J, Gu H. Database-Assisted Globally Optimized Targeted Mass Spectrometry (dGOT-MS): Broad and Reliable Metabolomics Analysis with Enhanced Identification. Anal Chem. 2019;91:13737–13745. doi: 10.1021/acs.analchem.9b03107. [DOI] [PubMed] [Google Scholar]
- 3.Zha H, Cai Y, Yin Y, Wang Z, Li K, Zhu Z-J. SWATHtoMRM: Development of High-Coverage Targeted Metabolomics Method Using SWATH Technology for Biomarker Discovery. Analytical Chemistry. 2018;90:4062–4070. doi: 10.1021/acs.analchem.7b05318. [DOI] [PubMed] [Google Scholar]
- 4.Zhong F, Xu M, Zhu J. Development and application of time staggered/mass staggered-globally optimized targeted mass spectrometry. Journal of Chromatography B. 2019;1120:80–88. doi: 10.1016/j.jchromb.2019.04.051. [DOI] [PubMed] [Google Scholar]
- 5.Vogeser M, Seger C. Pitfalls associated with the use of liquid chromatographytandem mass spectrometry in the clinical laboratory. Clin Chem. 2010;56:1234–1244. doi: 10.1373/clinchem.2009.138602. [DOI] [PubMed] [Google Scholar]
- 6.Stokvis E, Rosing H, Beijnen JH. Stable isotopically labeled internal standards in quantitative bioanalysis using liquid chromatography/mass spectrometry: necessity or not. Rapid Commun Mass Spectrom. 2005;19:401–407. doi: 10.1002/rcm.1790. [DOI] [PubMed] [Google Scholar]
- 7.Townsend MK, Clish CB, Kraft P, Wu C, Souza AL, Deik AA, Tworoger SS, Wolpin BM. Reproducibility of metabolomic profiles among men and women in 2 large cohort studies. Clin Chem. 2013;59:1657–1667. doi: 10.1373/clinchem.2012.199133. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Bornø A, van Hall G. Quantitative amino acid profiling and stable isotopically labeled amino acid tracer enrichment used for in vivo human systemic and tissue kinetics measurements. J Chromatogr B Analyt Technol Biomed Life Sci. 2014;951-952:69–77. doi: 10.1016/j.jchromb.2014.01.019. [DOI] [PubMed] [Google Scholar]
- 9.Krautbauer S, Büchler C, Liebisch G. Relevance in the Use of Appropriate Internal Standards for Accurate Quantification Using LC-MS/MS: Tauro-Conjugated Bile Acids as an Example. Anal Chem. 2016;88:10957–10961. doi: 10.1021/acs.analchem.6b02596. [DOI] [PubMed] [Google Scholar]
- 10.Lanckmans K, Sarre S, Smolders I, Michotte Y. Use of a structural analogue versus a stable isotope labeled internal standard for the quantification of angiotensin IV in rat brain dialysates using nano-liquid chromatography/tandem mass spectrometry. Rapid Commun Mass Spectrom. 2007;21:1187–1195. doi: 10.1002/rcm.2950. [DOI] [PubMed] [Google Scholar]
- 11.Godfrey KM, Cutfield W, Chan SY, Baker PN, Chong YS, NiPPeR SG. Nutritional Intervention Preconception and During Pregnancy to Maintain Healthy Glucose Metabolism and Offspring Health (“NiPPeR”): study protocol for a randomised controlled trial. Trials. 2017;18:131. doi: 10.1186/s13063-017-1875-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Midttun Ø, McCann A, Aarseth O, Krokeide M, Kvalheim G, Meyer K, Ueland PM. Combined Measurement of 6 Fat-Soluble Vitamins and 26 Water-Soluble Functional Vitamin Markers and Amino Acids in 50 μL of Serum or Plasma by High-Throughput Mass Spectrometry. Anal Chem. 2016;88:10427–10436. doi: 10.1021/acs.analchem.6b02325. [DOI] [PubMed] [Google Scholar]
- 13.Midttun Ø, Kvalheim G, Ueland PM. High-throughput, low-volume, multianalyte quantification of plasma metabolites related to one-carbon metabolism using HPLC-MS/MS. Anal Bioanal Chem. 2013;405:2009–2017. doi: 10.1007/s00216-012-6602-6. [DOI] [PubMed] [Google Scholar]
- 14.Midttun Ø, Hustad S, Ueland PM. Quantitative profiling of biomarkers related to B-vitamin status, tryptophan metabolism and inflammation in human plasma by liquid chromatography/tandem mass spectrometry. Rapid Commun Mass Spectrom. 2009;23:1371–1379. doi: 10.1002/rcm.4013. [DOI] [PubMed] [Google Scholar]
- 15.Kirkwood TBL. Geometric means and measures of dispertion. Biometrics. 1979;35:908–909. [Google Scholar]
- 16.Tan A, Hussain S, Musuku A, Massé R. Internal standard response variations during incurred sample analysis by LC-MS/MS: Case by case trouble-shooting. J Chromatogr B Analyt Technol Biomed Life Sci. 2009;877:3201–3209. doi: 10.1016/j.jchromb.2009.08.019. [DOI] [PubMed] [Google Scholar]
- 17.Ask KS, Lid M, Øiestad EL, Pedersen-Bjergaard S, Gjelstad A. Liquid-phase microextraction in 96-well plates - calibration and accurate quantification of pharmaceuticals in human plasma samples. J Chromatogr A. 2019;1602:117–123. doi: 10.1016/j.chroma.2019.06.013. [DOI] [PubMed] [Google Scholar]
- 18.Tokuoka SM, Yasumoto A, Kita Y, Shimizu T, Yatomi Y, Oda Y. Limitations of deuterium-labeled internal standards for quantitative electrospray ionization mass spectrometry analysis of fatty acid metabolites. Rapid Commun Mass Spectrom. 2020:34.:e8814. doi: 10.1002/rcm.8814. [DOI] [PubMed] [Google Scholar]
- 19.Matuszewski BK. Standard line slopes as a measure of a relative matrix effect in quantitative HPLC-MS bioanalysis. J Chromatogr B Analyt Technol Biomed Life Sci. 2006;830:293–300. doi: 10.1016/j.jchromb.2005.11.009. [DOI] [PubMed] [Google Scholar]
- 20.Berg T, Karlsen M, Oiestad AM, Johansen JE, Liu H, Strand DH. Evaluation of 13C- and 2H-labeled internal standards for the determination of amphetamines in biological samples, by reversed-phase ultra-high performance liquid chromatography-tandem mass spectrometry. J Chromatogr A. 2014;1344:83–90. doi: 10.1016/j.chroma.2014.04.020. [DOI] [PubMed] [Google Scholar]
- 21.Furey A, Moriarty M, Bane V, Kinsella B, Lehane M. Ion suppression; a critical review on causes, evaluation, prevention and applications. Talanta. 2013;115:104–122. doi: 10.1016/j.talanta.2013.03.048. [DOI] [PubMed] [Google Scholar]
- 22.Wieling J. LC-MS-MS Experiences with internal standards. Chromatographia. 2002;55:S107–S113. [Google Scholar]
- 23.Ciccimaro E, Blair IA. Stable-isotope dilution LC-MS for quantitative biomarker analysis. Bioanalysis. 2010;2:311–341. doi: 10.4155/bio.09.185. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Wang S, Cyronak M, Yang E. Does a stable isotopically labeled internal standard always correct analyte response? A matrix effect study on a LC/MS/MS method for the determiantion of carvedilol enantiomers in human plasma. Journal of Pharmaceutical and Biomedical Analysis. 2007;43:701–707. doi: 10.1016/j.jpba.2006.08.010. [DOI] [PubMed] [Google Scholar]
- 25.Liang HR, Foltz RL, Meng M, Bennett P. Ionization enhancement in atmospheric pressure chemical ionization and suppression in electrospray ionization between target drugs and stable-isotope-labeled internal standards in quantitative liquid chromatography/tandem mass spectrometry. Rapid Commun Mass Spectrom. 2003;17:2815–2821. doi: 10.1002/rcm.1268. [DOI] [PubMed] [Google Scholar]
- 26.Remane D, Wissenbach DK, Meyer MR, Maurer HH. Systematic investigation of ion suppression and enhancement effects of fourteen stable-isotope-labeled internal standards by their native analogues using atmospheric-pressure chemical ionization and electrospray ionization and the relevance for multi-analyte liquid chromatographic/mass spectrometric procedures. Rapid Commun Mass Spectrom. 2010;24:859–867. doi: 10.1002/rcm.4459. [DOI] [PubMed] [Google Scholar]
- 27.Sojo LE, Lum G, Chee P. Internal standard signal suppression by co-eluting analyte in isotope dilution LC-ESI-MS. Analyst. 2003;128:51–54. doi: 10.1039/b209521c. [DOI] [PubMed] [Google Scholar]
- 28.Sitnikov DG, Monnin CS, Vuckovic D. Systematic Assessment of Seven Solvent and Solid-Phase Extraction Methods for Metabolomics Analysis of Human Plasma by LC-MS. Sci Rep. 2016:6.:38885. doi: 10.1038/srep38885. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Tulipani S, Llorach R, Urpi-Sarda M, Andres-Lacueva C. Comparative analysis of sample preparation methods to handle the complexity of the blood fluid metabolome: when less is more. Anal Chem. 2013;85:341–348. doi: 10.1021/ac302919t. [DOI] [PubMed] [Google Scholar]
- 30.Hewavitharana AK. Matrix matching in liquid chromatography-mass spectrometry with stable isotope labelled internal standards--is it necessary. J Chromatogr A. 2011;1218:359–361. doi: 10.1016/j.chroma.2010.11.047. [DOI] [PubMed] [Google Scholar]
- 31.Gosetti F, Mazzucco E, Zampieri D, Gennaro MC. Signal suppression/enhancement in high-performance liquid chromatography tandem mass spectrometry. Journal of Chromatography A. 2010;1217:3929–3937. doi: 10.1016/j.chroma.2009.11.060. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.