Abstract
Serum and plasma are commonly used in metabolomic-epidemiology studies. Their metabolome is susceptible to differences in pre-analytical conditions and the impact of this is unclear. Participant-matched EDTA-plasma and serum samples were collected from 37 non-fasting volunteers and profiled using a targeted nuclear magnetic resonance (NMR) metabolomics platform (n = 151 traits). Correlations and differences in mean of metabolite concentrations were compared between reference (pre-storage: 4 °C, 1.5 h; post-storage: no buffer addition delay or NMR analysis delay) and four pre-storage blood processing conditions, where samples were incubated at (i) 4 °C, 24 h; (ii) 4 °C, 48 h; (iii) 21 °C, 24 h; and (iv) 21 °C, 48 h, before centrifugation; and two post-storage sample processing conditions in which samples thawed overnight (i) then left for 24 h before addition of sodium buffer followed by immediate NMR analysis; and (ii) addition of sodium buffer, then left for 24 h before NMR profiling. We used multilevel linear regression models and Spearman’s rank correlation coefficients to analyse the data. Most metabolic traits had high rank correlation and minimal differences in mean concentrations between samples subjected to reference and the different conditions tested, that may commonly occur in studies. However, glycolysis metabolites, histidine, acetate and diacylglycerol concentrations may be compromised and this could bias results in association/causal analyses.
Keywords: metabolomics, serum, plasma, nuclear magnetic resonance, pre-analytical phase
1. Introduction
The development of high-throughput methods for quantifying multiple ‘omic traits in large-scale epidemiological studies, using stored biosamples, has the potential to rapidly advance our understanding of how human physiology and metabolism vary across the lifecourse. Applying these methods to existing samples from very densely pheno- and genotyped cohorts also has the potential to enhance our understanding of causal mechanisms for disease. To do this in a scientifically rigorous way, further information is required regarding the potential impact of differing pre-analytical conditions (e.g., variations in incubation duration and temperature before centrifugation) of stored samples that are likely to vary between cohorts and potentially within the same cohort over time. For example, since sample collection from longitudinal epidemiological cohorts are collected over many decades, it may be impossible to ensure that identical protocols are used throughout the lifetime of the study. In clinical cohorts and in pregnancy/birth cohorts, such as the Avon Longitudinal Study of Parents and Children [1,2] and Born in Bradford [3] cohorts, some samples (e.g., during antenatal care or for cord-blood) are obtained during routine clinical practice where health care needs takes precedence over speedy sample processing and rapid storage. Yet these samples, collected early in life, are often the most valuable for research related to the long-term effects of childhood exposures. Other cohorts face different challenges, for example, the Health Survey for England [4] and Understanding Society [5] collect biological samples in participants’ homes, which is likely to increase pre-analytical variability.
Additionally, we are increasingly interested in cohort collaborations (e.g., University College London-London School of Hygiene & Tropical Medicine-Edinburgh-Bristol [6] consortium, COnsortium of METabolomics Studies [7], Cohorts for Heart and Aging Research in Genomic Epidemiology [8] consortium) and broader international comparisons, for example, between determinants of health and well-being in low-, middle- and high-income countries and how these differ. Consequently, we often need to combine data from samples collected in very different circumstances. For all these reasons, it is important to recognise that biosample collection protocols are often a compromise between reducing any effects of pre-analytical variation with maximising the amount of data that can be obtained from samples, within the financial and practical restraints of a given collection sweep.
In clinical chemistry, pre-analytics account for 60–80% of laboratory errors [9,10,11,12], and current standard operating procedures for blood handling in metabolomics are generally based on best practices [13] (not evidence-based) for conventional biochemistry tests [9,14]. While several studies have explored the impact of pre-analytical conditions on a small number of commonly assessed biomarkers in epidemiology [13,15,16,17,18], metabolomics—the simultaneous quantification of large numbers of metabolic traits—has particular challenges as different metabolites may have different susceptibilities to degradation [19,20,21,22,23].
The aim of this study was to determine the differences in metabolite concentrations between reference and six pre-analytical conditions, which reflect conditions potentially arising during sample collection and processing in large-scale epidemiological studies. We used both Spearman’s rank correlation and differences in mean concentrations to address this aim. Rank correlations are useful for considering any impact of pre-analytical effects on subsequent association or causal effect analyses using metabolomics data, as bias of these results is likely to be minimal if participants are still ranked similary with respect to metabolite concentrations. Differences in mean concentrations are important when describing distributions of a metabolite in a population or comparing differences between populations.
Throughout this manuscript, we use the term pre-storage instead of (sample) pre-processing to indicate the phase of the metabolomic workflow that occurs before long-term storage of samples. This was done to avoid confusion with the term (data) pre-processing which is used, particularly in untargeted metabolomic studies, to mean the several steps undertaken after metabolomics data acquisition (e.g., peak alignment, normalization).
2. Results
Characteristics of study participants are shown in Table 1, with metabolic trait distributions in Table S1, Supplementary Materials. Most participants were female (78%) and 46% were between 21 and 35 years old; 41% had drunk alcohol within the 24 h period prior to blood sampling.
Table 1.
Characteristics | n | % |
---|---|---|
Female | 29 | 78 |
Age | ||
(21–35) | 17 | 46 |
(36–50) | 10 | 27 |
(51–65) | 10 | 27 |
Ever smoked, No | 15 | 41 |
Time since last alcohol consumption | ||
Never Drinks | 2 | 5 |
<1 Week | 16 | 43 |
1–4 Weeks | 4 | 11 |
24 h before blood collection | 15 | 41 |
The following main conclusions can be drawn from our analyses of the effect of pre-analytical conditions on blood metabolic traits:
72% and 93% of metabolic traits across pre- and post-storage conditions, respectively, had Spearman’s correlations, between metabolite concentrations of samples subjected to reference and variant conditions, above 0.8 (Tables S2–S5, Supplementary Materials; Figure 1, Figure 2, Figure 3, Figure 4, Figure 5, Figure 6, Figure 7 and Figure 8; Figures S1–S4, Supplementary Materials). Exceptions are described in the next two bullet points.
In both biofluids, diacylglycerol and histidine concentrations had, generally, Spearman’s correlation coefficient of ~0.5 or lower between reference and variant pre- and post-storage conditions (Figure 1, Figure 2, Figure 3, Figure 4, Figure 5, Figure 6 and Figure 7; Tables S2–S5, Supplementary Materials).
Sphingomyelins, conjugated linoleic acid, lactate, glucose (serum), pyruvate (serum), glycerol (serum), phenylalanine (EDTA-plasma) and acetate had Spearman rank correlations between ~0.7 and −0.06 between concentration of samples subjected to variant pre-storage conditions and reference.
Glycolysis-related metabolites (glucose, lactate, pyruvate) and amino acids showed marked differences in mean concentration between reference and pre-storage conditions and post-storage conditions (amino acids only) (Figures S5–S7, Supplementary Materials; Tables S6–S9, Supplementary Materials).
Branched and aromatic amino acids had smaller changes in mean differences from the reference and variant pre-storage conditions in EDTA-plasma than in serum (Figure S5, Supplementary Materials; Tables S6 and S7, Supplementary Materials).
For pyruvate, pre-storage delay and temperature appeared to interact, with mean levels slightly decreasing/stable from reference levels per 24 h at 4 °C and increasing per 24 h at 21 °C (Figure S5, Supplementary Materials; Table S6, Supplementary Materials).
As an illustration of the magnitude of effects in serum, mean levels of glucose decreased by 0.91 mmol/L (95% CI: 0.87 to 0.95) and 1.9 mmol/L (95% CI: 1.7 to 2.1) for each 24 h delay, compared with the reference sample, at 4 °C and 21 °C, respectively. Pyruvate levels decreased by 0.013 mmol/L per 24 h (95% CI: 0.008 to 0.018) at 4 °C but increased by 0.73 mmol/L per 24 h (95% CI: 0.64, 0.82) at 21 °C.
Post-storage conditions affected histidine, phenylalanine and low-density lipoprotein (LDL) particle size, with differences in mean levels of up to 1.4 SD from reference (Figure S6, Supplementary Materials; Tables S8 and S9, Supplementary Materials).
3. Discussion
In this experiment, we showed that most metabolites, including lipids, lipoproteins and fatty acids, are minimally affected by different sample pre-analytical conditions that were designed to reflect plausible variation in large-scale epidemiological studies that collected samples at different time points and different populations. We explored the rank correlation and differences in mean levels of different pre-analytical conditions for over 151 metabolic traits. The pre-analytical conditions that we explored reflect realistic situations in large-scale epidemiological studies. We believe this is an important contribution to the literature given the increasing use of high-throughput metabolomics in stored samples from large epidemiological studies.
In this study, 72% and 93% of metabolite traits across pre- and post-storage conditions, respectively, had Spearman’s correlations, between metabolite concentrations of samples subjected to reference and variant conditions above 0.8. This suggests that, for most measurements, there is little effect of different pre-analytical conditions on metabolic concentrations or, where there are effects, these are similar across paticipants. Most metabolites also had similar mean concentrations between the reference and different tested pre-analytical conditions (91% and 97%, across pre and post-storage conditions, respectively, had <0.5 SD), with glycolysis and some amino acids in particular having both, low rank correlations and high differences in mean concentrations (up to 1.4 SD).
Previous studies examining pre-analytical effects on metabolites have largely explored differences in mean levels and measured this using clinical chemistry tests [15,16,17,18,24], mass spectrometry [14,25,26,27] and NMR [19,24,28,29,30,31,32], and with targeted and untargeted platforms. Where it is possible to make comparisons of these previous studies with our results, these are summarized in Table S10, Supplementary Materials. Pre-storage delay and incubation temperature of uncentrifuged samples can cause changes to serum and plasma metabolomes because blood cells and enzymes are still metabolically active inside the sample tube, resulting in the uptake and release of metabolites. Concordant findings include the reported stability of triglycerides [16,17,18], high density lipoprotein-cholesterol (C) [18], low density lipoprotein-C (4 °C) [18], total cholesterol (4 °C) [18], apolipoprotein A-I and B [18] and creatinine [18]. Our observed changes in glucose, pyruvate, lactate and alanine agree with previous studies [17,18,19,26,29], and are likely due to blood cell activity, primarily red blood cells (RBC) [23,33]. The absence of RBC in post-storage samples explains why these metabolites are minimally affected by the tested post-storage conditions [17,20]. Increases in concentrations of most amino acids with the pre- and post-storage (non-ideal) conditions that we tested here are also consistent with previous studies [14,29]. One possible reason for this is protein degradation, for example, in the case of phenylalanine [14,29] accumulation, as it is an uncommon component of proteins and has few degradation pathways [14]. Glutamine decreased [14,29], converted into glutamate, and high levels of the latter prevented reliable quantification of pyruvate, since pyruvate only gives one peak in the 1H NMR spectrum, which can be overlapped by glutamate signals. Our results also show that amino acids in serum are more affected by pre-storage conditions than in EDTA-plasma, possibly due to the inhibition of metal-dependent proteases by EDTA in the latter, and the activation of a range of proteases during coagulation, in the former. Histidine and acetate had opposite changes in serum and plasma, likely due to the influence of the presence/absence of anti-coagulant. Post-storage effects were less pronounced than pre-storage ones, indicating that most of the sample degradation is related to blood cell activities [23,34]. Nevertheless, we found changes in histidine and phenylalanine and, although not directly comparable to the conditions we tested, others [14,19] have reported changes in some metabolites when delays occur after centrifugation. These are potentially caused by enzymes released due to cell damage during centrifugation and other proteins [14,19,34]. While our study has explored the effects on metabolite concentrations using one quantitative NMR platform, we would anticipate similar results with other metabolomic platforms that measure the same metabolites, as pre-analytic conditions affect specific metabolites (as described above) rather than being related to the type of metabolomic platform (only concentration accuracy and precision may differ across platforms).
Potential Bias in Epidemiological Studies of Pre-Analytic Conditions
Rank correlations are useful for considering any impact of pre-analytical effects on subsequent association or causal effect analyses using metabolomics data (i.e., analytical studies), because the bias of these results is likely to be minimal if participants are still ranked similarly with respect to metabolite concentrations. Differences in mean concentrations are important in descriptive studies where the aim is to describe the distribution of a metabolite in a population or between two populations. For example, our results suggest that, on average, glucose levels from samples that were left for 24 h at 21 °C before being processed and then placed in long-term storage would be 1.9 mmol/L lower than the true value. Therefore, if mean levels of glucose from one study (e.g., from a population of South Asian adults) where samples had been left for 24 h at 21 °C were compared to those from another study (e.g., from a population of European adults) where samples had been left for no longer than 1.5 h at 4 °C (our reference/ideal conditions), we may incorrectly conclude that mean glucose concentrations are lower in South Asian compared to European adults or mask higher concentrations in South Asians. In our study, there was some consistency between the Spearman rank correlation and difference in mean concentration results. In particular, pre-analytical conditions for glycolysis metabolites and some amino acids affected both rank correlation and mean differences, and studies with less than ideal pre-analytical conditions should consider whether any analyses might be biased for these metabolites.
In analytical studies, the extent of any bias or random errors will depend on the particular research question, the exposure, the outcome and covariable measurements included in the analyses, as well as sample size. In addition, in association and causal effect analyses, random error and systematic bias must be considered relative to exposures, outcomes and covariables. For example, if we were interested in the potential effect of multiple metabolites (exposures) on incident coronary heart disease (CHD) (outcome), systematic bias would occur if the effect of pre-analytic conditions on metabolites differed between those who experienced CHD and those who did not (i.e., pre-analytical conditions would be associated with both, metabolite concentrations and incidence of CHD, therefore pre-analytical conditions would be a confounder of the association between metabolites and CHD). This could occur if samples from cases of CHD are handled more often (e.g., have more freeze–thaw cycles) than those of controls, which may occur in prospective nested case control studies. These cycles may enhance the impact of pre-analytical effects, particularly for labile metabolites. However, we acknowledge that we have not assessed that matter here. Random error could also bias results in this example. Since researchers at the time of collecting and processing serum/plasma are unlikely to know the future risk of CHD and as sample processing differences are likely to affect all samples similarly within a study, this may be more likely than systematic bias. In this case, effects of the pre-analytical conditions are random with respect to CHD and the expectation with random error is that it biases results towards the null. Let us consider a second example where metabolites are outcomes, for example, if we were interested in the effect of body mass index (BMI) (exposure) on metabolites (outcomes). As in the first example, if pre-analytical conditions that result in measurement error in the metabolites are associated with BMI, there will be systematic bias that, depending on the extent of measurement error and how strongly, and in which direction, it is related to BMI and confounders, could bias results towards or away from the null to a large or small extent. If the pre-analytical conditions are not related to BMI, the expectation would be bias towards the null. A key thing here is whether BMI (or other exposure) was measured some time before bio-samples were collected (i.e., in prospective studies) or at the same time (cross-sectional studies). It is not plausible that pre-analytical conditions could (retrospectively) influence prior measured BMI, and therefore in prospective studies there would be random error with an expectation of bias towards the null. However, in cross-sectional studies, and in particular if the exposure of interest was another biomarker measured in the same samples (e.g., if we were looking at the association of a cytokine with metabolites), then systematic bias is plausible. Regardless of metabolites being exposure or outcome in the analysis, associations between pre-analytical conditions and covariables also need to be considered. As with the examples above, random error would be expected to bias metabolite–covariable associations towards the null and this may result in residual confounding in the main analyses.
The above discussion and examples refer to single studies with single centres, whereas metabolomic (and other ‘omic analytical studies) are increasingly multi-studies, something that is important for replication and increasing sample size. Systematic bias could occur in multi-centre studies or collaboration pooling results from a number of different studies, if sample handling and the likelihood of having an outcome (e.g., CHD) or the distribution of an exposure (e.g., BMI) varies by centre.
In summary, for most of the metabolites that we assessed there were high rank correlations and low differences in means between plausible, but less than ideal pre-analytical conditions, compared with the ideal. However, for the metabolites that show low rank correlation and/or high differences in mean levels (specifically glycolysis metabolites, histidine, acetate and diacylglycerol), researchers should consider the extent to which these effects might have biased results towards the null (due to random error or systematic bias that masks a true effect) or away from the null (due to systematic bias or the introduction of residual confounding). Ideally, one would want to compare results with other studies in similar populations that had “ideal” pre-analytic conditions. Such bias and confounding are likely to be more pronounced in studies with small sample sizes. In large collaborations where results from several studies are pooled, we would recommend specifically exploring the extent to which pre-analytic processing might have influenced between-study heterogeneity.
4. Materials and Methods
Samples from 37 healthy volunteers were collected. Exclusion criteria for the study were the following: clotting/bleeding disorders, anaemia, use of anti-coagulant medication or insulin treatment and presence of blood borne viruses. Participants provided written informed consent and completed a questionnaire. Ethical approval for the study was obtained from the South West Frenchay Proportionate Review Committee, Bristol, UK, reference 14/SW/0087. An overview of the experimental design is given in Figure 9. Ten venous blood tubes were drawn for each non-fasting participant (5 EDTA-plasma, 5 serum). Further details are provided in Appendix A.
4.1. Reference Samples
All samples were processed within 1.5 h of blood withdrawal. Once centrifuged, serum and EDTA-plasma samples were aliquoted into 1.5 mL microtubes (E1415-2240, STARLAB, Milton Keynes, UK) and immediately frozen and stored at −80 °C for one month prior to NMR analysis. Prior to profiling, frozen samples were thawed in the refrigerator (4 °C) overnight, prepared with sodium phosphate buffer [35,36,37] and then immediately run through the NMR spectrometer.
4.2. Pre-Storage Handling: Effect of Pre-Centrifugation Delay and Temperature
We compared metabolic trait concentrations, quantified by the NMR platform, between four pre- and two post-storage conditions and the reference samples. The four pre-storage conditions had the following combination of incubation temperature/duration before centrifugation (i) 4 °C for 24 h; (ii) 4 °C for 48 h; (iii) 21 °C for 24 h; and (iv) 21 °C for 48 h. These conditions were chosen to reflect pre-storage variations likely to occur in epidemiological studies. Participant-matched serum and EDTA-plasma samples were processed according to these four conditions.
4.3. Post-Storage Handling: Effect of Buffer Addition Delay and NMR Profiling Delay
Two variations to the standard Nightingale Health© NMR protocol [35,36,37] were investigated to evaluate the impact of sample preparation and instrumental analysis delay. Matched serum and EDTA-plasma aliquots of each participant previously prepared according to the reference pre-storage conditions, described above, were subjected to two post-storage conditions. Samples thawed overnight (4 °C) and afterwards were either (i) left for 24 h, at 4 °C in the dark, before addition of sodium phosphate buffer followed by immediate NMR analysis (buffer addition delay); or (ii) addition of buffer before being left for 24 h, at 4 °C in the dark, then NMR profiling (NMR analysis delay).
4.4. Nuclear Magnetic Resonance Metabolomics Platform
A high-throughput NMR metabolomics platform, Nightingale Health©, at the University of Bristol, was used to quantify up to 151 lipoproteins, lipids and metabolites in serum and plasma. The platform applies a single experimental setup, providing the simultaneous quantification of routine lipids, 14 lipoprotein subclasses and lipids transported by these particles, various fatty acids (FA) and FA traits (e.g., chain length, degree of unsaturation), amino acids, ketone bodies, glycolysis and gluconeogenesis-related metabolites, fluid balance and one inflammation metabolite. Most of these are quantified in clinically meaningful concentrations (e.g., mmol/L), and particle size in nm. Details of this platform and its use in epidemiological studies have been described elsewhere [38,39,40,41]. Pyruvate, glycerol and glycine are not quantified in EDTA-plasma samples due to the interfering resonances of EDTA on their signals.
4.5. Statistical Analysis
We examined both the Spearman’s rank correlation and differences in mean concentrations of metabolic trait values/concentrations, between reference and six pre-analytical conditions likely to occur in large-scale epidemiological studies.
All analyses were conducted using participant-matched serum and EDTA-plasma samples. As bias in subsequent association or causal analyses involving metabolic traits is likely to be minimal if participants are still ranked similary with respect to metabolite levels (even if there are differences in mean levels), we considered the rank correlation coefficients between the reference and each pre-analytical conditions to be our main analyses. Confidence intervals for Spearman’s rank correlation coefficients were computed by bootstrap with 200 repetitions.
Differences in mean concentrations of metabolite levels between reference and each pre-analytic condition are particularly important in analyses of distributions within and between populations (i.e., descriptive studies). To assess changes in mean concentrations of metabolic traits due to the different pre-analytical conditions, we used linear regression models with random intercepts. Such models account for within individual clustering of observations as each individual participant contributes with multiple samples for analysis.
All metabolic traits were scaled to standard deviation (SD) units (by subtracting the mean and dividing by the standard deviation); this was done separately for EDTA-plasma and serum, and pre- and post-storage conditions. Standardization allows the comparison of metabolic traits with different units and/or different concentration ranges. Results in measured concentration units (e.g., mmol/L) are provided in the Supplementary Materials. Full specification of the models can be found in Appendix A. First, we evaluated the impact of pre-storage duration, before centrifugation, by keeping temperature constant (at 4 °C and 21 °C). Incubation duration was entered as a continuous term (levels: 0 = reference, 1 = 24 h, 2 = 48 h) in the model with betas representing the standardized mean difference in metabolite concentration per 24 h increment in incubation time at 4 °C and 21 °C, respectively. Next, we investigated the impact of post-storage conditions to estimate the standardized mean difference in metabolite concentration comparing delays in buffer addition and NMR profiling to the reference (levels: 0 = reference, 1 = buffer addition delay or NMR profiling delay).
Metabolic traits are highly correlated [38]. Nine principal components, in this study, explain at least 95% of the variance of serum and EDTA-plasma metabolome (see Text S1, Supplementary Materials), and this number is used to correct conventional p-value thresholds for nominal significance using the Bonferroni method (alpha = 0.05/9 = 0.006) [41,42]. Here, we focused on effect size and precision [43,44]. The rationale for defining the number of independent tests via principal component analysis (PCA) has been discussed previously [25,29,30], and more information is available in Text S1, Supplementary Materials.
In addition, to better appreciate the effects of pre-storage temperature and duration of incubation on the overall metabolic profile [45], we also used PCA [46]. In PCA scores plots, the metabolic profile of each sample is shown as a single data point, therefore each participant is represented by five data points corresponding to each pre-storage condition. Samples (data points) close to one another have similar metabolic composition in comparison to samples further apart [47]. Therefore, if pre-storage conditions do not impact metabolic profiles, samples will not cluster by pre-storage conditions.
Statistical analyses were conducted using R version 3.0.1 (R Foundation for Statistical Computing, Vienna, Austria).
5. Conclusions
Most serum and EDTA-plasma metabolic traits, quantified by Nightingale Health© NMR platform (87% of measured traits are lipid-related), are minimally affected by the pre- and post-storage conditions tested, with the exception of glycolysis metabolites, histidine, acetate and diacylglycerol. Potential bias, and its magnitude and direction, of association and causal effect estimates as a result of the impact of pre-analytical conditions, on some metabolites, will depend on the specific research questions being addressed. This should be considered for analyses with glycolsis metabolites, histadine, acetate and diaglycerol in particular. Furthermore, in large collaborations where data are pooled from different studies, the possible impact of pre-analytical conditions on between-study heterogeneity should be explored.
Acknowledgments
The authors are very grateful to all the participants who donated blood samples for this study. Sample processing and NMR analysis were carried out at the Bristol Bioresource Laboratory and the NMR Metabolomics facility at the University of Bristol.
Supplementary Materials
The following are available online at https://www.mdpi.com/2218-1989/9/4/64/s1, Text S1: Multiple testing correction: Principal component analysis (PCA) based Bonferroni correction, Figure S1: Serum, pre-storage handling effects (Spearman’s rank correlation), Figure S2: EDTA-plasma, pre-storage handling effects (Spearman’s rank correlation), Figure S3: Serum, post-storage handling effects (Spearman’s rank correlation), Figure S4: EDTA-plasma, post-storage handling effects (Spearman’s rank correlation), Figure S5: Pre-storage handling effects: standardized mean differences in metabolic trait concentration, in serum and EDTA-plasma, per 24 h increment in incubation duration at 4 °C and 21 °C, Figure S6: Post-storage handling effects: standardized mean differences in metabolic trait concentration, in serum and EDTA-plasma, comparing delays in sample preparation (i.e., 24 h delay in buffer addition) and NMR profiling (i.e., 24 h delay in NMR analysis) to the reference (no delays), for serum and plasma samples, Figure S7: Principal component analysis (PCA) on serum non-lipid-related metabolic traits subjected to five different pre-storage conditions, Table S1: Characteristics of metabolic traits: metabolic trait concentration (or value) in serum and EDTA-plasma samples, subjected to the reference pre- and post-storage conditions, from individuals who contributed to at least one pair of exposure–outcome analysis (N = 37), Table S2: Spearman’s correlation: serum, pre-storage handling effects. Spearman’s rank correlation coefficients between metabolic trait concentrations (or values) in reference samples (condition: 4 °C, 1.5 h) and samples incubated at (i) 4 °C, 24 h; (ii) 4 °C, 48 h; (iii) 21 °C, 24 h; and (iv) 21 °C, 48 h, before centrifugation, Table S3: Spearman’s correlation: EDTA-plasma, pre-storage handling effects. Spearman’s rank correlation coefficients between metabolic concentrations (or values) in reference samples (conditions: 4 °C, 1.5 h) and samples incubated at (i) 4 °C, 24 h; (ii) 4 °C, 48 h; (iii) 21 °C, 24 h; and (iv) 21 °C, 48 h, before centrifugation, Table S4: Spearman’s correlation: serum, post-storage handling effects. Spearman’s rank correlation coefficients between metabolic concentrations (or values) in reference conditions samples (i.e., no buffer addition delay or NMR analysis delay) and samples (i) left for 24 h before addition of sodium buffer followed by immediate NMR analysis (i.e., buffer delay); and (ii) addition of sodium buffer, then left for 24 h before NMR analysis (i.e., NMR delay), Table S5: Spearman’s correlation: EDTA-plasma, post-storage handling effects. Spearman’s rank correlation coefficients between metabolic concentrations (or values) in reference conditions samples (i.e., no buffer addition delay or NMR analysis delay) and samples (i) left for 24 h before addition of sodium buffer followed by immediate NMR analysis (i.e., buffer delay); and (ii) addition of sodium buffer, then left for 24 h before NMR analysis (i.e., NMR delay), Table S6: Serum, pre-storage handling effects: mean differences in metabolite concentrations (or trait value) per 24 h increment in incubation duration at 4 °C and 21 °C, for serum samples, Table S7: EDTA-plasma, pre-storage handling effects: mean differences in metabolite concentrations (or trait value) per 24 h increment in incubation duration at 4 °C and 21 °C, for EDTA-plasma samples, Table S8: Serum, post-storage handling effects: mean differences in metabolite concentrations (or trait value) comparing delays in buffer addition (i.e., buffer delay) and NMR profiling (i.e., NMR delay) to the reference (no delays), for serum samples, Table S9: EDTA-plasma, post-storage handling effects: mean differences in metabolite concentrations (or trait value) comparing delays in buffer addition (i.e., buffer delay) and NMR profiling (i.e., NMR delay) to the reference (no delays), for EDTA-plasma samples, Table S10: Literature table: summary of previous studies assessing the effects of pre- and post-storage handling conditions on serum or plasma metabolic traits also measured by Nightingale Health© NMR platform.
Appendix A
Appendix A.1. Supplementary Methods
Order of blood collection was (i) 5 × 5 mL serum SSTII (NHS supply chain KFK114), (ii) 1 × 6 mL K2 EDTA (NHS supply chain KFK286) and (iii) 4 × 4 mL K3 EDTA (NHS supply chain KFK042). Immediately after blood withdrawal, each of the serum and EDTA-plasma tubes was inverted 6 and 8–10 times, respectively. Serum and EDTA-plasma samples were stored on ice while transported to the laboratory.
Appendix A.2. Reference Samples
Serum tubes were placed upright in a tube rack in the refrigerator (4 °C) for 45 min to allow blood clotting, then centrifuged at speed 1500× g for 10 min at 18–25 °C (3K15, Sigma Laborzentrifugen, Osterode am Harz, Germany). EDTA-plasma tubes were centrifuged as soon as possible after blood withdrawal at 2260× g for 10 min at 4–5 °C (Heraeus Megafuge 16R, Thermo Scientific, city, state abbreviation, county).
Appendix A.3. Pre-storage Handling: Effect of Pre-Storage Delay and Temperature
Serum and plasma tubes were covered with tin foil and kept upright in a tube rack, and afterwards stored on a roller mixer in the dark (Bibby Scientific©,Stone, UK) at 4 °C or 21 °C, for 24 h or 48 h prior to centrifugation. Centrifugation was performed as per the reference protocol; subsequently, samples were aliquoted and immediately frozen and stored at −80 °C for one month until nuclear magnetic resonance (NMR) analysis.
Appendix A.4. Nuclear Magnetic Resonance (NMR) Metabolomics Platform
Up to 230 quantified metabolomic measures were obtained per sample of serum or EDTA-plasma, using a 1D proton (1H) NMR spectroscopy-based platform described previously [35,36,37]. Briefly, 260 μL plasma and 260 μL sodium phosphate buffer (75 mM Na2HPO4, 0.08% sodium 3-(trimethylsilyl) propionate-2,2,3,3-d4, 0.04% sodium azide in 80%/20% H2O/D2O, pH 7.4) were mixed and transferred to NMR tubes using an 8-channel, Varispan Janus liquid handling robot (PerkinElmer, Waltham, MA, USA). NMR spectra were acquired using a Bruker Avance III HD 500MHz spectrometer (Billerica, MA, USA) with a room temperature 5 mm, inverse triple resonance TXI probe and a Bruker Avance III HD 600 MHz spectrometer (Billerica, MA, USA) equipped with a nitrogen-cooled triple resonance probe (CryoProbe Prodigy TCI, Billerica, MA, USA). Both spectrometers were equipped with SampleJet auto-samplers with cooled (6 °C) sample storage. Spectra were acquired using standardized parameters via three NMR experiments or “molecular windows” to characterize lipoproteins, low molecular weight metabolites and lipids. Lipid spectra were acquired after a standardised lipid extraction procedure performed on each sample using a VIAFLO 96 channel electronic pipette (Integra Biosciences, Tardisstrasse, Switzerland). Data pre-processing and quantification were as previously described [35,36,37].
Appendix A.5. Statistical Analysis
Specification of random intercept model for analysis of pre-storage and post-storage conditions:
yij = β0 + µ0j + (β1)xij + eij, | (A1) |
where yij is the standardized or unstandardized (for figures and tables, respectively) metabolite concentration at each condition (xij) for individual j and β0 representing the average intercept of the model or the overall mean. Deviations from the average intercept for individual j is represented by µ0j. The eij term describes the deviation of the ith measurement on the jth individual from the mean standardized (or mean) metabolite concentration. This is the residual error term. For our analysis, β1 is the fixed parameter of interest, representing the standardized mean (or mean) difference in metabolite concentration for a 1-unit increase in xij. For analysis of pre-storage conditions, incubation duration was entered as continuous term in the model (levels: 0 = 1.5 h [reference], 1 = 24 h, 2 = 48 h) with β1 representing the standardized mean (or mean) difference in metabolite concentration for each 24 h increment in pre-storage incubation duration. This pre-storage condition model was run separately for samples incubated at 4 °C and 21 °C and for serum and EDTA-plasma samples. For analysis of post-storage conditions, β1 is estimated for each level of post-storage condition, representing the standardized mean (or mean) difference in metabolite concentration comparing delays in (a) post-storage buffer addition and (b) NMR profiling, to the reference (levels: 0 = no delays in buffer addition or NMR profiling [reference], 1 = buffer addition delay or NMR profiling delay).
Author Contributions
Conceptualization, D.A.L., H.J.M. and S.R.; methodology, D.L.S.F. and J.S.B.; formal analysis, D.L.S.F.; sample collection, A.G., V.Y. and J.L.M.; experimental work, H.J.M. and M.G.; writing—original draft preparation, D.L.S.F. and D.A.L.; writing—review and editing, all authors; funding acquisition, S.R., A.G. and D.A.L.
Funding
This work was supported by “Cohorts and Longitudinal Studies Enhancement Resources” (CLOSER), as part of a collaborative research program and was funded by the ESRC grant reference: ES/K000357/1. D.L.S.F., M.G., H.M., J.M., S.R., A.G., and D.A.L. work in a Unit that receives funds from the University of Bristol and the UK Medical Research Council (MC_UU_00011/6). D.A.L. is a UK National Institute of Health Research Senior Investigator (NF-0616-10102).
Conflicts of Interest
D.A.L. has received support from several national and international government and charitable funders as well as Medtronic LTD and Roche Diagnostics research that is not related to the study presented in this paper. The other authors report no conflicts. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.
References
- 1.Boyd A., Golding J., Macleod J., Lawlor D.A., Fraser A., Henderson J., Molloy L., Ness A., Ring S., Davey Smith G.D. Cohort Profile: The ‘Children of the 90s’-the index offspring of the Avon Longitudinal Study of Parents and Children. Int. J. Epidemiol. 2013;42:111–127. doi: 10.1093/ije/dys064. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Fraser A., Macdonald-Wallis C., Tilling K., Boyd A., Golding J., Davey Smith G., Henderson J., Macleod J., Molloy L., Ness A., et al. Cohort Profile: The Avon Longitudinal Study of Parents and Children: ALSPAC mothers cohort. Int. J. Epidemiol. 2013;42:97–110. doi: 10.1093/ije/dys066. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Wright J., Small N., Raynor P., Tuffnell D., Bhopal R., Cameron N., Fairley L., Lawlor D.A., Parslow R., Petherick E.S., et al. Cohort Profile: The Born in Bradford multi-ethnic family cohort study. Int. J. Epidemiol. 2013;42:978–991. doi: 10.1093/ije/dys112. [DOI] [PubMed] [Google Scholar]
- 4.Mindell J., Biddulph J.P., Hirani V., Stamatakis E., Craig R., Nunn S., Shelton N. Cohort Profile: The Health Survey for England. Int. J. Epidemiol. 2012;41:1585–1593. doi: 10.1093/ije/dyr199. [DOI] [PubMed] [Google Scholar]
- 5.Understanding Society The UK Household Longitudinal Study (UKHLS) [(accessed on 1 January 2019)];2009 Available online: https://www.understandingsociety.ac.uk/
- 6.Shah T., Engmann J., Dale C., Shah S., White J., Giambartolomei C., McLachlan S., Zabaneh D., Cavadino A., Finan C., et al. Population Genomics of Cardiometabolic Traits: Design of the University College London-London School of Hygiene and Tropical Medicine-Edinburgh-Bristol (UCLEB) Consortium. PLoS ONE. 2013;8 doi: 10.1371/annotation/89b51e89-a415-49c7-9caa-8dfcf6fde855. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Albanes D., Moore S., Ulrich C., Stolzenberg-Solomon R., Poole E., Temprosa M., Verma M., Boerwinkle E., Casas J., Clish C., et al. COnsortium for METabolomics Studies (COMETS): Leveraging Resources to Accelerate Scientific Discovery. Faseb J. 2016;30:lb129. [Google Scholar]
- 8.Psaty B.M., O’Donnell C.J., Gudnason V., Lunetta K.L., Folsom A.R., Rotter J.I., Uitterlinden A.G., Harris T.B., Witteman J.C.M., Boerwinkle E., et al. Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) Consortium Design of Prospective Meta-Analyses of Genome-Wide Association Studies From 5 Cohorts. Circ.-Cardiovasc. Gene. 2009;2 doi: 10.1161/Circgenetics.108.829747. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Lehmann R. Preanalytics: What can metabolomics learn from clinical chemistry? Bioanalysis. 2015;7:927–930. doi: 10.4155/bio.15.23. [DOI] [PubMed] [Google Scholar]
- 10.Carraro P., Zago T., Plebani M. Exploring the Initial Steps of the Testing Process: Frequency and Nature of Pre-Preanalytic Errors. Clin. Chem. 2012;58:638–642. doi: 10.1373/clinchem.2011.175711. [DOI] [PubMed] [Google Scholar]
- 11.Lippi G., Guidi G.C., Mattiuzzi C., Plebani M. Preanalytical variability: The dark side of the moon in laboratory testing. Clin. Chem. Lab. Med. 2006;44:358–365. doi: 10.1515/CCLM.2006.073. [DOI] [PubMed] [Google Scholar]
- 12.Szecsi P.B., Odum L. Error tracking in a clinical biochemistry laboratory. Clin. Chem. Lab. Med. 2009;47:1253–1257. doi: 10.1515/CCLM.2009.272. [DOI] [PubMed] [Google Scholar]
- 13.World Health Organization . Use of Anticoagulants in Diagnostic Laboratory Investigations. World Health Organization; Geneva, Switzerland: 2002. [Google Scholar]
- 14.Anton G., Wilson R., Yu Z.H., Prehn C., Zukunft S., Adamski J., Heier M., Meisinger C., Romisch-Margl W., Wang-Sattler R., et al. Pre-analytical sample quality: Metabolite ratios as an intrinsic marker for prolonged room temperature exposure of serum samples. PLoS ONE. 2015;10:e0121495. doi: 10.1371/journal.pone.0121495. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Key T., Oakes S., Davey G., Moore J., Edmond L.M., McLoone U.J., Thurnham D.I. Stability of vitamins A, C, and E, carotenoids, lipids, and testosterone in whole blood stored at 4 degrees C for 6 and 24 hours before separation of serum and plasma. Cancer Epidem. Biomar. 1996;5:811–814. [PubMed] [Google Scholar]
- 16.Clark S., Youngman L.D., Palmer A., Parish S., Peto R., Collins R. Stability of plasma analytes after delayed separation of whole blood: Implications for epidemiological studies. Int. J. Epidemiol. 2003;32:125–130. doi: 10.1093/ije/dyg023. [DOI] [PubMed] [Google Scholar]
- 17.Boyanton B.L., Jr., Blick K.E. Stability studies of twenty-four analytes in human plasma and serum. Clin. Chem. 2002;48:2242–2247. [PubMed] [Google Scholar]
- 18.Oddoze C., Lombard E., Portugal H. Stability study of 81 analytes in human whole blood, in serum and in plasma. Clin. Biochem. 2012;45:464–469. doi: 10.1016/j.clinbiochem.2012.01.012. [DOI] [PubMed] [Google Scholar]
- 19.Bernini P., Bertini I., Luchinat C., Nincheri P., Staderini S., Turano P. Standard operating procedures for pre-analytical handling of blood and urine for metabolomic studies and biobanks. J. Biomol. NMR. 2011;49:231–243. doi: 10.1007/s10858-011-9489-1. [DOI] [PubMed] [Google Scholar]
- 20.Hernandes V.V., Barbas C., Dudzik D. A review of blood sample handling and pre-processing for metabolomics studies. Electrophoresis. 2017;38:2232–2241. doi: 10.1002/elps.201700086. [DOI] [PubMed] [Google Scholar]
- 21.Dudzik D., Barbas-Bernardos C., Garcia A., Barbas C. Quality assurance procedures for mass spectrometry untargeted metabolomics. a review. J. Pharm. Biomed. Anal. 2018;147:149–173. doi: 10.1016/j.jpba.2017.07.044. [DOI] [PubMed] [Google Scholar]
- 22.Holland N.T., Smith M.T., Eskenazi B., Bastaki M. Biological sample collection and processing for molecular epidemiological studies. Mutat. Res.-Rev. Mutat. 2003;543:217–234. doi: 10.1016/S1383-5742(02)00090-X. [DOI] [PubMed] [Google Scholar]
- 23.Kirwan J.A., Brennan L., Broadhurst D., Fiehn O., Cascante M., Dunn W.B., Schmidt M.A., Velagapudi V. Preanalytical Processing and Biobanking Procedures of Biological Samples for Metabolomics Research: A White Paper, Community Perspective (for “Precision Medicine and Pharmacometabolomics Task Group”-The Metabolomics Society Initiative) Clin. Chem. 2018;64:1158–1182. doi: 10.1373/clinchem.2018.287045. [DOI] [PubMed] [Google Scholar]
- 24.Bervoets L., Louis E., Reekmans G., Mesotten L., Thomeer M., Adriaensens P., Linsen L. Influence of preanalytical sampling conditions on the H-1 NMR metabolic profile of human blood plasma and introduction of the Standard PREanalytical Code used in biobanking. Metabolomics. 2015;11:1197–1207. doi: 10.1007/s11306-015-0774-y. [DOI] [Google Scholar]
- 25.Breier M., Wahl S., Prehn C., Fugmann M., Ferrari U., Weise M., Banning F., Seissler J., Grallert H., Adamski J., et al. Targeted metabolomics identifies reliable and stable metabolites in human serum and plasma samples. PLoS ONE. 2014;9:e89728. doi: 10.1371/journal.pone.0089728. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Kamlage B., Maldonado S.G., Bethan B., Peter E., Schmitz O., Liebenberg V., Schatz P. Quality markers addressing preanalytical variations of blood and plasma processing identified by broad and targeted metabolite profiling. Clin. Chem. 2014;60:399–412. doi: 10.1373/clinchem.2013.211979. [DOI] [PubMed] [Google Scholar]
- 27.Nishiumi S., Suzuki M., Kobayashi T., Yoshida M. Differences in metabolite profiles caused by pre-analytical blood processing procedures. J. Biosci. Bioeng. 2018;125:613–618. doi: 10.1016/j.jbiosc.2017.11.011. [DOI] [PubMed] [Google Scholar]
- 28.Pinto J., Domingues M.R., Galhano E., Pita C., Almeida Mdo C., Carreira I.M., Gil A.M. Human plasma stability during handling and storage: Impact on NMR metabolomics. Analyst. 2014;139:1168–1177. doi: 10.1039/C3AN02188B. [DOI] [PubMed] [Google Scholar]
- 29.Jobard E., Tredan O., Postoly D., Andre F., Martin A.L., Elena-Herrmann B., Boyault S. A Systematic Evaluation of Blood Serum and Plasma Pre-Analytics for Metabolomics Cohort Studies. Int. J. Mol. Sci. 2016;17:2035. doi: 10.3390/ijms17122035. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Fliniaux O., Gaillard G., Lion A., Cailleu D., Mesnard F., Betsou F. Influence of common preanalytical variations on the metabolic profile of serum samples in biobanks. J. Biomol. NMR. 2011;51:457–465. doi: 10.1007/s10858-011-9574-5. [DOI] [PubMed] [Google Scholar]
- 31.Barton R.H., Nicholson J.K., Elliott P., Holmes E. High-throughput 1H NMR-based metabolic analysis of human serum and urine for large-scale epidemiological studies: Validation study. Int. J. Epidemiol. 2008;37(Suppl. 1):i31–i40. doi: 10.1093/ije/dym284. [DOI] [PubMed] [Google Scholar]
- 32.Teahan O., Gamble S., Holmes E., Waxman J., Nicholson J.K., Bevan C., Keun H.C. Impact of analytical bias in metabonomic studies of human blood serum and plasma. Anal. Chem. 2006;78:4307–4318. doi: 10.1021/ac051972y. [DOI] [PubMed] [Google Scholar]
- 33.Overfield C.V., Savory J., Heintges M.G. Glycolysis: A re-evaluation of the effect on blood glucose. Clin. Chim. Acta. 1972;39:35–40. doi: 10.1016/0009-8981(72)90296-3. [DOI] [PubMed] [Google Scholar]
- 34.Yin P., Lehmann R., Xu G. Effects of pre-analytical processes on blood samples used in metabolomics studies. Anal. Bioanal. Chem. 2015;407:4879–4892. doi: 10.1007/s00216-015-8565-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Soininen P., Kangas A.J., Wurtz P., Tukiainen T., Tynkkynen T., Laatikainen R., Jarvelin M.R., Kahonen M., Lehtimaki T., Viikari J., et al. High-throughput serum NMR metabonomics for cost-effective holistic studies on systemic metabolism. Analyst. 2009;134:1781–1785. doi: 10.1039/b910205a. [DOI] [PubMed] [Google Scholar]
- 36.Inouye M., Kettunen J., Soininen P., Silander K., Ripatti S., Kumpula L.S., Hamalainen E., Jousilahti P., Kangas A.J., Mannisto S., et al. Metabonomic, transcriptomic, and genomic variation of a population cohort. Mol. Syst. Biol. 2010;6:441. doi: 10.1038/msb.2010.93. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Soininen P., Kangas A.J., Wurtz P., Suna T., Ala-Korpela M. Quantitative serum nuclear magnetic resonance metabolomics in cardiovascular epidemiology and genetics. Circ. Cardiovasc. Genet. 2015;8:192–206. doi: 10.1161/CIRCGENETICS.114.000216. [DOI] [PubMed] [Google Scholar]
- 38.Wurtz P., Kangas A.J., Soininen P., Lawlor D.A., Davey Smith G., Ala-Korpela M. Quantitative Serum Nuclear Magnetic Resonance Metabolomics in Large-Scale Epidemiology: A Primer on -Omic Technologies. Am. J. Epidemiol. 2017;186:1084–1096. doi: 10.1093/aje/kwx016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Wang Q., Ferreira D.L.S., Nelson S.M., Sattar N., Ala-Korpela M., Lawlor D.A. Metabolic characterization of menopause: Cross-sectional and longitudinal evidence. BMC Med. 2018;16 doi: 10.1186/s12916-018-1008-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Wang Q., Wurtz P., Auro K., Makinen V.P., Kangas A.J., Soininen P., Tiainen M., Tynkkynen T., Jokelainen J., Santalahti K., et al. Metabolic profiling of pregnancy: Cross-sectional and longitudinal evidence. BMC Med. 2016;14 doi: 10.1186/s12916-016-0733-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Ferreira D.L.S., Williams D.M., Kangas A.J., Soininen P., Ala-Korpela M., Smith G.D., Jarvelin M.R., Lawlor D.A. Association of pre-pregnancy body mass index with offspring metabolic profile: Analyses of 3 European prospective birth cohorts. PLoS Med. 2017;14 doi: 10.1371/journal.pmed.1002376. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Gao X., Starmer J., Martin E.R. A multiple testing correction method for genetic association studies using correlated single nucleotide polymorphisms. Genet. Epidemiol. 2008;32:361–369. doi: 10.1002/gepi.20310. [DOI] [PubMed] [Google Scholar]
- 43.Sterne J.A., Davey Smith G. Sifting the evidence—what’s wrong with significance tests? BMJ. 2001;322:226–231. doi: 10.1136/bmj.322.7280.226. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Wasserstein R.L., Lazar N.A. The ASA’s statement on p-values: Context, process, and purpose. Am. Stat. 2016;70:129–133. doi: 10.1080/00031305.2016.1154108. [DOI] [Google Scholar]
- 45.Saccenti E., Hoefsloot H.C.J., Smilde A.K., Westerhuis J.A., Hendriks M.M.W.B. Reflections on univariate and multivariate analysis of metabolomics data. Metabolomics. 2014;10:361–374. doi: 10.1007/s11306-013-0598-6. [DOI] [Google Scholar]
- 46.Jolliffe I.T., Cadima J. Principal component analysis: A review and recent developments. Philos. Trans. A Math. Phys. Eng. Sci. 2016;374:20150202. doi: 10.1098/rsta.2015.0202. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Bro R., Smilde A.K. Principal component analysis. Anal. Methods-Uk. 2014;6:2812–2831. doi: 10.1039/C3AY41907J. [DOI] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.