Skip to main content
ACS AuthorChoice logoLink to ACS AuthorChoice
. 2022 May 3;94(19):6919–6923. doi: 10.1021/acs.analchem.2c00466

1H NMR Signals from Urine Excreted Protein Are a Source of Bias in Probabilistic Quotient Normalization

Gonçalo D S Correia †,#,*, Panteleimon G Takis †,#,*, Caroline J Sands †,#, Anna M Kowalka ¶,§, Tricia Tan ¶,§, Lance Turtle , Antonia Ho , Malcolm G Semple ○,, Peter J M Openshaw , J Kenneth Baillie , Zoltán Takáts †,#, Matthew R Lewis †,#
PMCID: PMC9118196  PMID: 35503092

Abstract

graphic file with name ac2c00466_0007.jpg

Normalization to account for variation in urinary dilution is crucial for interpretation of urine metabolic profiles. Probabilistic quotient normalization (PQN) is used routinely in metabolomics but is sensitive to systematic variation shared across a large proportion of the spectral profile (>50%). Where 1H nuclear magnetic resonance (NMR) spectroscopy is employed, the presence of urinary protein can elevate the spectral baseline and substantially impact the resulting profile. Using 1H NMR profile measurements of spot urine samples collected from hospitalized COVID-19 patients in the ISARIC 4C study, we determined that PQN coefficients are significantly correlated with observed protein levels (r2 = 0.423, p < 2.2 × 10–16). This correlation was significantly reduced (r2 = 0.163, p < 2.2 × 10–16) when using a computational method for suppression of macromolecular signals known as small molecule enhancement spectroscopy (SMolESY) for proteinic baseline removal prior to PQN. These results highlight proteinuria as a common yet overlooked source of bias in 1H NMR metabolic profiling studies which can be effectively mitigated using SMolESY or other macromolecular signal suppression methods before estimation of normalization coefficients.


Urine is a complex chemical mixture which contains metabolic end-products from host and associated microbiota, xenobiotics, and dietary compounds1 in highly variable concentrations.2 Urinalysis is routinely used in the clinic for noninvasive diagnosis of local conditions of urinary tract pathology and infection (e.g., via measurement of leukocytes and nitrite), systemic metabolic disease (e.g., diabetes via glucose), and to assess environmental and nutritional exposure. The ease of sample collection and diagnostic potential has made urine a focal point for biofluid-based metabolomics studies.31H nuclear magnetic resonance (NMR) spectroscopy emerged early in the evolution of the field4 as a suitable platform for such investigations owing to its broad linear dynamic range, excellent reproducibility, high throughput, and quantitative accuracy.57

Urine exhibits strong intraday and interindividual variation in dilution owing to factors such as hydration status, kidney function, diet, medication, and voiding interval. This poses a fundamental challenge in urinalysis, especially of spot urine samples, that must be accounted for when accurately making or comparing chemical measurements8 in both clinical applications and metabolic profiling studies.9 Measurements of urinary creatinine, osmolality, specific gravity, or volume (in 24h collections) are routinely used in clinical biochemistry to account for variable dilution (e.g., albumin-to-creatinine ratio) and allow comparison to normal reference ranges.10 However, in metabolomics studies, the use of profiling technologies such as 1H NMR allows for the estimation of more robust normalization coefficients based on a broader view of the urinary metabolome. Probabilistic quotient normalization (PQN)11 leverages the data captured in the metabolic profile as a whole and is the “gold standard” for computational normalization of 1H NMR spectra in urinalysis studies. PQN uses the complete set of profile measurements to estimate a normalization coefficient for each sample that is representative of its dilution factor relative to a predefined reference (usually a median spectrum). Compared to other normalization methods (e.g., total area normalization), PQN provides robustness against bias from few signals whose intensity dominates the total profile integral. However, PQN is not suitable if a large proportion of variables (50% or more) covary systematically with factors other than the sample dilution.11

The presence of urinary proteins can exert a broad effect on 1H NMR-based metabolic profiles by elevating the spectral baseline and contributing broad signals in a concentration-dependent manner (Figure 1a).12 Because a large proportion of signals across the profile are affected by the presence of substantial proteinuria, the fitness of PQN normalization for samples representative of many phenotypes may be challenged. Proteinuria is frequently encountered in chronic kidney disease,13 diabetes14 and obesity,15 and increases with age.16 Surprisingly, a thorough literature search did not reveal any critical assessment of the impact of proteinuria in the estimation of normalization coefficients. The ongoing Covid-19 pandemic and organized efforts to understand its underlying pathophysiological effects have provided both a need and the opportunity to evaluate this relationship, as proteinuria is prevalent in COVID-19 patients, independent of other comorbidities, and associated with disease severity and patient survival.17,18

Figure 1.

Figure 1

(a) The 1H NMR spectrum of albumin (black line) compared to a real urine sample containing the same amount of albumin (∼7 mg/mL). (b) Eighty-five urine 1H NMR profiles from COVID-19 patients, focusing on the backbone −NH (left panel) and parts of methyl proteinic protons (right panel), showcasing the spectral baseline effect from the presence of proteinuria. (c) SMolESY application on the 1H NMR spectrum of albumin (∼7 mg/mL) and a real urine sample with the same amount of albumin. In both cases SMolESY succeeds in broad signals suppression as well as baseline homogenization, allowing the enhancement of sharp signals from small molecules (i.e., metabolites).

We used an 1H NMR-based metabolic profiling approach to analyze urine samples from patients (n = 1022 spot urine samples from 711 patients) admitted to hospital with COVID-19, collected by the International Severe Acute Respiratory and Emerging Infections Consortium (ISARIC) following the WHO Clinical Characterization Protocol UK (CCP-UK). Further NMR experimental details are reported in the Supporting Information. Figure 1b shows several urine 1H NMR profiles (n = 85) from COVID-19 patients, focusing on the backbone −NH and parts of methyl proteinic protons, clearly illustrating the effects of proteinuria on the small molecule profiles observed. The impact of macromolecular signals on the small molecule profile may be reduced when employing additional NMR experiments beyond the routine one-dimensional general profile experiment (i.e., transverse relaxation (T2) spectral editing experiments such as spin–echo pulse sequences19). However, the need for these additional experiments may not be anticipated at the outset of a urine profiling study, and their use both increases the experimental cost and decreases analysis throughput. We recently introduced small molecule enhancement spectroscopy (SMolESY)12 as a computational alternative for removal of macromolecule-derived signals directly from routine one-dimensional NMR spectral profiles without the need of extra experiments, enabling the more specific and direct analysis of small molecule analytes (Figures 1c, S1, and S2). The approach can also be effectively reversed, removing the sharp small molecule-derived signals and providing an enhanced protein baseline for the purposes of urinary protein quantification (Figure 2a, S3).20 Integration of the proteinic methyl group-containing spectral region between 0.2 and 0.5 ppm (Figure 2a) accurately represents the total amount of urinary protein as validated by comparison with turbidimetric measurement (Figure 2b). In the present study, we compared the PQN coefficients obtained from the standard one-dimensional (1D) 1H NMR spectra and their SMolESY processed counterparts (Figure 3). Although there is a good correlation between both measures (Pearson’s ρ = 0.89), there is also a visible trend in the deviations from the regression line associated with high levels of total protein quantified in each sample, confirming that proteinuria does influence the estimation of PQN coefficients. Urinary protein excretion alone explains approximately 42.3% (r2 = 0.423) of the variance in PQN coefficients estimated from the 1D 1H NMR spectra without macromolecular baseline removal (Figure 4a). This trend is greatly reduced (r2 = 0.163) when the macromolecular signature is removed via SMolESY prior to PQN (Figure 4b), closer to the observed association between urinary creatinine and protein concentration (r2 = 0.063, Figure S5).

Figure 2.

Figure 2

(a) Fifty 1H NMR spectra from the COVID-19 cohort focusing on the proteinic methyl group-containing spectral region between 0.2 and 0.5 ppm (upper panel). After processed SMolESY filtering, the resulting spectra (bottom panel) are free from small metabolites sharp signals (or the remaining signals contain the same negative/positive part with almost zero integral) and their integration provides an estimate of the total protein. (b) The absolute quantification of total urinary protein via NMR highly correlates to the measured protein concentration by clinical methods (for both NMR and clinical methods see Supporting Information).

Figure 3.

Figure 3

Agreement between PQN coefficients estimated from the standard 1H NMR spectra and the corresponding SMolESY processed data. The linear regression trendline (dashed red line) was estimated with the orthogonal least-squares Passing–Bablok method. The regression coefficients, Pearson correlation coefficient (ρ) and the p-value from the two-sided correlation significance test are shown in the top left corner. Data points are colored by the natural logarithm of the estimated protein concentration (mg/mL).

Figure 4.

Figure 4

PQN coefficient variance explained (r2, estimated from the linear regression models plotted in red) by protein concentration, in the (a) standard and (b) SMolESY processed NMR spectra. Protein values equal or below the limit of detection (LOD) = 0.11 mg/mL were excluded for this analysis (final n = 810). PQN coefficients were square root transformed, and urine protein measurements were log-transformed.

A comparison between the PQN coefficients and creatinine concentrations is shown in Figure 5. While creatinine concentration can be affected by multiple biological factors, PQN coefficients should still correlate with creatinine levels in spot urine samples, as observed in Figure 5a,b. Although the Pearson correlation between PQN coefficients and creatinine concentration is only marginally improved in the SMolESY data set (ρ = 0.71 vs ρ = 0.69, respectively) the residuals from the linear regression models (dashed red lines in Figure 5a,b) have a strong association with total urine protein (r2 = 0.43, Figure 5c) in the original spectra, a trend which is reduced in SMolESY data (r2 = 0.1, Figure 5d).

Figure 5.

Figure 5

Correlation between PQN coefficients estimated from (a) the standard 1H NMR or (b) SMolESY processed spectra and creatinine. Pearson correlation coefficient and p-value from the two-sided correlation significance test are shown in each figure. Data points are colored by the natural logarithm of the estimated protein concentration. PQN coefficients and creatinine measurements were square root transformed. (c,d) The residuals from the OLS regression trendlines (dashed red lines in panels a and b) and their residual association with total urine protein (log-transformed). Protein values equal or below the LOD = 0.11 mg/mL were excluded in panels c and d (final n = 810).

Figure 6 shows the principal component analysis (PCA) scores plots obtained when the SMolESY processed data set is normalized using PQN coefficients estimated from the standard 1D NMR spectra (Figure 6a) or from SMolESY data (Figure 6b). Despite the similarity in major trends in both plots, clustering based on protein concentration is more marked when spectra are normalized with the standard PQN procedure (Figure 6a).

Figure 6.

Figure 6

PCA scores plots for the SMolESY processed data set normalized with the PQN coefficients estimated from the (a) standard 1D and (b) SMolESY processed NMR spectra. NMR data was unit-variance scaled. Protein values were log-transformed and values equal or below LOD were imputed by replacement with the LOD value = 0.11 mg/mL.

Normalization procedures are crucial to correctly interpret urinary metabolic profiles. Here, we show that protein signals can confound probabilistic quotient normalization, and it is reasonable to assume this could also happen with other computational normalization methods. We recommend the removal of protein baseline signals prior to estimation of normalization coefficients. This can be performed experimentally21 or with computational methods.22,23 However, we advocate the use of SMolESY, because of its ease of application to 1H NMR spectra (including retrospective application where proteinuria is observed after the fact), being fast and highly effective method for removing broad baseline signals from protein and improving the estimation of normalization coefficients. Our observations and proposed methodology are of high importance for the accurate normalization of urine biofluid 1H NMR spectra, especially in the context of studies on diseases and phenotypes where proteinuria is likely to be present (e.g., diabetes, chronic kidney disease, pregnancy, infection, or protein rich diet).24

Acknowledgments

This research is supported by grants from the National Institute for Health Research (NIHR) [award CO-CIN-01]; the Medical Research Council [grant MC_PC_19059, MC_PC_12025], the MRC UK Consortium for MetAbolic Phenotyping (MAP/UK) [grant number MR/S010483/1], and by the NIHR Health Protection Research Unit (HPRU) in Emerging and Zoonotic Infections at University of Liverpool, in partnership with Public Health England (PHE), in collaboration with the Liverpool School of Tropical Medicine and the University of Oxford [award 200907]; NIHR HPRU in Respiratory Infections at Imperial College London with Public Health England (PHE) [award 200927]; infrastructure support was provided by the NIHR Imperial Biomedical Research Centre (BRC).

Supporting Information Available

The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/acs.analchem.2c00466.

  • Urine samples collection details; NMR experiments/apparatus; NMR samples preparation SOPs; SMolESY data preparation; urinary creatinine quantification by 1H NMR; urinary total protein quantification by clinical methods and 1H NMR/SMolESY; statistical analyses details; PQN coefficients comparison with urinary creatinine concentration and osmolality measurements by clinical methods; correlation between urinary creatinine concentration and protein excretion; comparison between CPMG and SMolESY estimated PQN coefficients in human plasma heparin 1H NMR spectra; supporting references (PDF)

Author Contributions

G.D.S.C. and P.G.T. contributed equally.

The authors declare no competing financial interest.

Notes

This work was done on behalf of and used human samples collected by the ISARIC4C consortium. The full membership list can be found in the following link: https://isaric4c.net/about/authors/

Supplementary Material

ac2c00466_si_001.pdf (1.9MB, pdf)

References

  1. Bouatra S.; Aziat F.; Mandal R.; Guo A. C.; Wilson M. R.; Knox C.; Bjorndahl T. C.; Krishnamurthy R.; Saleem F.; Liu P.; Dame Z. T.; Poelzer J.; Huynh J.; Yallou F. S.; Psychogios N.; Dong E.; Bogumil R.; Roehring C.; Wishart D. S. The Human Urine Metabolome. PLoS One 2013, 8 (9), e73076 10.1371/journal.pone.0073076. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Takis P. G.; Schäfer H.; Spraul M.; Luchinat C. Deconvoluting Interrelationships between Concentrations and Chemical Shifts in Urine Provides a Powerful Analysis Tool. Nat. Commun. 2017, 8, 1662. 10.1038/s41467-017-01587-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Dinges S. S.; Hohm A.; Vandergrift L. A.; Nowak J.; Habbel P.; Kaltashov I. A.; Cheng L. L. Cancer Metabolomic Markers in Urine: Evidence, Techniques and Recommendations. Nat. Rev. Urol. 2019, 16 (6), 339–362. 10.1038/s41585-019-0185-3. [DOI] [PubMed] [Google Scholar]
  4. Nicholson J. K.; O’Flynn M. P.; Sadler P. J.; Macleod A. F.; Juul S. M.; Sönksen P. H. Proton-Nuclear-Magnetic-Resonance Studies of Serum, Plasma and Urine from Fasting Normal and Diabetic Subjects. Biochem. J. 1984, 217 (2), 365–375. 10.1042/bj2170365. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Takis P. G.; Ghini V.; Tenori L.; Turano P.; Luchinat C. Uniqueness of the NMR Approach to Metabolomics. TrAC - Trends Anal. Chem. 2019, 120, 115300. 10.1016/j.trac.2018.10.036. [DOI] [Google Scholar]
  6. Keun H. C.; Ebbels T. M. D.; Antti H.; Bollard M. E.; Beckonert O.; Schlotterbeck G.; Senn H.; Niederhauser U.; Holmes E.; Lindon J. C.; Nicholson J. K. Analytical Reproducibility in 1H NMR-Based Metabonomic Urinalysis. Chem. Res. Toxicol. 2002, 15 (11), 1380–1386. 10.1021/tx0255774. [DOI] [PubMed] [Google Scholar]
  7. Beckonert O.; Keun H. C.; Ebbels T. M. D.; Bundy J.; Holmes E.; Lindon J. C.; Nicholson J. K. Metabolic Profiling, Metabolomic and Metabonomic Procedures for NMR Spectroscopy of Urine, Plasma, Serum and Tissue Extracts. Nat. Protoc. 2007, 2 (11), 2692–2703. 10.1038/nprot.2007.376. [DOI] [PubMed] [Google Scholar]
  8. Edmands W. M. B.; Ferrari P.; Scalbert A. Normalization to Specific Gravity Prior to Analysis Improves Information Recovery from High Resolution Mass Spectrometry Metabolomic Profiles of Human Urine. Anal. Chem. 2014, 86 (21), 10925–10931. 10.1021/ac503190m. [DOI] [PubMed] [Google Scholar]
  9. Blaise B. J.; Correia G. D. S.; Haggart G. A.; Surowiec I.; Sands C.; Lewis M. R.; Pearce J. T. M.; Trygg J.; Nicholson J. K.; Holmes E.; Ebbels T. M. D. Statistical Analysis in Metabolic Phenotyping. Nat. Protoc. 2021, 16, 4299–4326. 10.1038/s41596-021-00579-1. [DOI] [PubMed] [Google Scholar]
  10. Helmersson-Karlqvist J.; Ärnlöv J.; Larsson A. Day-to-Day Variation of Urinary NGAL and Rational for Creatinine Correction. Clin. Biochem. 2013, 46 (1–2), 70–72. 10.1016/j.clinbiochem.2012.09.022. [DOI] [PubMed] [Google Scholar]
  11. Dieterle F.; Ross A.; Schlotterbeck G.; Senn H. Probabilistic Quotient Normalization as Robust Method to Account for Dilution of Complex Biological Mixtures. Application In1H NMR Metabonomics. Anal. Chem. 2006, 78 (13), 4281–4290. 10.1021/ac051632c. [DOI] [PubMed] [Google Scholar]
  12. Takis P. G.; Jiménez B.; Sands C. J.; Chekmeneva E.; Lewis M. R. SMolESY: An Efficient and Quantitative Alternative to on-Instrument Macromolecular 1H-NMR Signal Suppression. Chem. Sci. 2020, 11 (23), 6000–6011. 10.1039/D0SC01421D. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Garg A. X.; Kiberd B. A.; Clark W. F.; Haynes R. B.; Clase C. M. Albuminuria and Renal Insufficiency Prevalence Guides Population Screening: Results from the NHANES III. Kidney Int. 2002, 61 (6), 2165–2175. 10.1046/j.1523-1755.2002.00356.x. [DOI] [PubMed] [Google Scholar]
  14. Karalliedde J.; Viberti G. Proteinuria in Diabetes: Bystander or Pathway to Cardiorenal Disease?. J. Am. Soc. Nephrol. 2010, 21 (12), 2020–2027. 10.1681/ASN.2010030250. [DOI] [PubMed] [Google Scholar]
  15. Rosenstock J. L.; Pommier M.; Stoffels G.; Patel S.; Michelis M. F. Prevalence of Proteinuria and Albuminuria in an Obese Population and Associated Risk Factors. Frontiers in Medicine 2018, 5, 122. 10.3389/fmed.2018.00122. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Sureshkumar K. K.; Ray T.; Clark B. A. Evaluation and Outcome of Proteinuria in Older and Younger Adults. Journals Gerontol. Ser. A 2003, 58 (4), M378–M381. 10.1093/gerona/58.4.M378. [DOI] [PubMed] [Google Scholar]
  17. Chaudhri I.; Moffitt R.; Taub E.; Annadi R. R.; Hoai M.; Bolotova O.; Yoo J.; Dhaliwal S.; Sahib H.; Daccueil F.; Hajagos J.; Saltz M.; Saltz J.; Mallipattu S. K.; Koraishy F. M. Association of Proteinuria and Hematuria with Acute Kidney Injury and Mortality in Hospitalized Patients with COVID-19. Kidney Blood Press. Res. 2020, 45 (6), 1018–1032. 10.1159/000511946. [DOI] [PubMed] [Google Scholar]
  18. Ouahmi H.; Courjon J.; Morand L.; François J.; Bruckert V.; Lombardi R.; Esnault V.; Seitz-Polski B.; Demonchy E.; Dellamonica J.; Boyer-Suavet S. Proteinuria as a Biomarker for COVID-19 Severity. Front. Physiol. 2021, 12, 12. 10.3389/fphys.2021.611772. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Carr H. Y.; Purcell E. M. Effects of Diffusion on Free Precession in Nuclear Magnetic Resonance Experiments. Phys. Rev. 1954, 94 (3), 630–638. 10.1103/PhysRev.94.630. [DOI] [Google Scholar]
  20. Vuckovic I.; Denic A.; Charlesworth M. C.; Šuvakov M.; Bobart S.; Lieske J. C.; Fervenza F. C.; Macura S. 1H Nuclear Magnetic Resonance Spectroscopy-Based Methods for the Quantification of Proteins in Urine. Anal. Chem. 2021, 93 (39), 13177–13186. 10.1021/acs.analchem.1c01618. [DOI] [PubMed] [Google Scholar]
  21. Nagana Gowda G. A.; Raftery D. Quantitating Metabolites in Protein Precipitated Serum Using NMR Spectroscopy. Anal. Chem. 2014, 86 (11), 5433–5440. 10.1021/ac5005103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Worley B.; Sisco N. J.; Powers R. Statistical Removal of Background Signals from High-Throughput 1H NMR Line-Broadening Ligand-Affinity Screens. J. Biomol. NMR 2015, 63 (1), 53. 10.1007/s10858-015-9962-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Brown C. L. J.; Scott H.; Mulik C.; Freund A. S.; Opyr M. P.; Metz G. A. S.; Douglas Inglis G.; Montina T. Fecal 1H-NMR Metabolomics: A Comparison of Sample Preparation Methods for NMR and Novel in Silico Baseline Correction. Metabolites 2022, 12 (2), 148. 10.3390/metabo12020148. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Carroll M. F.; Temte J. L. Proteinuria in Adults: A Diagnostic Approach. Am. Fam. Physician 2000, 62 (6), 1333–1340. [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

ac2c00466_si_001.pdf (1.9MB, pdf)

Articles from Analytical Chemistry are provided here courtesy of American Chemical Society

RESOURCES