Skip to main content
NIST Author Manuscripts logoLink to NIST Author Manuscripts
. Author manuscript; available in PMC: 2017 Sep 28.
Published in final edited form as: Metrologia. 2016 Sep 28;53(5):1193–1203. doi: 10.1088/0026-1394/53/5/1193

Chemical purity using quantitative 1H-nuclear magnetic resonance: a hierarchical Bayesian approach for traceable calibrations

Blaza Toman 1, Michael A Nelson 1, Katrice A Lippa 1
PMCID: PMC5486237  NIHMSID: NIHMS868463  PMID: 28670006

Abstract

Chemical purity assessment using quantitative 1H-nuclear magnetic resonance spectroscopy is a method based on ratio references of mass and signal intensity of the analyte species to that of chemical standards of known purity. As such, it is an example of a calculation using a known measurement equation with multiple inputs. Though multiple samples are often analyzed during purity evaluations in order to assess measurement repeatability, the uncertainty evaluation must also account for contributions from inputs to the measurement equation. Furthermore, there may be other uncertainty components inherent in the experimental design, such as independent implementation of multiple calibration standards. As such, the uncertainty evaluation is not purely bottom up (based on the measurement equation) or top down (based on the experimental design), but inherently contains elements of both. This hybrid form of uncertainty analysis is readily implemented with Bayesian statistical analysis. In this article we describe this type of analysis in detail and illustrate it using data from an evaluation of chemical purity and its uncertainty for a folic acid material.

Keywords: measurement uncertainty, observation equation, measurement equation, internal calibration, Markov chain Monte Carlo

1. Introduction

Nuclear magnetic resonance (NMR) spectroscopy is a prominent measurement technique for chemical analysis. Quantitative solution-state NMR (qNMR) is considered a primary ratio measurement method, through which ratio quantities of chemical substances can be determined directly [15]. For this reason, qNMR is highly useful for traceable chemical purity assessments [610].

NMR spectroscopy is a measurement method for isotopes that have nonzero nuclear spins, and thus a magnetic moment and angular momentum. Each of these isotopes has opposite spin states (‘ + ’ and ‘ − ’) with equivalent energy, dependent upon neutron and proton composition, that are characterized by a spin quantum number (s). The NMR measurement technique is based on the perturbation of these nuclear magnetic moments when placed within a strong external magnetic field and radiation of a specific (Larmor) radio frequency is absorbed. Electron spins surrounding the nucleus induce opposing magnetic fields, and as such, this resonance is also dependent on the surrounding electronic environment; that is, the molecular structure(s) of the sample material.

After the radiation absorption has ceased, the perturbed magnetic moments relax and precess about the external magnetic field at the corresponding resonance frequencies. The net sum of magnetic precessions induces voltage oscillations that are observable as a time domain signal, known as a free induction decay (FID). This signal is amplified, then converted to an audio frequency signal and digitized, then lastly Fourier transformed to obtain NMR spectral data in the frequency domain. Each individual signal in the spectrum therefore represents a structurally distinct molecular moiety of the measured sample.

The amplitude of each spin component of the FID is directly proportional to the number of corresponding resonant nuclei, and thus the signal intensities of the Fourier transformed spectra can be used to directly infer the ratio amount of nuclei for each unique resonance [1113]. It is for this reason that qNMR, when performed with proper experimental conditions, is considered a direct primary ratio measurement method for quantitative chemical analysis. The common isotope of hydrogen, 1H (≈99.9% abundance, s = 1/2, 2 spin states), is the usual nucleus for high-precision qNMR as it provides the highest analytical sensitivity and excellent linearity of signal intensity with respect to 1H concentration (R ⩾ 0.999) [12, 14]. Figure 1 illustrates the qNMR measurement of folic acid (FA) dissolved in deuterated aqueous (D2O) phosphate buffer.

Figure 1.

Figure 1

1H-NMR measurement data for folic acid in D2O phosphate buffer as: (a) time domain FID and (b) Fourier transformed spectrum.

Precise and unbiased quantity determinations of the purity of (soluble) liquid or solid chemical materials may be made via ratio reference of mass and NMR signal intensity to those of primary chemical standards of known purity [15]. The main component of these standards must contain the appropriate reference nuclei, as well as a distinct spectral peak and common solubility, but they are otherwise not compound-specific. The 1H-qNMR assessment described in this article is an evaluation of the mass purity, PPC, of a neat chemical folic acid material. The measurement equation used to derive this quantity is presented as equation (1).

PPC=(NISNPC)×(MPCMIS)×(APCAIS)×(mISmPC)×PIS (1)

where:

  • NPC multiplicity (# H/peak) of primary chemical species signal

  • NIS multiplicity (# H/peak) of internal reference standard species signal

  • MPC relative molar mass (molecular weight, g mol−1) of primary species

  • MIS relative molar mass (molecular weight, g mol−1) of the internal standard species

  • APC integrated area of primary species signal

  • AIS integrated area of the internal reference standard species signal

  • mPC mass (g) of the composite material weighed for sample solution, adjusted for buoyancy effects

  • mIS mass (g) of the internal reference standard weighed for sample solution, adjusted for buoyancy effects

  • PISknown purity (g g−1) of the internal reference standard

Due to chemical and spectroscopic limitations of the measurement system, purity determinations were first made for select secondary reference materials that were suitable for measurement with the folic acid species. For this evaluation, two high-purity (PIS > 99.99%) primary reference standards, Standard Reference Material (SRM) 350b Benzoic acid (BA) and SRM 84k Potassium Hydrogen Phthalate (KHP), were used as internal calibrators to determine the mass purity of two chemically-distinct secondary reference materials (PIS > 99.9%): methylsulfonylmethane (MSM, also known as dimethyl sulfone) and 2,2-dimethylpropanedioic acid (Me2PDA, also known as dimethylmalonic acid). With this experimental design, the evaluated mass purity of folic acid is traceable to the certified mass fraction purity of both primary standards.

Section 2 describes the statistical model used to evaluate purity of the primary chemical species based on a qNMR analysis. Section 3 describes the implementation of this model during the investigation performed to evaluate the purity of a neat folic acid material. Conclusions are given in section 4.

2. The measurement model in terms of observation equations

The measurement equation (equation (1)) is based on four different measured quantities: APC, AIS, mPC, and mIS. The remaining input quantities are either known constants, NPC, NIS, MPC, and MIS, or quantities with a mean and uncertainty specified by a certificate, PIS. Though molecular masses (MPC, and MIS) are imperfectly known due to natural variation in isotopic abundances, the slight degree of uncertainty is relatively inconsequential. Given such data, the most common approach to the evaluation of purity and its uncertainty would be to apply a standard Guide to the expression of uncertainty in measurement (GUM) [16] analysis to each individual sample’s inputs and propagate the uncertainties through the measurement equation. This would produce a purity estimate and associated uncertainty for each sample. GUM-style analyses are readily accomplished ‘by hand’ or using software such as the NIST Uncertainty Machine [17].

One potentially critical factor for chemical mass purity evaluations is the need to constrain the result to lie within the interval (0 g g−1, 1 g g−1). The usual GUM analysis does not naturally preserve this constraint, which is especially relevant for highly pure materials (PPC > 99%). But whether or not a constraint is necessary, in order to capture the between sample variability (reproducibility uncertainty) of multiple samples, the individual results must be combined to obtain a final purity estimate and uncertainty. When multiple samples with different internal standards are used, as was the case for the evaluation that we describe in detail in the following sections, accounting for all sources of uncertainty in a rigorous manner is not straightforward since the outputs from the measurement equation for single samples may not be statistically independent due to shared sources of uncertainty attributable to the experimental design. Thus, the evaluation of measurement uncertainty of the purity estimate includes both bottom-up elements, that is, uncertainty in the inputs to the measurement equation propagated through the function, and top-down elements, being sources of uncertainty due to factors that are part of the experimental design and whose contribution to uncertainty is accounted for using the variability of the measurements.

A statistically rigorous method of combining these bottom-up and top-down elements of the uncertainty analysis, while also preserving any natural physical and chemical constraints, is best achieved using an observation equation [18] as an integral part of a Bayesian statistical model [19]. An observation equation is a statistical model for the measurements, which in this case are APC, AIS, mPC, and mIS. While uniquely able to account for all of the various constraints and uncertainty sources present in a measurement process, Bayesian analysis requires so called prior distributions for all inputs that are not known constants. These prior distributions summarize all of the information about such quantities that is available before the measurements are obtained.

In order to specify the statistical model for the area measurements, we first note that equation (1) can be written as

PPC×NPCMPC×mPCAPC=PIS×NISMIS×mISAIS=K.

It then follows that:

PIS×NISMIS=K×AISmIS. (2)

Since NIS and MIS are known constants, the constant K converts the ratio of the measurement of the integrated area of the signal to the mass of the material in the sample solution into a measurement of chemical purity. Since the purity of the internal standard is known up to an uncertainty, it possible to estimate K and use it to estimate the purity of the analyte.

To estimate K we first note that

AIS=(PIS×NIS×mISMIS)1K,

a relationship which will define the expected value of the measurement of AIS. We term the uncertainty for this measurement, uAIS. NMR area measurements can generally be represented by a Gaussian distribution, or when uAIS is based on known degrees of freedom, by a Student t distribution. Measurements of sample mass, mIS, can also be represented as Gaussian or Student t distributions with mean mass μIS and uncertainty umIS. Assuming Gaussian distributions, the observation equations for AIS and mIS are therefore:

AIS~N((PIS×NIS×μISMIS)1K,uAIS2) (3)

and

mIS~N(μIS,umIS2)

where the symbol ‘~’ is interpreted ‘distributed as’ and N(,) specifies a Gaussian distribution with given mean and variance. When the uncertainties uAIS or umIS are based on known degrees of freedom, Student t distributions, t(,), should be used instead of N(,).

The main objective of these internal standard-based observation equations is to obtain information about K to be used in the analysis of similar observation equations for the primary chemical species. Generally, the information about the purity of the internal standard is given in a certificate, value μIS and uncertainty uPIS, and can be transformed into a probability distribution. If no constraint on purity is necessary, a Gaussian distribution with the given mean and variance, PIS~N(μIS,uPIS2), may be used. If purity must be constrained to lie between 0 and 1 then the beta distribution

PIS~Beta(a,b)

where

a=μPIS[μPIS(1μPIS)uPIS21],b=(1μPIS)[μPIS(1μPIS)uPIS21]

is appropriate.

Without additional information, a rectangular distribution on the interval (0, c) for some constant c can be used as a prior for K: K ~ R(0, c). Again without additional information, a Gaussian distribution with mean 0 and a large variance is appropriate for μIS: μIS~N(0, large. In our application we found the results to be robust with respect to the choice of the variance of μIS.

While the posterior distribution, which summarizes all the information resulting from a Bayesian analysis, of K cannot be obtained in closed form, a Markov chain Monte Carlo (MCMC) [19] analysis using the free software OpenBUGS [20] is straightforward and produces a sample of random draws from this posterior distribution. The OpenBUGS code is given in the appendix.

The main objective of the analysis described in this article is the estimation of the purity of the primary chemical species. This is accomplished through Bayesian analysis of two observation equations for APC and mPC:

APC~N((PPC×NPC×μPCMPC)1K,uAPC2) (4)

and

mPC~N(μPC,umPC2)

where the quantity of interest now is not K, but the measurand PPC.

As above, all unknown constants need to have prior probability distributions. The analysis of equation (3) produces a random sample from the posterior distribution for K which the OpenBUGS program can use as a prior distribution. A suitable prior distribution for mPC is again Gaussian distribution with mean 0 and a large variance μPCPC ~ N (0, large).

A prior distribution for the measurand PPC needs to be defined expressly for each application, as this is where the specifics of the experimental design are expressed in the form of constraints and/or additional structure of the prior distribution. For example, purity of different samples analyzed using a single standard may be somewhat different due to slightly different measurement conditions or heterogeneity of the material, but the differences would usually be smaller than differences in purity of samples analyzed using two different internal standards. Such details can be modeled using a second level in the prior distribution of PPC creating a so called hierarchical structure.

Evaluation of equations (3) and (4) performed using MCMC preserves all correlations present in the statistical model. The resulting posterior distribution of PPC can be summarized in terms of a mean, standard deviation, and 95% uncertainty interval to produce the desired purity estimate.

3. Examples of purity determination

This section shows application of equations (3) and (4) to the specific case of purity evaluation of folic acid. The OpenBUGS code used for the analysis is given in the appendix.

3.1. Experimental design

For the evaluation of each secondary reference material, four samples containing BA as internal standard and three containing KHP were dissolved in per-deuterated dimethyl sulfoxide, DMSO-d6 (99.9% D atom purity; Cambridge Isotope Laboratories (CIL), Cambridge, MA). These secondary reference materials were evaluated for use in folic acid (FA) measurements because they have 1H resonance signals that do not interfere with those of FA and are mutually soluble in aqueous solutions. For purity determination of FA, four samples containing the MSM and three samples containing the Me2PDA secondary standard were prepared in a K2HPO4/KD2PO4 aqueous (99.99% D atom purity D2O, CIL) buffer solution. Approximately 4 mg–15 mg masses of neat FA and internal standard materials were weighed using an ultra-microbalance (Mettler Toledo UMX5; Columbus, OH).

Experimental NMR data was acquired by a Bruker Avance 600 MHz spectrometer equipped with a 5 mm broadband inverse (BBI) detection probe and operating with Topspin (Version 3.2) software. Experiments were performed with 128 scan repetitions (60 s delay time), a spectral width of 20.0276 ppm, and transmitter frequency offset (O1) of 6.175 ppm. A 90° excitation pulse width was calibrated for each sample. Data acquisition time was 5.45 s for each scan and 131 072 data points were collected for each FID. The probe temperature was 298 K. No 13C decoupling was executed during data acquisition. Processing of the Fourier transformed 1H spectra, including baseline corrections, phase adjustment, and signal integration, was performed manually.

3.2. Purity of the secondary internal standards MSM and Me2PDA

During this investigation and for the reasons described herein, purity of FA was evaluated using two different internal standards, MSM and Me2PDA. The inputs for the purity evaluation of MSM are given in table 1. Masses and areas are based on three measurements, and therefore the uncertainties of the measurements are associated with 2 degrees of freedom. Three or four different samples containing neat chemical and internal standard materials were prepared for each of the purity assessments.

Table 1.

Inputs for Secondary Internal Standard MSM.

Internal standard SRM 350b benzoic acid (BA) SRM 84 k potassium hydrogen phthalate (KHP)

Parameters Sample 1 Sample 2 Sample 3 Sample 4 Sample 1 Sample 2 Sample 3
MPC (g mol−1) 94.13 94.13 94.13 94.13 94.13 94.13 94.13
MIS (g mol−1) 122.121 122.121 122.121 122.121 204.22 204.22 204.22
NIS 1 1 1 1 1 1 1
NPC 1 1 1 1 1 1 1
APC 1411 977 105 975 476 967 1460 479 190 1044 400 884 2049 421 640 1735 241 388 1751 551 403
uAPC 413 545.106 144 615.57 246 609.241 104 452.893 243 774.687 580 978.61 364 045.672
AIS 750028 937 623003 497 518822 717 671799 916 456562022 814572865 747400179
uAIS 73363.6013 349365.9 116331.207 165393.397 40977.0251 360632556 18405.9296
mPC (mg) 13.4375 9.2011 14.3194 9.9598 10.3778 8.2405 6.3565
umPC(mg) 0.0005 0.0005 0.0005 0.0005 0.0005 0.0005 0.0005
mIS (mg) 9.2704 7.6297 6.6060 8.3209 5.0207 8.4030 5.8872
umIS (mg) 0.0005 0.0005 0.0005 0.0005 0.0005 0.0005 0.0005
μPIS (g g−1) 0.999 978 0.999 978 0.999 978 0.999 978 0.999 911 0.999 911 0.999 911
uPIS(g g−1) 0.000 044 0.000 044 0.000 044 0.000 044 0.000 054 0.000 054 0.000 054

We specified the prior for PPC. as PPC~N(μPC,uμPC2), assigning μPC the non-informative prior μPC~N(0,d) with d set to a large value. We assign uμPC a non-informative gamma distribution prior, uμPC~ Gamma (e, f) with e and f set as large numbers, that captures the non-negativity of standard deviations. The calculated purities of the secondary internal standards, PIS in corresponding calculations of folic acid purity, were not constrained to a maximum of 1. Given that qNMR is a relative measurement, the effective purity of the internal standard is a normalized ratio of select resonant 1H content with respect to mIS, rather than an absolute chemical mass fraction. Therefore, the PIS may exceed the natural limit of MSM or Me2PDA purity. This may be the case if an assumed MIS is not consistent with the actual isotopic composition, or if there is a systematic bias of AIS associated with an indiscernible chemical impurity. Several qNMR evaluations of the high purity MSM secondary standard over multiple years and by multiple analysts indicate that PIS is greater than 1 if a molecular weight of 94.13 g mol−1 is assumed.

The MCMC posterior distribution of PPC was summarized as a mean and standard deviation. The resulting values for each sample and internal standard are given in table 2.

Table 2.

Summary outputs for purity of secondary internal standard MSM.

Internal standard Sample Posterior mean (g g−1) Posterior standard deviation (g g−1)
BA 1 1.001 0.0004
2 1.0007 0.0024
3 1.0009 0.0011
4 1.0011 0.0008

KHP 1 1.0009 0.0004
2 1.0012 0.0004
3 1.0004 0.0003

The observation equation model as specified in equations (3) and (4) represents only the bottom-up elements of the analysis, and the results in table 2 are essentially identical to the usual uncertainty analysis obtained with the GUM procedure [16].

The top-down elements in the uncertainty quantification, that is, the between sample and between internal standard variability, can be accounted for in a separate layer of the observation equation model. This is preferable to simply averaging or otherwise combining the entries in table 2 because the values of purity for the samples based on the same internal standard are correlated, and the value of this correlation would need to be estimated to produce an accurate uncertainty for the final purity estimate. A hierarchical model [21] on PPC accomplishes this naturally, without the need for separate estimation of this correlation. We define

PPCij~N(αi,τi2),i=1,2;j=1,,3 or 4 (5)

and

αi~N(μP1,ω2)

where i = 1 for BA, and i = 2 for KHP. Here the parameters τi quantify the between-sample variability for each internal standard and the ω quantifies the between-internal standard variability, which is also the basis of the correlation between the sample values. Note that this is the usual random effects model [22], widely used to account for additional uncertainty in interlaboratory studies.

The final estimate of purity is the posterior mean of μP1. For MSM this is 1.0009, with standard uncertainty = 0.0009. For the evaluation of folic acid purity these are interpreted as PIS~N (1.0009, 0.00092). Figure 2 shows the posterior distribution of μP1

Figure 2.

Figure 2

Posterior distribution of purity (g g−1) for MSM. Open square is the posterior mean, solid circles are the endpoints of a 95% uncertainty interval.

The same procedure was followed to obtain a purity estimate for the second internal standard, Me2PDA, using inputs given in table 3. Bayesian analysis of equations (3) and (4), with these data and the same prior distributions as before, produced the values in table 4.

Table 3.

Inputs for secondary internal standard Me2PDA.

Internal standard SRM 350b benzoic acid (BA) SRM 84 k potassium hydrogen phthalate (KHP)

Parameters Sample 1 Sample 2 Sample 3 Sample 4 Sample 1 Sample 2 Sample 3 Sample 4
MPC (g mol−1) 132.11 132.11 132.11 132.11 132.11 132.11 132.11 132.11
MIS (g mol−1) 122.121 122.121 122.121 122.121 204.22 204.22 204.22 204.22
NIS 1 1 1 1 1 1 1 1
NPC 1 1 1 1 1 1 1 1
APC 640246178.3 966957581 966 370 521 864647730 1061975433 1093570394 1097418797 1136552998
uAPC
93106.60606 24507.84 53870.701 73390.8488 86891.1374 63973.5968 37111.3776 67927.198
AIS 815288040.8 1.108 × 109 955889852 1079955713 803319978 944110166 613684339 720228806
uAIS
1082996261 148379.41 8876.6106 357305.977 59360.1527 96723.3087 61710.4119 100666.621
mPC (mg) 8.5701 5.9678 6.7620 7.0961 5.1465 6.0393 5.8252 5.2730
umPC (mg) 0.0005 0.0005 0.0005 0.0005 0.0005 0.0005 0.0005 0.0005
mIS (mg) 10.0868 6.3157 6.1778 8.1847 6.0208 8.0579 5.0333 5.1597
umIS(mg) 0.0005 0.0005 0.0005 0.0005 0.0005 0.0005 0.0005 0.0005
μPIS (g g−1) 0.999 978 0.999 978 0.999 978 0.999 978 0.999 911 0.999 911 0.999 911 0.999 911
uPIS (g g−1) 0.000044 0.000044 0.000044 0.000044 0.000054 0.000054 0.000054 0.000054

Table 4.

Summary outputs for purity of secondary internal standard Me2PDA.

Internal standard Sample Posterior mean (g g−1) Posterior standard deviation (g g−1)
BA 1 0.9999 0.0004
2 0.9991 0.0004
3 0.9992 0.0001
4 0.999 0.0013

KHP 1 1.0004 0.0002
2 0.9997 0.0004
3 0.9995 0.0003
4 0.9989 0.0005

These estimates were combined using the hierarchical model of equation (5) to obtain the posterior distribution of μP1, the purity of Me2PDA. The posterior mean and standard deviation were 0.9994 and 0.0008, respectively. Figure 3 shows the posterior distribution of Me2PDA.

Figure 3.

Figure 3

Posterior distribution of purity (g g−1) of Me2PDA. Open square is the posterior mean, solid circles are the endpoints of a 95% uncertainty interval.

3.3. Folic acid purity

To estimate the purity of the folic acid material, multiple samples were evaluated using each of the two internal standards. The inputs are given in table 5.

Table 5.

Inputs for folic acid (FA) samples.

Internal standard Methylsulfonylmethane (MSM) 2,2-dimethylpropanedioic acid (Me2PDA)

Parameters Sample 1 Sample 2 Sample 3 Sample 4 Sample 1 Sample 2 Sample 3
MPC (g mol−1) 441.4 441.4 441.4 441.4 441.4 441.4 441.4
MIS (g mol−1) 94.13 94.13 94.13 94.13 132.11 132.11 132.11
NIS 1 1 1 1 1 1 1
Npc 1 1 1 1 1 1 1
AreaPC 435 884 911 546 673 820 312 049 348 445 682 930 627 834 226 575 519 508 479 363 762
uAPC 1341 111.45 1163 993.9 656 777.749 1062 082.87 862 113.073 2004 862.69 616 232.246
AreaIS 1153 787 136 1.88 × 109 746 379 745 1492 167 552 579 576 035 606 371 428 854 524 945
uAIS 576 893.568 937 811.13 373 189.873 746 083.776 289 788.017 303 185.714 427 262.472
mPC (mg) 5.0559 5.2872 4.7875 4.7024 4.4486 5.0038 4.4013
umPC (mg) 0.0005 0.0005 0.0005 0.0005 0.0005 0.0005 0.0005
mIS (mg) 2.5868 3.5075 2.21 3.0396 1.112 1.4288 2.1288
umIS (mg) 0.0005 0.0005 0.0005 0.0005 0.0005 0.0005 0.0005

As in the purity assessments described above, the analysis proceeds from the same four observation equations, here using the information about the two internal standards (probability distributions of μP1 and μP2 as prior distributions) obtained in the previous section as

AISi~t((μPiμISi×NISiMISi)1Ki,uAISi2,2);mISi~N(μISi,umISi2);i=1,2 (6)

where i = 1 for BA and i = 2 for KHP. The area measurements were based on 3 replicates and so a Student t distribution with 2 degrees of freedom was used. Bayesian analysis of equation (6) leads to a posterior distribution for each Ki which can then be applied in the analysis of the observation equations for APC and mPC:

APCi~t((PPCiμPCi×NPCiMPCi)1Ki,uAPCi2,2);mPCi~N(μPCi,umPCi2);i=1,2 (7)

One difference between this analysis and the two described above is that the purity of folic acid must be constrained to the interval (0 g g−1, 1 g g−1). To produce purity estimates separately for the individual samples, this constraint is applied using a rectangular prior distribution: PPCi~R(0,1). MCMC analysis produced the results in table 6.

Table 6.

Summary outputs for purity of the folic acid (FA) samples.

Internal standard Sample Posterior mean (g g−1) Posterior standard deviation (g g−1)
MSM 1 0.9068 0.0023
2 0.9069 0.0020
3 0.9062 0.0021
4 0.9064 0.0021

Me2PDA 1 0.9044 0.0024
2 0.9051 0.0028
3 0.9058 0.0021

Since somewhat different folic acid purity values were inferred from the two internal standards, the corresponding sample sets were combined separately. The constraint to lie in the interval (0 g g−1, 1 g g−1) requires a different version of the hierarchical model for the purity values:

PPCij~Beta(ai,bi),i=1,2;j=1,,3or 4 (8)

and

ai=μPiσi2;bi=1μPiσi2;μPi~R(0.6,1);σi~R(0.0.1).

The analysis results were not sensitive to the ranges of the uniform distributions for the parameters μPi and σi but convergence was faster with the slightly informative prior distributions given here. As there were different distributions for the measurand depending on the internal standard i, there were two different sets of posterior means and standard deviations as given in table 7.

Table 7.

Summary outputs for purity for the two internal standards.

Internal standard Posterior mean (g g−1) Posterior standard deviation (g g−1)
MSM 0.9065 0.0027
Me2PDA 0.9048 0.0047

It is required to arrive at a single posterior distribution of purity. There are various statistical approaches for combining posterior distributions of a measurand, the most conservative in the sense of producing the largest uncertainty in the estimate is the linear pool method [21]. This treats the two posterior distributions of PPC (call them p1 and p2) as equally likely to be correct and combines them to produce a single probability distribution p:p(x)=p1(x)+p2(x)2 for any value x of PPC.

The value and uncertainty for the purity of FA, PFA, produced in this way was the mean and standard deviation of the posterior distribution of p: 0.9056 g g−1 and 0.0039 g g−1, respectively. Considering this uncertainty to be too conservative, we used an alternative approach that assumes the purities of the samples using either of the two internal standards are related through the same parameters of the hierarchical model, that is:

PPCi~Beta(a,b);a=μPCσ2;b=1μPCσ2 (9)

This model does not account for the between internal standard variability directly, rather it becomes part of the repeatability uncertainty component. Using this model, the estimate of purity is the mean and standard deviation of the posterior distribution of μPC: 0.9058 g g−1 and 0.0011 g g−1, respectively. Figure 4 shows the four posterior distributions.

Figure 4.

Figure 4

Purity (g g−1) of folic acid based on methylsulfonyl-methane (MSM) (blue), 2,2-dimethylpropanedioic acid (Me2PDA) (green), the linear pool consensus (red), and the version without the internal standard component (violet). Open squares are the posterior means, solid circles are the endpoints of the 95% uncertainty intervals.

4. Conclusions

We have developed an observation equation model for chemical purity determinations based on qNMR measurements. The advantage of this approach is that it ensures that any constraints on the purity estimate (for example that the mass-fraction purity should not exceed 1) are satisfied and incorporates both top-down and bottom-up uncertainty evaluations in the same statistical model, thus naturally including correlations of the measurements due to the experimental design. We illustrated Bayesian analysis of this model on measurements of folic acid (FA) purity using two different internal standards. We showed how the purity of the internal standards can first be evaluated and then used as input into the FA analysis. The OpenBUGS code used for the analysis is given in the appendix. This model was developed to be implemented as part of a direct measurement approach for traceable chemical purity assessments via qNMR, and allows observation of natural limits and evaluation of realistic uncertainty intervals.

Acknowledgments

The authors would like to thank David Duewer and Antonio Possolo for many helpful comments and suggestions that greatly improved this article. Certain commercial equipment, instruments, or materials are identified in this paper to specify adequately the experimental procedure. Such identification does not imply recommendation or endorsement by the National Institute of Standards and Technology, nor does it imply that the materials or equipment identified are necessarily the best available for the purpose.

Appendix. OpenBUGS code and data file for evaluation of purity of folic acid

{## constants
 # MIS1 is the relative molar mass of MSM
 # MPC1, MPC2 is the relative molar mass of FA
 # MIS2 is the relative molar mass of Me2PDA
## inputs
 # mIS1 is the mass measurement of MSM #
 # umIS1 is the uncertainty of the mass of the MSM weighed for sample solution
 # mPC1 is the mass of FA in samples analyzed using MSM
 # mIS2 is the mass measurement of Me2PDA
 # umIS1 is the uncertainty in the mass of the Me2PDA weighed for sample solution
 # mPC2 is the mass of FA in samples analyzed using Me2PDA
 # AIS1 integrated area of the MSM signal
 # uAIS1 uncertainty in the integrated area of the MSM signal
 # APC1 integrated area of FA signal in samples analyzed using MSM
 # uAPC1 uncertainty in the integrated area of FA signal in samples analyzed using MSM
 # AIS2 integrated area of the Me2PDA signal
 # uAIS2 uncertainty in the integrated area of the Me2PDA signal
 # APC2 integrated area of FA signal in samples analyzed using Me2PDA
 # uAPC2 uncertainty in the integrated area of FA signal in samples analyzed using Me2PDA
## parameters of the probability distributions of the mass measurements
 # mumIS1 is μIS1
 # mumPC1 is μPC1
 # mumIS2 is μIS2
 # mumPC2 is μPC2
## prior parameters
 # muP1 is the purity of MSM
 # muP2 is the purity of Me2PDA
## the measurand
 # mu is μPC, i.e. the mean of the FA purity
  MIS1 <− 94.13
  MPC1 <− 441.4
   precmIS1<1/(umIS1umIS1)
  MIS2 <− 132.11
  MPC2 <− 441.4
   precmIS2<1/(umIS2umIS2)
 # define the parameters of the Beta prior distribution for the purity of FA. The mean of this
 # distribution mu is the measurand.
  a <− mu/(sd * sd)
  b <− (1 − mu)/(sd * sd)
  mu ~ dunif(0.6, 1)
  sd ~ dunif(0, 0.1)
#
 # define the distribution of the purity of MSM and of Me2PDA, as obtained in section 3.2.
 # Variability is given in terms of precision, that is, as 1/0.00092 for MSM and 1/0.00082 for
 # Me2PDA.
  muP1 ~ dnorm(1.0009, 1234 568)
  muP2 ~ dnorm(0.9994, 1562 500)
#
# specify the prior distributions for the 4 samples analyzed using MSM
 for(i in 1:4){pPC1[i] ~ dbeta(a, b)
   mumIS1[i]~dnorm(0,1.0×105)mumPC1[i]~dnorm(0,1.0×105)}
#
# specify the observation equations for the mass measurements for the 4 samples analyzed using # MSM
for(iin1:4){mIS1[i]~dnorm(mumIS1[i],precmIS1)mPC1[i]~dnorm(mumPC1[i],precmIS1)}
#
# specify the observation equation for the area measurements of MSM
 for(i in 1:4){
  k1[i] ~ dunif(0, 0.01)
   meanAIS1[i]<muP1mumIS1[i]/MIS1/K1[i]precAIS1[i]<1/(uAIS1[i]uAIS1[i])}
 for (i in 1:4){
  AIS1 [i] ~ dt(meanAIS1[i], 2)}
#
#specify the observation equation for area of FA for the 4 samples analyzed using MSM
 for(i in 1:4){
  k.cut1[i] <− cut(k1[i])
   meanAPC1[i]<pPC1[i]mumPC1[i]/MPC1/k.cut1[i]precAPC1[i]<1/(uAPC1[i]uAPC1[i])
}
 for(i in 1:4){
  APC1[i] ~ dnorm(meanAPC1[i], precAPC1[i])}
  # specify the prior distributions for the 3 samples analyzed using Me2PDA
 for(i in 1:3){pPC2[i] ~ dbeta(a, b)
   mumPC2[i]~dnorm(0,1.0×105)mumIS2[i]~dnorm(0,1.0×105)}
#
# specify the observation equations for the mass measurements for the 3 samples analyzed using
#Me2PDA
   for(iin 1:3){mIS2[i]~dnorm(mumIS2[i],precmIS2)mPC2[i]~dnorm(mumPC2[i],precmIS2)}
#
# specify the observation equation for the area measurements of Me2PDA
 for(i in 1:3){
  k2[i] ~ dunif(0,0.01)
   meanAIS2[i]<muP2mumIS2[i]/MIS2/k2[i]precAIS2[i]<1/(uAIS2[i]uAIS2[i])}
 for(i in 1:3){
  AIS2[i] ~ dt(meanAIS2[i],precAIS2[i],2)}
# specify the observation equation for area of FA for the 3 samples analyzed using Me2PDA
 for(i in 1:3){
  k.cut2[i] <− cut(k2[i])
   meanAPC2[i]<pPC2[i]mumPC2[i]/MPC2/k.cut2[i]precAPC2[i]<1/(uAPC2[i]uAPC2[i])
}
 for(i in 1:3){
  APC2[i] ~ dnorm(meanAPC2[i], precAPC2[i])}
}
###
Data file:
list(mIS1 = c(0.0025 868,0.0035075,0.00221,0.0030396),
mPC1 = c(0.0050 559,0.0052 872,0.0047 875,0.0047 024),
umIS1 = 0.0000 005,
AIS1 = c(0.011 537 871 36,0.018 76,0.007 463 797 45,0.014 921 675 52),
uAIS1= c(0.000 000 979 46,0.000 000 638 23,0.000 000 766 96,0.000 000 731 31),
APC1 = c(0.004 358 849 11,0.005 466 738 20,0.003 120 493 48,0.004456829 30),
uAPC1= c(0.000 013 411 11,0.000 011 639 93,0.000 006 567 77,0.000 010 620 83),
mIS2 = c(0.001 112,0.001 4288,0.002 1288),
mPC2 = c(0.004 4486,0.005 0038,0.004 4013), umIS2 = 0.000 0005,
AIS2 = c(0.005 795 760 35,0.006 063 714 28,0.008 545 249 45),
uAIS2= c(0.000 000 122 54,0.000 000 134 47,0.000 000 190 03),
APC2 = c(0.006 278 342 26,0.005 755 195 08,0.004 793 63762),
uAPS2= c(0.000 008 621 13,0.000 020 048 63,0.000 006 162 32))

References

  • 1.Quinn TJ. Metrologia. 1997;34:61–5. [Google Scholar]
  • 2.Malz F, Jancke H. J Pharm Biomed Anal. 2004;38:813–23. doi: 10.1016/j.jpba.2005.01.043. [DOI] [PubMed] [Google Scholar]
  • 3.Holzgrabe U, Deubner R, Schollmayer C, Waibel B. J Pharm Biomed Anal. 2005;38:806–812. doi: 10.1016/j.jpba.2005.01.050. [DOI] [PubMed] [Google Scholar]
  • 4.Jancke H, Malz F, Haesselbarth W. Accredit Qual Assur. 2005;10:421–9. [Google Scholar]
  • 5.Pauli GF. Phytochem Anal. 2001;12:28–42. doi: 10.1002/1099-1565(200101/02)12:1<28::AID-PCA549>3.0.CO;2-D. [DOI] [PubMed] [Google Scholar]
  • 6.DeBiévre P, Dybkaer R, Fajgelj A, Hibbert B. Pure Appl Chem. 2011;83:1873–935. [Google Scholar]
  • 7.Nelson M, Bedner M, Lang B, Toman B, Lippa K. Anal Bioanal Chem. 2015;407:8557–69. doi: 10.1007/s00216-015-9013-7. [DOI] [PubMed] [Google Scholar]
  • 8.Weber M, Hellriegel C, Rueck A, Sauermoser R, Wuethrich J. Accredit Qual Assur. 2013;18:91–8. [Google Scholar]
  • 9.Saito T, Ihara T, Koike M, Kinugasa S, Fujimine Y, Nose K, Hirai T. Accredit Qual Assur. 2009;14:79–86. [Google Scholar]
  • 10.Schoenberger T. Anal Bioanal Chem. 2012;403:247–54. doi: 10.1007/s00216-012-5777-1. [DOI] [PubMed] [Google Scholar]
  • 11.Jancke H. NMR Spectroscopy as a primary analytical metho. CCQM Report. 1998;98:1–12. [Google Scholar]
  • 12.Bharti SK, Roy R. Trends Anal Chem. 2012;35:5–26. [Google Scholar]
  • 13.Wells RJ, Cheung J, Hook JM. Accredit Qual Assur. 2004;9:450–6. [Google Scholar]
  • 14.Gadape HH, Parikh KS, editors. J Chem Pharm Res. 2011;3:649. [Google Scholar]
  • 15.Al-Deen TS, Hibbert DB, Hook JM, Wells RJ. Accredit Qual Assur. 2004;9:55–63. [Google Scholar]
  • 16.Joint Committee for Guides in Metrology. Evaluation of Measurement Data—Guide to the Expression of Uncertainty in Measurement. Sevres: International Bureau of Weights and Measures (BIPM); 2008. available at www.bipm.org/utils/common/documents/jcgm/JCGM_100_2008_E.pdf (Accessed: June 2016) [Google Scholar]
  • 17.Lafarge T, Possolo A. NIST Uncertainty Machine. Gaithersburg, MD: National Institute of Standards and Technology; 2015. available at http://uncertainty.nist.gov/ (Accessed: June 2016. [Google Scholar]
  • 18.Possolo A, Toman B. Metrologia. 2007;44:464–75. [Google Scholar]
  • 19.Gelman A, Carlin J, Stern H, Dunson D, Vehtari A, Rubin D. Bayesian Data Analysis. 3rd. Boca Raton, FL: CRC Press; 2013. [Google Scholar]
  • 20.Lunn DJ, Spiegelhalter D, Thomas A, Best N. Stat Med. 2009;28:3049–82. doi: 10.1002/sim.3680. [DOI] [PubMed] [Google Scholar]
  • 21.Toman B. Technometrics. 2007;49:81–7. [Google Scholar]
  • 22.Toman B, Possolo A. Accredit Qual Assur. 2009;14:553–63. [Google Scholar]

RESOURCES