Abstract
Chemical purity assessment using quantitative 1H-nuclear magnetic resonance spectroscopy is a method based on ratio references of mass and signal intensity of the analyte species to that of chemical standards of known purity. As such, it is an example of a calculation using a known measurement equation with multiple inputs. Though multiple samples are often analyzed during purity evaluations in order to assess measurement repeatability, the uncertainty evaluation must also account for contributions from inputs to the measurement equation. Furthermore, there may be other uncertainty components inherent in the experimental design, such as independent implementation of multiple calibration standards. As such, the uncertainty evaluation is not purely bottom up (based on the measurement equation) or top down (based on the experimental design), but inherently contains elements of both. This hybrid form of uncertainty analysis is readily implemented with Bayesian statistical analysis. In this article we describe this type of analysis in detail and illustrate it using data from an evaluation of chemical purity and its uncertainty for a folic acid material.
Keywords: measurement uncertainty, observation equation, measurement equation, internal calibration, Markov chain Monte Carlo
1. Introduction
Nuclear magnetic resonance (NMR) spectroscopy is a prominent measurement technique for chemical analysis. Quantitative solution-state NMR (qNMR) is considered a primary ratio measurement method, through which ratio quantities of chemical substances can be determined directly [1–5]. For this reason, qNMR is highly useful for traceable chemical purity assessments [6–10].
NMR spectroscopy is a measurement method for isotopes that have nonzero nuclear spins, and thus a magnetic moment and angular momentum. Each of these isotopes has opposite spin states (‘ + ’ and ‘ − ’) with equivalent energy, dependent upon neutron and proton composition, that are characterized by a spin quantum number (s). The NMR measurement technique is based on the perturbation of these nuclear magnetic moments when placed within a strong external magnetic field and radiation of a specific (Larmor) radio frequency is absorbed. Electron spins surrounding the nucleus induce opposing magnetic fields, and as such, this resonance is also dependent on the surrounding electronic environment; that is, the molecular structure(s) of the sample material.
After the radiation absorption has ceased, the perturbed magnetic moments relax and precess about the external magnetic field at the corresponding resonance frequencies. The net sum of magnetic precessions induces voltage oscillations that are observable as a time domain signal, known as a free induction decay (FID). This signal is amplified, then converted to an audio frequency signal and digitized, then lastly Fourier transformed to obtain NMR spectral data in the frequency domain. Each individual signal in the spectrum therefore represents a structurally distinct molecular moiety of the measured sample.
The amplitude of each spin component of the FID is directly proportional to the number of corresponding resonant nuclei, and thus the signal intensities of the Fourier transformed spectra can be used to directly infer the ratio amount of nuclei for each unique resonance [11–13]. It is for this reason that qNMR, when performed with proper experimental conditions, is considered a direct primary ratio measurement method for quantitative chemical analysis. The common isotope of hydrogen, 1H (≈99.9% abundance, s = 1/2, 2 spin states), is the usual nucleus for high-precision qNMR as it provides the highest analytical sensitivity and excellent linearity of signal intensity with respect to 1H concentration (R ⩾ 0.999) [12, 14]. Figure 1 illustrates the qNMR measurement of folic acid (FA) dissolved in deuterated aqueous (D2O) phosphate buffer.
Figure 1.

1H-NMR measurement data for folic acid in D2O phosphate buffer as: (a) time domain FID and (b) Fourier transformed spectrum.
Precise and unbiased quantity determinations of the purity of (soluble) liquid or solid chemical materials may be made via ratio reference of mass and NMR signal intensity to those of primary chemical standards of known purity [15]. The main component of these standards must contain the appropriate reference nuclei, as well as a distinct spectral peak and common solubility, but they are otherwise not compound-specific. The 1H-qNMR assessment described in this article is an evaluation of the mass purity, PPC, of a neat chemical folic acid material. The measurement equation used to derive this quantity is presented as equation (1).
| (1) |
where:
NPC multiplicity (# H/peak) of primary chemical species signal
NIS multiplicity (# H/peak) of internal reference standard species signal
MPC relative molar mass (molecular weight, g mol−1) of primary species
MIS relative molar mass (molecular weight, g mol−1) of the internal standard species
APC integrated area of primary species signal
AIS integrated area of the internal reference standard species signal
mPC mass (g) of the composite material weighed for sample solution, adjusted for buoyancy effects
mIS mass (g) of the internal reference standard weighed for sample solution, adjusted for buoyancy effects
PISknown purity (g g−1) of the internal reference standard
Due to chemical and spectroscopic limitations of the measurement system, purity determinations were first made for select secondary reference materials that were suitable for measurement with the folic acid species. For this evaluation, two high-purity (PIS > 99.99%) primary reference standards, Standard Reference Material (SRM) 350b Benzoic acid (BA) and SRM 84k Potassium Hydrogen Phthalate (KHP), were used as internal calibrators to determine the mass purity of two chemically-distinct secondary reference materials (PIS > 99.9%): methylsulfonylmethane (MSM, also known as dimethyl sulfone) and 2,2-dimethylpropanedioic acid (Me2PDA, also known as dimethylmalonic acid). With this experimental design, the evaluated mass purity of folic acid is traceable to the certified mass fraction purity of both primary standards.
Section 2 describes the statistical model used to evaluate purity of the primary chemical species based on a qNMR analysis. Section 3 describes the implementation of this model during the investigation performed to evaluate the purity of a neat folic acid material. Conclusions are given in section 4.
2. The measurement model in terms of observation equations
The measurement equation (equation (1)) is based on four different measured quantities: APC, AIS, mPC, and mIS. The remaining input quantities are either known constants, NPC, NIS, MPC, and MIS, or quantities with a mean and uncertainty specified by a certificate, PIS. Though molecular masses (MPC, and MIS) are imperfectly known due to natural variation in isotopic abundances, the slight degree of uncertainty is relatively inconsequential. Given such data, the most common approach to the evaluation of purity and its uncertainty would be to apply a standard Guide to the expression of uncertainty in measurement (GUM) [16] analysis to each individual sample’s inputs and propagate the uncertainties through the measurement equation. This would produce a purity estimate and associated uncertainty for each sample. GUM-style analyses are readily accomplished ‘by hand’ or using software such as the NIST Uncertainty Machine [17].
One potentially critical factor for chemical mass purity evaluations is the need to constrain the result to lie within the interval (0 g g−1, 1 g g−1). The usual GUM analysis does not naturally preserve this constraint, which is especially relevant for highly pure materials (PPC > 99%). But whether or not a constraint is necessary, in order to capture the between sample variability (reproducibility uncertainty) of multiple samples, the individual results must be combined to obtain a final purity estimate and uncertainty. When multiple samples with different internal standards are used, as was the case for the evaluation that we describe in detail in the following sections, accounting for all sources of uncertainty in a rigorous manner is not straightforward since the outputs from the measurement equation for single samples may not be statistically independent due to shared sources of uncertainty attributable to the experimental design. Thus, the evaluation of measurement uncertainty of the purity estimate includes both bottom-up elements, that is, uncertainty in the inputs to the measurement equation propagated through the function, and top-down elements, being sources of uncertainty due to factors that are part of the experimental design and whose contribution to uncertainty is accounted for using the variability of the measurements.
A statistically rigorous method of combining these bottom-up and top-down elements of the uncertainty analysis, while also preserving any natural physical and chemical constraints, is best achieved using an observation equation [18] as an integral part of a Bayesian statistical model [19]. An observation equation is a statistical model for the measurements, which in this case are APC, AIS, mPC, and mIS. While uniquely able to account for all of the various constraints and uncertainty sources present in a measurement process, Bayesian analysis requires so called prior distributions for all inputs that are not known constants. These prior distributions summarize all of the information about such quantities that is available before the measurements are obtained.
In order to specify the statistical model for the area measurements, we first note that equation (1) can be written as
It then follows that:
| (2) |
Since NIS and MIS are known constants, the constant K converts the ratio of the measurement of the integrated area of the signal to the mass of the material in the sample solution into a measurement of chemical purity. Since the purity of the internal standard is known up to an uncertainty, it possible to estimate K and use it to estimate the purity of the analyte.
To estimate K we first note that
a relationship which will define the expected value of the measurement of AIS. We term the uncertainty for this measurement, . NMR area measurements can generally be represented by a Gaussian distribution, or when is based on known degrees of freedom, by a Student t distribution. Measurements of sample mass, mIS, can also be represented as Gaussian or Student t distributions with mean mass μIS and uncertainty . Assuming Gaussian distributions, the observation equations for AIS and mIS are therefore:
| (3) |
and
where the symbol ‘~’ is interpreted ‘distributed as’ and N(,) specifies a Gaussian distribution with given mean and variance. When the uncertainties or are based on known degrees of freedom, Student t distributions, t(,), should be used instead of N(,).
The main objective of these internal standard-based observation equations is to obtain information about K to be used in the analysis of similar observation equations for the primary chemical species. Generally, the information about the purity of the internal standard is given in a certificate, value μIS and uncertainty , and can be transformed into a probability distribution. If no constraint on purity is necessary, a Gaussian distribution with the given mean and variance, , may be used. If purity must be constrained to lie between 0 and 1 then the beta distribution
where
is appropriate.
Without additional information, a rectangular distribution on the interval (0, c) for some constant c can be used as a prior for K: K ~ R(0, c). Again without additional information, a Gaussian distribution with mean 0 and a large variance is appropriate for μIS: μIS~N(0, large. In our application we found the results to be robust with respect to the choice of the variance of μIS.
While the posterior distribution, which summarizes all the information resulting from a Bayesian analysis, of K cannot be obtained in closed form, a Markov chain Monte Carlo (MCMC) [19] analysis using the free software OpenBUGS [20] is straightforward and produces a sample of random draws from this posterior distribution. The OpenBUGS code is given in the appendix.
The main objective of the analysis described in this article is the estimation of the purity of the primary chemical species. This is accomplished through Bayesian analysis of two observation equations for APC and mPC:
| (4) |
and
where the quantity of interest now is not K, but the measurand PPC.
As above, all unknown constants need to have prior probability distributions. The analysis of equation (3) produces a random sample from the posterior distribution for K which the OpenBUGS program can use as a prior distribution. A suitable prior distribution for mPC is again Gaussian distribution with mean 0 and a large variance μPC:μPC ~ N (0, large).
A prior distribution for the measurand PPC needs to be defined expressly for each application, as this is where the specifics of the experimental design are expressed in the form of constraints and/or additional structure of the prior distribution. For example, purity of different samples analyzed using a single standard may be somewhat different due to slightly different measurement conditions or heterogeneity of the material, but the differences would usually be smaller than differences in purity of samples analyzed using two different internal standards. Such details can be modeled using a second level in the prior distribution of PPC creating a so called hierarchical structure.
Evaluation of equations (3) and (4) performed using MCMC preserves all correlations present in the statistical model. The resulting posterior distribution of PPC can be summarized in terms of a mean, standard deviation, and 95% uncertainty interval to produce the desired purity estimate.
3. Examples of purity determination
This section shows application of equations (3) and (4) to the specific case of purity evaluation of folic acid. The OpenBUGS code used for the analysis is given in the appendix.
3.1. Experimental design
For the evaluation of each secondary reference material, four samples containing BA as internal standard and three containing KHP were dissolved in per-deuterated dimethyl sulfoxide, DMSO-d6 (99.9% D atom purity; Cambridge Isotope Laboratories (CIL), Cambridge, MA). These secondary reference materials were evaluated for use in folic acid (FA) measurements because they have 1H resonance signals that do not interfere with those of FA and are mutually soluble in aqueous solutions. For purity determination of FA, four samples containing the MSM and three samples containing the Me2PDA secondary standard were prepared in a K2HPO4/KD2PO4 aqueous (99.99% D atom purity D2O, CIL) buffer solution. Approximately 4 mg–15 mg masses of neat FA and internal standard materials were weighed using an ultra-microbalance (Mettler Toledo UMX5; Columbus, OH).
Experimental NMR data was acquired by a Bruker Avance 600 MHz spectrometer equipped with a 5 mm broadband inverse (BBI) detection probe and operating with Topspin (Version 3.2) software. Experiments were performed with 128 scan repetitions (60 s delay time), a spectral width of 20.0276 ppm, and transmitter frequency offset (O1) of 6.175 ppm. A 90° excitation pulse width was calibrated for each sample. Data acquisition time was 5.45 s for each scan and 131 072 data points were collected for each FID. The probe temperature was 298 K. No 13C decoupling was executed during data acquisition. Processing of the Fourier transformed 1H spectra, including baseline corrections, phase adjustment, and signal integration, was performed manually.
3.2. Purity of the secondary internal standards MSM and Me2PDA
During this investigation and for the reasons described herein, purity of FA was evaluated using two different internal standards, MSM and Me2PDA. The inputs for the purity evaluation of MSM are given in table 1. Masses and areas are based on three measurements, and therefore the uncertainties of the measurements are associated with 2 degrees of freedom. Three or four different samples containing neat chemical and internal standard materials were prepared for each of the purity assessments.
Table 1.
Inputs for Secondary Internal Standard MSM.
| Internal standard | SRM 350b benzoic acid (BA) | SRM 84 k potassium hydrogen phthalate (KHP) | ||||||
|---|---|---|---|---|---|---|---|---|
|
| ||||||||
| Parameters | Sample 1 | Sample 2 | Sample 3 | Sample 4 | Sample 1 | Sample 2 | Sample 3 | |
| MPC (g mol−1) | 94.13 | 94.13 | 94.13 | 94.13 | 94.13 | 94.13 | 94.13 | |
| MIS (g mol−1) | 122.121 | 122.121 | 122.121 | 122.121 | 204.22 | 204.22 | 204.22 | |
| NIS | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| NPC | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| APC | 1411 977 105 | 975 476 967 | 1460 479 190 | 1044 400 884 | 2049 421 640 | 1735 241 388 | 1751 551 403 | |
| uAPC | 413 545.106 | 144 615.57 | 246 609.241 | 104 452.893 | 243 774.687 | 580 978.61 | 364 045.672 | |
| AIS | 750028 937 | 623003 497 | 518822 717 | 671799 916 | 456562022 | 814572865 | 747400179 | |
| uAIS | 73363.6013 | 349365.9 | 116331.207 | 165393.397 | 40977.0251 | 360632556 | 18405.9296 | |
| mPC (mg) | 13.4375 | 9.2011 | 14.3194 | 9.9598 | 10.3778 | 8.2405 | 6.3565 | |
| (mg) | 0.0005 | 0.0005 | 0.0005 | 0.0005 | 0.0005 | 0.0005 | 0.0005 | |
| mIS (mg) | 9.2704 | 7.6297 | 6.6060 | 8.3209 | 5.0207 | 8.4030 | 5.8872 | |
| (mg) | 0.0005 | 0.0005 | 0.0005 | 0.0005 | 0.0005 | 0.0005 | 0.0005 | |
| (g g−1) | 0.999 978 | 0.999 978 | 0.999 978 | 0.999 978 | 0.999 911 | 0.999 911 | 0.999 911 | |
| (g g−1) | 0.000 044 | 0.000 044 | 0.000 044 | 0.000 044 | 0.000 054 | 0.000 054 | 0.000 054 | |
We specified the prior for PPC. as , assigning μPC the non-informative prior with d set to a large value. We assign a non-informative gamma distribution prior, ~ Gamma (e, f) with e and f set as large numbers, that captures the non-negativity of standard deviations. The calculated purities of the secondary internal standards, PIS in corresponding calculations of folic acid purity, were not constrained to a maximum of 1. Given that qNMR is a relative measurement, the effective purity of the internal standard is a normalized ratio of select resonant 1H content with respect to mIS, rather than an absolute chemical mass fraction. Therefore, the PIS may exceed the natural limit of MSM or Me2PDA purity. This may be the case if an assumed MIS is not consistent with the actual isotopic composition, or if there is a systematic bias of AIS associated with an indiscernible chemical impurity. Several qNMR evaluations of the high purity MSM secondary standard over multiple years and by multiple analysts indicate that PIS is greater than 1 if a molecular weight of 94.13 g mol−1 is assumed.
The MCMC posterior distribution of PPC was summarized as a mean and standard deviation. The resulting values for each sample and internal standard are given in table 2.
Table 2.
Summary outputs for purity of secondary internal standard MSM.
| Internal standard | Sample | Posterior mean (g g−1) | Posterior standard deviation (g g−1) |
|---|---|---|---|
| BA | 1 | 1.001 | 0.0004 |
| 2 | 1.0007 | 0.0024 | |
| 3 | 1.0009 | 0.0011 | |
| 4 | 1.0011 | 0.0008 | |
|
| |||
| KHP | 1 | 1.0009 | 0.0004 |
| 2 | 1.0012 | 0.0004 | |
| 3 | 1.0004 | 0.0003 | |
The observation equation model as specified in equations (3) and (4) represents only the bottom-up elements of the analysis, and the results in table 2 are essentially identical to the usual uncertainty analysis obtained with the GUM procedure [16].
The top-down elements in the uncertainty quantification, that is, the between sample and between internal standard variability, can be accounted for in a separate layer of the observation equation model. This is preferable to simply averaging or otherwise combining the entries in table 2 because the values of purity for the samples based on the same internal standard are correlated, and the value of this correlation would need to be estimated to produce an accurate uncertainty for the final purity estimate. A hierarchical model [21] on PPC accomplishes this naturally, without the need for separate estimation of this correlation. We define
| (5) |
and
where i = 1 for BA, and i = 2 for KHP. Here the parameters τi quantify the between-sample variability for each internal standard and the ω quantifies the between-internal standard variability, which is also the basis of the correlation between the sample values. Note that this is the usual random effects model [22], widely used to account for additional uncertainty in interlaboratory studies.
The final estimate of purity is the posterior mean of μP1. For MSM this is 1.0009, with standard uncertainty = 0.0009. For the evaluation of folic acid purity these are interpreted as PIS~N (1.0009, 0.00092). Figure 2 shows the posterior distribution of μP1
Figure 2.

Posterior distribution of purity (g g−1) for MSM. Open square is the posterior mean, solid circles are the endpoints of a 95% uncertainty interval.
The same procedure was followed to obtain a purity estimate for the second internal standard, Me2PDA, using inputs given in table 3. Bayesian analysis of equations (3) and (4), with these data and the same prior distributions as before, produced the values in table 4.
Table 3.
Inputs for secondary internal standard Me2PDA.
| Internal standard | SRM 350b benzoic acid (BA) | SRM 84 k potassium hydrogen phthalate (KHP) | |||||||
|---|---|---|---|---|---|---|---|---|---|
|
| |||||||||
| Parameters | Sample 1 | Sample 2 | Sample 3 | Sample 4 | Sample 1 | Sample 2 | Sample 3 | Sample 4 | |
| MPC (g mol−1) | 132.11 | 132.11 | 132.11 | 132.11 | 132.11 | 132.11 | 132.11 | 132.11 | |
| MIS (g mol−1) | 122.121 | 122.121 | 122.121 | 122.121 | 204.22 | 204.22 | 204.22 | 204.22 | |
| NIS | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| NPC | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| APC | 640246178.3 | 966957581 | 966 370 521 | 864647730 | 1061975433 | 1093570394 | 1097418797 | 1136552998 | |
|
|
93106.60606 | 24507.84 | 53870.701 | 73390.8488 | 86891.1374 | 63973.5968 | 37111.3776 | 67927.198 | |
| AIS | 815288040.8 | 1.108 × 109 | 955889852 | 1079955713 | 803319978 | 944110166 | 613684339 | 720228806 | |
|
|
1082996261 | 148379.41 | 8876.6106 | 357305.977 | 59360.1527 | 96723.3087 | 61710.4119 | 100666.621 | |
| mPC (mg) | 8.5701 | 5.9678 | 6.7620 | 7.0961 | 5.1465 | 6.0393 | 5.8252 | 5.2730 | |
| (mg) | 0.0005 | 0.0005 | 0.0005 | 0.0005 | 0.0005 | 0.0005 | 0.0005 | 0.0005 | |
| mIS (mg) | 10.0868 | 6.3157 | 6.1778 | 8.1847 | 6.0208 | 8.0579 | 5.0333 | 5.1597 | |
| (mg) | 0.0005 | 0.0005 | 0.0005 | 0.0005 | 0.0005 | 0.0005 | 0.0005 | 0.0005 | |
| (g g−1) | 0.999 978 | 0.999 978 | 0.999 978 | 0.999 978 | 0.999 911 | 0.999 911 | 0.999 911 | 0.999 911 | |
| (g g−1) | 0.000044 | 0.000044 | 0.000044 | 0.000044 | 0.000054 | 0.000054 | 0.000054 | 0.000054 | |
Table 4.
Summary outputs for purity of secondary internal standard Me2PDA.
| Internal standard | Sample | Posterior mean (g g−1) | Posterior standard deviation (g g−1) |
|---|---|---|---|
| BA | 1 | 0.9999 | 0.0004 |
| 2 | 0.9991 | 0.0004 | |
| 3 | 0.9992 | 0.0001 | |
| 4 | 0.999 | 0.0013 | |
|
| |||
| KHP | 1 | 1.0004 | 0.0002 |
| 2 | 0.9997 | 0.0004 | |
| 3 | 0.9995 | 0.0003 | |
| 4 | 0.9989 | 0.0005 | |
These estimates were combined using the hierarchical model of equation (5) to obtain the posterior distribution of μP1, the purity of Me2PDA. The posterior mean and standard deviation were 0.9994 and 0.0008, respectively. Figure 3 shows the posterior distribution of Me2PDA.
Figure 3.

Posterior distribution of purity (g g−1) of Me2PDA. Open square is the posterior mean, solid circles are the endpoints of a 95% uncertainty interval.
3.3. Folic acid purity
To estimate the purity of the folic acid material, multiple samples were evaluated using each of the two internal standards. The inputs are given in table 5.
Table 5.
Inputs for folic acid (FA) samples.
| Internal standard | Methylsulfonylmethane (MSM) | 2,2-dimethylpropanedioic acid (Me2PDA) | |||||
|---|---|---|---|---|---|---|---|
|
| |||||||
| Parameters | Sample 1 | Sample 2 | Sample 3 | Sample 4 | Sample 1 | Sample 2 | Sample 3 |
| MPC (g mol−1) | 441.4 | 441.4 | 441.4 | 441.4 | 441.4 | 441.4 | 441.4 |
| MIS (g mol−1) | 94.13 | 94.13 | 94.13 | 94.13 | 132.11 | 132.11 | 132.11 |
| NIS | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| Npc | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| AreaPC | 435 884 911 | 546 673 820 | 312 049 348 | 445 682 930 | 627 834 226 | 575 519 508 | 479 363 762 |
| uAPC | 1341 111.45 | 1163 993.9 | 656 777.749 | 1062 082.87 | 862 113.073 | 2004 862.69 | 616 232.246 |
| AreaIS | 1153 787 136 | 1.88 × 109 | 746 379 745 | 1492 167 552 | 579 576 035 | 606 371 428 | 854 524 945 |
| uAIS | 576 893.568 | 937 811.13 | 373 189.873 | 746 083.776 | 289 788.017 | 303 185.714 | 427 262.472 |
| mPC (mg) | 5.0559 | 5.2872 | 4.7875 | 4.7024 | 4.4486 | 5.0038 | 4.4013 |
| (mg) | 0.0005 | 0.0005 | 0.0005 | 0.0005 | 0.0005 | 0.0005 | 0.0005 |
| mIS (mg) | 2.5868 | 3.5075 | 2.21 | 3.0396 | 1.112 | 1.4288 | 2.1288 |
| (mg) | 0.0005 | 0.0005 | 0.0005 | 0.0005 | 0.0005 | 0.0005 | 0.0005 |
As in the purity assessments described above, the analysis proceeds from the same four observation equations, here using the information about the two internal standards (probability distributions of μP1 and μP2 as prior distributions) obtained in the previous section as
| (6) |
where i = 1 for BA and i = 2 for KHP. The area measurements were based on 3 replicates and so a Student t distribution with 2 degrees of freedom was used. Bayesian analysis of equation (6) leads to a posterior distribution for each Ki which can then be applied in the analysis of the observation equations for APC and mPC:
| (7) |
One difference between this analysis and the two described above is that the purity of folic acid must be constrained to the interval (0 g g−1, 1 g g−1). To produce purity estimates separately for the individual samples, this constraint is applied using a rectangular prior distribution: . MCMC analysis produced the results in table 6.
Table 6.
Summary outputs for purity of the folic acid (FA) samples.
| Internal standard | Sample | Posterior mean (g g−1) | Posterior standard deviation (g g−1) |
|---|---|---|---|
| MSM | 1 | 0.9068 | 0.0023 |
| 2 | 0.9069 | 0.0020 | |
| 3 | 0.9062 | 0.0021 | |
| 4 | 0.9064 | 0.0021 | |
|
| |||
| Me2PDA | 1 | 0.9044 | 0.0024 |
| 2 | 0.9051 | 0.0028 | |
| 3 | 0.9058 | 0.0021 | |
Since somewhat different folic acid purity values were inferred from the two internal standards, the corresponding sample sets were combined separately. The constraint to lie in the interval (0 g g−1, 1 g g−1) requires a different version of the hierarchical model for the purity values:
| (8) |
and
The analysis results were not sensitive to the ranges of the uniform distributions for the parameters and σi but convergence was faster with the slightly informative prior distributions given here. As there were different distributions for the measurand depending on the internal standard i, there were two different sets of posterior means and standard deviations as given in table 7.
Table 7.
Summary outputs for purity for the two internal standards.
| Internal standard | Posterior mean (g g−1) | Posterior standard deviation (g g−1) |
|---|---|---|
| MSM | 0.9065 | 0.0027 |
| Me2PDA | 0.9048 | 0.0047 |
It is required to arrive at a single posterior distribution of purity. There are various statistical approaches for combining posterior distributions of a measurand, the most conservative in the sense of producing the largest uncertainty in the estimate is the linear pool method [21]. This treats the two posterior distributions of PPC (call them p1 and p2) as equally likely to be correct and combines them to produce a single probability distribution for any value x of PPC.
The value and uncertainty for the purity of FA, PFA, produced in this way was the mean and standard deviation of the posterior distribution of p: 0.9056 g g−1 and 0.0039 g g−1, respectively. Considering this uncertainty to be too conservative, we used an alternative approach that assumes the purities of the samples using either of the two internal standards are related through the same parameters of the hierarchical model, that is:
| (9) |
This model does not account for the between internal standard variability directly, rather it becomes part of the repeatability uncertainty component. Using this model, the estimate of purity is the mean and standard deviation of the posterior distribution of μPC: 0.9058 g g−1 and 0.0011 g g−1, respectively. Figure 4 shows the four posterior distributions.
Figure 4.

Purity (g g−1) of folic acid based on methylsulfonyl-methane (MSM) (blue), 2,2-dimethylpropanedioic acid (Me2PDA) (green), the linear pool consensus (red), and the version without the internal standard component (violet). Open squares are the posterior means, solid circles are the endpoints of the 95% uncertainty intervals.
4. Conclusions
We have developed an observation equation model for chemical purity determinations based on qNMR measurements. The advantage of this approach is that it ensures that any constraints on the purity estimate (for example that the mass-fraction purity should not exceed 1) are satisfied and incorporates both top-down and bottom-up uncertainty evaluations in the same statistical model, thus naturally including correlations of the measurements due to the experimental design. We illustrated Bayesian analysis of this model on measurements of folic acid (FA) purity using two different internal standards. We showed how the purity of the internal standards can first be evaluated and then used as input into the FA analysis. The OpenBUGS code used for the analysis is given in the appendix. This model was developed to be implemented as part of a direct measurement approach for traceable chemical purity assessments via qNMR, and allows observation of natural limits and evaluation of realistic uncertainty intervals.
Acknowledgments
The authors would like to thank David Duewer and Antonio Possolo for many helpful comments and suggestions that greatly improved this article. Certain commercial equipment, instruments, or materials are identified in this paper to specify adequately the experimental procedure. Such identification does not imply recommendation or endorsement by the National Institute of Standards and Technology, nor does it imply that the materials or equipment identified are necessarily the best available for the purpose.
Appendix. OpenBUGS code and data file for evaluation of purity of folic acid
| {## constants |
| # MIS1 is the relative molar mass of MSM |
| # MPC1, MPC2 is the relative molar mass of FA |
| # MIS2 is the relative molar mass of Me2PDA |
| ## inputs |
| # mIS1 is the mass measurement of MSM # |
| # is the uncertainty of the mass of the MSM weighed for sample solution |
| # mPC1 is the mass of FA in samples analyzed using MSM |
| # mIS2 is the mass measurement of Me2PDA |
| # is the uncertainty in the mass of the Me2PDA weighed for sample solution |
| # mPC2 is the mass of FA in samples analyzed using Me2PDA |
| # AIS1 integrated area of the MSM signal |
| # uncertainty in the integrated area of the MSM signal |
| # APC1 integrated area of FA signal in samples analyzed using MSM |
| # uncertainty in the integrated area of FA signal in samples analyzed using MSM |
| # AIS2 integrated area of the Me2PDA signal |
| # uncertainty in the integrated area of the Me2PDA signal |
| # APC2 integrated area of FA signal in samples analyzed using Me2PDA |
| # uncertainty in the integrated area of FA signal in samples analyzed using Me2PDA |
| ## parameters of the probability distributions of the mass measurements |
| # is μIS1 |
| # is μPC1 |
| # is μIS2 |
| # is μPC2 |
| ## prior parameters |
| # muP1 is the purity of MSM |
| # muP2 is the purity of Me2PDA |
| ## the measurand |
| # mu is μPC, i.e. the mean of the FA purity |
| MIS1 <− 94.13 |
| MPC1 <− 441.4 |
| MIS2 <− 132.11 |
| MPC2 <− 441.4 |
| # define the parameters of the Beta prior distribution for the purity of FA. The mean of this |
| # distribution mu is the measurand. |
| a <− mu/(sd * sd) |
| b <− (1 − mu)/(sd * sd) |
| mu ~ dunif(0.6, 1) |
| sd ~ dunif(0, 0.1) |
| # |
| # define the distribution of the purity of MSM and of Me2PDA, as obtained in section 3.2. |
| # Variability is given in terms of precision, that is, as 1/0.00092 for MSM and 1/0.00082 for |
| # Me2PDA. |
| muP1 ~ dnorm(1.0009, 1234 568) |
| muP2 ~ dnorm(0.9994, 1562 500) |
| # |
| # specify the prior distributions for the 4 samples analyzed using MSM |
| for(i in 1:4){pPC1[i] ~ dbeta(a, b) |
| # |
| # specify the observation equations for the mass measurements for the 4 samples analyzed using # MSM |
| # |
| # specify the observation equation for the area measurements of MSM |
| for(i in 1:4){ |
| k1[i] ~ dunif(0, 0.01) |
| for (i in 1:4){ |
| AIS1 [i] ~ dt(meanAIS1[i], 2)} |
| # |
| #specify the observation equation for area of FA for the 4 samples analyzed using MSM |
| for(i in 1:4){ |
| k.cut1[i] <− cut(k1[i]) |
| } |
| for(i in 1:4){ |
| APC1[i] ~ dnorm(meanAPC1[i], precAPC1[i])} |
| # specify the prior distributions for the 3 samples analyzed using Me2PDA |
| for(i in 1:3){pPC2[i] ~ dbeta(a, b) |
| # |
| # specify the observation equations for the mass measurements for the 3 samples analyzed using |
| #Me2PDA |
| # |
| # specify the observation equation for the area measurements of Me2PDA |
| for(i in 1:3){ |
| k2[i] ~ dunif(0,0.01) |
| for(i in 1:3){ |
| AIS2[i] ~ dt(meanAIS2[i],precAIS2[i],2)} |
| # specify the observation equation for area of FA for the 3 samples analyzed using Me2PDA |
| for(i in 1:3){ |
| k.cut2[i] <− cut(k2[i]) |
| } |
| for(i in 1:3){ |
| APC2[i] ~ dnorm(meanAPC2[i], precAPC2[i])} |
| } |
| ### |
| Data file: |
| list(mIS1 = c(0.0025 868,0.0035075,0.00221,0.0030396), |
| mPC1 = c(0.0050 559,0.0052 872,0.0047 875,0.0047 024), |
| = 0.0000 005, |
| AIS1 = c(0.011 537 871 36,0.018 76,0.007 463 797 45,0.014 921 675 52), |
| = c(0.000 000 979 46,0.000 000 638 23,0.000 000 766 96,0.000 000 731 31), |
| APC1 = c(0.004 358 849 11,0.005 466 738 20,0.003 120 493 48,0.004456829 30), |
| = c(0.000 013 411 11,0.000 011 639 93,0.000 006 567 77,0.000 010 620 83), |
| mIS2 = c(0.001 112,0.001 4288,0.002 1288), |
| mPC2 = c(0.004 4486,0.005 0038,0.004 4013), = 0.000 0005, |
| AIS2 = c(0.005 795 760 35,0.006 063 714 28,0.008 545 249 45), |
| = c(0.000 000 122 54,0.000 000 134 47,0.000 000 190 03), |
| APC2 = c(0.006 278 342 26,0.005 755 195 08,0.004 793 63762), |
| = c(0.000 008 621 13,0.000 020 048 63,0.000 006 162 32)) |
References
- 1.Quinn TJ. Metrologia. 1997;34:61–5. [Google Scholar]
- 2.Malz F, Jancke H. J Pharm Biomed Anal. 2004;38:813–23. doi: 10.1016/j.jpba.2005.01.043. [DOI] [PubMed] [Google Scholar]
- 3.Holzgrabe U, Deubner R, Schollmayer C, Waibel B. J Pharm Biomed Anal. 2005;38:806–812. doi: 10.1016/j.jpba.2005.01.050. [DOI] [PubMed] [Google Scholar]
- 4.Jancke H, Malz F, Haesselbarth W. Accredit Qual Assur. 2005;10:421–9. [Google Scholar]
- 5.Pauli GF. Phytochem Anal. 2001;12:28–42. doi: 10.1002/1099-1565(200101/02)12:1<28::AID-PCA549>3.0.CO;2-D. [DOI] [PubMed] [Google Scholar]
- 6.DeBiévre P, Dybkaer R, Fajgelj A, Hibbert B. Pure Appl Chem. 2011;83:1873–935. [Google Scholar]
- 7.Nelson M, Bedner M, Lang B, Toman B, Lippa K. Anal Bioanal Chem. 2015;407:8557–69. doi: 10.1007/s00216-015-9013-7. [DOI] [PubMed] [Google Scholar]
- 8.Weber M, Hellriegel C, Rueck A, Sauermoser R, Wuethrich J. Accredit Qual Assur. 2013;18:91–8. [Google Scholar]
- 9.Saito T, Ihara T, Koike M, Kinugasa S, Fujimine Y, Nose K, Hirai T. Accredit Qual Assur. 2009;14:79–86. [Google Scholar]
- 10.Schoenberger T. Anal Bioanal Chem. 2012;403:247–54. doi: 10.1007/s00216-012-5777-1. [DOI] [PubMed] [Google Scholar]
- 11.Jancke H. NMR Spectroscopy as a primary analytical metho. CCQM Report. 1998;98:1–12. [Google Scholar]
- 12.Bharti SK, Roy R. Trends Anal Chem. 2012;35:5–26. [Google Scholar]
- 13.Wells RJ, Cheung J, Hook JM. Accredit Qual Assur. 2004;9:450–6. [Google Scholar]
- 14.Gadape HH, Parikh KS, editors. J Chem Pharm Res. 2011;3:649. [Google Scholar]
- 15.Al-Deen TS, Hibbert DB, Hook JM, Wells RJ. Accredit Qual Assur. 2004;9:55–63. [Google Scholar]
- 16.Joint Committee for Guides in Metrology. Evaluation of Measurement Data—Guide to the Expression of Uncertainty in Measurement. Sevres: International Bureau of Weights and Measures (BIPM); 2008. available at www.bipm.org/utils/common/documents/jcgm/JCGM_100_2008_E.pdf (Accessed: June 2016) [Google Scholar]
- 17.Lafarge T, Possolo A. NIST Uncertainty Machine. Gaithersburg, MD: National Institute of Standards and Technology; 2015. available at http://uncertainty.nist.gov/ (Accessed: June 2016. [Google Scholar]
- 18.Possolo A, Toman B. Metrologia. 2007;44:464–75. [Google Scholar]
- 19.Gelman A, Carlin J, Stern H, Dunson D, Vehtari A, Rubin D. Bayesian Data Analysis. 3rd. Boca Raton, FL: CRC Press; 2013. [Google Scholar]
- 20.Lunn DJ, Spiegelhalter D, Thomas A, Best N. Stat Med. 2009;28:3049–82. doi: 10.1002/sim.3680. [DOI] [PubMed] [Google Scholar]
- 21.Toman B. Technometrics. 2007;49:81–7. [Google Scholar]
- 22.Toman B, Possolo A. Accredit Qual Assur. 2009;14:553–63. [Google Scholar]
