Abstract
This commentary compares 233 CheqSol intrinsic solubility values (log S0) reported in the Wiki-pS0 database for 145 different druglike molecules to the 838 log S0 values determined mostly by the saturation shake-flask (SSF) method for 124 of the molecules from the CheqSol set. The range of log S0 spans from -1.0 to -10.6 (log molar units), averaging at -3.8. The correlation plot between the two methods indicates r2 = 0.96, RMSE = 0.34 log unit, and a slight bias of -0.07 log unit. The average interlaboratory standard deviation (SDi) is slightly better for the CheqSol set than that of the SSF set: SDiCS = 0.15 and SDiSSF = 0.24. The intralaboratory errors reported in the CheqSol method (0.05 log) need to be multiplied by a factor of 3 to match the expected interlaboratory errors for the method. The scale factor, in part, relates to the hidden systematic errors in the single-lab values. It is expected that improved standardizations in the ‘gold standard’ SSF method, as suggested in the recent ‘white paper’ on solubility measurement methodology, should make the SDi of both methods be about ~0.15 log unit. The multi-lab averaged log S0 (and the corresponding SDi) values could be helpful additions to existing training-set molecules used to predict the intrinsic solubility of drugs and druglike molecules.
Keywords: Interlaboratory solubility measurement errors, aqueous intrinsic solubility, shake-flask solubility, potentio- metric solubility, thermodynamic solubility, Henderson-Hasselbalch equation
Introduction
This commentary considers the interlaboratory reproducibility of published aqueous intrinsic solubility data (log S0) for 124 drugs, determined both by the potentiometric CheqSol (CS) method (233 reported log S0CS values) and mostly by the ‘gold standard’ saturation shake-flask (SSF) method (838 log S0SSF). For each drug, its method-dependent interlaboratory measurement standard deviations (SDiCS and SDiSSF) are estimated, by comparing solubility values for a given drug determined in different laboratories. The multi-lab averaged log S0 (and the corresponding SDi values) could be helpful additions to existing training-set molecules used to predict the intrinsic solubility of drugs and druglike molecules.
The present contribution is the third in a series of papers aiming to address contemporary issues of solubility measurement, interpretation and prediction [1,2]. These are intended to serve as prologue/accompaniment to an upcoming session on solubility at the IAPC-8 meeting in Split, Croatia, 9-11 September 2019. Since 2009, the International Association of Physical Chemists (IAPC, www.iapchem.org) series of symposia maintained extensive coverage of the topic of solubility measurement, both from solid state and solution perspectives. At the 2015 IAPC-4 meeting, the special session on solubility measurement resulted in a widely-circulated ‘white paper’ drawing on expert consensus thoughts of scientists from six countries (Hungary, Russia, Serbia, Spain, Sweden, United States) [3]. It is expected that future sessions will continue to cover solubility methods and strategies, to critically address the different needs in pharmaceutical research, spanning from drug discovery to drug development.
Background
Most of the small-molecule research compounds in today’s drug discovery projects are ionizable and poorly soluble in water, and are thus prone to show low and/or erratic in vivo intestinal absorption [4]. In discovery, high-throughput microtitre plate methods are used to estimate solubility, where small volumes of 10 mM DMSO solutions of library compounds are added to buffer solutions to induce formation of solid suspensions in the wells. Such estimates of solubility are needed, in part, to anticipate whether compounds would precipitate in bioassays (and thus indicate false positives). In parallel, methods to predict solubility of molecules play an important role in drug discovery, since virtual screening of compound libraries could prioritize molecules for testing in early in vitro screens [2,5,6].
Saturation shake-flask (SSF) and potentiometric (CheqSol) methods
In more advanced stages of drug research, solubility measurement necessarily becomes more rigorous, where ‘solubility’ refers to the concentration of a solute in a saturated solution, where the dissolved molecule is in a thermodynamic equilibrium with its crystalline form suspended in the solution (of a known pH, composition, and temperature). Accurate solubility measurement of druglike molecules can be very difficult to do well, although to an untrained eye it ought to be as easy as measuring the concentration of a molecule in water. In development projects, thermodynamic solubility measurements are usually done using some variant of the SSF method [3]. As an alternative, a potentiometric procedure, called the dissolution template titration (DTT) method was introduced in 1998 [7] and validated two years later [8]. The methodology is capable of producing highly precise (intralaboratory SD ~0.07 log unit) measurements of aqueous intrinsic solubility, log S0, i.e., the solubility of the uncharged form of an ionizable molecule. A much faster variant of the pH-titration method, called CheqSol, was described in 2005 [9]. Instruments implementing the potentiometric methods have been used in several universities and pharmaceutical companies.
All of the potentiometric methods require that the molecule be ionizable and that the accurately-measured pKa be provided. The molecule cannot be too soluble, since the method depends on the pH difference between a saturated and an unsaturated solution in the titration where the molecule is half ionized. So, it is ideally suited for low-soluble molecules, since these molecules display large pH differences. Furthermore, to calculate the CheqSol log S0, it is assumed that solubility as a function of pH follows the curve predicted by the Henderson-Hasselbalch equation. The molecule needs to be stable to hydrolysis when repeatedly exposed to pH conditions far from neutral. Means to recognize hydrolytic decomposition are important to incorporate into the measurement. Sometimes multiple polymorphs may form in the CheqSol method, which requires solid state characterization to identify. In the traditional SSF method, most of the time, the thermodynamically most-stable form of the solid is the one associated with the measured solubility [3]. Equilibration times are selected to be long (24-168 h) to ensure this expectation.
Challenging measurement
What can make solubility measurement of an ionizable low-soluble molecule so difficult? There are two sides to consider for the reaction at equilibrium: (a) the solid state and (b) the solution. Temperature needs to be regulated and specified. To keep the following examples simple, let’s assume that the solvent is distilled water or an aqueous buffer, and that the crystalline form of the test compound is a free-acid/base or a salt. Multiple circumstances may arise, some making the interpretation of the measurement challenging: (i) the simplest suspension is the one where no drug ionization takes place on equilibration. One needs to measure the concentration of the compound in the saturated solution, and to confirm that the solid state form is unchanged – simple. (ii) However, if the solid introduced is not the thermodynamically most-stable polymorph (or is amorphous), then it is possible that the measured solubility would correspond to a different solid form. That’s important to know. (iii) Complication can arise if a low-soluble weak base is added to water (usually saturated with CO2 from the air). The pH will change, depending on the pKa of the molecule. The ambient CO2 may act as a buffer, so the final pH of the saturated solution needs to be carefully measured. Otherwise, the calculated log S0 could be quite erroneous. (iv) If a solid salt of the compound is added to water, a supersaturated solution may form. On equilibration, there could be two precipitates in the suspension: the original compound salt and the neutral free-acid/base form of the solid (at the pH called ‘pHmax’). Solid state characterization of the solid(s) and the measurement of pH would be highly beneficial. (v) When a buffer medium is used, the analysis of the solution and solid states can be complicated, as water-soluble drug-buffer complexes or aggregates may form [10]. In a supersaturated solution, drug molecules may self-associate as sub-micellar aggregates, particularly if they are surface-active [11,12]. For bases introduced as drug salts into a high-pH solution, the charged drug in the supersaturated solution can disproportionate into oil or undergo precipitation into an amorphous solid, along with which charged water-soluble aggregates may co-exist [10-13]. Given enough time, the multiple phases are expected to undergo transformation into a thermodynamically most-stable crystalline solid. Good understanding of solution chemistry and solid state characterization is essential for correctly interpreting the results of solubility measurements, so that high-quality intrinsic solubility data can be reported [3].
The need for high-quality data in accurate solubility prediction
Accurate prediction of the intrinsic solubility of druglike molecules [2] requires that (a) the log S0 values used to train the prediction method are of high quality (with water solubility values, log Sw, or values measured at a particular pH, log SpH, properly corrected for ionization [14] and with all solubility values referring to the same temperature [15]), and (b) the compounds in the training set cover the druglike chemical space of the test set of compounds.
These two important notions became the focus of a number of studies since 2008, spurred by the publications of Llinàs et al. [16] and Hopfinger et al. [17]. These authors introduced the ‘Solubility Challenge,’ a competition to probe the limits of prediction methods. The CheqSol method was used to measure the log S0 of 132 structurally diverse drugs. The log S0 values of 100 molecules were offered as the training set for the prediction of an external test set of 32 molecules (not found in the training set), whose values were not revealed before the completion of the competition.
In a number of earlier studies, it was suggested that the typical error in measured aqueous solubility is ~0.6 log unit or higher, when the solubility values were collected from many published sources [18]. This suggested that the quality of prediction methods was approaching the experimental limit. However, in the Solubility Challenge competition all of the values came from one laboratory, and the intralaboratory precision (repetitive measurements of the same sample by the same chemist, using the same instrument) of the CheqSol data was reported to be SD = 0.05 log unit. It was not known what the expected interlaboratory precision would be, given the unknown systematic errors that might affect the accuracy of results. For example, when 125 published CheqSol values were compared in 2015 to those obtained by the SSF method, it was reported that r2 = 0.90, prediction root-mean-square-error, RMSE = 0.52 log unit, and there was a slight bias of -0.13 log unit [19]. The values in the comparison came from the Wiki-pS0 database (in-ADME Research), which at that time contained 4557 log S0 entries. The database now has 6355 entries, with many newly added CheqSol and SSF values. It was thus of interest to update and better characterize the comparison of data quality between the CheqSol and the ‘gold standard’ saturation shake-flask methods. In parallel, armed with new curated data, the second Solubility Challenge has just been announced [2], with the prediction submission deadline set to 8 September 2019, the day before the IAPC-8 conference starts. The Excel submission form in Supporting Info at https://pubs.acs.org/doi/suppl/10.1021/acs.jcim.9b00345 is freely downloadable for those interested to participate. The data described below could be a useful addition to other druglike training sets currently in circulation.
Method
Data source: Wiki-pS0 database
The ongoing Wiki-pS0 database project [2,3,15,19], which started in 2011, now has 6355 log S0 entries for 3014 different drug-relevant molecules (solids at room temperature), drawing on the study of 1325 publications. The overall interlaboratory standard deviation, SDiALL = 0.17 log unit, has been estimated from the 870 molecules for which solubility was reported from two or more different sources (comprising 4209 individual S0 values), by taking the average of the individual 870 SD values. The SDiALL, being lower than the older estimate of solubility measurement error (~0.6 log unit [18]), indicates that (i) when legacy data are subjected to critical analysis, as recommended in [3,15,19], improvements in the quality of the extracted log S0 data can be achieved, and (ii) there is room for further improvement to the current prediction methods. Alongside the database, the pDISOL-X program (in-ADME Research) was designed to interpret solubility data and make temperature corrections, to produce a reliable estimate of the underpinning log S0 [10,11,20].
Results
There are 233 reported CheqSol log S0 values in the Wiki-pS0 database for 145 different druglike molecules. For 124 of the molecules, there are 838 reported log S0 determined mostly by the SSF method. Of the 838 entries, 298 (36 %) were log S0 values calculated from log S vs. pH data (using pDISOL-X), based on a total of 2925 individual log SpH measurments. (For 21 of the 145 molecules, SSF data have not been located in the literature.) Table 1 lists the solubility values for the 124 overlapping molecules measured by the CheqSol and SSF methods. The range of log S0 spans from -1.0 to -10.6 (log molar units), averaging at -3.8.
Table 1.
Compound | avg. 25 °C log S0SSF | SDSSF | N | avg. 25 °C log S0CS | SD CS | N | Ref. (CheqSol) |
---|---|---|---|---|---|---|---|
Acebutolol | -2.50 | 0.42 | 2 | -2.68 | 0.31 | 1 | [17] |
Acetaminophen | -0.97 | 0.10 | 18 | -1.03 | 0.05 | 2 | [16,22] |
Acetazolamide | -2.38 | 0.18 | 10 | -2.44 | 0.04 | 1 | [16] |
Alprenolol | -2.83 | 0.16 | 1 | -2.66 | 0.04 | 2 | [22,28] |
Amantadine | -2.76 | 0.50 | 1 | -1.90 | 0.07 | 2 | [22,29] |
Amiodarone | -10.58 | 0.35 | 4 | -9.68 | 0.15 | 1 | [22] |
Amitriptyline | -4.66 | 0.40 | 5 | -4.55 | 0.15 | 1 | [22] |
Amodiaquine | -4.74 | 0.35 | 1 | -5.87 | 0.10 | 2 | [16,22] |
Amoxicillin | -2.13 | 0.06 | 10 | -1.97 | 0.08 | 1 | [17] |
Aripiprazole | -6.74 | 0.16 | 2 | -6.43 | 0.30 | 1 | [28] |
Atenolol | -1.20 | 0.06 | 7 | -1.25 | 0.06 | 4 | [22,23,27,28] |
Atropine | -2.01 | 0.29 | 4 | -2.00 | 0.07 | 1 | [16] |
Barbital,Hexo- | -2.79 | 0.15 | 5 | -2.67 | 0.07 | 1 | [16] |
Barbital,Pheno- | -2.30 | 0.08 | 25 | -2.29 | 0.07 | 1 | [22] |
Bendroflumethiazide | -4.35 | 0.34 | 4 | -4.19 | 0.15 | 2 | [24,29] |
Benzocaine | -2.18 | 0.12 | 13 | -2.33 | 0.31 | 1 | [17] |
Benzoic_Acid | -1.59 | 0.05 | 12 | -1.61 | 0.15 | 1 | [22] |
Benzoic_Acid,4-Hydroxy- | -1.38 | 0.10 | 5 | -1.46 | 0.04 | 1 | [22] |
Benzthiazide | -4.84 | 0.28 | 4 | -4.86 | 0.04 | 2 | [22,24] |
Bifonazole | -6.27 | 0.25 | 4 | -6.01 | 0.15 | 1 | [25] |
Bisoprolol | -2.40 | 0.34 | 2 | -1.46 | 0.26 | 1 | [28] |
Bupivacaine | -3.47 | 0.18 | 9 | -3.08 | 0.21 | 2 | [16,24] |
Carprofen | -4.63 | 0.05 | 1 | -4.70 | 0.22 | 1 | [22] |
Carvedilol | -5.46 | 0.36 | 12 | -4.46 | 0.27 | 1 | [28] |
Cephalothin | -3.40 | 0.32 | 1 | -2.94 | 0.04 | 1 | [16] |
Chlorpheniramine | -2.65 | 0.10 | 1 | -2.60 | 0.12 | 3 | [21,22] |
Chlorpromazine | -5.45 | 0.31 | 11 | -5.55 | 0.04 | 1 | [22] |
Chlorpropamide | -3.15 | 0.17 | 5 | -3.21 | 0.05 | 2 | [16,28] |
Chlorprothixene | -5.66 | 0.38 | 3 | -6.31 | 0.44 | 3 | [16,22] |
Chlorzoxazone | -2.78 | 0.11 | 3 | -2.64 | 0.04 | 2 | [16,29] |
Cimetidine | -1.50 | 0.22 | 7 | -1.69 | 0.04 | 1 | [16] |
Ciprofloxacin | -3.57 | 0.19 | 19 | -3.60 | 0.18 | 1 | [22] |
Cyproheptadine | -5.02 | 0.46 | 4 | -5.00 | 0.15 | 1 | [24] |
Diazoxide | -3.46 | 0.26 | 3 | -3.36 | 0.07 | 1 | [16] |
Dibucaine | -3.70 | 0.35 | 1 | -4.20 | 0.26 | 2 | [17,24] |
Diclofenac | -5.33 | 0.19 | 27 | -5.40 | 0.14 | 6 | [9,21,22,24,26] |
Diethylstilbestrol | -4.38 | 0.38 | 6 | -4.43 | 0.48 | 1 | [17] |
Difloxacin | -3.94 | 0.10 | 2 | -3.60 | 0.04 | 1 | [16] |
Diflunisal | -5.08 | 0.40 | 5 | -4.92 | 0.70 | 6 | [17,21] |
Diltiazem | -2.95 | 0.20 | 1 | -3.05 | 0.16 | 2 | [22] |
Diphenhydramine | -3.29 | 0.64 | 3 | -2.95 | 0.04 | 1 | [22] |
Dipyridamole | -5.13 | 0.13 | 10 | -5.16 | 0.02 | 1 | [17] |
Enrofloxacin | -3.16 | 0.13 | 7 | -3.18 | 0.18 | 1 | [16] |
Famotidine | -2.67 | 0.27 | 5 | -2.65 | 0.04 | 1 | [22] |
Fenoprofen | -3.92 | 0.20 | 1 | -3.70 | 0.26 | 1 | [16] |
Flufenamic_Acid | -5.27 | 0.28 | 10 | -5.35 | 0.04 | 1 | [22] |
Flumequine | -4.10 | 0.08 | 1 | -3.80 | 0.11 | 2 | [16,22] |
Flurbiprofen | -4.36 | 0.19 | 21 | -4.05 | 0.15 | 2 | [16,22] |
Fluvastatin | -3.78 | 0.06 | 1 | -3.87 | 0.16 | 1 | [28] |
Furosemide | -4.51 | 0.20 | 19 | -4.18 | 0.12 | 3 | [21,22] |
Glibenclamide | -6.63 | 0.45 | 10 | -6.41 | 0.07 | 2 | [27,28] |
Gliclazide | -4.27 | 0.40 | 14 | -4.20 | 0.15 | 3 | [23,27,28] |
Glimepiride | -7.22 | 0.42 | 5 | -6.44 | 0.15 | 1 | [27] |
Glipizide | -5.66 | 0.24 | 6 | -5.51 | 0.10 | 3 | [27-29] |
Guanine | -4.09 | 0.07 | 2 | -4.43 | 0.37 | 1 | [16] |
Haloperidol | -5.76 | 0.14 | 8 | -5.52 | 0.19 | 2 | [22,24] |
Hydroflumethiazide | -2.72 | 0.11 | 15 | -2.70 | 0.03 | 3 | [21,29] |
Hydrochlorothiazide | -3.08 | 0.08 | 5 | -2.97 | 0.44 | 1 | [16] |
Ibuprofen | -3.80 | 0.22 | 19 | -3.97 | 0.23 | 4 | [21-23] |
Imipramine | -4.39 | 0.28 | 7 | -4.15 | 0.13 | 4 | [17,21,22] |
Ketoprofen | -3.45 | 0.23 | 20 | -3.23 | 0.02 | 4 | [17,21,24] |
Lidocaine | -1.82 | 0.08 | 19 | -1.87 | 0.05 | 1 | [22] |
Loperamide | -6.80 | 0.32 | 3 | -7.10 | 0.04 | 2 | [16,22] |
Maprotiline | -4.54 | 0.26 | 3 | -4.75 | 0.08 | 2 | [22,24] |
Meclizine | -5.55 | 0.66 | 1 | -6.48 | 0.15 | 1 | [29] |
Meclofenamic_Acid | -6.88 | 0.08 | 2 | -6.56 | 0.42 | 2 | [17,22] |
Mefenamic_Acid | -6.36 | 0.42 | 6 | -6.54 | 0.28 | 2 | [16,22] |
Metoclopramide | -3.18 | 0.28 | 1 | -3.58 | 0.07 | 1 | [29] |
Metoprolol | -1.31 | 0.16 | 2 | -1.22 | 0.01 | 2 | [22,28] |
Metronidazole | -1.25 | 0.06 | 17 | -1.22 | 0.04 | 1 | [16] |
Miconazole | -6.04 | 0.41 | 4 | -5.38 | 0.45 | 2 | [16,22] |
Nadolol | -1.29 | 0.40 | 2 | -1.63 | 0.08 | 2 | [22,28] |
Nalidixic_Acid | -3.76 | 0.26 | 8 | -3.61 | 0.04 | 1 | [16] |
Naphthoic_Acid,2- | -3.82 | 0.28 | 5 | -3.77 | 0.15 | 1 | [22] |
Naphthol,1- | -2.09 | 0.10 | 6 | -1.98 | 0.07 | 1 | [22] |
Naproxen | -4.21 | 0.14 | 15 | -4.41 | 0.25 | 2 | [16,22] |
Niflumic_Acid | -4.18 | 0.13 | 6 | -4.11 | 0.08 | 2 | [16,22] |
Nitrofurantoin | -3.34 | 0.11 | 11 | -3.29 | 0.06 | 2 | [16,22] |
Norfloxacin | -2.88 | 0.16 | 18 | -2.75 | 0.15 | 1 | [22] |
Nortriptyline | -3.94 | 0.23 | 3 | -3.93 | 0.02 | 2 | [16,22] |
Ofloxacin | -1.37 | 0.14 | 6 | -1.27 | 0.04 | 1 | [16] |
Olanzapine | -4.47 | 0.17 | 1 | -4.23 | 0.18 | 1 | [24] |
Orphenadrine | -3.71 | 0.35 | 1 | -3.17 | 0.15 | 1 | [22] |
Paliperidone | -4.56 | 0.18 | 1 | -4.31 | 0.22 | 1 | [28] |
Papaverine | -4.40 | 0.14 | 8 | -4.21 | 0.24 | 4 | [16,21,22,24] |
Phenazopyridine | -3.99 | 0.16 | 6 | -4.19 | 0.07 | 1 | [16] |
Phenol,4-Iodo- | -1.83 | 0.07 | 1 | -1.72 | 0.04 | 1 | [22] |
Phenylbutazone | -4.51 | 0.36 | 10 | -4.39 | 0.04 | 1 | [22] |
Phenytoin | -4.08 | 0.13 | 29 | -3.86 | 0.18 | 1 | [16] |
Phthalic_Acid,2- | -1.46 | 0.02 | 10 | -1.55 | 0.08 | 2 | [16,22] |
Pindolol | -3.81 | 0.14 | 6 | -3.64 | 0.13 | 3 | [22,24,28] |
Pioglitazone | -6.21 | 0.81 | 3 | -6.16 | 0.46 | 1 | [28] |
Piroxicam | -4.46 | 0.26 | 16 | -4.73 | 0.06 | 4 | [16,21,22] |
Pramoxine | -3.46 | 0.39 | 1 | -3.02 | 0.15 | 1 | [22] |
Probenecid | -4.82 | 0.24 | 3 | -4.86 | 0.04 | 1 | [17] |
Procaine | -2.59 | 0.46 | 2 | -1.72 | 0.07 | 1 | [22] |
Prochlorperazine | -4.38 | 0.26 | 1 | -4.75 | 0.15 | 1 | [22] |
Promethazine | -4.39 | 0.19 | 10 | -4.26 | 0.15 | 1 | [22] |
Propranolol | -3.82 | 0.30 | 4 | -3.49 | 0.06 | 6 | [21-24,27,28] |
Pyrimethamine | -3.96 | 0.57 | 3 | -4.11 | 0.22 | 1 | [22] |
Quinine | -3.16 | 0.67 | 5 | -2.80 | 0.01 | 2 | [16,22] |
Ranitidine | -1.70 | 0.57 | 1 | -2.50 | 0.26 | 1 | [16] |
Rosiglitazone | -5.28 | 0.50 | 3 | -5.22 | 0.61 | 1 | [28] |
Salicylic_Acid | -1.88 | 0.08 | 20 | -1.93 | 0.04 | 1 | [17] |
Sertraline | -5.41 | 0.28 | 2 | -4.83 | 0.15 | 1 | [22] |
Sulfacetamide | -1.50 | 0.05 | 5 | -1.52 | 0.04 | 1 | [16] |
Sulfamerazine | -3.10 | 0.07 | 6 | -3.12 | 0.05 | 1 | [22] |
Sulfamethazine | -2.63 | 0.27 | 6 | -2.73 | 0.04 | 1 | [16] |
Sulfamethizole | -2.77 | 0.14 | 5 | -2.78 | 0.14 | 1 | [17] |
Sulfasalazine | -6.47 | 0.08 | 7 | -6.21 | 0.10 | 2 | [16,22] |
Sulfathiazole | -2.61 | 0.23 | 8 | -2.69 | 0.15 | 1 | [22] |
Terfenadine | -7.68 | 0.71 | 10 | -8.40 | 0.15 | 1 | [29] |
Tetracaine | -3.22 | 0.11 | 1 | -3.05 | 0.06 | 2 | [16,24] |
Tetracycline | -3.29 | 0.05 | 6 | -3.00 | 0.09 | 2 | [16,29] |
Tetracycline,Oxy- | -3.27 | 0.07 | 4 | -3.09 | 0.33 | 1 | [16] |
Thiabendazole | -4.13 | 0.46 | 3 | -3.48 | 0.10 | 1 | [17] |
Thymol | -2.18 | 0.05 | 3 | -2.19 | 0.04 | 1 | [22] |
Tolbutamide | -3.55 | 0.11 | 5 | -3.50 | 0.05 | 2 | [17,28] |
Tolmetin | -4.05 | 0.15 | 1 | -4.11 | 0.03 | 2 | [16,22] |
Trazodone | -3.31 | 0.09 | 4 | -3.19 | 0.40 | 2 | [17] |
Trichlormethiazide | -3.64 | 0.12 | 3 | -3.38 | 0.18 | 3 | [16,22] |
Trimethoprim | -2.64 | 0.39 | 9 | -2.95 | 0.15 | 1 | [16] |
Verapamil | -4.36 | 0.28 | 9 | -4.07 | 0.22 | 3 | [21,22] |
Warfarin | -4.77 | 0.22 | 9 | -4.80 | 0.14 | 2 | [22,24] |
a SD refers to the standard deviation of reported values from N different sourcs. The average interlaboratory SDi values for for the two sets of data are: SDiSSF = 0.24 and SDiCS = 0.15 log unit.
Note that indomethacin is not listed in the table. The CheqSol value was not accepted in the Wiki-pS0 database due to the hydrolytic decomposition encountered during the CheqSol assay [30]. On the other hand, the pH-metric DDT method indicated log S0DTT = -5.33 [31], in good agreement with the average of 20 interlaboratory SSF measurements: log S0SSF = -5.49, SDi = 0.23.
Figure 1 shows the correlation plot between the two types of measurements, with each point showing both method interlaboratory error bars. The statistics have improved slightly over the 2015 comparison [19], with current values being r2 = 0.96, RMSE = 0.34 log unit, with a lower slight bias of -0.07 log unit. The average interlaboratory standard deviation is slightly lower for the CheqSol set over that of the SSF set: SDiCS = 0.15 and SDiSSF = 0.24, which probably highlights the benefit of using a highly standardized method (CheqSol) over an ‘open’ method (SSF). It should be kept in mind that the above comparison sets are small. For 870 of these sorts of comparisons, SDiALL = 0.17. The intralaboratory comparison between the methods by one group of researchers performing both the SSF and CheqSol measurements (both highly-standardized) [32] produced the statistics r2 = 0.96, RMSE = 0.20 for 15 compounds, comparable to SDiALL. The latter is the target for computational methods to aim at, provided that the training sets are of high and consistent quality.
Conclusions
This brief commentary reasserts that the quality of the standardized CheqSol measurements is comparable to that of the ‘gold standard’ saturation shake-flask measurements. Measurement errors are much lower than commonly acknowledged in the computational prediction community. The intralaboratory (single instrument) errors reported in the CheqSol method (0.05 log) need to be multiplied by a factor of 3 to match the expected interlaboratory errors for the method (0.15 log). The scale factor, in part, relates to the hidden systematic errors in the single-lab values. It is expected that better standardizations in the ‘open’ SSF methods, as recommended in the ‘white paper’ [3], may equalize the SDi of both methods at about ~0.15 log unit. When solubility prediction methods indicate RMSE below 0.15, ‘overfitting’ is probably taking place, overlapping noise with information.
Acknowledgement
The commentary sprang out of a suggestion made by a reviewer of the earlier paper in the series [2], that it would be helpful if a training set of data were provided for the new Solubility Challenge [2]. We are grateful for this good thought.
Footnotes
Conflict of interest. None.
References
- [1].Bergström C.A.S., Avdeef A.. Perspectives in solubility measurement and interpretation. ADMET & DMPK 7 (2019) 88-105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [2].Llinas A., Avdeef A.. Solubility Challenge revisited after 10 years, with multi-lab shake-flask data, using tight (SD ~0.17 log) and loose (SD ~0.62 log) test sets, J. Chem. Inf. Model. (2019). doi: 10.1021/acs.jcim.9b00345. 10.1021/acs.jcim.9b00345 [DOI] [PubMed] [Google Scholar]
- [3].Avdeef A. Fuguet E. Llinàs A. Ràfols C. Bosch E. Völgyi G. Verbi T.ćBoldyreva E. Takács-Novák K.. Equilibrium solubility measurement of ionizable drugs – consensus recommendations for improving data quality. ADMET & DMPK 4 (2016) 117-178. [Google Scholar]
- [4].Bergström C.A.S., Holm R., Jørgensen S.A., Andersson S.B.E., Artursson P., Beato S., Borde A., Box K., Brewster M., Dressman J., Feng K.-I., Halbert G., Kostewicz E., McAllister M., Muenster U., Thinnes J., Taylor R., Mullertz A.. Early pharmaceutical profiling to predict oral drug absorption: Current status and unmet needs. Eur. J. Pharm. Sci. 57 (2014) 173-199. [DOI] [PubMed] [Google Scholar]
- [5].Dearden J.C.. In silico prediction of aqueous solubility. Expert Opin. Drug Discov. 1 (2006) 31–52. [DOI] [PubMed] [Google Scholar]
- [6].Taskinen J., Norinder U.. In silico prediction of solubility. In: Testa B., van de Waterbeemd H. (Eds.). Comprehensive Medicinal Chemistry II, Elsevier: Oxford, UK, 2007, pp. 627-648. [Google Scholar]
- [7].Avdeef A.. pH-metric solubility. 1. Solubility-pH profiles from Bjerrum plots. Gibbs buffer and pKa in the solid state. Pharm. Pharmacol. Commun. 4 (1998) 165-178. [Google Scholar]
- [8].Avdeef A., Berger C.M., Brownell C.. pH-metric solubility. 2. Correlation between the acid-base titration and the saturation shake-flask solubility-pH methods. Pharm. Res. 17 (2000) 85-89. [DOI] [PubMed] [Google Scholar]
- [9].Stuart M., Box K.. Chasing equilibrium: Measuring the intrinsic solubility of weak acids and bases. Anal. Chem. 77 (2005) 983-990. [DOI] [PubMed] [Google Scholar]
- [10].Marković O.S. Pešić M.P. Shah A.V. Serajuddin A.T.M. Verbi T.Z.ćAvdeef A.. Solubility-pH profile of desipramine hydrochloride in saline phosphate buffer: enhanced solubility due to drug-buffer aggregates. Eur. J. Pharm. Sci. 133 (2019) 264-274. [DOI] [PubMed] [Google Scholar]
- [11].Pobudkowska A., Ràfols C., Subirats X., Bosch E., Avdeef A.. Phenothiazines solution complexity – determination of pKa and solubility-pH profiles exhibiting sub-micellar aggregation at 25 and 37 ℃. Eur. J. Pharm. Sci. 93 (2016) 163-176. [DOI] [PubMed] [Google Scholar]
- [12].Shoghi E., Fuguet E., Bosch E., Ràfols C.. Solubility-pH profile of some acidic, basic and amphoteric drugs. Eur. J. Pharm. Sci. 48 (2012) 290-300. [DOI] [PubMed] [Google Scholar]
- [13].Zografi G., Zarenda I.. The surface activity of phenothiazine derivatives at the air-solution interface. Biochem. Pharmacol. 15 (1966), 591–598. [DOI] [PubMed] [Google Scholar]
- [14].Avdeef A.. Absorption and Drug Development Second Edition, Wiley-Interscience, Hoboken NJ, 2012, Ch. 6. [Google Scholar]
- [15].Avdeef A.. Solubility temperature dependence predicted from 2D structure. ADMET & DMPK 3 (2015) 298-344. [Google Scholar]
- [16].Llinàs A., Glen R.C., Goodman J.M.. Solubility challenge: can you predict solubilities of 32 molecules using a database of 100 reliable measurements? J. Chem. Inf. Model. 48 (2008) 1289-1303. [DOI] [PubMed] [Google Scholar]
- [17].Hopfinger A.J., Esposito E.X., Llinàs A., Glen R.C., Goodman J.M.. Findings of the challenge to predict aqueous solubility. J. Chem. Inf. Model. 49 (2009) 1-5. [DOI] [PubMed] [Google Scholar]
- [18].Katritzky A.R., Wang Y., Sild S., Tamm T., Karelson M.. QSPR studies on vapor pressure, aqueous solubility, and the prediction of water-air partition coefficients. J. Chem. Inf. Model. 38 (1998) 720-725. [Google Scholar]
- [19].Avdeef A.. Suggested improvements for measurement of equilibrium solubility-pH of ionizable drugs. ADMET & DMPK 3 (2015) 84-109. [Google Scholar]
- [20].Völgyi G., Marosi A., Takács-Novák K., Avdeef A.. Salt solubility products of diprenorphine hydrochloride, codeine and lidocaine hydrochlorides and phosphates – Novel method of data analysis not dependent on explicit solubility equations. ADMET & DMPK 1 (2013) 48-62. [Google Scholar]
- [21].Baek K., Jeon S.B., Kim B.K., Kang N.S.. Method validation for equilibrium solubility and determination of temperature effect on the ionization constant and intrinsic solubility of drugs. J. Pharm. Sci. Emerg. Drugs 6 (2018) 1-6. [Google Scholar]
- [22].Box K.J., Comer J.E.A.. Using measured pKa, log P and solubility to investigate supersaturation and predict BCS class. Curr. Drug Metab. 9 (2008) 869-878. [DOI] [PubMed] [Google Scholar]
- [23].Etherson K., Halbert G., Elliott M.. Determination of excipient based solubility increases using the CheqSol method. Int. J. Pharm. 465 (2014) 202-209. [DOI] [PubMed] [Google Scholar]
- [24].Fornells E., Fuguet E., Mañéa M., Ruiz R., Box K., Bosch E., Ràfols C.. Effect of vinylpyrrolidone polymers on the solubility and supersaturation of drugs; a study using the Cheqsol method. Eur. J. Pharm. Sci. 117 (2018) 227-235. [DOI] [PubMed] [Google Scholar]
- [25].Hsieh Y.-L., Ilevbare G.A., van Eerdenbrugh B., Box K.J., Sanchez-Felix M.V., Taylor L.S.. pH-Induced precipitation behavior of weakly basic compounds: determination of extent and duration of supersaturation using potentiometric titration and correlation to solid state properties. Pharm. Res. 29 (2012) 2738-2753. [DOI] [PubMed] [Google Scholar]
- [26].Llinàs A., Burley J.C., Box K.J., Glen R.C., Goodman J.M.. Diclofenac solubility: independent determination of the intrinsic solubility of three crystal forms. J. Med. Chem. 50 (2007) 979-983. [DOI] [PubMed] [Google Scholar]
- [27].Narasimham L.Y.S., Barhate V.D.. Kinetic and intrinsic solubility determination of some β-blockers and antidiabetics by potentiometry. J. Pharmacy Res. 4 (2011) 532-536. [Google Scholar]
- [28].Schönherr D., Wollatz U., Haznar-Garbacz D., Hanke U., Box K.J., Taylor R., Ruiz R., Beato S., Becker D., Weitschies W.. Characterisation of selected active agents regarding pKa values, solubility concentrations and pH profiles by SiriusT3. Eur. J. Pharm. Biopharm. 92 (2015) 155-170. [DOI] [PubMed] [Google Scholar]
- [29].Sköld C., Winiwarter S., Wernevik J., Bergström F., Engström L., Allen R., Box K., Comer J., Mole J., Hallberg A., Lennernäs H., Lundstedt T., Ungell A.-L., Karlén A.. Presentation of a structurally diverse and commercially available drug data set for correlation and benchmarking studies. J. Med. Chem. 49 (2006) 6660-6671. [DOI] [PubMed] [Google Scholar]
- [30].Comer J., Judge S., Matthews D., Towes L., Falcone B., Goodman J., Dearden J.. The intrinsic aqueous solubility of indomethacin. ADMET & DMPK 2 (2014) 18-32. [Google Scholar]
- [31].Bergström C.A.S., Strafford M, Lazarova L., Avdeef A., Luthman K., Artursson P.. Absorption classification of oral drugs based on molecular surface properties. J. Med. Chem. 46 (2003) 558-570. [DOI] [PubMed] [Google Scholar]
- [32].Box K.J., Völgyi G., Baka E., Stuart M., Takács-Novák K., Comer J.E.A.. Equilibrium versus kinetic measurements of aqueous solubility, and the ability of compounds to supersaturate in solution- a validation study. J. Pharm. Sci. 95 (2006) 1298-1307. [DOI] [PubMed] [Google Scholar]