Abstract
The octanol–water partition coefficient (logPow) is an important index for measuring solubility, membrane permeability, and bioavailability in the drug discovery field. In this paper, the logPow values of 58 compounds were predicted by alchemical free energy calculation using molecular dynamics simulation. In free energy calculations, the atomic charges of the compounds are always fixed. However, they must be recalculated for each solvent. Therefore, three different sets of atomic charges were tested using quantum chemical calculations, taking into account vacuum, octanol, and water environments. The calculated atomic charges in the different environments do not necessarily influence the correlation between calculated and experimentally measured ∆Gwater values. The largest correlation coefficient values of the solvation free energy in water and octanol were 0.93 and 0.90, respectively. On the other hand, the correlation coefficient of logPow values calculated from free energies, the largest of which was 0.92, was sensitive to the combination of the solvation free energies calculated from the calculated atomic charges. These results reveal that the solvent assumed in the atomic charge calculation is an important factor determining the accuracy of predicted logPow values.
Keywords: MD simulation, thermodynamic integration, solvation environment, atomic charge
1. Introduction
The octanol–water partition coefficient (logPow) is an index used in drug discovery processes to estimate the solubility, membrane permeability, and bioavailability of compounds [1,2]. Several experimental methods for measuring it have been developed [2,3,4,5,6], but they are time consuming and costly, because experimental methods use the actual compounds, and sometimes, the synthesis of a compound takes a long time. To accelerate the drug discovery process and to reduce its cost, we need a logPow measurement method that is easier, faster, and more accurate than the current ones.
Computational methods for predicting the logPow value have been developed using various techniques. The methods summing the contributions of a compound’s atoms and fragments (e.g., XLOGP3 [7], KOWWIN [8], etc.) rapidly calculate logPow values similar to those obtained experimentally, and have therefore been used in the screening of drug-like compounds [9], making it possible to calculate the logPow values for the tens of millions of compounds in databases such as PubChem [10] and ZINC [11].
The logPow values of compounds can also be predicted accurately by alchemical free energy calculation techniques using molecular dynamics (MD) simulation and Monte Carlo simulation, such as thermodynamic integration (TI) [12] and free energy perturbation (FEP) [13]. These methods have been used to calculate hydration free energy of amino acids [14], nucleobases [15], and ligand/protein binding free energy [16,17,18,19,20,21,22,23,24,25,26,27,28]. They can treat water molecules either implicitly or explicitly. Thus, in the case of ligand/protein binding free energy calculation, they can be used to predict the thermodynamic behavior of a ligand in solvents. In compounds containing water molecules bridging between the protein and ligand, the calculation with explicit water molecules is efficient for calculating binding free energy. In the application of alchemical free energy calculation to logPow calculation, DeBolt and Kollman successfully estimated the logPow using FEP method with modified force field [29]. They were able to reproduce the structural and thermodynamic behaviors of liquid octanol. Using an implicit solvation model, Huang et al. reported a logPow prediction method in which the 3D-RISM-KH theory was used to calculate the solvation free energy in water and in octanol [30]. To reproduce the experimental conditions, Bhatnagar et al. reported an adaptive biasing force method using explicit water and octanol solvents [31]. In another application, saturated octanol solvation was used for the calculation, taking the hydrate state of octanol in the experiment into account [32,33]. These methods were able to estimate logPow values in agreement with the ones measured experimentally. The alchemical methods using all-atom MD simulation require time and computer resources, compared with atom- and fragment-based methods. However, these methods are able to predict the partition coefficient of several solvents, and observe the thermodynamic behavior of compounds in the solvents. Therefore, the free energy calculation using the alchemical method is expected to provide useful information on physical chemistry properties of the compounds.
In this paper, the logPow values for relatively small compounds were predicted from the solvent free energies calculated by TI [34,35]. The calculation of the difference free energy, ∆G, was estimated by the Bennett’s acceptance ratio (BAR) method [36]. To analyze the contribution of the atomic charges of the compounds, three sets of atomic charges were examined: one in vacuum, one in octanol, and the other in water. Solvation free energies in water and in octanol are in good agreement with the ones measured experimentally. The correlation coefficient (R) of the solvation free energies indicated strong correlation between calculated and experimentally measured values. They did not differ significantly among the three sets of atomic charges. The logPow values, however, showed slightly different behaviors. In fact, the R values of logPow depended on the combination of the atomic charges calculated in the different solvents. These results reveal that the environment assumed in the calculation of the atomic charges is a very critical factor for the logPow calculations. Therefore, our results provide useful information for improving the quality of the calculations. Finally, the logPow values of 17 drugs calculated using the best combination of the atomic charges showed good agreement with the values measured experimentally.
2. Results and Discussion
2.1. Solvation Free Energy in Water
To calculate free energy in water, the three charge sets were used independently. In this paper, the notation of the parameter sets calculated with the atomic charges in the different solvents are simplified as Pwater{x}, where x is either v(vacuum), o(octanol), or w(water). It should be noted that, in this work, we did not change the atomic charges in water molecules, because the force field of water molecules was established by the many studies conducted on water molecules [37].
Strong correlations between calculated and experimentally measured ∆Gwater values were observed (Figure 1). In particular, the free energy calculated using Pwater{w}, for which the coefficient of determination (R2) is 0.87, indicated good agreement with experimentally measured values. The free energy calculated using Pwater{v} also showed a strong correlation with the experimental data, even though R2 (=0.84) was the smallest in the three cases. These results confirm that the calculated atomic charges in the different environments do not necessarily influence the correlation between calculated and experimentally measured ∆Gwater values.
On the other hand, the values of atomic charge were sensitive to the solvent in the quantum chemistry calculation (Figure 2). In the three solvents, somewhat large differences in atomic charge values can be found in the charged group containing heteroatoms, e.g., oxygen in acetone, nitrogen in pyridine, and oxygen in octanol. The calculated atomic charges in octanol and water were similar, and were slightly different from the ones in the vacuum state. This tendency can be understood from the graphs drawn for the atomic charges vs. dielectric constant values (Figure S1). The atomic charges dramatically increase or decrease at dielectric constants around 10, and become constant thereafter. These differences can affect the regression line in the scatter diagrams (Figure 1). The slope of the regression line for ∆Gwater values with Pwater(w) is close to that with Pwater(o), whereas that with Pwater(v) is slightly different from the others. However, the R values do not differ significantly among the three parameters. Furthermore, there were strong correlations among three.
∆Gwater values, for which the R values of Pwater(v)–Pwater(o), Pwater(v)–Pwater(w), and Pwater(o)–Pwater(w) pairs, were 0.9912, 0.9885, and 0.9996, respectively. In these cases, the order in which the compounds are arranged according to the ∆Gwater, is more or less the same among the three pairs. These analyses show that the atomic charges were affected by the solvent types assumed in the calculation, but these differences did not affect the correlation of the ∆Gwater values.
The RMSE and MAE values in the three parameter sets show opposite relations to the R values; i.e., the calculated data having strong correlation showed large errors (Table 1). The ∆Gwater of Pwater{v} had the smallest R and R2 values, but the errors, for which the RMSE and MAE values are 4.49 kJ/mol (=1.07 kcal/mol) and 3.65 (=0.87 kcal/mol), respectively, were the smallest in the three parameter sets. On the other hand, the calculation with Pwater{w} shows the largest R value, but the RMSE and MAE values are the largest. In these calculations, the difference of RMSE and MAE values, however, are 1.2 kJ/mol and 0.6 kJ/mol between Pwater{v} and Pwater{w}. These may not be significant differences among the three parameters. Therefore, for all the selections of atomic charges, the BAR method can estimate the ∆Gwater values accurately enough.
Table 1.
Parameter | R | R2 | RMSE a (kJ/mol) | MAE b (kJ/mol) |
---|---|---|---|---|
Pwater{v} | 0.92 | 0.84 | 4.47 | 3.63 |
Pwater{o} | 0.93 | 0.87 | 5.05 | 3.74 |
Pwater{w} | 0.93 | 0.87 | 5.69 | 4.28 |
a Root mean square error; b Mean average error.
2.2. Solvation Free Energy in Octanol
In calculating the free energy in octanol, nine sets of the atomic charges were used: all combinations of the three sets of atomic charges of the compound (3 sets) and octanol (3 sets). The atomic charges of octanol molecule in octanol solvation, the octanol environment, should be considered. However, in this work, to understand the effect of the atomic charges in octanol on the logPow calculation, we took into account the atomic charges in the octanol molecule for vacuum, octanol, and water state.
The compound and octanol parameter sets calculated with the atomic charges in octanol is defined as Poctanol{x, y}, in which x and y (for compound and octanol, respectively) are either v(vacuum), o(octanol), or w(water).
Tendencies similar to those seen in the results obtained with the ∆Gwater calculation could be seen in the results obtained with the ∆Goctanol calculation. It can be seen in Figure 3 that all the calculated ∆Goctanol values showed strong correlations with the ones measured experimentally. The largest R value was 0.9 (Poctanol{v, o} and Poctanol{v, w}), though these values are slightly smaller than that in ∆Gwater (0.93). The calculated ∆Goctanol values distributed toward the small range of the horizontal axis, thus, all the calculations may underestimate the ∆Goctanol value. The tendency of the relationship between the R2 values and error values (RMSE and MAE) is similar to that seen in the ∆Gwater results; i.e., the larger the R2 values are, the larger the RMSE and MAE values are. The largest MAE is 13.26 kJ/mol (=3.16 kcal/mol) in the ∆Goctanol with Poctanol{w, w}. This value is a little bit larger than that in ∆Gwater. This result reveals that the calculation of ∆Goctanol values accumulates more errors than that of ∆Gwater. However, these ∆Goctanol values still showed strong correlation with the experimental data.
In Figure 3 and Table 2, the correlations between the calculated and experimentally measured ∆Goctanol values are shown to be independent of the atomic charges. The values of R and R2 are close in all the ∆Goctanol calculations. The R, R2, RMSE, and MAE values of the ∆Goctanol with Poctanol{o, o} are comparable with the other ones. This means that the atomic charges did not affect the correlation of ∆Goctanol calculations. The ∆Goctanol calculated with either Poctanol{v, o} or Poctanol{v, w} had the largest R and R2 values. Those values indicated a strong correlation with the experimental data. On the other hand, the ∆Goctanol calculated with Poctanol{w, v} had the smallest R and R2 values. However, these values still indicated strong correlation. The differences of R and R2 values between Poctanol{v, o} (or Poctanol{v, w}) and Poctanol{w, v}, are 0.02 and 0.04, respectively. These are not necessarily significant differences. Therefore, it is possible to conclude that the calculated atomic charges in the different environments did not affect the R and R2 values in ∆Goctanol calculations.
Table 2.
Parameter | R | R2 | RMSE a (kJ/mol) | MAE b (kJ/mol) |
---|---|---|---|---|
Poctanol{v, v} | 0.89 | 0.80 | 11.18 | 10.36 |
Poctanol{v, o} | 0.90 | 0.81 | 11.23 | 10.49 |
Poctanol{v, w} | 0.90 | 0.81 | 11.26 | 10.56 |
Poctanol{o, v} | 0.88 | 0.78 | 13.59 | 12.53 |
Poctanol{o, o} | 0.89 | 0.78 | 13.70 | 12.77 |
Poctanol{o, w} | 0.89 | 0.79 | 13.65 | 12.72 |
Poctanol{w, v} | 0.88 | 0.77 | 14.18 | 13.05 |
Poctanol{w, o} | 0.89 | 0.79 | 14.15 | 13.18 |
Poctanol{w, w} | 0.89 | 0.79 | 14.23 | 13.26 |
a Root mean square error; b Mean average error.
2.3. Effect of Water and Octanol Solvents on Free Energy
To investigate the effects of solvents on free energy calculation, the dielectric constant of water and octanol solvents were evaluated in the simulation. The dielectric constant was calculated as [38,39]
(1) |
where V, kb and T are volume, Boltzmann constant, and temperature, respectively. M is the total dipole moment. To calculate the dielectric constant of water and octanol solvents, additional 5 ns MD simulations were performed to generate the products, after 10,000 steps optimization, 50 ps heat-up, and 2 ns equilibration. The dielectric constants were calculated for the trajectories of 5 ns MD simulations.
Overestimated dielectric constants of water solvents were obtained by simulation (Table 3). This value is very similar to that reported by Vega et al. [40]. In Figure 1, the largest slope of the regulations line is found in the calculation with Pwater(v), of which absolute atomic charge values was smallest in the three parameters. Therefore, the pair of the water solvent with which the dielectric constant was underestimated, and the compound with Pwater(v) provided the closest absolute values of ∆G in our calculation.
Table 3.
Solvents | Charge | ε | <μ> (Debye) | Flu c (Debye2) | <V> (Å3) | vdW (kJ/mol) | Elec. (kJ/mol) | |
---|---|---|---|---|---|---|---|---|
Exp. | Sim. | |||||||
Water | 78.49 (25 °C) a | 92.70 | 2.35 | 20,914.2 | 22,290.7 | 14,371.9 | −33,641.5 | |
Octanol | Vacuum | 9.34 (32.1 °C) b | 4.21 | 2.08 | 836.1 | 26,547.7 | −3860.9 | −2778.5 |
Octanol | 3.63 | 2.24 | 678.0 | 26,320.5 | −3768.3 | −3687.8 | ||
Water | 3.42 | 2.27 | 625.2 | 26,347.2 | −3714.9 | −3879.3 |
On the other hand, the dielectric constants of octanol solvent were also underestimated. All the values in the different charges were smaller than the experimentally measured ones. These values were slightly smaller than that in the papers reported by DeBolt and Kollman [29]. They obtained dielectric constants near 5.1 for the liquid octanol at 40 °C. The dielectric constant in octanol solvent gradually decreased by the atomic charges in the order of vacuum, octanol, and water, i.e., these values decreased in proportion to the absolute values of atomic charges. In Table 3, <μ> of octanol with atomic charge in water was larger than that of vacuum because of increasing absolute atomic charges. This means that, owing to increasing <μ>, the values of electrostatic interaction increased. On the other hand, the van der Waals interaction drops with increasing absolute atomic charges. Since the van der Waals parameters were the same among the different simulations, the electrostatic interaction is thought to dominate the simulation. However, in Figure 3, the slopes of the regulation line were dependent on the atomic charges of compounds, rather than those of octanol solvents. Moreover, the dielectric constant with atomic charges in water solvent further differed from the experimentally measured values than that within vacuum. In the simulation of liquid octanol, of which the dielectric constant of the octanol is relatively smaller than that of water solvent, a strong contribution of the electrostatic interaction leads to smaller dielectric constant values that appear as simulation errors. Therefore, in order to improve the calculated free energy values, the balance between electrostatic and van der Waals interactions may need to be investigated in further studies.
2.4. LogPow for Tested Compounds
The logPow values were calculated using the Formula (7) (See Section 3). Here, the notation for the logPow value calculated from ∆Gwater using Pwater{x}, and ∆Goctanol using Poctanol{y, z} was simplified to PlogP{x, {y, z}}, where x, y, and z are either v(vacuum), o(octanol), or w(water).
The logPow values were sensitive to the atomic charges calculated in the solvent. The correlation of the logPow with the experimental data is strongly dependent on the pair of solvation free energies ∆Gwater and ∆Goctanol (Figure 4 and Figure S2). For instance, in Figure 4, completely different correlations of logPow values can be seen in the scatter diagrams with the largest and smallest R2, which are PlogP{w, {v, o}} and PlogP{v, {w, o}}, respectively. In the case of PlogP{w, {v, o}}, the logPow was calculated from ∆Gwater with Pwater{w}, and ∆Goctanol with Poctanol{v, o}. This combination is expected to result in logPow values closely correlated with the experimental ones.
On the other hand, in the case of PlogP{v, {w, o}}, the logPow values showed very weak correlation with the experimental data. The calculated logPow values were based on ∆Gwater with Pwater{v} and ∆Goctanol with Poctanol{w,o}. The free energies in water and octanol both showed strong correlations with the experimental data, and the RMSE and MAE values were not as significantly different from the others, as mentioned above. The slope of the regression line for ∆Gwater with Pwater{v}, however, was over 1, whereas that of the regression line for ∆Goctanol with Poctanol{w,o} was under 1. These differences raised the large RMSE and MAE values slightly (Table 4). As a result, the logPow values showed very weak correlation. These results reveal that the combination of the ∆Gwater and ∆Goctanol is very important for the correct logPow. That is, the selection of the solvent in the calculation of the atomic charges is very important for the logPow calculation. Thus, the inappropriate selection of the solvents for the calculation of atomic charges results in the weak correlation between the logPow and the experimental data. This tendency can be observed in the logPow with PlogP{w,{w, w}}. The ∆Gwater with Pwater{w}, and ∆Goctanol with Poctanol{w, w}, showed the strongest correlation in our test data. The values, however, showed moderate correlation (R = 0.77) not stronger than that of the ∆Gwater with PlogP{w, {v, o}}.
Table 4.
Parameters | R | R2 | RMSE 1 (kJ/mol) | MAE 2 (kJ/mol) |
---|---|---|---|---|
PlogP{v, {v, v}} | 0.77 | 0.59 | 2.16 | 2.04 |
PlogP{v, {v, o}} | 0.80 | 0.64 | 2.17 | 2.07 |
PlogP{v, {v, w}} | 0.82 | 0.68 | 2.17 | 2.07 |
PlogP{v, {o, v}} | 0.55 | 0.31 | 2.61 | 2.42 |
PlogP{v, {o, o}} | 0.60 | 0.36 | 2.63 | 2.46 |
PlogP{v, {o, w}} | 0.62 | 0.38 | 2.62 | 2.45 |
PlogP{v, {w, v}} | 0.49 | 0.24 | 2.72 | 2.51 |
PlogP{v, {w, o}} | 0.54 | 0.29 | 2.72 | 2.53 |
PlogP{v, {w, w}} | 0.54 | 0.29 | 2.74 | 2.55 |
PlogP{o, {v, v}} | 0.92 | 0.84 | 1.40 | 1.31 |
PlogP{o, {v, o}} | 0.92 | 0.85 | 1.42 | 1.34 |
PlogP{o, {v, w}} | 0.92 | 0.85 | 1.43 | 1.35 |
PlogP{o, {o, v}} | 0.85 | 0.73 | 1.79 | 1.69 |
PlogP{o, {o, o}} | 0.87 | 0.76 | 1.82 | 1.73 |
PlogP{o, {o, w}} | 0.88 | 0.77 | 1.81 | 1.72 |
PlogP{o, {w, v}} | 0.82 | 0.67 | 1.90 | 1.78 |
PlogP{o, {w, o}} | 0.84 | 0.71 | 1.90 | 1.81 |
PlogP{o, {w, w}} | 0.84 | 0.71 | 1.92 | 1.82 |
PlogP{w, {v, v}} | 0.92 | 0.85 | 1.26 | 1.17 |
PlogP{w, {v, o}} | 0.92 | 0.86 | 1.29 | 1.20 |
PlogP{w, {v, w}} | 0.92 | 0.85 | 1.31 | 1.21 |
PlogP{w, {o, v}} | 0.88 | 0.78 | 1.63 | 1.55 |
PlogP{w, {o, o}} | 0.90 | 0.80 | 1.66 | 1.59 |
PlogP{w, {o, w}} | 0.90 | 0.81 | 1.66 | 1.58 |
PlogP{w, {w, v}} | 0.86 | 0.75 | 1.73 | 1.64 |
PlogP{w, {w, o}} | 0.88 | 0.77 | 1.75 | 1.66 |
PlogP{w, {w, w}} | 0.88 | 0.77 | 1.76 | 1.68 |
1 Root mean square error; 2 Mean average error.
The chemical group’s effect on the logPow values of a compound is shown in Figure 5. In the logPow calculation with PlogP{w, {v, o}}, the errors of each compound were ~2.0 in all groups, except the ethers (for ethyl phenyl ether, the error was 3.16). The calculated ∆Gwater value of ethyl phenyl ether was an overestimate. The calculated and the experimentally measured ∆Gwater values were −10.22 kJ/mol and −17.92 kJ/mol, respectively. On the other hand, the calculated ∆Goctanol was an underestimate. The calculated and experimentally measured ∆Goctanol values were −34.91 kJ/mol and −23.65 kJ/mol, respectively. These differences caused large errors in logPow values. Similar errors can be found in the logPow values calculated with PlogP{w, {w, w}}. The profile of histograms for PlogP{w, {w, w}} is similar to that of histograms for PlogP{w, {v, o}}. The error of ethyl phenyl ether stands out from the other compounds. The error of each compound was around 2.0, but these errors are slightly larger than that of the values calculated with PlogP{w, {v, o}}. Therefore, the R values for PlogP{w, {w, w}} were slightly smaller than those for PlogP{w, {v, o}}.
On the other hand, most of the logPow values calculated with PlogP{v, {w, o}} were larger than those calculated with the other parameters. Those of the compounds in the phosphate group, which has a strong polarity at the phosphoric acid moiety, showed particularly large errors. The selected environments in the charge calculation was inconsistent, i.e., the vacuum state was selected in ∆Gwater calculation and water environment was selected in ∆Goctanol calculations. These inadequate combinations induce large errors in the logPow calculation. A similar tendency can be found in the other groups with polar moieties, such as ether and ketone groups. The selection of the environment of the calculated charges, therefore, is important in logPow calculation.
To analyze the contributions of the atomic charges in different environments, the R and R2 values of logPow in Table 4 were compared. In the calculated ∆Gwater, the R and R2 values become small in the order of Pwater{v}, Pwater{o}, and Pwater{w} (Figure S3a). In terms of the atomic charges of the compounds in ∆Goctanol calculations, the R and R2 values of logPow become small in the order of water, octanol, and vacuum environments (Figure S3b). However, the atomic charges of the octanol in octanol solvent seem to affect the logPow values very slightly (Figure S3c). To briefly summarize these results, the atomic charges of the compound in the ∆Gwater and ∆Goctanol calculations are better in water and vacuum environments, respectively. In free energy, the atomic charges of the octanol molecule in octanol solvent should be reasonable, nonetheless, they do not influence the logPow value much. It should be noted that this tendency will be independent of the functional and basis sets in the calculation of atomic charges, because the RMSE shows the same behavior (Figure S4), and the correlation efficient values of our procedure are very large with the other procedures using the different functionals and basis sets in five compounds (Table S4).
2.5. LogPow for 17 Compounds
The best combination of the parameters of the additional 17 compounds (PlogP{w, {v, w}}) was different from those of the small compounds (PlogP{w, {v, o}}) (Table S5). However, the R and R2 values of PlogP{w, {v, v}}, PlogP{w, {v, o}}, and PlogP{w, {v, w}} were very close to each other. Therefore, we conclude that choosing one of PlogP{w, {v, *}} (* = v, o, and w) parameter set will provide R and R2 values close to the experimental ones.
The calculated logPow values of the additional 17 compounds with the best combination of the atomic charges in water and octanol (PlogP{w, {v, w}}) showed a good agreement with the experimental data (Figure 6). The R and R2 values were 0.93 and 0.86, respectively. These values were similar to those obtained by the other methods based on the compounds’ atoms and fragments [44], though our method needs more computational time than the other methods. The RMSE and MAE were 2.58 and 2.26, respectively, which were slightly larger than those of the small compounds. These errors affect the slope of the regression line, which is somewhat smaller than that of the line obtained for the small compounds when using PlogP{w, {v, o}} (Figure 4). These results indicate that the order of the compounds sorted by the logPow values, is very similar to the order of the compounds sorted by the logPow values measured experimentally. This means that the alchemical free energy calculation method with PlogP{w, {v, w}} will be effective to predict the lipophilicity of compounds, and will be a useful tool for drug design.
The logPow values of the other additional 32 compounds were calculated for comparing our results with the previous results reported by Bannan et al. [45]. The logPow values show strong correlation with the experimentally measured values, of which the R and R2 values were 0.85 and 0.72, respectively (Figure 7). These values are very close to those calculated using GAFF-DC that were reported by Bannan et al., of which R and R2 values are 0.85 and 0.74, respectively. These results reveal that the consideration of the atomic charge is important for improving the logPow values.
3. Materials and Methods
3.1. Setting Parameters of Compounds
The general amber force field (GAFF) [46] was used for bonding parameters (bond length, bond angle, torsion angle, and improper torsion angle) and for nonbonding van der Waals parameters (radius and well-depths) of the compounds and 1-octanol. Atomic charges were calculated by using Gaussian09 software [47] at the B3LYP/6-31G(d) level. The polarizable continuum model (PCM) [48] was used to consider the solvent effect on the calculation of atomic charges. Antechamber software was used for restrained electrostatic potential (RESP) charge-fitting [49,50,51] from the results of quantum chemical calculation. In the preparations for the simulations, biomolecule’s atomic charges are calculated once in one solvent, i.e., in water solvent, in protein, or in vacuum state. When calculating a compound’s free energy in octanol, however, better results can be obtained using the atomic charges calculated for the compound in octanol. Therefore, in this study, the atomic charges of the compounds and 1-octanol were calculated for three states: in vacuum, in 1-octanol, and in water. In the calculation of the free energy in water, the three charge sets were used independently. To calculate the free energy in octanol, nine sets of the atomic charges were used: all combinations of the three sets of atomic charges of the compound (3 sets) and octanol (3 sets). The compound and octanol parameter sets calculated with the atomic charges in octanol is defined as Poctanol{x, y}, in which x and y (for compound and octanol, respectively) are either v(vacuum), o(octanol), or w(water). The logPow values based on these free energies were calculated. The dependency of the calculated atomic charges in the different environments on the free energy and logPow values were examined.
3.2. Initial Models of Compounds in Solvent Box
We constructed the model systems as follows. All the compounds were hydrated using a TIP3P [37] water box consisting of 723 water molecules, and all the compounds were placed into an octanol box consisting of 100 1-octanol molecules.
All the systems were minimized by 10,000 steps of the steepest-descent method, first with constraint on the positions of the heavy atoms, and then without constraint. After the minimization, 7 ns MD simulation for the compound was performed with Gromacs 4.5.3 software suite in a periodic boundary condition, by using particle-mesh Ewald (PME) calculations [52,53] for coulombic interactions. The temperature was set at 298.15 K by using a stochastic dynamics algorithm, and pressure was maintained at 1 bar by scaling the box size in accordance with the Berendsen algorithm [54]. The frictional coefficient γ and time step were 5 ps−1 and 2.0 fs. Snapshot structures were extracted from equilibration runs as initial structures for successive annihilation MD simulations. The snapshot after 2 ns of MD simulation was used as the initial structure for the free energy calculation.
3.3. Free Energy Calculation with BAR Method
Annihilation with the BAR method is widely used in the calculation of ligand/protein binding free energy [25,26,27,28], and is briefly explained here. We consider the changes in free energy for transformation from vacuum to solvent using the BAR method, based on the following formulas:
(2) |
(3) |
(4) |
where ΔGsolvent represents the free energy change from the compound in gas (Sgas) to the compound in solvent (Ssolvent) in Formula (3), and ΔGgas represents the free energy change from the compound in vacuum (S0) to the compound in gas (Sgas) in Formula (4). In Formula (3), to calculate the interaction by considering the reverse processes, the first annihilation is performed for the compound from solvent to gas. And in Formula (4), the second annihilation is performed to remove the internal energy in the compounds. In these processes, the free energy is estimated by summing the free energy differences between neighboring partially annihilated states (here states are denoted as windows). The free-energy difference between the neighboring i-th and (i + 1)-th windows is calculated by using the BAR method [55]. In the BAR method, for Ui representing the potential energy of the i-th window, nonequilibrium work W (=Ui+1–Ui) was sampled in the MD trajectory. According to the maximum likelihood argument of BAR, the free-energy estimator for two equal-length simulations can be written as
(5) |
where ΔGi is the annihilation free-energy change from the i-th window to the (i + 1)-th window, and where a pair of angle brackets indicates an ensemble average over a window. The free energy difference is then defined as follows:
(6) |
In the free energy calculation using the BAR method, the annihilating simulations with the soft core potential [56,57] were independently performed for the sets of λ points composing λC and λLJ for Coulomb and van der Waals interactions, respectively. With λLJ = 0.0, the λC was increased in steps of 0.05 from 0 to 1, and with λC = 1, the λLJ was increased in steps of 0.05 from 0 to 1. Totally, 41 λ-points were considered in this paper.
For each window, 1 ns runs for data collection were performed for water and octanol solvent after 1 ns MD simulation for equilibration. Nonequilibrium work between neighboring windows was measured every 2 ps. Other conditions for each simulation were the same as those for the simulation generating initial structure, as mentioned above. To reduce artifacts due to the initial structure selection and to improve the statistical reliability, three independent sets of annihilation simulations were performed.
3.4. Calculation of LogPow
The logPow values were calculated using the following formula:
(7) |
where R and T are the gas constant and temperature, respectively, and ΔGoctanol and ΔGwater are the free energy differences (calculated using the BAR method as described) between the compound in solvent and the compound in vacuum. Here, the notation for the logPow value calculated from ΔGwater using Pwater{x}, and ΔGoctanol using Poctanol{y, z}, was simplified to PlogP{x, {y, z}}, where x, y, and z are either v(vacuum), o(octanol) or w(water). Equation (6) indicates that we should consider the effects of free energy differences caused by the perturbation of the atomic charge, by the change of the environments, and by the different atomic charges. In this work, these free energy differences were neglected, because they cannot be accurately reproduced using currently available methods.
3.5. Test Compounds
The 58 relatively small sized compounds examined to measure the logPow are listed in Table S1, along with their experimentally measured free energies extracted from a paper by Wang et al. [58]. In each compound, logPow values were calculated by the combination of the free energies of the water and octanol solvents, i.e., three Pwater{x} and nine Poctanol{x, y} parameters. Finally, 27 logPow values were calculated in total.
The logPow values of an additional 17 compounds were calculated using the same combination of the atomic charges in water and octanol for free energy calculations, as mentioned above. These compounds (Table S2) were used as drugs in the work reported by Daina et al. [44]. The calculation conditions used were the same as those used for the other 58 compounds.
As additional tests, the logPow values of 32 compounds tested in the previous work reported by Bannan et al. [45], were calculated using the best combination of the atomic charge calculations. In this case, the Pwater{w} and Poctanol{v, o} showed the best correlation, and were used in additional calculations. All the calculations were performed with the same conditions as the previous work by Bannan et al. [45], except for the set of λ points, for which the values of this work were used, as mentioned above.
4. Conclusions
A logPow calculation method using alchemical free energy calculation with the BAR method was proposed, and the results showed good agreement with the values measured experimentally. The free energies in water and octanol both showed strong correlations with the experimental data. However, the correlation of logPow values was sensitive to the combination of the ΔGwater and ΔGoctanol values. The weak correlation was due to the different slopes of regression line between ΔGwater and ΔGoctanol. Finally, we found that the calculation with PlogP{w, {v, o}} provides the best correlations between the calculated logPow values and experimentally measured values.
We calculated the logPow values for the drugs using the best parameter set in our trials, and obtained values closely correlated with the ones obtained experimentally. The R values of logPow values were better than those obtained in previous studies. These results show that the solvent assumed in the charge calculation greatly influences the logPow values. The calculation of the atomic charges was time consuming, but dramatically increased the accuracy of the logPow values. We expect these accurate results to be useful in the field of drug discovery, and to accelerate drug design processes.
Acknowledgments
All the calculations in this study were performed using the HOKUSAI Greatwave of RIKEN Facility.
Supplementary Materials
The following are available online, Table S1: Test compounds, free energy values, and logPow values, Table S2: Additional test set, Table S3: Additional test set for comparing with our results. Table S4: Correlation coefficient of logPow values using between B3LYP/6-31G* and the different methods and basis sets, Table S5: Summary of logPow calculations for 17 compounds. Figure S1: Atomic charges vs. dielectric constant values in quantum chemistry calculation. Figure S2: Scatter diagram of calculated and experimentally measured logPow values, and Figure S3: Bar graph of R2 values for logPow. Figure S4: RMSE of logPow for five compounds (dimethyl sulfide, ethyl phenyl ether, naphthalene, 1-nitropropane, and tribromomethane) calculated using different methods and basis sets.
Author Contributions
K.O. performed MD simulations, analyzed the results, wrote the paper, and was involved in the design of the study. M.H. collected the data, and wrote the paper. S.N. designed the study, oversaw the overall project strategy and management, and wrote the paper.
Conflicts of Interest
The authors declare no conflict of interest.
Footnotes
Sample Availability: Samples of the compounds not available.
References
- 1.Hansch C., Fujita T. p-σ-π analysis. A method for the correlation of biological activity and chemical structure. J. Am. Chem. Soc. 1964;86:1616–1626. doi: 10.1021/ja01062a035. [DOI] [Google Scholar]
- 2.Leo A., Hansch C., Elkins D. Partition coefficients and their uses. Chem. Rev. 1971;71:525–616. doi: 10.1021/cr60274a001. [DOI] [Google Scholar]
- 3.Sangster J. Octanol–water partition coefficients of simple organic compounds. J. Phys. Chem. Ref. Data. 1989;18:1111–1229. doi: 10.1063/1.555833. [DOI] [Google Scholar]
- 4.Fujita T., Iwasa J., Hansch C. A new substituent constant, π, derived from partition coefficients. J. Am. Chem. Soc. 1964;86:5175–5180. doi: 10.1021/ja01077a028. [DOI] [Google Scholar]
- 5.Devoe H., Miller M.M., Wasik S.P. Generator columns and high pressure liquid chromatography for determining aqueous solubilities and octanol–water partition coefficients of hydrophobic substances. J. Res. Nat. Bur. Stand. 1981;86:361–366. doi: 10.6028/jres.086.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Opperhuizen A., Serne P., Van der Steen J.M.D. Thermodynamics of fish/water and octan-1-ol/water partitioning of some chlorinated benzenes. Environ. Sci. Technol. 1988;22:286–292. doi: 10.1021/es00168a008. [DOI] [PubMed] [Google Scholar]
- 7.Cheng T., Zhao Y., Li X., Lin F., Xu Y., Zhang X., Li Y., Wang R., Lai L. Computation of octanol–water partition coefficients by guiding an additive model with knowledge. J. Chem. Inf. Model. 2007;47:2140–2148. doi: 10.1021/ci700257y. [DOI] [PubMed] [Google Scholar]
- 8.Meylan W.M., Howard P.H. Estimating log P with atom/fragments and water solubility with log P. Perspect. Drug Discov. Des. 2000;19:67–84. doi: 10.1023/A:1008715521862. [DOI] [Google Scholar]
- 9.Lipinski C.A. Drug-like properties and the causes of poor solubility and poor permeability. J. Pharmacol. Toxicol. Methods. 2000;44:235–249. doi: 10.1016/S1056-8719(00)00107-6. [DOI] [PubMed] [Google Scholar]
- 10.Bolton E.E., Wang Y., Thiessen P.A., Bryant S.H. Chapter 12—Pubchem: Integrated platform of small molecules and biological activities. In: Ralph A.W., David C.S., editors. Annual Reports in Computational Chemistry. Volume 4. Elsevier; Amsterdam, The Netherlands: 2008. pp. 217–241. [Google Scholar]
- 11.Irwin J.J., Sterling T., Mysinger M.M., Bolstad E.S., Coleman R.G. Zinc: A free tool to discover chemistry for biology. J. Chem. Inf. Model. 2012;52:1757–1768. doi: 10.1021/ci3001277. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Kirkwood J.G. Statistical mechanics of fluid mixtures. J. Chem. Phys. 1935;3:300–313. doi: 10.1063/1.1749657. [DOI] [Google Scholar]
- 13.Zwanzig R.W. High-temperature equation of state by a perturbation method. I. Nonpolar gases. J. Chem. Phys. 1954;22:1420–1426. doi: 10.1063/1.1740409. [DOI] [Google Scholar]
- 14.Shirts M.R., Pitera J.W., Swope W.C., Pande V.S. Extremely precise free energy calculations of amino acid side chain analogs: Comparison of common molecular mechanics force fields for proteins. J. Chem. Phys. 2003;119:5740–5761. doi: 10.1063/1.1587119. [DOI] [Google Scholar]
- 15.Wolf M.G., Groenhof G. Evaluating nonpolarizable nucleic acid force fields: A systematic comparison of the nucleobases hydration free energies and chloroform-to-water partition coefficients. J. Comput. Chem. 2012;33:2225–2232. doi: 10.1002/jcc.23055. [DOI] [PubMed] [Google Scholar]
- 16.Chodera J.D., Mobley D.L., Shirts M.R., Dixon R.W., Branson K., Pande V.S. Alchemical free energy methods for drug discovery: Progress and challenges. Curr. Opin. Struct. Biol. 2011;21:150–160. doi: 10.1016/j.sbi.2011.01.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Steinbrecher T., Labahn A. Towards accurate free energy calculations in ligand protein-binding studies. Curr. Med. Chem. 2010;17:767–785. doi: 10.2174/092986710790514453. [DOI] [PubMed] [Google Scholar]
- 18.Gao J., Kuczera K., Tidor B., Karplus M. Hidden thermodynamics of mutant proteins: A molecular dynamics analysis. Science. 1989;244:1069–1072. doi: 10.1126/science.2727695. [DOI] [PubMed] [Google Scholar]
- 19.Ha S., Gao J., Tidor B., Brady J.W., Karplus M. Solvent effect on the anomeric equilibrium in D-glucose: A free energy simulation analysis. J. Am. Chem. Soc. 1991;113:1553–1557. doi: 10.1021/ja00005a015. [DOI] [Google Scholar]
- 20.Jorgensen W.L., Ravimohan C. Monte carlo simulation of differences in free energies of hydration. J. Chem. Phys. 1985;83:3050–3054. doi: 10.1063/1.449208. [DOI] [Google Scholar]
- 21.Kollman P. Free energy calculations: Applications to chemical and biochemical phenomena. Chem. Rev. 1993;93:2395–2417. doi: 10.1021/cr00023a004. [DOI] [Google Scholar]
- 22.Bash P., Singh U., Langridge R., Kollman P. Free energy calculations by computer simulation. Science. 1987;236:564–568. doi: 10.1126/science.3576184. [DOI] [PubMed] [Google Scholar]
- 23.Wong C.F., McCammon J.A. Dynamics and design of enzymes and inhibitors. J. Am. Chem. Soc. 1986;108:3830–3832. doi: 10.1021/ja00273a048. [DOI] [Google Scholar]
- 24.Merz K.M., Kollman P.A. Free energy perturbation simulations of the inhibition of thermolysin: Prediction of the free energy of binding of a new inhibitor. J. Am. Chem. Soc. 1989;111:5649–5658. doi: 10.1021/ja00197a022. [DOI] [Google Scholar]
- 25.Kitamura K., Tamura Y., Ueki T., Ogata K., Noda S., Himeno R., Chuman H. Binding free-energy calculation is a powerful tool for drug optimization: Calculation and measurement of binding free energy for 7-azaindole derivatives to glycogen synthase kinase-3β. J. Chem. Inf. Model. 2014;54:1653–1660. doi: 10.1021/ci400719v. [DOI] [PubMed] [Google Scholar]
- 26.Okada O., Yamashita H., Takedomi K., Ono S., Sunada S., Kubodera H. Prediction of the binding affinity of compounds with diverse scaffolds by MP-CAFEE. Biophys. Chem. 2013;180–181:119–126. doi: 10.1016/j.bpc.2013.07.005. [DOI] [PubMed] [Google Scholar]
- 27.Fujitani H., Tanida Y., Ito M., Jayachandran G., Snow C.D., Shirts M.R., Sorin E.J., Pande V.S. Direct calculation of the binding free energies of fkbp ligands. J. Chem. Phys. 2005;123:084108. doi: 10.1063/1.1999637. [DOI] [PubMed] [Google Scholar]
- 28.Fujitani H., Tanida Y., Matsuura A. Massively parallel computation of absolute binding free energy with well-equilibrated states. Phys. Rev. E. 2009;79:021914. doi: 10.1103/PhysRevE.79.021914. [DOI] [PubMed] [Google Scholar]
- 29.DeBolt S.E., Kollman P.A. Investigation of structure, dynamics, and solvation in 1-octanol and its water-saturated solution: Molecular dynamics and free-energy perturbation studies. J. Am. Chem. Soc. 1995;117:5316–5340. doi: 10.1021/ja00124a015. [DOI] [Google Scholar]
- 30.Huang W., Blinov N., Kovalenko A. Octanol–water partition coefficient from 3D-RISM-KH molecular theory of solvation with partial molar volume correction. J. Phys. Chem. B. 2015;119:5588–5597. doi: 10.1021/acs.jpcb.5b01291. [DOI] [PubMed] [Google Scholar]
- 31.Bhatnagar N., Kamath G., Chelst I., Potoff J.J. Direct calculation of 1-octanol–water partition coefficients from adaptive biasing force molecular dynamics simulations. J. Chem. Phys. 2012;137:014502. doi: 10.1063/1.4730040. [DOI] [PubMed] [Google Scholar]
- 32.Hansen N., Hünenberger P.H., van Gunsteren W.F. Efficient combination of environment change and alchemical perturbation within the enveloping distribution sampling (eds) scheme: Twin-system eds and application to the determination of octanol–water partition coefficients. J. Chem. Theory Comput. 2013;9:1334–1346. doi: 10.1021/ct300933y. [DOI] [PubMed] [Google Scholar]
- 33.Chen B., Siepmann J.I. Partitioning of alkane and alcohol solutes between water and (dry or wet) 1-octanol. J. Am. Chem. Soc. 2000;122:6464–6467. doi: 10.1021/ja001120+. [DOI] [Google Scholar]
- 34.Pranata J., Jorgensen L.W. Monte carlo simulations yield absolute free energies of binding for guanine—Cytosine and adenine—Uracil base pairs in chloroform. Tetrahedron. 1991;47:2491–2501. doi: 10.1016/S0040-4020(01)81783-X. [DOI] [Google Scholar]
- 35.Jorgensen W.L., Buckner J.K., Boudon S., Tirado-Rives J. Efficient computation of absolute free energies of binding by computer simulations. Application to the methane dimer in water. J. Chem. Phys. 1988;89:3742–3746. doi: 10.1063/1.454895. [DOI] [Google Scholar]
- 36.Bennett C.H. Efficient estimation of free energy differences from monte carlo data. J. Comput. Phys. 1976;22:245–268. doi: 10.1016/0021-9991(76)90078-4. [DOI] [Google Scholar]
- 37.Jorgensen W.L., Chandrasekhar J., Madura J.D., Impey R.W., Klein M.L. Comparison of simple potential functions for simulating liquid water. J. Chem. Phys. 1983;79:926–935. doi: 10.1063/1.445869. [DOI] [Google Scholar]
- 38.De Leeuw S.W., Perram J.W., Smith E.R. Simulation of electrostatic systems in periodic boundary conditions. I. Lattice sums and dielectric constants. Proc. R. Soc. A. Math. Phys. Sci. 1980;373:27–56. doi: 10.1098/rspa.1980.0135. [DOI] [Google Scholar]
- 39.Zhang C., Hutter J., Sprik M. Computing the kirkwood G-factor by combining constant maxwell electric field and electric displacement simulations: Application to the dielectric constant of liquid water. J. Phys. Chem. Lett. 2016;7:2696–2701. doi: 10.1021/acs.jpclett.6b01127. [DOI] [PubMed] [Google Scholar]
- 40.Vega C., Abascal J.L.F. Simulating water with rigid non-polarizable models: A general perspective. Phys. Chem. Chem. Phys. 2011;13:19663–19688. doi: 10.1039/c1cp22168j. [DOI] [PubMed] [Google Scholar]
- 41.Uematsu M., Frank E.U. Static dielectric constant of water and steam. J. Phys. Chem. Ref. Data. 1980;9:1291–1306. doi: 10.1063/1.555632. [DOI] [Google Scholar]
- 42.Smyth C.P., Stoops W.N. The dielectric polarization of liquids. Vi. Ethyl iodide, ethanol, normal-butanol and normal-octanol. J. Am. Chem. Soc. 1929;51:3312–3329. doi: 10.1021/ja01386a019. [DOI] [Google Scholar]
- 43.Smyth C.P., Stoops W.N. The dielectric polarization of liquids. Vii. Isomeric octyl alcohols and molecular orientation. J. Am. Chem. Soc. 1929;51:3330–3341. doi: 10.1021/ja01386a020. [DOI] [Google Scholar]
- 44.Daina A., Michielin O., Zoete V. iLOGP: A simple, robust, and efficient description of n-octanol/water partition coefficient for drug design using the GB/SA approach. J. Chem. Inf. Model. 2014;54:3284–3301. doi: 10.1021/ci500467k. [DOI] [PubMed] [Google Scholar]
- 45.Bannan C.C., Calabró G., Kyu D.Y., Mobley D.L. Calculating partition coefficients of small molecules in octanol/water and cyclohexane/water. J. Chem. Theory Comput. 2016;12:4015–4024. doi: 10.1021/acs.jctc.6b00449. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Wang J., Wolf R.M., Caldwell J.W., Kollman P.A., Case D.A. Development and testing of a general amber force field. J. Comput. Chem. 2004;25:1157–1174. doi: 10.1002/jcc.20035. [DOI] [PubMed] [Google Scholar]
- 47.Frisch M.J., Trucks G.W., Schlegel H.B., Scuseria G.E., Robb M.A., Cheeseman J.R., Scalmani G., Barone V., Mennucci B., Petersson G.A., et al. Gaussian 09. Gaussian, Inc.; Wallingford, CT, USA: 2009. [Google Scholar]
- 48.Tomasi J., Mennucci B., Cammi R. Quantum mechanical continuum solvation models. Chem. Rev. 2005;105:2999–3094. doi: 10.1021/cr9904009. [DOI] [PubMed] [Google Scholar]
- 49.Bayly C.I., Cieplak P., Cornell W., Kollman P.A. A well-behaved electrostatic potential based method using charge restraints for deriving atomic charges: The RESP model. J. Phys. Chem. 1993;97:10269–10280. doi: 10.1021/j100142a004. [DOI] [Google Scholar]
- 50.Cornell W.D., Cieplak P., Bayly C.I., Kollmann P.A. Application of RESP charges to calculate conformational energies, hydrogen bond energies, and free energies of solvation. J. Am. Chem. Soc. 1993;115:9620–9631. doi: 10.1021/ja00074a030. [DOI] [Google Scholar]
- 51.Cieplak P., Cornell W.D., Bayly C., Kollman P.A. Application of the multimolecule and multiconformational RESP methodology to biopolymers: Charge derivation for DNA, RNA, and proteins. J. Comput. Chem. 1995;16:1357–1377. doi: 10.1002/jcc.540161106. [DOI] [Google Scholar]
- 52.Darden T., York D., Pedersen L. Particle mesh ewald: An N·log(N) method for Ewald sums in large systems. J. Chem. Phys. 1993;98:10089–10092. doi: 10.1063/1.464397. [DOI] [Google Scholar]
- 53.Essmann U., Perera L., Berkowitz M.L., Darden T., Lee H., Pedersen L.G. A smooth particle mesh Ewald method. J. Chem. Phys. 1995;103:8577–8593. doi: 10.1063/1.470117. [DOI] [Google Scholar]
- 54.Berendsen H.J.C., Postma J.P.M., Van Gunsteren W.F., Dinola A., Haak J.R. Molecular dynamics with coupling to an external bath. J. Chem. Phys. 1984;81:3684–3690. doi: 10.1063/1.448118. [DOI] [Google Scholar]
- 55.Shirts M.R., Bair E., Hooker G., Pande V.S. Equilibrium free energies from nonequilibrium measurements using maximum-likelihood methods. Phys. Rev. Lett. 2003;91:140601–140604. doi: 10.1103/PhysRevLett.91.140601. [DOI] [PubMed] [Google Scholar]
- 56.Beutler T.C., Mark A.E., van Schaik R.C., Gerber P.R., van Gunsteren W.F. Avoiding singularities and numerical instabilities in free energy calculations based on molecular simulations. Chem. Phys. Lett. 1994;222:529–539. doi: 10.1016/0009-2614(94)00397-1. [DOI] [Google Scholar]
- 57.Zacharias M., Straatsma T.P., McCammon J.A. Separation-shifted scaling, a new scaling method for lennard-jones interactions in thermodynamic integration. J. Chem. Phys. 1994;100:9025–9031. doi: 10.1063/1.466707. [DOI] [Google Scholar]
- 58.Wang J., Wang W., Huo S., Lee M., Kollman P.A. Solvation model based on weighted solvent accessible surface area. J. Phys. Chem. B. 2001;105:5055–5067. doi: 10.1021/jp0102318. [DOI] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.