Figure - PMC

Skip to main content

View full-text article in PMC

. Author manuscript; available in PMC: 2019 Oct 1.

Published in final edited form as: J Comput Aided Mol Des. 2018 Aug 6;32(10):1203–1216. doi: 10.1007/s10822-018-0138-6

Fig. 4 — Training data set. The pK_a of the training data set compounds are used to derive a simple linear model that relates the free energy correction $Δ G_{corr}^{*}$ to the experimental pK_a. Two linear models were derived: a global linear model (black dashed line), utilizing all data, and a piecewise linear model that applies to either neutral acids (subset QM1, blue) or to positively charged acids (subset QM2, green). a: Correlation between experimental and calculated pK_a of the training data set. The dashed line indicates ideal correlation with the gray band indicating 1 pK_a unit deviation. b: Global linear fit of the calculated $Δ G_{corr}^{*}$ to the experimental pK_a. c: Linear fits of the calculated $Δ G_{corr}^{*}$ to the experimental pK_a, split between the QM1 and the QM2 subsets. In (b) and (c) the dashed lines are linear models to the data, with shaded bands indicating 95% confidence intervals from 1000 bootstrap samples.