Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2008 Oct 10.
Published in final edited form as: J Med Chem. 2005 Aug 25;48(17):5561–5569. doi: 10.1021/jm0501376

QSAR Studies of Copper Azamacrocycles and Thiosemicarbazones

MM3 Parameter Development and Prediction of Biological Properties

Peter Wolohan 1, Jeongsoo Yoo 1, Michael J Welch 1, David E Reichert 1,*
PMCID: PMC2566539  NIHMSID: NIHMS60559  PMID: 16107156

Abstract

Genetic algorithms (GA) were used to develop specific copper metal-ligand force field parameters for the MM3 force field, from a combination of crystallographic structures and ab initio calculations. These new parameters produced results in good agreement with experiment and previously reported copper metal-ligand parameters for the AMBER force field. The MM3 parameters were then used to develop several Quantitative Structure Activity Relationship (QSAR) models. A successful QSAR for predicting the lipophilicity (logPow) of several classes of Cu(II) chelating ligands, was built using a training set of thirty-two Cu(II) radiometal complexes and six simple molecular descriptors. The QSAR exhibited a correlation between the predicted and experimental logPow with a r2 = 0.95, q2 = 0.92. When applied to an external test set of eleven Cu(II) complexes the QSAR preformed with great accuracy; r2 = 0.93 and a q2 = 0.91 utilizing a leave-one-out cross-validation analysis. Additional QSAR models were developed to predict the biodistribution of a smaller set of Cu(II) bis(thiosemicarbazone) complexes.

Introduction

Radioactive copper complexes have been utilized for many years in both a therapeutic mode and as imaging agents for such non-invasive medical techniques as gamma scintigraphy and positron emission tomography (PET).1,2 Macrocyclic polyaminocarboxylate ligands for Cu(II) have been used most successfully as bifunctional chelates (BFCs) where the metal chelate is coupled to a biological targeting molecule such as a monoclonal antibody or a peptide.3-6 Cu(II) bis(thiosemicarbazone) complexes have been used in vivo as radiotracers for the evaluation of blood flow in various organs such as the brain, kidneys and heart as well as for the evaluation of hypoxia in tissue.7-9 In addition Cu(II) thiosemicarbazone complexes are of interest as they have shown significant anti-tumor activity.10 Indeed whether used in a therapeutic mode or as an imaging agent the desire to understand and predict the biodistribution and bioactivity of such Cu(II) complexes is a priority.

A particularly important property to be able to predict for in vivo use is the logarithm of the experimental n-Octanol/Water partition coefficient (logPow). This is an important pharmacological descriptor of bioavailability for both organic compounds and inorganic complexes.11 Predicting and altering logPow is a key element in rational drug design since the correct incorporation of lipophilic groups to a chemical compound can result in an increase in biological activity.12

Quantitative structure-activity relationships (QSAR) have become a common tool in the field of molecular modeling since their introduction.13 Indeed they have found application in both the prediction of biological activity and more recently in the prediction of the Absorption, Distribution, Metabolism, Excretion and Toxicological (ADME/tox) properties of organic drug-like compounds.14-18 QSAR methods have also been applied to the prediction of logPow.19 While QSAR methods have found many applications in the modeling of the properties of organic drug-like compounds; there have been few reports of QSAR studies of metal based pharmaceuticals. One possible factor in this low utilization is the lack of specific metal-ligand parameters for most common force fields; in general most force fields do a poor job of accurately predicting the structure of complexes of interest for in vivo imaging or radiotherapy. With the development of suitable metal-ligand force field parameters, the utilization of QSAR methods requires much less effort and will likely aid the search for novel metal-based pharmaceuticals targeted towards biological receptors.

The MM3* force field developed by Allinger et al20 is one of the most accurate force fields available for the modeling of organic ligands.21 Several researchers have successfully developed specific metal-ligand parameters compatible with this accurate forcefield; Ru22, Re23, Pd24, and a variety of metal amide complexes.25 The MM3* force field describes the energy of a molecule with a simple algebraic expression consisting of terms describing bond stretching, angle bending, dihedral rotation and intermolecular forces, such as electrostatics, van der Waals interactions and hydrogen bonding. The constants in these equations are obtained from experimental data or ab initio calculations. The development of specific metal-ligand force field parameters consists of determining these constants within the framework of the algebraic expression of the force field being used and is commonly achieved via two main methods. The first method utilizes a traditional optimization algorithm such as the Newton-Raphson or Simplex techniques while the second, and probably most promising method, utilizes a Genetic Algorithm (GA).23,26,27 Both methods approach this multidimensional problem from the same starting point in that they attempt to find a common set of parameters which reproduce most accurately the crystallographic X-ray structure of a training set of metal complexes. We have adopted the GA approach of Strassner et al23 within the framework of their automated parameter program FFGenerAtor in this work.

Experimental

Copper ligand complexes were implemented as structures in the MM3* force field using the TINKER28 molecular modeling software running on an SGI Indigo2 workstation. X-ray structures representative of three classes of important copper radiopharmaceuticals (Figure 1) were taken from the Cambridge Structural Database29 (CSD) and manipulated with the program MOLDEN,30 which provides a convenient interface between the CSD and TINKER. The reference data for the Cu(II) parameters involved a total of sixteen copper containing crystal structures, shown in Figures 2 and 3. The first set of ten copper containing crystal structures were predominately azamacrocycle complexes (refcodes CUCJOZ01, SUKGUA, DIRJET, FEKVAS, VOPSAU, DEFJIH, HAFTAJ, JUMMUZ, ZEBJIZ, LEPNOJ) while the second six were predominately bis(thiosemicarbazone) complexes (refcodes BNZSCU01, HOGQUP, VISJIQ, VISNEQ, VISQAP, XOBNIL).

Figure 1.

Figure 1

Classes of Copper (II) chelates studied.

Figure 2.

Figure 2

Reference compounds used to develop the Cu(II)-azamacrocycle parameters.

Figure 3.

Figure 3

Reference compounds used to develop the Cu(II)-bis(thiosemicarbazone) parameters.

From MOLDEN a TINKER input file was generated for each of the copper complexes. At this point a series of new MM3* force field atom types were defined based on details from TINKER’s optimize program which produces a text file consisting of a list of those force field parameters which are missing in the MM3* force field. These new force field parameters included the definition of new MM3* parameters for a four-, five- and six-coordinate copper atom together with definitions for coordinating organic ligands. For example looking at (1,4,7,10-tetramethyl-1,4,7,10-tetra-azacyclododecane)-copper(II) [SUKGUA] a four-coordinate Cu(II) atom type was defined together with a new nitrogen Nsp3 to represent the amines coordinating to the central Cu(II) metal atom. In addition, in the case of the six-coordinate copper azamacrocycles where two of the coordinating groups are carboxylates separate Osp3 parameters were defined in order to represent axial-axial and axial-equatorial configurations.

These force field parameters were optimized using the automated parameter development program FFGenerAtor which utilizes a GA. Following the work of Strassner et al23 a steady state GA using a roulette selector together with a crossover rate of 0.9 and a mutation rate of 0.1 was used for each run. Within the GA the force field parameters were represented by chromosomes of real numbers. Within a population each individual chromosome is constructed of genes that correspond to the missing parameters i.e. bond, angle, torsion and out of plane bending parameters. Initial values for these parameters were taken from an average of the corresponding experimental value following structural analysis of the reference structures from the CSD. Bond lengths were allowed to optimize within a range of ±0.5 Å of this average while angles were allowed to optimize within a range of ±30°. Associated force constants were constrained to a range of 0-250 mydn/Å and 0-30 mydn/(rad2) respectively. Coding of the torsional parameters is more arbitrary because of the shear number of these missing parameters hence the need to impose flexible boundaries for these terms in order to allow the GA to optimize fully. As a result torsional angles were allowed to optimize within a range -180 and 180 and their associated constants were constrained to a range of ±200 kcal/mol.

Initial guesses for the coded parameters from the GA were fed into the force field making it now possible to minimize each starting X-ray structure using TINKER’s optimize program. Once minimized TINKER’s superpose was used to calculate the root-mean-squared deviation (rmsd), most commonly in massweighted coordinates, between the two structures. The rmsd is then used to generate a fitness value for each newly developed parameter set which in turn guides the GA. The cycle is repeated for a number of GA cycles which we set to 750 generations. Electrostatic point charges were added to the final force field parameters following the procedure of Bayly et al and the programs RESP and Jaguar 5.0.31,32

The end result is a set of newly defined force field parameters whose fitness to both the reference set of structures and a blind validation set of structures can be evaluated based on the average rmsd; the goal being to produce a set of robust parameters which accurately reproduce the X-ray structure of a diverse set of related structures with an average rmsd below 0.400 being indicative of accuracy.

Parameter Testing

Seven crystal structures were chosen as a validation set of the Cu(II) azamacrocycle parameters (Figure 4) CEVMAR, LEWCOF, HAFSUC, PIMFEW, POTPUJ, VALVUZ and ZALFUN. These independent structures were minimized with the developed parameter set and compared to the X-ray crystal structure as an accuracy test. In the same way two crystal structures were chosen as a validation set of the Cu(II) bis(thiosemicarbazone) parameters (Figure 5) WUTHEY and WUTHIC. It is important to note that these structures serve as an independent test of the parameters as they were not utilized in developing the parameters.

Figure 4.

Figure 4

Validation complexes for the Cu(II)-azamacrocycle parameters.

Figure 5.

Figure 5

Validation complexes for the Cu(II)-bis(thiosemicarbazone) parameters.

At this point the newly developed MM3* Cu(II) force field parameters from both the copper(II) azamacrocycles and copper(II) bis(thiosemicarbazones) were combined to form a complete MM3* force field parameter file. Exhaustive testing was carried out by reproducing the rmsd for all structures in the reference data sets with the combined parameters to ensure no conflicts.

QSAR Modeling

For the QSAR modeling a total of forty-three structures, representative of the three classes of important copper radiopharmaceuticals (Figure 1), with their experimental logPow’s were constructed in the program MOE.33 Eighteen copper(II) bis(thiosemicarbazone) complexes were taken from John et al.,34 a further nine from Dearling et al.,[Dearling, 2002 #35] and a set of four copper(II) thiosemicarbazone complexes were taken from Easmon et al.36 A total of eight copper(II) regioselectively N-substituted derivatives of cyclen were taken from Yoo et al.37 Finally the cross-bridged analogues of copper(II) 1,4,7,10-tetraazacyclododecane-1,4,8,11-tetracetic acid (DOTA) and copper (II) 1-4-8-11-tetraazacyclotetradecane-1,4,8,11-tetracetic acid (TETA) CBDO2A and CBTE2A respectively were taken from Boswell et al38 and Wong et al.39 Where an experimental structure was not available a related analogue was used as a template. For example the X-ray structure of XOBNIL was utilized as the template for constructing models of the copper(II) bis(thiosemicarbazone) complexes etc. These structures were then minimized using the developed MM3* force field parameters and TINKER’s optimize program. Where necessary the conformational space of the functional groups of any of copper complexes was sampled using TINKER’s scan program. Once a viable model of each of the forty-three structures was obtained these structures were passed back to MOE for QSAR modeling. Two dimensional schematics taken directly from MOE of all these structures are included in Tables 5 and 6.

Table 5.

Comparison of Calculated and Experimental Lipophilicity (logPow) of Copper(II) Complexes in QSAR Training Set

Compound Structure LogPexp logPcalc Residual
ETS graphic file with name nihms-60559-t0006.jpg 1.35 1.27 -0.08
KTS graphic file with name nihms-60559-t0007.jpg 1.71 1.44 -0.27
n-PrTS graphic file with name nihms-60559-t0008.jpg 1.78 1.81 0.03
i-PrTS graphic file with name nihms-60559-t0009.jpg 1.82 2.04 0.22
n-BuTS graphic file with name nihms-60559-t0010.jpg 2.30 2.35 0.05
PhTS graphic file with name nihms-60559-t0011.jpg 2.37 2.27 -0.10
PTSM graphic file with name nihms-60559-t0012.jpg 1.92 1.73 -0.19
n-PrTSM graphic file with name nihms-60559-t0013.jpg 2.86 2.60 -0.26
i-PrTSM graphic file with name nihms-60559-t0014.jpg 2.77 2.83 0.06
n-BuTSM graphic file with name nihms-60559-t0015.jpg 3.16 3.13 -0.03
PhTSM graphic file with name nihms-60559-t0016.jpg 3.06 2.91 -0.15
PTSE graphic file with name nihms-60559-t0017.jpg 2.76 2.49 -0.27
PTSM2 graphic file with name nihms-60559-t0018.jpg 3.18 3.02 -0.16
ETSM graphic file with name nihms-60559-t0019.jpg 2.65 2.05 -0.60
KTSM2 graphic file with name nihms-60559-t0020.jpg 3.46 3.50 0.04
ATSM graphic file with name nihms-60559-t0021.jpg 1.48 2.10 0.62
ATS graphic file with name nihms-60559-t0022.jpg 0.65 1.32 0.67
CTS graphic file with name nihms-60559-t0023.jpg 2.34 1.74 -0.60
DTSM graphic file with name nihms-60559-t0024.jpg 2.34 2.97 0.63
GTSM graphic file with name nihms-60559-t0025.jpg 0.84 1.16 0.32
PTS graphic file with name nihms-60559-t0026.jpg 1.96 2.2 0.25
EPH142 graphic file with name nihms-60559-t0027.jpg 1.72 1.67 -0.05
EPH144 graphic file with name nihms-60559-t0028.jpg 1.92 1.98 0.06
tetraBBC graphic file with name nihms-60559-t0029.jpg 2.36 2.27 -0.09
bis14BBC graphic file with name nihms-60559-t0030.jpg 2.62 2.70 0.08
bis17BBC graphic file with name nihms-60559-t0031.jpg 3.19 2.70 -0.49
BBC4710trisAA graphic file with name nihms-60559-t0032.jpg 1.27 0.45 -0.82
trisBBC10AA graphic file with name nihms-60559-t0033.jpg 2.73 2.86 0.13
bis14BBC710bisAA graphic file with name nihms-60559-t0034.jpg 1.77 2.68 0.91
DOTA graphic file with name nihms-60559-t0035.jpg -3.24 -2.85 0.39
TETA graphic file with name nihms-60559-t0036.jpg -1.67 -1.91 -0.24
CBTE2A graphic file with name nihms-60559-t0037.jpg -2.29 -2.17 0.12

LogPexp refers to the LogP10 of the experimental n-Octanol/Water partition coefficient, LogPcalc refers to the QSAR LogP10 of the calculated n-Octanol/Water partition coefficient and Residual refers to the difference between the predicted and measured values.

Table 6.

Comparison of Calculated and Experimental Lipophilicity logPow’s of Copper(II) Complexes in QSAR Test Set

Structure LogPexp LogPcalc Residual
PTS graphic file with name nihms-60559-t0038.jpg 0.76 0.98 0.22
KTSM graphic file with name nihms-60559-t0039.jpg 2.64 2.23 -0.41
ETSM2 graphic file with name nihms-60559-t0040.jpg 3.32 3.34 0.02
CTSM graphic file with name nihms-60559-t0041.jpg 2.69 2.53 -0.16
DTS graphic file with name nihms-60559-t0042.jpg 1.69 2.18 0.49
GTS graphic file with name nihms-60559-t0043.jpg 0.45 0.47 0.02
EPH143 graphic file with name nihms-60559-t0044.jpg 1.33 2.51 1.18
EPH270 graphic file with name nihms-60559-t0045.jpg 1.34 1.30 -0.04
tris147BBC graphic file with name nihms-60559-t0046.jpg 2.73 3.56 0.83
bis17BBC410bisAA graphic file with name nihms-60559-t0047.jpg 2.28 2.68 0.40
CBDO2A graphic file with name nihms-60559-t0048.jpg -2.42 -3.15 -0.73

LogPexp refers to the LogP10 of the experimental n-Octanol/Water partition coefficient, LogPcalc refers to the QSAR LogP10 of the calculated n-Octanol/Water partition coefficient and Residual refers to the difference between the predicted and measured values.

Within the program MOE one hundred and seventy molecular descriptors were calculated for each of the forty-three structures. These molecular descriptors attempt to describe basic concepts such as shape, size, internal charge distribution and lipophilicity through a myriad of terms which when combined can be used to construct a quantitative relationship to an experimental property such as biological activity.18 For the logPow QSAR a test set of eleven structures was selected at random from the complete set of forty-three compounds leaving a training set of thirty-two structures. The number of molecular descriptors used in the model was drastically reduced by evaluating their individual statistical significance to the experimental data, a process commonly referred to as pruning the descriptors. Similarly QSAR models were constructed to model the experimental biodistribution data of the eighteen copper(II) bis(thiosemicarbazone) complexes from John et al.34 Here because of the small size of the dataset four structures were selected at random for the blind test set leaving a training set of fourteen structures.

In QSAR equations n is the number of observations, r2 is the correlation coefficient utilizing a partial least squares (PLS) analysis, q2 is the cross-validated correlation coefficient utilizing a leave-one-out analysis, RMSE is the root mean square error, SEEcv is the associated cross-validated root mean square error and F is a measure of the statistical significance of the correlation. The q2 coefficient is the traditional measure of the quality of the fit, with a value above 0.50 suggesting a predictive model.

RESULTS AND DISCUSSION

Initially we wanted to investigate whether we could reproduce the accuracy of our previously reported AMBER force field parameters using the approach of Strassner et al.23 As a result we initially used the same Cu(II) reference and validation structures in our initial GA analysis consisting of predominately azamacrocycle complexes as shown in Figures 2 and 4 respectively.40 The massweighted root-mean-squared deviation (rmsd) of each of the structures in the reference azamacrocycle set from both methods are shown in Table 1. As can be seen from Table 1 the GA developed MM3* parameters were able to reproduce the rmsd accuracy of the previously developed AMBER parameters. Indeed overall there is a slight improvement in the accuracy of the force field parameters taking the average rmsd of the entire reference set. If one excludes the azamacrocycles containing carboxylates one can see a significant improvement in the rmsd between the two sets of developed force field parameters also suggesting that it may be more appropriate to treat this class of azamacrocycles as a unique set. Nevertheless when applied to the blind validation set (Table 2) one can see that the level of accuracy is maintained. As a result it is clear to see that it is possible to develop robust and accurate metal-ligand parameters regardless of the force field used or the method used to develop the parameters, be it a traditional minimization algorithm or a GA.

Table 1.

Comparison of Calculated MM3 Structure to X-ray for Cu(II)-azamacrocycle Reference Set and Previously Reported AMBER Calculated Structure

REFCODE rmsd (Å)
AMBER (NewtonRaphson/Simplex)
MM3 (GA)
CUCJOZ01 0.386 0.194
SUKGUA 0.267 0.191
DIRJET 0.155 0.190
FEKVAS 0.329 0.344
VOPSAU 0.313 0.257
DEFJIH 0.475 0.190
HAFTAJ 0.359 0.220
JUMMUZ 1.640 0.398
ZEBJIZ 0.296 0.394
LEPNOJ 0.275 0.244
Average 0.350 0.272

Table 2.

Comparison of Calculated MM3 Structure to X-ray for Cu(II)-azamacrocycle Validation Set

REFCODE rmsd (Å)
MM3 (GA)
CEVMAR 0.176
HAFSUC 0.162
POTPUJ 0.223
LEWCOF 0.404
PIMFEW 0.288
VALVUZ 0.503
ZALFUN 0.163
Average 0.274

Having successfully developed parameters for the copper(II) azamacrocycles we moved on to developing parameters for the copper(II) bis(thiosemicarbazones). The results from the MM3* parameter development for the six X-ray structures (Figure 3) are shown in Table 3. Although a smaller data set, hence requiring a smaller number of interdependent parameters to be developed, the average rmsd of both the reference set and the blind validation set (Figure 5) of structures exhibit excellent accuracy. This is in spite of the diversity of the functional groups of the metal-ligand scaffold in these copper(II) bis(thiosemicarbazones).

Table 3.

Comparison of Calculated MM3 Structure to X-ray for Cu(II)-bis(thiosemicarbazone) Reference and Validation Set

REFCODE rmsd (Å)
MM3 (GA)
BNZSCU01 0.330
HOGQUP 0.221
VISJIQ 0.274
VISNEQ 0.206
VISQAP 0.244
XOBNIL 0.053

Average 0.221

WUTHEY 0.441
WUTHIC 0.189
Average 0.315

QSAR Modeling of logPow and Biodistribution

In order to illustrate the utility of the development of new metal-ligand force field parameters we endeavored to use our new MM3* parameters to build QSAR models of important experimental biological data. The biological data utilized for these studies consisted of logPow measurements and the biodistribution of a subset of the copper(II) bis(thiosemicarbazone) ligands in animal models. As discussed logPow is an important indicator of a compounds ability to permeate through human tissue particularly its ability to pass the blood-brain barrier and exhibit cerebral activity.41 While logPow can give an indication of how a compound might act in vivo, biodistribution studies on animals, most commonly rats, relate the accumulation and clearance of a compound at different time-points directly. Most commonly the retention of the compound in the blood, brain, heart, kidney, liver and lungs at different times is measured thus mapping the distribution of the compound.34 The ratio of the mass of compound found in the blood over that found in the kidney is used to define the blood clearance via urinary excretion, while the ratio of the mass of compound found in the blood over that found in the liver describes blood clearance via hepatobiliary excretion.

The results from the logPow QSAR analysis for both the training set of thirty-two copper radiopharmaceuticals and the test set of eleven copper radiopharmaceuticals are shown in Table 4. The cross-validated correlation coefficient, q2, of 0.92 and small cross-validated error, RMSEcv, indicate a predictive model exceeding the 0.50 benchmark. This conclusion is supported by the performance of the same QSAR in predicting the logPow’s of the eleven compounds in the blind test set. Here the corresponding q2 = 0.91 is also well above the benchmark 0.50. The corresponding QSAR equation is given below:

LogPCalc=4.674+(0.039PEOE_VSA+0)(0.016PEOE_VSA0)+(0.052PEOE_VSA_NEG)+(0.030SlogP_VSA0)(6.677b_1rotR)(0.002weinerPath) Eq. 1

Table 4.

Results from QSAR model of the Lipophilicity (logPow) of the Copper(II) Complexes

QSAR r2 q2 RMSE RMSEcv nmols ndescriptors
logPow Model (Training set) 0.95 0.92 0.35 0.35 32 6
logPow Model (Test set) 0.93 0.91 0.41 0.47 11 6

r2 refers to non-validated correlation, q2 to the cross-validated correlation, RMSE refers to the root mean square error and RMSECV refers to the cross-validated root mean square error.

Three of the predictive descriptors (PEOE) are based on the partial charge analysis of Gasteiger et al.42 The VSA extension refers to the fitting of the calculated partial charges to the van der Waals surface of the molecule while the further extensions i.e. +0, -0 and refer to a summation of the surface area where the partial charge falls between 0.00 - 0.05 for the + label and -0.05 - 0.00 for the - label. The extension NEG refers to the summation of the van der waals surface where the PEOE charge is negative. The SlogP_VSA0 descriptor is based on the logPow prediction by Crippen et al which attempts to predict the experimental logPow of a molecule based on atomic contributions.43 The VSA0 label refers to the summation of the van der Waals surface where SlogP is less then or equal to -0.40. The remaining two molecular descriptors are b_1rotR which is the fraction of rotatable single bonds and weinerPath which is a summation of the entries in a matrix which describes specific internal distance metrics.44 In conclusion our correlation to the experimental logPow is based on a relationship of the van der Waals surface at specific points where particular terms, charge and SlogP, are satisfied together with the fraction of rotatable single bonds and an internal description of distance metrics.

Similarly Tables 5 (Training set) and 6 (Test set) present the results from the biodistribution QSAR analysis eighteen copper(II) bis(thiosemicarbazone) complexes from John et al. Here it was consistently feasible to construct QSAR models exhibiting cross-validated correlations of well above the benchmark 0.50 utilizing a total of eight molecular descriptors. Indeed F numbers above one thousand signified statistically significant QSAR models no doubt aided by the small size of the training set. However, here is a clear example of why the statistical indicators r2, q2 and F are not reliable indicators of the external predictive ability of any model; particularly one with few observations, since upon analysis of the test sets a range of r2 and q2 are observed (Table 8). Despite this overall observation our QSAR models perform generally very well when exposed to the blind test sets. Indeed we are able to achieve q2’s for these test sets above 0.50 four out of six times. The poorest performance comes from the prediction of the kidney data at the 1min and 2hour time points exhibiting q2 = 0.20 and 0.45 respectively when looking at the test set data. The corresponding QSAR equations are given below:

Table 8.

Analysis of Results from QSAR Models of the Biodistribution of the Copper(II) bis(thiosemicarbazones) in the Test Set

QSAR Model r2 q2 RMSE RMSEcv nmols
Liver
Log10(Blood/Liver) @1min 1.00 0.97 0.01 0.03 4
Log10(Blood/Liver) @5min 0.99 0.98 0.02 0.03 4
Log10(Blood/Liver) @2hr 0.02 0.67 0.11 0.20 4
Kidney
Log10(Blood/Kidney) @1min 0.34 0.20 0.06 0.16 4
Log10(Blood/Kidney) @5min 0.26 0.93 0.08 0.19 4
Log10(Blood/Kidney) @2hr 0.01 0.45 0.01 0.01 4

Log10(Blood/Liver) refers to Log10 {(%ID/g Blood) / (%ID/g Liver)}, Log10(Blood/Kidney) refers to Log10 {(%ID/g Blood) / (%ID/g Kidney)}, r2 refers to non-validated correlation, q2 to the cross-validated correlation, RMSE refers to the root mean square error and RMSECV refers to the cross-validated root mean square error.

QSAR Equations (Liver)

Log10(BloodLiver)@1min=5.516+(0.021ASA_P)(30.678Q_RPC)(0.010SMR_VSA7)+(0.013SlogP_VSA0)(0.718a_ICM)+(3.283balabanJ)(0.235petitjeanSC)(0.783std_dim3) Eq. 2
Log10(BloodLiver)@5min=13.656+(0.001ASA_H)+(0.001CASA+)+(3.010VDistMa)+(0.044a_count)(0.283a_heavy)(0.831chi0v)+(2.315density)(0.086std_dim1) Eq. 3
Log10(BloodLiver)@2hr=1.246(0.256Kier2)+(2.202Kier3)+(0.020SMR_VSA5)(0.103a_count)(0.106radius)(0.002weinerPath)+(0.223weinerPol)(0.062zagreb) Eq. 4

QSAR Equations (Kidney)

Log10(BloodKidney)@1min=5.634(0.004CASA+)(2.848FASA)(1.203Kier1)(38.459Q_RPC)+(0.009SlogP_VSA3)(0.013SlogP_VSA7)+(2.334mr)+(1.222std_dim2) Eq. 5
Log10(BloodKidney)@5min=26.965+(0.004CASA+)(0.394KierFlex)+(6.745PC)(33.304PEOE_RPC+)(56.195PEOE_RPC)+(0.008SMR_VSA4)+(0.033radius)(0.031weinerPol) Eq. 6
Log10(BloodKidney)@2hr=0.276+(0.002ASA+)+(0.423KierA1)(1.797PEOE_VSA_FPNEG)+(1.193SlogP)(0.295apol)(0.574chi0_C)+(0.313petitjeanSC)+(0.311weinerPol) Eq. 7

Given the number of descriptors used a full description of each one is not efficient suffice to say that they encapsulate aspects of the accessible van der Waals surface, functionality and charge distribution of the copper(II) bis(thiosemicarbazone) complexes used in the training set.

Conclusions

This work illustrates the effectiveness of the application of a genetic algorithm to the method of developing metal ligand parameters for Allinger’s MM3* force field. The development of new metal ligand parameters provides an important tool for the researcher in the fields of bioinorganic chemistry and nuclear medicine. The availability of specific metal ligand parameters for these important radiopharmaceuticals provides the researcher with a timely opportunity to model their biological properties/action, in doing so aiding the development of radiopharmaceuticals with enhanced therapeutic and imaging characteristics. These new parameters were utilized in the development of Quantitative Structure Activity Relationships to predict firstly the lipophilicity of a series of common copper(II) chelating agents and secondly the biodistribution of eighteen copper(II) bis(thiosemicarbazone) radiometal complexes.

In the case of the logPow QSAR model we were able to predict with excellent accuracy the lipophilicity of several classes of copper(II) complexes. Our biodistribution QSAR models were able to predict with excellent accuracy the clearance of the copper(II) bis(thiosemicarbazones) from the liver based on an analysis of our test set while our QSAR models performed less adequately when predicting the uptake of the copper(II) bis(thiosemicarbazones) in the kidneys. A more rigorous analysis of the application of the QSAR methodology to the prediction of biodistribution would be facilitated by a larger and more diverse dataset, an issue which we hope to address in future work.

Supplementary Material

si20050531_030

Table 7.

Results from Several QSAR Models of the Biodistribution of the Copper(II) bis(thiosemicarbazones) in the Training Set

QSAR Model r2 q2 RMSE RMSEcv nmols F
Liver
Log10(Blood/Liver) @1min 1.00 1.00 0.01 0.01 14 4273
Log10(Blood/Liver) @5min 1.00 1.00 0.01 0.01 14 6879
Log10(Blood/Liver) @2hr 1.00 1.00 0.01 0.01 14 4067
Kidney
Log10(Blood/Kidney) @1min 1.00 0.97 0.01 0.01 14 2456
Log10(Blood/Kidney) @5min 1.00 1.00 0.01 0.01 14 1069
Log10(Blood/Kidney) @2hr 1.00 0.98 0.01 0.01 14 1193

Log10(Blood/Liver) refers to Log10 {(%ID/g Blood) / (%ID/g Liver)}, Log10(Blood/Kidney) refers to Log10 {(%ID/g Blood) / (%ID/g Kidney)}, r2 refers to non-validated correlation, q2 to the cross-validated correlation, F is a measure of the statistical significance of the model, RMSE refers to the root mean square error and RMSECV refers to the cross-validated root mean square error.

Acknowledgment

This work was supported by the National Institute of Biomedical Imaging and Bioengineering (EB00340) and the United States Department of Energy (DEFG02-087ER-60512).

REFERENCES

  • (1).Blower PJ, Lewis JS, Zweit J. Copper radionuclides and radiopharmaceuticals in nuclear medicine. Nucl. Med. Biol. 1996;23:957–980. doi: 10.1016/s0969-8051(96)00130-8. [DOI] [PubMed] [Google Scholar]
  • (2).Reichert DE, Lewis JS, Anderson CJ. Metal Complexes as Diagnostic Tools. Coord. Chem. Rev. 1999;184:3–66. [Google Scholar]
  • (3).Moi MK, Meares CF, McCall MJ, Cole WC, DeNardo SJ. Copper chelates as probes of biological systems: stable copper complexes with a macrocyclic bifunctional chelating agent. Anal. Biochem. 1985;148:249–253. doi: 10.1016/0003-2697(85)90653-0. [DOI] [PubMed] [Google Scholar]
  • (4).Cole WC, DeNardo SJ, Meares CF, McCall MJ, DeNardo GL, Epstein AL, O’Brien HA, Moi MK. Comparative serum stability of radiochelates for antibody radiopharmaceuticals. J. Nucl. Med. 1987;28:83–90. [PubMed] [Google Scholar]
  • (5).Anderson CJ, Pajeau TS, Edwards WB, Sherman EL, Rogers BE, Welch MJ. In vitro and in vivo evaluation of copper-64-octreotide conjugates. J. Nucl. Med. 1995;36:2315–2325. [PubMed] [Google Scholar]
  • (6).Anderson CJ, Dehdashti F, Cutler PD, Schwarz SW, Laforest R, Bass LA, Lewis JS, McCarthy DW. 64Cu-TETA-octreotide64Cu-TETA-octreotide as a PET imaging agent for patients with neuroendocrine tumors. J. Nucl. Med. 2001;42:213–221. [PubMed] [Google Scholar]
  • (7).Green MA. A Potential Copper Radiopharmaceutical for Imaging the Heart and Brain: Copper-Labeled Pyruvaldehyde Bis(N4-methylthiosemicarbazone) Nucl. Med. Biol. 1987;14:59–61. doi: 10.1016/0883-2897(87)90162-0. [DOI] [PubMed] [Google Scholar]
  • (8).Fujibayashi Y, Taniuchi H, Yonekura Y, Ohtani H, Konishi J, Yokoyama A. Copper-62-ATSM: a new hypoxia imaging agent with high membrane permeability and low redox potential. J. Nucl. Med. 1997;38:1155–1160. [PubMed] [Google Scholar]
  • (9).Dearling JLJ, Lewis JS, McCarthy DW, Welch MJ, Blower PJ. Redox-active metal complexes for imaging hypoxic tissues: Structure-activity relationships in copper(II) bis(thiosemicarbazone) complexes. J. Chem. Soc., Chem. Commun. 1998:2531–2532. [Google Scholar]
  • (10).Easmon J, Pürstinger G, Heinisch G, Roth T, Fiebig HH, Holzer W, Jäger W, Jenny M, Hofmann J. Synthesis, Cytotoxicity, and Antitumor Activity of Copper(II) and Iron(II) Complexes of 4N-Azabicyclo[3.2.2]nonane Thiosemicarbazones Derived from Acyl Diazines. J. Med. Chem. 2001;44:2164–2171. doi: 10.1021/jm000979z. [DOI] [PubMed] [Google Scholar]
  • (11).Lipinski CA, Lombardo F, Dominy BW, Feeney PJ. Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings. Advanced Drug Delivery Reviews. 1997;23:3–25. doi: 10.1016/s0169-409x(00)00129-0. [DOI] [PubMed] [Google Scholar]
  • (12).Andrews PR, Craik DJ, Martin JL. Functional group contributions to drug-receptor interactions. J. Med. Chem. 1984;27:1648–1657. doi: 10.1021/jm00378a021. [DOI] [PubMed] [Google Scholar]
  • (13).Hansch C, Maloney PP, Fujita T, Muir RM. Correlation of biological activity of phenoxyacetic acids with Hammett substituent constants and partition coefficients. Nature. 1962;194:178. [Google Scholar]
  • (14).Hansch C, Leo A. Exploring QSAR Fundamentals and Applications in Chemistry and Biology. American Chemical Society; Washington DC: 1995. p. 557. [Google Scholar]
  • (15).Agarwal A, Pearson PP, Taylor EW, Li HB, Dahlgren T, Herslöf M, Yang Y, Lambert G, Nelson DL, Regan JW, Martin AR. Three-Dimensional Quantitative Structure-Activity Relationships of 5-HT Receptor Binding Data for Tetrahydropyridinylindole Derivatives: A Comparison of the Hansch and CoMFA Methods. J. Med. Chem. 1993;36:4006–4014. doi: 10.1021/jm00077a003. [DOI] [PubMed] [Google Scholar]
  • (16).Hansch C, Leo A, Mekapati SB, Kurup A. QSAR and ADME. Bioorg. Med. Chem. 2004;12:3391–3400. doi: 10.1016/j.bmc.2003.11.037. [DOI] [PubMed] [Google Scholar]
  • (17).Klein C, Kaiser D, Kopp S, Chiba P, Ecker GF. Similarity based SAR (SIBAR) as tool for early ADME profiling. J. Comp.-Aided Mol. Des. 2003;16:785–793. doi: 10.1023/a:1023828527638. [DOI] [PubMed] [Google Scholar]
  • (18).Garg R, Kurup A, Mekapati SB, Hansch C. Cyclooxygenase (COX) Inhibitors: A Comparative QSAR Study. Chem. Rev. 2003;103:703–731. doi: 10.1021/cr020464a. [DOI] [PubMed] [Google Scholar]
  • (19).Eros D, Kovesdi I, Orfi L, Takacs-Novak K, Acsady G, Keri G. Reliability of logP predictions based on calculated molecular descriptors: a critical review. Curr. Med. Chem. 2002;9:1819–1829. doi: 10.2174/0929867023369042. [DOI] [PubMed] [Google Scholar]
  • (20).Allinger NL, Yuh YH, Lii JH. Molecular mechanics. The MM3 force field for hydrocarbons. J. Am. Chem. Soc. 1989;111:8551–8566. [Google Scholar]
  • (21).Gundertofte K, Liljefors T, Norrby P-O, Pettersson I. A comparison of conformational energies calculated by several molecular mechanics methods. J. Comput. Chem. 1996;17:429–449. [Google Scholar]
  • (22).Brandt P, Norrby T, Åkermark B, Norrby P-O. Molecular Mechanics (MM3*) Parameters for Ruthenium(II)-Polypyridyl Complexes. Inorg. Chem. 1998;37:4120–4127. doi: 10.1021/ic980021i. [DOI] [PubMed] [Google Scholar]
  • (23).Strassner T, Busold M, Herrmann WA. MM3 parametrization of four- and five-coordinated rhenium complexes by a genetic algorithm--which factors influence the optimization performance? J. Comput. Chem. 2002;23:282–290. doi: 10.1002/jcc.10000. [DOI] [PubMed] [Google Scholar]
  • (24).Hagelin H, Svensson M, Aakermark B, Norrby P-O. Molecular Mechanics (MM3) Force Field Parameters for Calculations on Palladium Olefin Complexes with Phosphorus Ligands. Organometallics. 1999;18:4574–4583. [Google Scholar]
  • (25).Hay BP, Clement O, Sandrone G, Dixon DA. A Molecular Mechanics (MM3(96)) Force Field for Metal-Amide Complexes. Inorg. Chem. 1998;37:5887–5894. [Google Scholar]
  • (26).Norrby P-O, Brandt P. Deriving Force Field Parameters for Coordination Complexes. Coord. Chem. Rev. 2001;212:79–109. [Google Scholar]
  • (27).Cundari TR, Fu W. Genetic algorithm optimization of a molecular mechanics force field for technetium. Inorg. Chim. Acta. 2000;300-302:113–124. [Google Scholar]
  • (28).Ponder JW. TINKER4.2. 2004. [Google Scholar]
  • (29).Allen FH, Kennard O. 3D Search and Research Using the Cambridge Structural Database. Chemical Design Automation News. 1993;8:31–37. [Google Scholar]
  • (30).Schaftenaar G, Noordik JH. Molden: a pre- and post-processing program for molecular and electronic structures. J. Comp.-Aided Mol. Des. 2000;14:123–134. doi: 10.1023/a:1008193805436. [DOI] [PubMed] [Google Scholar]
  • (31).Bayly CI, Cieplak P, Cornell WD, Kollman PA. A Well-Behaved Electrostatic Potential Based Method Using Charge Restraints for Deriving Atomic Charges: The RESP Model. J. Phys. Chem. 1993;97:10269–10280. [Google Scholar]
  • (32).Jaguar; 5.0 ed. Schrödinger, Inc.; Portland, OR: [Google Scholar]
  • (33).Molecular Operating Environment. Chemical Computing Group Inc.; Quebec, Canada: 19972004. [Google Scholar]
  • (34).John EK, Green MA. Structure-activity relationships for metal-labeled blood flow tracers: comparison of keto aldehyde bis(thiosemicarbazonato)copper(II) derivatives. J. Med. Chem. 1990;33:1764–1770. doi: 10.1021/jm00168a035. [DOI] [PubMed] [Google Scholar]
  • (35).Dearling JLJ, Lewis JS, Mullen GED, Welch MJ, Blower PJ. Copper bis(thiosemicarbazone) complexes as hypoxia imaging agents: structure-activity relationships. J. Biol. Inorg. Chem. 2002;7:249–259. doi: 10.1007/s007750100291. [DOI] [PubMed] [Google Scholar]
  • (36).Easmon J, Heinisch G, Holzer W, Rosenwirth B. Pyridazines. 63. Novel thiosemicarbazones derived from formyl- and acyldiazines: synthesis, effects on cell proliferation, and synergism with antiviral agents. J. Med. Chem. 1992;35:3288–3296. doi: 10.1021/jm00095a027. [DOI] [PubMed] [Google Scholar]
  • (37).Yoo J, Reichert DE, Welch MJ. Comparative in Vivo Behavior Studies of Cyclen-Based Copper-64 Complexes: Regioselective Synthesis, X-ray Structure, Radiochemistry, log P, and Biodistribution. J. Med. Chem. 2004;47:6625–6637. doi: 10.1021/jm0496990. [DOI] [PubMed] [Google Scholar]
  • (38).Boswell CA, Sun X, Niu W, Weisman GR, Wong EH, Rheingold AL, Anderson CJ. Comparative in Vivo Stability of Copper-64-Labeled Cross-Bridged and Conventional Tetraazamacrocyclic Complexes. J. Med. Chem. 2004;47:1465–1474. doi: 10.1021/jm030383m. [DOI] [PubMed] [Google Scholar]
  • (39).Wong EH, Weisman GR, Hill DC, Reed DP, Rogers ME, Condon JS, Fagan MA, Calabrese JC, Lam K-C, Guzei IA, Rheingold AL. Synthesis and Characterization of Cross-Bridged Cyclams and Pendant-Armed Derivatives and Structural Studies of Their Copper(II) Complexes. J. Am. Chem. Soc. 2000;122:10561–10572. [Google Scholar]
  • (40).Reichert DE, Norrby P-O, Welch MJ. Molecular Modeling of Bifunctional Chelate Peptide Conjugates. 1. Copper and Indium Parameters for the AMBER Force Field. Inorg. Chem. 2001;40:5223–5230. doi: 10.1021/ic0012118. [DOI] [PubMed] [Google Scholar]
  • (41).Levin VA. Relationship of Octanol/Water Partition Coefficient and Molecular Wieght to Rat Brain Capillary Permeability. J. Med. Chem. 1980;23:682–684. doi: 10.1021/jm00180a022. [DOI] [PubMed] [Google Scholar]
  • (42).Gasteiger J, Marsili M. Iterative Partial Equalization of Orbital Electronegativity - A Rapid Access to Atomic Charges. Tetrahedron. 1980;36:3219–3288. [Google Scholar]
  • (43).Wildman SA, Crippen GM. Prediction of Physicochemical Parameters by Atomic Contributions. J. Chem. Inf. Comput. Sci. 1999;39:868–873. [Google Scholar]
  • (44).Balaban AT. Chemical graphs. XXXIV. Five new topological indexes for the branching of tree-like graphs. Theor. Chim. Acta. 1979;53:355–375. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

si20050531_030

RESOURCES