Skip to main content
Molecules logoLink to Molecules
. 2018 Sep 13;23(9):2348. doi: 10.3390/molecules23092348

QSAR Study of N-Myristoyltransferase Inhibitors of Antimalarial Agents

Letícia Santos-Garcia 1, Marco Antônio de Mecenas Filho 1, Kamil Musilek 2,3, Kamil Kuca 2,*, Teodorico Castro Ramalho 1,4, Elaine Fontes Ferreira da Cunha 1,*
PMCID: PMC6225221  PMID: 30217086

Abstract

Malaria is a disease caused by protozoan parasites of the genus Plasmodium that affects millions of people worldwide. In recent years there have been parasite resistances to several drugs, including the first-line antimalarial treatment. With the aim of proposing new drugs candidates for the treatment of disease, Quantitative Structure–Activity Relationship (QSAR) methodology was applied to 83 N-myristoyltransferase inhibitors, synthesized by Leatherbarrow et al. The QSAR models were developed using 63 compounds, the training set, and externally validated using 20 compounds, the test set. Ten different alignments for the two test sets were tested and the models were generated by the technique that combines genetic algorithms and partial least squares. The best model shows r2 = 0.757, q2adjusted = 0.634, R2pred = 0.746, R2m = 0.716, ∆R2m = 0.133, R2p = 0.609, and R2r = 0.110. This work suggested a good correlation with the experimental results and allows the design of new potent N-myristoyltransferase inhibitors.

Keywords: malaria, N-myristoyltransferase, drug development, QSAR, mosquito-borne protozoal infection

1. Introduction

Malaria is a mosquito-borne protozoal infection caused by five parasites of the genus Plasmodium: P. falciparum, P. ovale, P. malariae, P. vivax, and P. knowlesi [1]. Among these, Plasmodium falciparum is the most prevalent and lethal [2]. Over the past 50 years, the parasite resistance to chloroquine and sulphadoxine-pyrimethamine in endemic countries has been noted [3,4]. In addition, signs of resistance to artemisinin-based combination therapies (ACTs) have been detected. Actually, ACTs are the first-line treatment for malaria and thus, new drugs are constantly required [5,6,7,8]. Researchers have identified the N-myristoyltransferase (NMT) enzyme as an important target for a generation of drugs to be used for the treatment of malaria. NMT catalyzes the transfer of the myristoyl group from a myristoyl coenzyme A (CoA) to the N-terminal glycine residue after the targeted protein [9]. N-terminal myristoylation (MYR) by NMT occurs through the Bi-Bi mechanism, where MYR-CoA binds to the apo-enzyme, inducing a conformational change that allows the NMT substrate to bind [5,10]. P. falciparum has a single NMT isoform and mRNA is expressed in asexual blood-stage forms [11]. The first reported NMT inhibitors were obtained by mimicking the structure of peptide substrates (Figure 1A) [12] or by designing non hydrolysable, methylene-bridged analogue of myristoyl coenzyme A (Figure 1B) [13]. After that, inhibitors based on a quinolone scaffold and furan core were reported (Figure 1C,D) [14]. In this work, we used dimensional quantitative structure-activity relationship analysis of 83 NMT inhibitors based on a phenyl scaffold [15] seeking to propose new candidates for NMT inhibitors. Furthermore, a physicochemical properties evaluation was performed in order to find the most appropriate compound predicted.

Figure 1.

Figure 1

(A) Inhibitor mimicking the structure of substrates (Ki = 5.0, 8.0, and 35.0 μM for S. cerevisiae, C. albicans, and human NMT) [12]; (B) inhibitor methylene-bridged analogue of myristoyl coenzyme A (Ki = 24.0 nM) [13]; (C) inhibitor based on a quinolone scaffold (Ki = 4.7, and >100 μM for Plasmodium vivax and Plasmodium falciparum NMT, respectively); and (D) inhibitor based on a furan core RO-09-4609 (IC50 = 0.1 and >540 μM for C. albicans, and human NMT) [14].

2. Results

The GA-PLS analysis using grid cells of 1.0 A generated several models or equations. The statistical parameters of ten alignments studied for Test Set I (compounds 1, 3, 5, 6, 12, 16, 20, 30, 33, 39, 40, 50, 56, 57, 61, 65, 66, 69, 76, and 80) and Test Set II (compounds 3, 6, 9, 13, 20, 21, 27, 28, 31, 32, 40, 56, 57, 58, 64, 70, 73, 76, 78, and 82) are shown in Table 1 and Table 2, respectively. All tested alignments showed q2 values higher than 0.5. This reveals that the model can be a useful tool for predicting affinities of new compounds based on these structures; r2 greater than 0.7 indicates that the model is correlated and may be considered to represent the training set in the same manner [16]. Alignments 6B and 7B were eliminated from the analysis because it presented a low r2 value (<0.7).

Table 1.

Statistical parameters evaluated in the 4D-QSAR analysis for the ten performed alignments of Test I.

Alignment r2 RMSEC q2adj RMSECV R2Pred RMSEP r2m R2r R2p
A1 0.746 0.481 0.607 0.549 0.532 0.65 0.71 0.312 0.82
A2 0.744 0.478 0.608 0.548 0.548 0.663 0.692 0.343 0.799
A3 0.761 0.469 0.609 0.546 0.508 0.702 0.735 0.182 0.994
A4 0.708 0.508 0.576 0.579 0.588 0.595 0.645 0.287 0.825
A5 0.736 0.511 0.589 0.566 0.477 0.698 0.766 0.245 0.895
A6 0.739 0.477 0.582 0.563 0.567 0.637 0.67 0.286 0.83
A7 0.722 0.503 0.584 0.571 0.555 0.656 0.683 0.291 0.831
A8 0.746 0.445 0.605 0.551 0.62 0.59 0.606 0.216 0.891
A9 0.734 0.491 0.578 0.566 0.547 0.684 0.693 0.25 0.861
A10 0.723 0.519 0.583 0.572 0.503 0.676 0.74 0.311 0.816

r2: Coefficient of determination; RMSEc: root mean square deviation of calibration; q2adj: adjusted cross-validated squared correlation coefficient; RMSEcv: root mean square deviation of cross validation; R2pred: correlation coefficient of external validation; RMSEp: root mean square deviation of prediction; r2m(test): Equation (3) (Materials and Methods); R2r: Y-randomization; R2p: Equation (2) (Materials and Methods).

Table 2.

Statistical parameters evaluated in the 4D-QSAR analysis for the ten performed alignments of Test II.

Alignment r2 RMSEC q2adj RMSECV R2Pred RMSEP R2m R2r R2p
B1 0.728 0.504 0.617 0.544 0.728 0.532 0.688 0.301 0.476
B2 0.728 0.515 0.607 0.553 0.763 0.496 0.749 0.289 0.482
B3 0.757 0.472 0.634 0.527 0.746 0.515 0.716 0.11 0.609
B4 0.704 0.549 0.585 0.573 0.782 0.476 0.765 0.253 0.473
B5 0.725 0.5 0.601 0.55 0.706 0.553 0.692 0.198 0.526
B6 0.692 0.559 0.576 0.581 0.771 0.489 0.755 0.272 0.448
B7 0.69 0.556 0.581 0.577 0.751 0.509 0.735 0.289 0.437
B8 0.73 0.514 0.6 0.55 0.77 0.489 0.75 0.209 0.527
B9 0.723 0.528 0.605 0.555 0.786 0.472 0.773 0.229 0.508
B10 0.744 0.501 0.619 0.542 0.779 0.48 0.744 0.289 0.502

r2: coefficient of determination; RMSEc: root mean square deviation of calibration; q2adj: adjusted cross-validated squared correlation coefficient; RMSEcv: root mean square deviation of cross validation; R2pred: correlation coefficient of external validation; RMSEp: root mean square deviation of prediction; r2m(test): Equation (3) (Materials and Methods); R2r: Y-randomization; R2p: Equation (2) (Materials and Methods).

Evaluating the predictive ability in terms of R2pred (means of an external validation), the alignment 5A was eliminated (R2pred < 0.5). All R2m values were greater than 0.60, and values over 0.5 are acceptable. Analyzing the R2p values, alignments B1, B2, B4, B6, and B7 were excluded because this parameter value was less than 0.5 [17].

Alignment 3 from Test Set II (B3) provides the best 4D-QSAR models as judged by the highest q2adj, in addition to presenting fewer descriptors. Among the alignments with only seven descriptors, this still has the highest values of r2, q2adj, R2p, and the lowest value of R2r. According to these results, we selected Model B3 as the best alignment. We will only present the analysis of the best model derived from B3.

The statistical measures, including the values of r2, q2, q2adj, LSE, LOF, RMSEC, RMSECV, RMSEP, Y-Rand, R2pred, R2m, R2p, and R2r are presented below. Each GCOD (grid cell occupancy descriptors) is labeled as “x, y, z, IPE” which represent the cartesian coordinate positions of the selected grid cell (x, y, z) and the respective atom type (interaction pharmacophore elements, IPE): (i) any type (any); (ii) nonpolar (np); (iii) polar-positive charge density (p+); (iv) polar-negative charge density (p−); (v) hydrogen bond acceptor (hba); (vi) hydrogen bond donor (hbd); and (vii) aromatic systems (ar).

Model B3

pIC50 = 3.997 + 4.942(0,−3,−1, hba) + 2.345(0,−5,−1, any) + 2.100(0,−1,0, any) + 1.692(0,3,−3, any) + 1.191(−1,−4,−3, any) ‒ 8.269(−1,−4,−4, np)

n = 63, GCODs = 7, r2 = 0.757, q2 = 0.702, q2adj = 0.634, LSE = 0.233, LOF = 0.418, RMSEC = 0.472, RMSECV = 0.527, RMSEP = 0.515, RMSEcy-rand = 1.055, R2pred = 0.746, R2m = 0.716, R2p = 0.609, and R2r = 0.110.

Another different variant of R2m metrics was calculated from Model B3 to assess the predictive ability of the test set, ∆R2m. The value of ∆R2m found was 0.133. It has been suggested that to be considered a predictive model, this value should be less than 0.2 [18]. Model B3 generated seven descriptors, where GCODs (−1,−4,−3, any), (0−0, any), (0,6,2, any), (0,−5,−1, any), (0,3,−3, any), and (0,−3, −1, hba) present positive coefficients (Equation (3)) and correspond to favorable interactions between the molecule substituent and amino acid residues in the active site of NMT. Therefore, substituents in these positions increase the effectiveness of the compounds. The GCOD (−1,−4,−4, np) has negative coefficient and correspond to unfavorable interactions between the molecule substituent and amino acid residues in the active site of NMT. Therefore, the occupation of GCOD (−1,−4,−4, np) decreases the compound potency.

3. Discussion

GCODs are related to the coordinates of IPE mapped in a common grid. A graphic representation of the descriptors of Model B3 is shown in Figure 2 using Compound 81 as a reference. Light and dark spheres represent GCODs with positive and negative coefficients, respectively, in accordance with Model B3. GCOD-1 (0,−3,−1, hba) (Figure 3) is the descriptor that most contributes to the increased effectiveness of compounds and presents a coefficient of 4.942. This grid cell represents an acceptor hydrogen bond atom type (IPE) and shows high frequency of occupation for compounds 42, 48, 65, 68, and 69. It is located close to the nitrogen atom of the oxadiazole ring and indicates an amino acid donor hydrogen bond in N-myristoyltransferase.

Figure 2.

Figure 2

Graphic representation of Compound 81 according to the 4D-QSAR Model B3. GCODs occupancy represented by white spheres contributes to increasing the potency of compounds, and black spheres to decrease the potency of the compounds. The GCODs described are: (1) (0,−3,−1, hba), (2) (−1,−4,−4, np), (3) (0,−5,−1, any) (4) (0,−1,0, any), (5) (0,3,−3, any) (6) (−1,−4,−3, any), and (7) (0,6,2, any). The gray and red representations are carbons and oxygen atoms.

Figure 3.

Figure 3

(A) Representation of compound 42 and GCOD-1 (0,−3,−1, hba) (white sphere), and (B) compound 81. GCODs occupancy represented by white spheres contributes to increasing the potency of compounds. The gray and red representations are carbons and oxygen atoms.

The oxadiazole ring in the ortho position allows the nitrogen atom to occupy this grid cell, exemplified by compound 42 (Figure 3). However, the most active molecules of the training set, 81 and 83, do not have this descriptor that contributes most to the increase in the potency of the compounds. Once in this position there is a methyl group and the oxadiazole group that are displaced. In fact, the oxadiazole ring in these compounds does not occupy this grid cell.

GCOD-2 (−1,−4,−4, np) (Figure 4) contributes to decrease compound potency and presents a coefficient of −8.269. This grid cell corresponds to a nonpolar IPE and shows high occupation frequency for Compounds 42, 48, and 55. These molecules present non-polar groups in this local, such as ethyl and methyl groups. Thus, the occupation of this cell is drastically reduced when this position is non-polar substituted, that decrease the activity of these compounds. Meanwhile, if the polar groups, such as OH or N in 81 or 83, respectively, are localized in this grid cell, the GCOD-2 descriptor yields less negative value which does not decrease the predicted pIC50. We can see that compound 42 occupies the descriptor that decreases the activity. However, it also presents occupancy for GCOD-1, which increases the activity. As expected, this compound exhibits an intermediate power.

Figure 4.

Figure 4

Representation of GCOD-2 (−1,−4,−4, np) (black sphere) and (A) compound 42, (B) compound 81, and (C) compound 83. CODs occupancy represented by black spheres to decrease the potency of the compounds. The gray and red representations are carbons and oxygen atoms.

This suggests that the occupation of this cell by acceptor hydrogen bond should be favorable in contrast to nonpolar atoms that are unable to perform hydrogen bond interactions.

GCOD-3 (0,−5,−1, any) is present as a non-specific IPE. The atom type “any” is used when more than one specific atom [IPE] type across the training set is found to satisfy the interaction being captured by a particular GCOD [19]. GCOD (0,−5,−1, any) has a positive coefficient of 2.345 which increases the potency of the compounds (Figure 5). This grid cell is close to methyl and ethyl groups, toward the left side, and shows greater occupation frequency for compounds 26, 30, and 33. On the other hand, compounds 42 and 45 showed no occupancy for this descriptor, because during the molecular dynamics simulation, they assumed a different conformation, facing right. On the other hand, GCOD (0,−5,−1, any) also provides occupation frequency in the benzene ring of molecules 59, 60, 61, and 62.

Figure 5.

Figure 5

Representation of GCOD-3 (0,−5,−1, any) (white sphere) and (A) compound 30, (B) compound 42, and (C) compound 60. CODs occupancy represented by white spheres contributes to increasing the potency of compounds. The gray and red representations are carbons and oxygen atoms.

GCOD-4 (0,−1,0, any) have a positive coefficient and, thus, also greatly influence the increase in inhibitor potency (Figure 6). It is located near the methyl group in benzofuran, 2,3-dihidro-3-methyl and represents a non-specific IPE. It shows the highest occupation frequency for the most active compounds in this series, compounds 1, 81, and 83. The GCOD (0,−3,−1, hba) reflects the importance of occupation of this receptor region for the effectiveness of the inhibitors.

Figure 6.

Figure 6

Representation of GCOD-4 (0,−1,0, any) (white sphere) and (A) compound 81, and (B) compound 83. CODs occupancy represented by white spheres contributes to increasing the potency of compounds, and the gray and red representations are carbons and oxygen atoms.

GCOD-5 (0,3,−3, any) have a positive coefficient, and so, improves the effectiveness of inhibitors (Figure 7). The GCOD (0,3,−3, any) is situated near the piperidine ring and has occupancy for a large majority of molecules, such as compounds 65 and 79. The change of position of the piperidine ring in compounds 15, 16, and 30 is not favorable, so these do not have this GCOD.

Figure 7.

Figure 7

Representation of GCOD-5 (0,3,−3, any) (white sphere) and (A) compound 79, and (B) compound 30. CODs occupancy represented by white spheres contributes to increasing the potency of compounds, and the gray and red representations are carbons and oxygen atoms.

GCOD-6 (−1,−4,−3, any) represents a non-specific IPE and also has a positive coefficient, indicating an increase in potency of compounds which have high occupation frequency for this descriptor (Figure 8). The molecules 14, 19, and 22 have a high occupation frequency for this GCOD, located near the benzene group. This grid cell suggests a hydrophobic region in the receptor close to the benzene ring, which should be making a π–π staking interaction between the aromatic ring of an inhibitor and one aromatic amino acid residue.

Figure 8.

Figure 8

Representation of GCOD-6 (−1,−4,−3, any) (white sphere) and (A) compound 14, (B) compound 19, and (C) compound 22. CODs occupancy represented by white spheres contributes to increasing the potency of compounds, and the gray and red representations are carbons and oxygen atoms.

Lastly, GCOD-7 (0,6,2, any) (Figure 9) has a positive coefficient and shows a non-specific class. Molecules 5963 have a high occupation frequency for this descriptor. The presence of naphthalene group in this grid cell increase the activity. In fact, it shows that the aromatic substituents in this position should be preferred.

Figure 9.

Figure 9

Representation of GCOD-7 (0,6,2, any) (white sphere) and (A) compound 62, and (B) compound 59, and the gray and red representations are carbons and oxygen atoms.

In order to find new active structures, the information of the descriptors obtained by the Model B3 was used. Modifications to the structure of compounds 60, 65, and 81 are suggested and compounds AE were proposed.

The proposed compounds CE, exhibited predicted pIC50 higher than 81 (the best experimental compound). The structure of the five compounds and their predicted pIC50 values are shown in Table 3. The ADME (absorption, distribution, metabolism, and excretion) of a drug is an important property that can determine the utilization of molecules proposed in the therapeutic usage. For the evaluation of pharmacokinetic parameters for molecules AE we used the Lipinski’s Rule of Five, where molecular properties are closely related to the oral bioavailability of a drug [20], wherein compounds should not violate more than one rule. In this rule, the compounds should present logP no more than 5, molecular weight of 500 Daltons, number of hydrogen bond acceptors (nON) of 10, number of hydrogen bond donors (nOHNH) of 5, and number of rotatable bonds (nrotb) no more than 10. The proposed molecules have been designed in the Molinspiration Online Property Calculation Software Toolkit [21] to evaluate the criteria discussed above.

Table 3.

Structures of the compounds proposed and the predicted pIC50 values based on Model B3.

No. Structure pIC50 No. Structure pIC50
A graphic file with name molecules-23-02348-i001.jpg 7.014 B graphic file with name molecules-23-02348-i002.jpg 7.171
C graphic file with name molecules-23-02348-i003.jpg 7.622 D graphic file with name molecules-23-02348-i004.jpg 8.161
E graphic file with name molecules-23-02348-i005.jpg 7.894

The Molinspiration Online Property Calculation Software Toolkit [21] was used to analyze drug likeness (Lipinski’s Rule of Five) and the results are shown in Table 4. According to the data in Table 4, no compound violated the Lipinski’s Rule of Five.

Table 4.

Calculated parameters of the Lipinski rule of five for the proposed molecules.

Molecule miLogP MW n ON n OHNH n n violations
A 3.13 514.97 10 1 8 1
B 2.74 430.89 8 1 7 0
C 3.13 415.88 7 1 6 0
D 2.41 356.81 6 1 4 0
E 3.52 415.88 7 1 6 0

4. Materials and Methods

4.1. Biological Data

In order to build QSAR models, 83 Plasmodium falciparum inhibitors were retrieved from Leatherbarrow et al. [15]. Twenty compounds (25%) were randomly selected to compose the test set (external validation). Two test groups were chosen. The first (Test Set I) has the following molecules: 1, 3, 5, 6, 12, 16, 20, 30, 33, 39, 40, 50, 56, 57, 61, 65, 66, 69, 76, and 80; Test Set II has the following molecules: 3, 6, 9, 13, 20, 21, 27, 28, 31, 32, 40, 56, 57, 58, 64, 70, 73, 76, 78, and 82 (Table 5).

Table 5.

Chemical structures and experimental pIC50Exp (M) values of Plasmodium falciparum inhibitors. Test Set I compound numbers are marked with an asterisk. Test Set II compound numbers are underlined.

No. Structure pIC50 No. Structure pIC50
1 * graphic file with name molecules-23-02348-i006.jpg 6.155 2 graphic file with name molecules-23-02348-i007.jpg 4.000
3 * graphic file with name molecules-23-02348-i008.jpg 4.000 4 graphic file with name molecules-23-02348-i009.jpg 4.000
5 * graphic file with name molecules-23-02348-i010.jpg 4.000 6 * graphic file with name molecules-23-02348-i011.jpg 4.000
7 graphic file with name molecules-23-02348-i012.jpg 4.000 8 graphic file with name molecules-23-02348-i013.jpg 4.000
9 graphic file with name molecules-23-02348-i014.jpg 4.000 10 graphic file with name molecules-23-02348-i015.jpg 4.000
11 graphic file with name molecules-23-02348-i016.jpg 4.000 12 * graphic file with name molecules-23-02348-i017.jpg 5.721
13 graphic file with name molecules-23-02348-i018.jpg 4.785 14 graphic file with name molecules-23-02348-i019.jpg 5.113
15 graphic file with name molecules-23-02348-i020.jpg 4.000 16 * graphic file with name molecules-23-02348-i021.jpg 4.000
17 graphic file with name molecules-23-02348-i022.jpg 4.000 18 graphic file with name molecules-23-02348-i023.jpg 4.745
19 graphic file with name molecules-23-02348-i024.jpg 5.215 20 * graphic file with name molecules-23-02348-i025.jpg 4.366
21 graphic file with name molecules-23-02348-i026.jpg 4.000 22 graphic file with name molecules-23-02348-i027.jpg 4.000
23 graphic file with name molecules-23-02348-i028.jpg 4.000 24 graphic file with name molecules-23-02348-i029.jpg 4.000
25 graphic file with name molecules-23-02348-i030.jpg 4.000 26 graphic file with name molecules-23-02348-i031.jpg 5.699
27 graphic file with name molecules-23-02348-i032.jpg 6.400 28 graphic file with name molecules-23-02348-i033.jpg 6.102
29 graphic file with name molecules-23-02348-i034.jpg 5.780 30 * graphic file with name molecules-23-02348-i035.jpg 6.398
31 graphic file with name molecules-23-02348-i036.jpg 5.796 32 graphic file with name molecules-23-02348-i037.jpg 5.420
33 * graphic file with name molecules-23-02348-i038.jpg 5.292 34 graphic file with name molecules-23-02348-i039.jpg 6.456
35 graphic file with name molecules-23-02348-i040.jpg 6.678 36 graphic file with name molecules-23-02348-i041.jpg 6.468
37 graphic file with name molecules-23-02348-i042.jpg 5.131 38 graphic file with name molecules-23-02348-i043.jpg 5.585
39 * graphic file with name molecules-23-02348-i044.jpg 4.730 40 * graphic file with name molecules-23-02348-i045.jpg 5.585
41 graphic file with name molecules-23-02348-i046.jpg 5.886 42 graphic file with name molecules-23-02348-i047.jpg 5.284
43 graphic file with name molecules-23-02348-i048.jpg 6.000 44 graphic file with name molecules-23-02348-i049.jpg 5.602
45 graphic file with name molecules-23-02348-i050.jpg 4.876 46 graphic file with name molecules-23-02348-i051.jpg 6.319
47 graphic file with name molecules-23-02348-i052.jpg 6.215 48 graphic file with name molecules-23-02348-i053.jpg 6.051
49 graphic file with name molecules-23-02348-i054.jpg 4.445 50 * graphic file with name molecules-23-02348-i055.jpg 4.958
51 graphic file with name molecules-23-02348-i056.jpg 4.086 52 graphic file with name molecules-23-02348-i057.jpg 4.217
53 graphic file with name molecules-23-02348-i058.jpg 6.398 54 graphic file with name molecules-23-02348-i059.jpg 5.569
55 graphic file with name molecules-23-02348-i060.jpg 4.663 56 * graphic file with name molecules-23-02348-i061.jpg 6.229
57 * graphic file with name molecules-23-02348-i062.jpg 4.182 58 graphic file with name molecules-23-02348-i063.jpg 5.056
59 graphic file with name molecules-23-02348-i064.jpg 5.009 60 graphic file with name molecules-23-02348-i065.jpg 5.149
61 * graphic file with name molecules-23-02348-i066.jpg 5.886 62 graphic file with name molecules-23-02348-i067.jpg 5.886
63 graphic file with name molecules-23-02348-i068.jpg 4.801 64 graphic file with name molecules-23-02348-i069.jpg 5.201
65 * graphic file with name molecules-23-02348-i070.jpg 6.959 66 * graphic file with name molecules-23-02348-i071.jpg 5.538
67 graphic file with name molecules-23-02348-i072.jpg 6.482 68 graphic file with name molecules-23-02348-i073.jpg 5.921
69* graphic file with name molecules-23-02348-i074.jpg 6.769 70 graphic file with name molecules-23-02348-i075.jpg 5.569
71 graphic file with name molecules-23-02348-i076.jpg 6.824 72 graphic file with name molecules-23-02348-i077.jpg 6.051
73 graphic file with name molecules-23-02348-i078.jpg 7.222 74 graphic file with name molecules-23-02348-i079.jpg 6.000
75 graphic file with name molecules-23-02348-i080.jpg 6.620 76 * graphic file with name molecules-23-02348-i081.jpg 6.638
77 graphic file with name molecules-23-02348-i082.jpg 5.495 78 graphic file with name molecules-23-02348-i083.jpg 5.959
79 graphic file with name molecules-23-02348-i084.jpg 6.181 80 * graphic file with name molecules-23-02348-i085.jpg 5.187
81 graphic file with name molecules-23-02348-i086.jpg 7.301 82 graphic file with name molecules-23-02348-i087.jpg 6.921
83 graphic file with name molecules-23-02348-i088.jpg 6.201

* Test set group of compounds.

The biological activities of these compounds were reported as the negative logarithm of concentration capable of inhibiting 50% of the enzyme activity (IC50), measured using an adapted version of the sensitive fluorescence-based assay based on detection of CoA by 7-diethylamino-3-(4-maleimido-phenyl)-4-methylcoumarin [15]. Furthermore, in an effort to eliminate the potential noise that might have been introduced by the pooling of data sets from different sources, all pharmacological data are obtained from the same laboratory. The IC50 (µM) values were converted into molar units and then expressed in negative logarithmic units (pIC50), and are depicted in Table 4. The range of pIC50 values for the training and test set spans at least three orders of magnitude (4.00 to 7.30), and the biological activity values show a regular distribution over the whole range.

4.2. Molecular Dynamic Simulation (MDS)

The three-dimensional structure from 83 analogues (Table 5) were optimized in vacuum, without any restriction, and the partial atomic charges assigned using RM1 semiempirical Hamiltonian [22]. The MDS was carried out at 300 K, close to the temperature assays, with a simulation sampling time of 100 ps, and intervals of 0.001 ps. Thus, a total sample of 100,000 conformations of each compound was produced. MDS have been performed using the GROMACS 5.1 package [23].

4.3. Alignment Definition

As the compounds are structural analogs, we will assume that all molecules bind to the receptor in a similar mode. In general, the alignments are chosen to span the common framework of the molecules in the training and test sets [24,25,26]. In this work, ten alignments were performed using atoms of the (common) benzene ring. Three-ordered atom trial alignments were selected: (1) a-b-c, (2) a-b-d, (3) b-c-d, (4) c-d-f, (5) b-c-a, (6) b-a-c, (7) a-c-b, (8) c-b-a, (9) d-a-f, and (10) a-b-e (Figure 10). The order of the three ordered-atoms is important: the first atom specified for a molecule might be expected to occupy a similar location in space as the first atom specified for the second molecule. The conformational ensemble profile (CEP) for each compound obtained after the MDS step was overlaid onto a cubic lattice with grid cell size of 1A.

Figure 10.

Figure 10

Ordered atom letter codes (a–f) used in the 4D-QSAR analysis defines the three trial alignments. Compound 81 (pIC50 = 7.301) is used to define the atom letter code.

4.4. Interaction Pharmacophore Elements

According to the 4D-QSAR methodology, atoms of each compound are defined by seven types of interaction pharmacophore elements (IPEs). IPEs correspond to the interactions that may occur between ligand and the active site: (i) any type (any); (ii) nonpolar (np); (iii) polar-positive charge density (p+); 9iv) polar-negative charge density (p−); (v) hydrogen bond acceptor (hba); (vi) hydrogen bond donor (hbd); and (vii) aromatic systems (ar). The occupancy of the grid cells by each IPE type are recorded over the conformational assembly profile, and forms the set of grid cell occupancy descriptors (GCOD) to be utilized as the pool of trial descriptors in the model building and optimization process [16]. The idea underlying a 4D-QSAR analysis is that variations in biological responses were related to differences in the Boltzmann average spatial distribution of molecular shape with respect to the IPE. Thus, the normalized grid cell absolute occupancy, defined as the number of times that a cell was occupied by an atom type during the MDS, divided by the size of the CEP (1000 conformations), was used to define the GCODs.

4D-QSAR model calculation. In order to exclude data noise from databases generated by the alignments, partial least-squares (PLS) regression analysis was performed as a data reduction fit between the observed dependent variable measures and the corresponding set of GCOD values. Additionally, PLS identifies the most highly weighted GCODs from data set of local grid cells [16].

The two hundred GCODs with the highest weight from the data reduction were chosen to form the trial descriptor basis sets for model optimization by genetic function approximation (GFA) analysis [16]. Optimizations were initiated using 100 randomly generated models and 10,000–100,000 crossover operations. Mutation probability over the crossover optimization cycle was set at 10–30%. The smoothing factor, the variable that specifies the number of descriptors in the QSAR models, was varied between 1.0 and 3.0, in order to determine equations with no more than twelve terms. Each alignment was evaluated using the procedure described above.

The best models, resulting from the 4D-QSAR study were based on different criteria [18,26,27,28]:

  • (1)

    Coefficient of determination (r2): is a measure of how well the regression line represents the data.

  • (2)

    Adjusted cross-validated squared correlation coefficient (q2adj): allows the comparison between models with different number of variables.

  • (3)
    Correlation coefficient of external validation set (R2pred): reflects the degree of correlation between the observed (YExp(test))and predicted (YPred(test)) activity data of the test set:
     RPred2=11n(YExp(test)YPred(test))21n(YExp(test)Y¯Training)2  (1)
    where Y¯Training is average value for the dependent variable for the training set.
  • (4)
    Modified r2 (r2m(test)) equation determining the proximity between the observed and predicted values with the zero axis intersection:
     rm(test)2=r2(1|r2r02|)  (2)
  • (5)

    Y-randomization (R2r) consists of the random exchange of the independent variable values. Thus, the R2r value must be less than the correlation coefficient of the non-randomized models.

  • (6)
    R2p penalizes the model R2 for the difference between the squared mean correlation coefficient (R2r) of randomized models and the square correlation coefficient (r2) of the non-randomized model:
     Rp2=r2*r2Rr2  (3)

4.5. Conformational Selection

In the 4D-QSAR method, the conformation of each compound can be postulated as the lowest-energy conformer state from the set sampled for each compound, which predicted the maximum activity using the optimum 4D-QSAR model [16,29,30,31,32].

5. Conclusions

In summary, 4D-QSAR models for NMT inhibitors were built and evaluated. Two test groups were evaluated for the ten tested alignments. The best model was obtained from Alignment B3, and generated an equation with seven descriptors, six of which have positive coefficients and one a negative coefficient. Model B3 showed a satisfactory statistical quality and predictive abilities as shown by r2 = 0.757, q2 = 0.702, q2adjusted = 0.634, R2pred = 0.746, R2m = 0.716, and R2p = 0.609. Furthermore, it showed low values of R2r = 0.110, and ∆R2m(test) = 0.133. 4D-QSAR analysis indicated an important role of acceptor hydrogen bonding groups and aromatic groups, allowing to propose five structures. These have proved more active than compound 81, in addition to being assessed by the Lipinski’s Rule. Accordingly, these molecules may be considered promising prototypes against malaria.

Acknowledgments

We thank the Brazilian agencies “Conselho Nacional de Desenvolvimento Científico e Tecnológico” (CNPq, Brazil), “Coordenação de Aperfeiçoamento de Pessoal de Nível Superior” (CAPES) and “Fundação de Amparo à Pesquisa do Estado de Minas Gerais” (FAPEMIG, Brazil) for their support. We thank A. J. Hopfinger who kindly supplied the 4D-QSAR program for academic use. This work was also supported by Excellence Project FIM, Long-Term Development Plan UHK, and the Ministry of Health (Czech Republic) project NV16-34390A.

Author Contributions

Conceptualization E.F.F.d.C., K.K. and K.M.; L.S.-G. and M.A.d.M.F. did the computational experiments; T.C.R. did data interpretation; E.F.F.d.C. and L.S.-G. Writing-Original Draft Preparation; E.F.F.d.C., and K.M. Writing-Review & Editing.

Funding

This research was funded by Conselho Nacional de Desenvolvimento Científico e Tecnológico (307837/2014-9).

Conflicts of Interest

The authors declare no conflict of interest.

Footnotes

Sample Availability: Samples of all compounds are available from the authors.

References

  • 1.Njoroge M., Njuguna N.M., Mutai P., Ongarora D.S.B., Smith P.W., Chibale K. Recent Approaches to Chemical Discovery and Development against Malaria and the Neglected Tropical Diseases Human African Trypanosomiasis and Schistosomiasis. Chem. Rev. 2014;114:11138–11163. doi: 10.1021/cr500098f. [DOI] [PubMed] [Google Scholar]
  • 2.Liu W., Li Y., Learn G.H., Rudicell R.S. Origin of the human malaria parasite Plasmodium falciparum in gorillas. Nature. 2010;467:420–467. doi: 10.1038/nature09442. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Krungkrai S.R., Krungkrai J. Insights into the pyrimidine biosynthetic pathway of human malaria parasite Plasmodium falciparum as chemotherapeutic target. Asian Pac. J. Trop. Med. 2016;9:525–534. doi: 10.1016/j.apjtm.2016.04.012. [DOI] [PubMed] [Google Scholar]
  • 4.Ridley R.G. Medical need, scientific opportunity and the drive for antimalarial drugs. Nature. 2002;415:686–693. doi: 10.1038/415686a. [DOI] [PubMed] [Google Scholar]
  • 5.Tate E.W., Bell A.S., Rackham M.D., Wright A.H. N-yristoyltransferase as a potential drug target in malaria and leishmaniasis. Parasitology. 2014;141:37–49. doi: 10.1017/S0031182013000450. [DOI] [PubMed] [Google Scholar]
  • 6.Fidock D.A., Nomura T., Talley A.K., Cooper R.A., Dzekunov S.M., Ferdig M.T., Ursos L.M.B., Sidhu A.B.S., Naude B., Deitsch K.W. Mutations in the P-falciparum digestive vacuole transmembrane protein PfCRT and evidence for their role in chloroquine resistance. Mol. Cell. 2000;6:861–871. doi: 10.1016/S1097-2765(05)00077-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Dondorp A.M., Nosten F., Yi P., Das D., Phyo A.P., Tarning J., Lwin K.M., Ariey F., Hanpithakpong W., Lee S.J. Artemisinin Resistance in Plasmodium falciparum Malaria. N. Engl. J. Med. 2009;361:455–467. doi: 10.1056/NEJMoa0808859. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Okombo J., Chibale K. Antiplasmodial drug targets: A patent review (2000–2013) Expert.Opin. Ther. Pat. 2016;26:107–130. doi: 10.1517/13543776.2016.1113258. [DOI] [PubMed] [Google Scholar]
  • 9.Goldston A.M., Sharma A.I., Paul K.S., Engman D.M. Acylation in trypanosomatids: An essential process and potential drug target. Trends Parasitol. 2014;30:350–360. doi: 10.1016/j.pt.2014.05.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Wright M.H., Heal W.P., Mann D.J., Tate E.W. Protein myristoylation in health and disease. J. Chem. Biol. 2010;3:19–35. doi: 10.1007/s12154-009-0032-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Gunaratne R.S., Sajid M., Ling I.T., Tripathi R., Pachebat J.A., Holder A.A. Characterization of N-myristoyltransferase from Plasmodium falciparum. Biochem. J. 2000;348:459–463. doi: 10.1042/bj3480459. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Devadas B., ZUpec M.E., Freeman S.K., Brown D.L., Nagarajan S., Sikorski J.A., McWherter C.A., Getman D.P., Gordon J.I. Design and Syntheses of Potent and Selective Dipeptide Inhibitors of Candida albicans Myristoyl-CoA: Protein N-Myristoyltransferase. Med. Chem. 1995;38:1837–1840. doi: 10.1021/jm00011a001. [DOI] [PubMed] [Google Scholar]
  • 13.Paige L.A., Zheng G., DeFrees S.A., Cassady J.M., Geahlen R.L. S-(2-Oxopentadecyl)-CoA, a Nonhydrolyzable Analogue of Myristoyl-CoA, Is a Potent Inhibitor of Myristoyl-CoA: Protein N-Myristoyltransferase. J. Med. Chem. 1989;32:1665–1667. doi: 10.1021/jm00128a001. [DOI] [PubMed] [Google Scholar]
  • 14.Gonçalves V., Brannigan J.A., Laporte A., Bell A.S., Robert S.M., Wilkinson A.J., Leatherbarrow R.J., Tate E.W. Structure-guided optimization of quinoline inhibitors of Plasmodium N-myristoyltransferase. Med. Chem. Comm. 2016;8:191–197. doi: 10.1039/C6MD00531D. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Leatherbarrow R., Tate E., Yu Z., Racklam M. Novel Compounds and Their Use in Therapy. WO 2013/083991. London Patent. 2013 Jun 13;
  • 16.Hopfinger A.J., Wang S., Tokarski J.S., Jin B.Q., Albuquerque M.G., Madhav P.J., Duraiswami C. Construction of 3D-QSAR models using the 4D-QSAR analysis formalism. J. Am. Chem. Soc. 1997;119:10509–10524. doi: 10.1021/ja9718937. [DOI] [Google Scholar]
  • 17.Roy P.P., Paul S., Mitra I., Roy K. On Two Novel Parameters for Validation of Predictive QSAR Models. Molecules. 2009;14:1660–1701. doi: 10.3390/molecules14051660. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Roy K., Chakraborty P., Mitra I., Ojha P.K., Kar S., Das R.N. Some case studies on application of “rm2” metrics for judging quality of quantitative structure-activity relationship predictions: Emphasis on scaling of response data. J. Comput. Chem. 2013;34:1071–1082. doi: 10.1002/jcc.23231. [DOI] [PubMed] [Google Scholar]
  • 19.Pan D.H., Liu J.Z., Senese C., Hopfinger A.J., Tseng Y. Characterization of a ligand-receptor binding event using receptor-dependent four-dimensional quantitative structure-activity relationship analysis. J. Med. Chem. 2004;47:3075–3088. doi: 10.1021/jm030586a. [DOI] [PubMed] [Google Scholar]
  • 20.Patel H.M., Sing B., Bhardwaj V., Palkar M., Shaikh M.S., Rane R., Alwan W.S., Gadad A.K., Noolvi M.N., Karpoormath R. Design, synthesis and evaluation of small molecule imidazo [2,1-b][1,3,4] thiadiazoles as inhibitors of transforming growth factor-beta type-I receptor kinase (ALK5) Eur. J. Med. Chem. 2015;93:599–613. doi: 10.1016/j.ejmech.2014.09.002. [DOI] [PubMed] [Google Scholar]
  • 21.Molinspiration Cheminformatics (Nova ulica, Slovensky Grob, Slovak Republic) [(accessed on 13 September 2018)]; Available online: http://www.molinspiration.com/
  • 22.Rocha G.B., Freire R.O., Simas A.M., Stewart J.J.P. RM1: A reparameterization of AM1 for H, C, N, O, P, S, F, Cl, Br, and I. J. Comput. Chem. 2006;27:1101–1111. doi: 10.1002/jcc.20425. [DOI] [PubMed] [Google Scholar]
  • 23.Páll S., Abraham M.J., Kutzner C., Hess B., Lindahl E. International Conference on Exascale Applications and Software. Springer International Publishing; Cham, Switzerland: 2015. Tackling Exascale Software Challenges in Molecular Dynamics Simulations with GROMACS; pp. 3–27. [Google Scholar]
  • 24.Oliveira A.A., Ramalho T.C., da Cunha E.F.F. QSAR Study of Androstenedione Analogs as Aromatase Inhibitors. Lett. Drug Des. Discov. 2009;6:554–562. doi: 10.2174/157018009789353464. [DOI] [Google Scholar]
  • 25.da Cunha E.F.F., Albuquerque M.G., Antunes O.A.C., de Alencastro R.B. 4D-QSAR models of HOE/BAY-793 analogues as HIV-1 protease inhibitors. QSAR Comb. Sci. 2015;24:240–253. doi: 10.1002/qsar.200430893. [DOI] [Google Scholar]
  • 26.Caldas G.B., Ramalho T.C., da Cunha E.F.F. Application of 4D-QSAR studies to a series of benzothiophene analogs. J. Mol. Mod. 2014;20:2420. doi: 10.1007/s00894-014-2420-4. [DOI] [PubMed] [Google Scholar]
  • 27.Kiralj R., Ferreira M.M.C. Basic Validation Procedures for Regression Models in QSAR and QSPR Studies: Theory and Application. J. Braz. Chem. Soc. 2009;20:770–787. doi: 10.1590/S0103-50532009000400021. [DOI] [Google Scholar]
  • 28.Roy K., Paul S. Exploring 2D and 3D QSARs of 2,4-Diphenyl-1,3-oxazolines for Ovicidal Activity against Tetranychus urticae. QSAR Comb. Sci. 2009;28:406–425. doi: 10.1002/qsar.200810130. [DOI] [Google Scholar]
  • 29.Veerasamy R., Rajak H., Jain A., Sivadasan S., Varghese C.P., Agrawal R.K. Validation of QSAR Models—Strategies and Importance. Int. J. Drug Des. Discov. 2011;3:511–519. [Google Scholar]
  • 30.Cormanich R.A., Moreira M.A., Freitas M.P., Ramalho T.C., Anconi C.P.A., Rittner R., Contreras R.H., Tormena C.F. 1hJFH coupling in 2-fluorophenol revisited: Is intramolecular hydrogen bond responsible for this long-range coupling? Magn. Reson. Chem. 2011;49:763–767. doi: 10.1002/mrc.2838. [DOI] [PubMed] [Google Scholar]
  • 31.Caetano M.S., Ramalho T.C., Botrel D.F., da Cunha E.F.F., de Mello W.C. Understanding the inactivation process of organophosphorus herbicides: A DFT study of glyphosate metallic complexes with Zn2+, Ca2+, Mg2+, Cu2+, Co3+, Fe3+, Cr3+, and Al3+ Int. J. Quantum Chem. 2012;112:2752–2762. doi: 10.1002/qua.23222. [DOI] [Google Scholar]
  • 32.De Castro A.A., Prandi I.G., Kuca K., Ramalho T.C. Organophosphorus degrading enzymes: Molecular basis and perspectives for enzymatic bioremediation of agrochemicals. Cienc. Agrotecnol. 2017;41:471–482. [Google Scholar]

Articles from Molecules : A Journal of Synthetic Chemistry and Natural Product Chemistry are provided here courtesy of Multidisciplinary Digital Publishing Institute (MDPI)

RESOURCES